- Categories:
Aggregate functions (General) , Window functions (General)
CORR¶
Returns the correlation coefficient for non-null pairs in a group. It is computed for non-null pairs using the following formula:
COVAR_POP(y, x) / (STDDEV_POP(x) * STDDEV_POP(y))
Where x
is the independent variable and y
is the dependent variable.
- See also:
Syntax¶
Syntax when used as an aggregate function:
CORR( y , x )
Syntax when used as a window function:
CORR( y , x ) OVER ( [ PARTITION BY <expr3> ] )
Usage notes¶
DISTINCT is not supported for this function.
When this function is called as a window function, it does not support:
An ORDER BY clause within the OVER clause.
Explicit window frames.
Examples¶
CREATE OR REPLACE TABLE aggr(k int, v decimal(10,2), v2 decimal(10, 2));
INSERT INTO aggr VALUES(1, 10, NULL);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, NULL), (2, 30, 35);
SELECT * FROM aggr;
+---+-------+-------+
| K | V | V2 |
|---+-------+-------|
| 1 | 10.00 | NULL |
| 2 | 10.00 | 11.00 |
| 2 | 20.00 | 22.00 |
| 2 | 25.00 | NULL |
| 2 | 30.00 | 35.00 |
+---+-------+-------+
SELECT k, CORR(v, v2) FROM aggr GROUP BY k;
+---+--------------+
| K | CORR(V, V2) |
|---+--------------|
| 1 | NULL |
| 2 | 0.9988445981 |
+---+--------------+