hanaml.PearsonrMatrix.Rd
hanaml.PearsonrMatrix is a R wrapper for SAP HANA PAL Multivariate Analysis.
hanaml.PearsonrMatrix(data, cols = NULL)
DataFrame
DataFrame containting the data.
list of characters, optional
Name of the columns to analyze.
If not given, it defaults to all columns.
Dataframe
Pearsonr.matrix: Pearson's correlation coefficient between any two data samples(columns),
structured as follows:
ID:
type NVARCHAR. The values of this column are the column names
from cols.
Covariance columns:
, type DOUBLE, named after
the columns in cols.
The covariance between variables X and Y is in column X, in the
row with ID value Y.
Computes a correlation matrix using Pearson's correlation coefficient.
Input DataFrame data:
> data$Collect()
X Y Z
1 1 2.4 5.1
2 5 3.5 6.2
3 3 8.9 8.3
4 10 -1.4 9.4
5 -4 -3.5 11.5
6 11 32.8 12.6
Call the function and obtain the result:
> hanaml.PearsonrMatrix(df)
ID X Y Z
1 X 1.0000000 0.5927077 0.2119775
2 Y 0.5927077 1.0000000 0.4618840
3 Z 0.2119775 0.4618840 1.0000000