hanaml.Kmeans {hana.ml.r} | R Documentation |
hanaml.Kmeans is a R wrapper for PAL K-means and accelerated K-Means algorithm.
hanaml.Kmeans(conn.context, data = NULL, key = NULL, features = NULL, n.clusters = NULL, n.clusters.min = NULL, n.clusters.max = NULL, init = NULL, max.iter = NULL, tol = NULL, thread.ratio = NULL, distance.level = NULL, minkowski.power = NULL, category.weights = NULL, normalization = NULL, categorical.variable = NULL, memory.mode = NULL, accelerated = FALSE)
conn.context |
|
data |
|
key |
|
features |
|
n.clusters |
|
n.clusters.min |
|
n.clusters.max |
|
init |
|
max.iter |
|
thread.ratio |
|
distance.level |
|
minkowski.power |
|
category.weights |
|
normalization |
Defaults to 'no'. |
categorical.variable |
|
tol |
|
memory.mode |
Only valid when accelerated is TRUE. Defaults to 'auto'. |
accelerated |
|
R6Class
object.
labels : DataFrame
Label assigned to each sample.
cluster.centers : DataFrame
Coordinates of cluster centers.
model : DataFrame
Model content.
statistics : DataFrame
Statistic value.
## Not run: Input DataFrame data for training: ID V000 V001 V002 1 0 0.5 A 0.5 2 1 1.5 A 0.5 3 2 1.5 A 1.5 4 3 0.5 A 1.5 5 4 1.1 B 1.2 6 5 0.5 B 15.5 7 6 1.5 B 15.5 8 7 1.5 B 16.5 9 8 0.5 B 16.5 10 9 1.2 C 16.1 11 10 15.5 C 15.5 12 11 16.5 C 15.5 13 12 16.5 C 16.5 14 13 15.5 C 16.5 15 14 15.6 D 16.2 16 15 15.5 D 0.5 17 16 16.5 D 0.5 18 17 16.5 D 1.5 19 18 15.5 D 1.5 20 19 15.7 A 1.6 Model traning and a "Kmeans" object km is returned: > km <- hanaml.Kmeans(conn.context = conn, data = data, features = NULL, n.clusters = 4, init = "first_k", max.iter = 100, tol = 1.0E-6, thread.ratio = 0.2, distance.level = "euclidean", category.weights = 0.5) Expected output: > km$labels$Collect() ID CLUSTER_ID DISTANCE SLIGHT_SILHOUETTE 1 0 0 0.891088 0.944370 2 1 0 0.863917 0.942478 3 2 0 0.806252 0.946288 4 3 0 0.835684 0.944942 5 4 0 0.744571 0.950234 6 5 3 0.891088 0.940733 ## End(Not run)