hanaml.Text.Collector is a R wrapper for SAP Text Mining text collector algorithm.

hanaml.Text.Collector(data)

Arguments

data

DataFrame
Data to be analysis.

Value

DataFrame

  • DataFrame 1: Inverse document frequency of documents.

  • DataFrame 2: Extended table.

Examples

Input DataFrame data:


> data$collect()
       ID                                CONTENT
1    doc1    term1 term2 term2 term3 term3 term3
2    doc2    term2 term3 term3 term4 term4 term4
3    doc3    term3 term4 term4 term5 term5 term5
4    doc5    term3 term4 term4 term5 term5 term5 term5 term5 term5
5    doc4    term4 term6
6    doc6    term4 term6 term6 term6

Call the function:


> result <- hanaml.Text.Collector(data)

Output:


> result[[1]]$Collect()
     TM_TERMS    TM_TERM_IDF_VALUE
1       term1             1.791759
2       term2             1.098612
3       term3             0.405465
4       term4             0.182322
5       term5             1.098612
6       term6             1.098612