hanaml.Get.Related.Doc.Rdhanaml.Get.Related.Doc is a R wrapper for SAP HANA PAL get related doc algorithm.
hanaml.Get.Related.Doc( pred.data, ref.data = NULL, top = NULL, threshold = NULL )
| pred.data |
|
|---|---|
| ref.data |
|
| top |
|
| threshold |
|
DataFrame
DataFrame of get related doc result.
Input DataFrame data:
> ref_df$Collect()
ID CONTENT CATEGORY
0 doc1 term1 term2 term2 term3 term3 term3 CATEGORY_1
1 doc2 term2 term3 term3 term4 term4 term4 CATEGORY_1
2 doc3 term3 term4 term4 term5 term5 term5 CATEGORY_2
3 doc4 term3 term4 term4 term5 term5 term5 term5 term5 term5 CATEGORY_2
4 doc5 term4 term6 CATEGORY_3
5 doc6 term4 term6 term6 term6 CATEGORY_3
> pred_df$Collect()
CONTENT
0 term2 term2 term3 term3
Call the function:
> result <- hanaml.Get.Related.Doc(pred_df, ref_df)
Output:
> result$Collect()
ID SCORE
0 doc2 0.891550
1 doc1 0.804670
2 doc3 0.042024
3 doc4 0.021225