Properties that can be configured for the HANA R-Apriori algorithm.
Use this algorithm to find frequent itemsets patterns in large transactional datasets for generating association rules using the "arules" R package. This algorithm is used to understand what products and services customers tend to purchase at the same time. By analyzing the purchasing trends of customers with association analysis, prediction of their future behavior can be made.
For example, the information that a customer who buys shoes is more likely to buy socks at the same time can be represented in an association rule (with a given minimum support and minimum confidence) as: Shoes=> Socks [support = 0.5, confidence= 0.1]
| Property | Description |
|---|---|
| Output Mode | Select the mode in which you want to use the output of this algorithm. |
| Input Format | Select the format of the input data. |
| Item Column(s) | Select the columns containing the items to which you want to apply the algorithm. |
| TransactionID Column | Select the column containing the transaction IDs to which you want to apply the algorithm. |
| Support | Enter a value for the minimum support of an item. |
| Confidence | Enter a value for the minimum confidence of rules/association. |
| Rules | Enter a name for the new column that contains the apriori rules for the given dataset. |
| Support Values | Enter a name for the new column that contains the support for the corresponding rules. |
| Confidence Values | Enter a name for the new column that contains the confidence values for the corresponding rules. |
| Lift values | Enter a name for the new column that contains the lift values for the corresponding rules. |
| Transaction ID | Enter a name for the new column that contains transaction ID. |
| Items | Enter a name for the new column that contains the names of the items. |
| Matching Rules | Enter a name for the new column that contains the matching rules. |
| Lhs Item(s) | Enter comma-separated labels for the items which should appear on the left hand side of rules or itemsets. |
| Rhs Item(s) | Enter comma-separated labels for the items which should appear on the right hand side of rules or itemsets. |
| Both Item(s) | Enter comma-separated labels for the items which should appear on both sides of rules or itemsets. |
| None Item(s) | Enter a comma-separated labels of the items which need not appear in the rules or itemsets. |
| Default Appearance | Enter default appearance of items that are not explicitly mentioned. |
| Sort Type | Select the sort option to sort items with respect to their frequency. |
| Filter Criteria | Enter a numerical value that indicates how to filter unused items from transactions. The default value is 0.1. |
| Use Tree Structure | To organize transactions as a prefix tree, select True. |
| Use HeapSort | To use heapsort instead of quick sort for sorting transactions, select True. |
| Optimize Memory | To minimize memory usage instead of maximizing speed, select True. |
| Load Transactions into Memory | To load transactions into memory, select True. |