HANA CHAID

Properties that can be configured for the HANA CHAID algorithm.

Syntax CHAID stands for CHi-squared Automatic Interaction Detection. CHAID is a classification method for building decision trees by using chi-square statistics to identify optimal splits.
Note The data type of columns used during model scoring should be same as the data type of columns used while building the model.
HANA CHAID Properties
Table 1: Algorithm Properties
Property Description
Output Mode Select the mode in which you want to use the output of this algorithm
Possible values:
  • Trend: Predicts the values for the dependent column and adds an extra column in the output containing the predicted values.
  • Fill: Fills missing values in the target column.
Features Select the input columns with which you want to perform the analysis.
Target Variable Select the target column for which you want to perform the analysis.
Note It only accepts column with integer data type.
Missing Values Select the method for handling missing values.
Possible values:
  • Ignore: The algorithm skips the records containing missing values in the independent or dependent columns.
  • Keep: The algorithm retains the records containing missing values during calculation.
Percentage of Input Data Enter the percentage of data to be considered for analysis.
Minimum split Enter the minimum number of records for a node, beyond which the splitting of that particular node is not allowed. The default value is 0.
Maximum Depth Enter the maximum depth of the tree.
Column Name Select the name of the independent column containing numerical values.
Enter Bin Ranges Enter bin ranges.
Predicted Column name Enter a name for the new column that contains the predicted values.
Number of Threads Enter the number of threads that the algorithm should use during execution.