Nearest Neighbor Outlier

Properties that can be configured for the Nearest Neighbor Outlier algorithm.

Syntax Use this algorithm to find outlying values based on the number of neighbors (N) and the average distance of values compared to their nearest N neighbors.
Note Creating models using the Nearest Neighbor Outlier is not supported.
Nearest Neighbour Outlier Properties
Table 1: Algorithm Properties
Property Description
Output Mode Select the mode in which you want to use the output of this algorithm.
Possible values:
  • Show Outliers: Adds a Boolean column to the input data specifying if the corresponding value is an outlier.
  • Remove Outliers: Removes outlying values from the input data.
Feature Select the input column with which you want to perform the analysis.
Missing Values Select the method for handling missing values.
Possible methods:
  • Ignore: The algorithm skips the records containing missing values in the independent or dependent columns.
  • Stop: The algorithm stops the execution if a value is missing in the independent column or the dependent column.
Neighborhood Count Enter the number of neighbors for finding distances. The default value is 5.
Number of Outliers Enter the number of outliers, which you want to remove.
Predicted Column Name Enter a name for the new column that contains the predicted values.