What’s New in hana.ml.r v2.17.240111
Bug Fixed:
- Resolved the issue preventing the use of a generic function
‘CallPalAutoWithConnection’ to call native PAL SQL Procedure.
- Rectified the ‘evaluation.metric’ check failure that occurred when
‘family’ was set to ordinal in ‘hanaml.GLM’.
- Corrected the precompute parameters check in massive mode for
‘hanaml.UnifiedClustering’.
What’s New in hana.ml.r v2.17.231018
Enhancement:
- Enhanced the support of data type of DECIMAL.
Bug Fixed:
- Fixed the type check issue of pmml.export in
hanaml.LogisticRegression.
What’s New in hana.ml.r v2.17.230615
New Functions:
- Added a new function hanaml.ROCKET.
- Added a new function hanaml.ClassificationTS for time series
Classification.
Enhancement:
- Enhanced precalculated/precomputed distances Matrix input for
KMedoid in hanaml.UnifiedClustering.
- Enabled interval prediction for Linear Regression, GLM and Random
Decision Trees in UnifiedRegression.
- Added SPEC as a new accuracy measure for time-series
forecasting.
- Enhanced auto-ml with reason code and successive halving.
What’s New in hana.ml.r v2.16.230316
New Functions:
- Added a new function hanaml.Periodogram.
- Added a Long-Term Series Forecasting method hanaml.LTSF.
- Added a new function hanaml.AutomaticTimeSeries.
Enhancement:
- Enhanced hanaml.OutlierDetectionTS with new outlier.method
“isolationforest”, “dbscan”.
- Enhanced hanaml.UnifiedClustering with Massive mode.
- Enhanced SVM and MLP in Unified Regression with sha/hyperband
params.
- Enhanced automl with new parameter model.table.name to define the
table name
- Enabled sha/hyperband param select for NB, MLP, FRM and ALS.
- Enhanced HGBT in Unified Classification with early stop
parameter.
Bug Fixed:
- Correction for warn suppression in DataFrame Collect.
What’s New in hana.ml.r v2.15.221216
New Functions:
- Added time series outlier detection called
hanaml.OutlierDetectionTS.
- Added Kolmogorov-Smirnov Test called hanaml.KsTest.
- Added time series imputation called hanaml.ImputeTS.
Enhancement:
- Enhanced text mining with French and Russian support.
- Enhanced JSON model format support in hanaml.LogisticRegression and
hanaml.LinearRegression.
- Ehhanced hanaml.Correlation with confidence interval
calculation.
- Enhanced the support of pre-defined period setting in
hanaml.SeasonalDecompose with a new parameter ‘periods’.
- Enhanced HGBT, KNN, SVM and MLR with Successive halving.
- Enhanced auto-ml with multiple config_dict templates.
- Enhanced multi-class LogisticRegression with AFL state.
API Changes:
- Added a parameter “json.export” in hanaml.LogisticRegression and
hanaml.LinearRegression for exporting model in JSON format.
- Added new parameters “calculate.confint”, “alpha”, “bartlett” in
hanaml.Correlation for confidence interval calculation.
- Added a parameter ‘periods’ in hanaml.SeasonalDecompose.
What’s New in hana.ml.r v2.14.220918
New Functions:
- Added Stationarity Test called hanaml.StationarityTest.
- Added AutoML called hanaml.AutomaticClassification and
hanaml.AutomaticRegression.
- Added Croston TSB method.
Enhancement:
- Enhanced missing value handling in hanaml.UnifiedClassification and
hanaml.UnifiedRegression.
- Enhanced hanaml.UnifiedClassification and hanaml.UnifiedRegression
with Massive mode.
- Enhanced additive model forecst, ARIMA, AutoARIMA with massive
mode.
- Enhanced garch to support more model types.
- Enhanced text mining with Spanish language.
Bug Fixed:
- Fixed null data check in isolation forest.
- Fixed parameter mapping error in UnifiedClassification.
- Fixed the bug for parsing ‘c’ param in svm for unified
interfaces.
- Fixed the incompatibleness between afl state enabling and
ModelStorage.
What’s New in hana.ml.r v2.13.220511
New Functions:
- Added Isolation Forest called hanaml.IsolationForest.
- Added AFL state in unified API.
Enhancement:
- Enhanced hanaml.WhiteNoiseTest with a new parameter “model.df” to
specify the degrees of freedom occuppied by a model.
- Enhanced HGBT with more loss functions.
- Enhanced discrete wavelet transform with threshold method and
compression.
- Enhanced text mining with German support.
- Enhanced expalinability in mechanism LSTM/ATTENTION.
API Changes:
- Added a new parameter “model.df” in hanaml.WhiteNoiseTest for
selecting the degree of freedom.
What’s New in hana.ml.r v2.12.220325
New Functions:
- Added PCA categorical support.
- Added Bayesian Structured Time Series(BSTS) forecast.
- Added Feature Selection.
- Added Continous HybridGradientBoostingTrees.
- Added pivot in dataframe.
Enhancement:
- Supports Explainer in hanaml.ARIMA and hanaml.AutoARIMA.
- Supports Explainer in hanaml.AdditiveModelForecast.
- Supports warm start mode in hanaml.HGBTClassifier and
hanaml.HGBTRegressor.
- Supports data compression for discrete wavelet transform and wavelet
packet transform.
- Supports feature-wise explain mode in hanaml.GRUAttention.
- Supports more types of objective functions in hanaml.HGBT.
- Supports reference column and ordering as inputs when adding an ID
column for DataFrame.
- Supports precomputed affinity in
hanaml.AgglomerateHierarchical.
- Supports window in hanaml.FFT.
API Changes:
- Added a new parameter “background.size” in hanaml.ARIMA and
AutoARIMA for generating explainer in predict.ARIMA and
predict.AutoARIMA.
- Added new parameters “show.explainer”, “thread.ratio”,
“top.k.attributions”, “trend.mod”, “trend.width”, “seasonal.width” in
predict.ARIMA and predict.AutoARIMA.
- Added new parameters “show.explainer”, “decompose.seasonality”,
“decompose.holiday” in predict.AdditiveModelForecast.
- Added new parameters “model” and “warm.start” in
hanaml.HGBTClassifier and hanaml.HGBTRegressor to support warm start
mode.
- Added new parameters “compression”, “method”, “threshold” and
“level.thresholds”(only valid for hanaml.DWT) in hanaml.DWT and
hanaml.WPT.
- Added new parameter “explain.mode” in hanaml.GRUAttention.
- Added new parameters “ref.col” and “order” for the AddID() function
in DataFrame.
- Added new parameters “window”, “window.start”, “window.length”,
“alpha”, “beta”, “attenuation”, “flattop.mode”, “flattop.precision” and
“tukey.r” in hanaml.FFT.
Bug Fixed:
- Fixed the bug of inconsistent mappings of solvers between logistic
regression and multi-class logistic regression.
What’s New in hana.ml.r v2.11.211211
New Functions:
- Added unifed exponential smoothing.
- Added GARCH.
- Added spectral clustering.
- Added LSTM and attention.
- Added generic DTW.
- Added wavelet transforms.
Enhancement:
- Supports more distributions in MCMC.
API Changes:
- Added categorical support in SMOTE/SMOTETomek.
What’s New in hana.ml.r v2.10.210918
New Functions:
- TextMining : TF.Analysis, Text.Classification, Get.Related.Doc,
Get.Relevant.Doc, Get.Related.Term, Get.Relevant.Term,
Get.Suggested.Term, Text.Collector, Text.TFIDF.
- New function hanaml.OnlineLinearRegression(), an online version of
linear regression.
- New function hanaml.OnlineMultiLogisticRegression(), an online
version of multi logistic regression.
- New Bayesian Change-Point Detection function hanaml.BCPD().
Enhancement:
- Categorical feature support in AdditiveModelForecast().
- Supports SAP HANA Data Lake in ModelStorage.
API Changes:
- Added a new parameter
categorical.variable
in
hanaml.AdditiveModelForecast().
What’s New in hana.ml.r v2.9.210619
Bug Fixed:
- Fixed missing default value of parameter ‘cols’ in
hanaml.CovarianceMatrix and hanaml.PearsonrMatrix.
- Fixed missing schema setting issue in hanaml.ConvertToHANADataFrame
using JDBC methods to write the HANA table.
- Fixed lowercase error of ACCURACY_MEASURE spelling.
- Fixed the corrected the value of param intercept in linear
regression.
What’s New in hana.ml.r v2.8.210321
Version 2.8.210321 supports SAP HANA SPS05 and
SAP HANA Cloud
New Functions:
- Added tbl() to support dbplyr
- Added hana2dbplyr() and dbplyr2hana() to subpport dbplyr
- New Unified functions: hanaml.UnifiedRegression,
hanaml.UnifiedClustering.
- New Markov Chain Monte Carlo function: hanaml.mcmc.
- New ‘hana.version’ function is provided by
hanaml.ConnectionContext.
- New function hanaml.OnlineARIMA.
- New function hanaml.VectorARIMA.
- New function hanaml.sqltrace.
API Changes:
- hanaml.KMeans with two added parameters ‘use.fast.library’ and
‘use.float’.
- Added a parameter ‘distance.level’ in hanaml.UnifiedClustering when
‘func’ is AgglomerateHierarchicalClustering and DBSCAN. Please refer to
documentation for details.
- Added a parameter ‘range.penalty’ in hanaml.CPD.
- Provides more methods of ‘decomposition’ in various regression
functions, please refer to the documentation of the specific
algorithm.
- Added a parameter ‘key’ in prediction() function of hanaml.ARIMA and
hanaml.AutoARIMA.
- Added 2 parameters ‘min.measure’, ‘max.consequent’ in
hanaml.KORD.
- Added a parameter ‘output.threshold’ in hanaml.AUC to enable the
output of threshold values in roc table.
- Removed RODBC dependency.
Enhancement:
- Improved the data upload in ConvertToHANADataFrame() for odbc and
jdbc connection.
- Enhanced hanaml.UnifiedClustering to support ‘distance.level’ in
AgglomerateHierarchicalClustering and DBSCAN functions. Please refer to
the documentation for details.
- Enhanced the speed of data import functionality when odbc is
applied.
- Enhanced the validation of types of parameters.
- Enhanced the parameter check of season.start and allow.linear in
hanaml.AutoARIMA.
- Enhanced the model storage for hanaml.ARIMA, hanaml.AutoARIMA,
hanaml.VectorARIMA and hanaml.OnlineARIMA.
Bug Fixed:
- Fixed the displacement of parameter ‘dispersion’ in hanaml.CPD.
- Fixed the displacement of parameter ‘category.weight’ in
hanaml.GaussianMixture.
What’s New in hana.ml.r v2.6.201016
API Changes:
- HybridGradientBoostingClassifier, HybridGradientBoostingRegressor:
added a parameter ‘adopt_prior’ to indicate whether to adopt the prior
distribution as the initial point.
- LinearRegression: added a parameter ‘features.must.select’ to
specifies the column name that needs to be included in the final
training model when executing the variable selection.
What’s New in hana.ml.r v2.6.200928
API Changes:
- Added encrypt, validateCertificate and autocommit options to JDBC
ConnectionContext.
- SVC, SVR, OneClassSVM, SVRanking: added parameters ‘compression’,
‘max.bits’, ‘max.quantization.iter’ for model compression.
- RandomForestClassifier: added parameters ‘compression’, ‘max.bits’,
‘quantize.rate’ for model compression.
- RandomForestRegressor: added parameters ‘compression’, ‘max.bits’,
‘quantize.rate’, ‘fittings.quantization’ for model compression.
- In prediction function ARIMA and AutoARIMA, new value
‘truncation.algorithm’ of forecast_method is introduced to improve the
prediction performance.
- New parameters ‘string.variable’, ‘variable.weight’ added in
KNNClassifier, KNNRegressor and DBSCAN to enable distance calculation
based on String distance.
- New parameters ‘extrapolation’, ‘smooth.width’,
‘auxiliary.normalitytest’ are added in SeasonalDecompose.
New Functions:
- Clustering: SlightSilhouette.
- Preprocessing : SMOTETomek, TomekLinks.
Bug Fixed:
- Fixed version check to support HANA cloud.
- Fixed the error in random distribution sampling when full
distribution parameter set is specified.
- Fixed HAS_ID error in TomekLinks.
- Fixed the Collect() method that returns ‘No Data’ when contraining a
column of type ST_GEOMETRY(in particular, for GeoDBSCAN).
What’s New in hana.ml.r v2.5.200626
API Changes:
- Removed parameter ConnectionContext in functions.
- Added a new parameter ‘decay’ to replace ‘learning.rate’ as the
later one is misleading in SOM().
- Added a new parameter ‘stratified.columns’ in hanaml.Sampling() to
replace ‘features’ whose name is misleading.
- Added a new parameter ‘col.types’ in ConvertToHANADataFrame() to
replace ‘clob.columns’ for enhancement.
- Changed parameter ‘seed’ to ‘random.state’ for parameter name
consistency in MLPClassifier() and MLPRegressor().
- Update function names for consistency: Arima -> ARIMA, AutoArima
-> AutoARIMA, Auc -> AUC, Kmeans -> KMeans, Kmedian ->
KMedian, Kmedoid -> KMedoid, Kord -> KORD.
New Functions:
- Recommender System Algorithms : Alternating Least Square, Factorized
Polynomial Regression Models, Field-Aware Factorization Machine.
- Regression : Cox Proportional Hazard Model.
- Statistics Functions : Cumulative Distribution Function (CDF),
Distribution Fitting, Distribution Quantile, Entropy, Equal Variance
Test, Factor Analysis, Wilcoxon Signed Rank Test, Grubbs’ Test,
Kaplan-Meier Survival Analysis, Kernel Density Estimation, One-Sample
Median Test.
- Time Series : Fast DTW
- Preprocessing : SMOTE
- Miscellaneous : ABC Analysis, T-Distributed Stochastic Neighbour
Embedding(TSNE), Weighted Score Table.
- Unified classification
- Model Selection
Bug Fixed:
- Fixed parameter ‘formula’ parsing issue when a single feature is
entered.
- Fixed falsely recognizing the type of query statement with JDBC in
sqlQueryMix().
- Fixed phrasing error of parameter ‘timeout’ in Decision Tree, Linear
Regression, Logistic Regression, Naive Bayes and SVM functions.
Enhancement:
- Added cross-validation options to some functions (Decision Tree
Classifier/Regressor, Gradient Boosting Classifier/Regressor, Hybrid
Gradient Boosting Classifier/Regressor, Generalised Linear Models(GLM),
Naive Bayes, Linear Regression, Logistic Regression Multi-Layer
Perceptron Classifier/Regressor, Support Vector Machines functions,
K-Nearest Neighbors Classifier/Regressor, Alternating Least Square(ALS),
Factorized Polynomial Regression Models(FRM), Polynomial
Regression).
- Improved the robustness of ConvertToHANADataFrame().
- Enhancement of ConvertToHANADataFrame() function: support data.frame
input with missing (NA) values.
What’s New in hana.ml.r v1.0.8
Bug Fixed:
- Fixed wrong error message with RJDBC connection. Add two additional
error message for missing RJDBC connection and wrong RJDBC
connection.context type.
- Fix label type error in RandomForest. Add process to check
continuous label type for regression and categorical label type for
classification.
- Fixed wrong API for DecisionTreeRegressor. Remove type conversion
for result.
- Fixed wrong cast error in Neighbors. Use NVARCHAR instead of DOUBLE
for result.
- Fixed wrong cast error in HGBT. Use NVARCHAR instead of DOUBLE for
result.
What’s New in hana.ml.r v1.0.7
New Algorithms:
- Association : Apriori, Apriorilite, FP-Growth, KORD, Sequential
Pattern Mining (SPM)
- Clustering : Affinity Propagation, Agglomerate Hierarchical
Clustering, DBSCAN, Geometry DBSCAN, Latent Dirichlet Allocation,
Self-Organizing Maps (SOM)
- Classification : Conditional Random Field (CRF), Confusion Matrix,
Hybrid Gradient Boosting Tree (HGBT), Logistic Regression, Multilayer
Perceptron (MLP).
- Regression : Bi-Variate Geometric Regression, Bi-Variate Natural
Logarithmic Regression, Exponential Regression.
- Time Series : ARIMA, Auto ARIMA, FFT, Seasonal Decompose, Trend
Test, White Noise Test Single/Double/Triple/Auto/Brown Exponential
Smoothing, Change-Point Detection, Croston’s Method, Linear Regression
With Damped Trend And Seasonal Adjust, Additive Model Forecast,
Hierarchical Forecast, Correlation Function.
- Preprocessing : Discretize, Inter-Quartile Range (IQR), Missing
Value Handing, Multidimensional Scaling (MDS), Partition, Random
Distribution Sampling, Variance Test function.
- Statistics Functions : Chi-Squared Test Functions, T-Test
Functions, Analysis Of Variance Functions (ANOVA),
Univariate/Multivariate Analysis Functions.
- Random Distribution Sampling Functions : Bernoulli, Beta, Binomial,
Cauchy, Chi_Squared, Exponential, Extreme_Value, F, Gamma, Geometric,
Gumbel, Lognormal, Negative_Binomial, Normal, Pert, Poisson, Student_T,
Uniform, Weibull, Multinomial).
- Social Networks : Link Prediction, Pagerank.
- Connection Context : JDBC Option.
- Model Storage Services.
- Dataframe Functions : ConvertToHANADataFrame.
Quality Improvements:
- Use Anonymous Block.
- Remove Unnecessary Temporary Table.
- Error Message Enhancement.
- Fix Documentation.
- Code Quality Improvement.