Release v2.0.0 · jvalegre/robert

Adaptation of the code to avoid overfitting and to use with low-data problems

Fixed a bug in one-hot encoding in the one-hot test
Adding the possibility to disable the automatic standarization of descriptors (--std False)
Changing CV_test (now it standardizes the full database with sklearn functions)
Fixing a bug with the sklearn-intelex accelerator
Fixing a threading bug with matplotlib in SHAP
train:validation split was replaced by a repeated k-fold CV
The program always holds out a test set
The average results of the repeated k-fold CV are used to measure predictive ability and to predict new results
The BayesianOptimization() is used to find the bets model, using a combined metric that depends on interpolation and extrapolation of diferent types of CVs
This version does not work with classification problems and the AQME and EVALUATE modules were disabled until v2.0.1.
Updated ROBERT score, which is more robust towards small data problems

Provide feedback