Releases: tlverse/sl3
Releases · tlverse/sl3
v1.4.4: blitzen
v1.4.4 ("Blitzen") is a major release, featuring numerous updates and bugfixes
(totaling 400+ commits spread across ~8 months), including
- Updates to
Lrnr_nnlsto support binary outcomes, including support for convexity of the resultant model fit and warnings on prediction quality. - Changes to
Lrnr_cv_selectorto support improved computation of the CV-risk, averaging the risk strictly across validation/holdout sets. - Update
Lrnr_slby adding a new private slot.cv_riskto store the risk estimates, using this to avoid unnecessary re-computation in theprintmethod (the.cv_riskslot is populated on the firstprintcall, and only ever re-printed thereafter). - Fix
Lrnr_screener_importance's pairing of (a) covariates returned by the importance function with (b) covariates as they are defined in the task. This issue only arose when discrete covariates were automatically one-hot encoded upon task initiation (i.e., whencolnames(task$X) != task$nodes$covariates). - Enhanced functionality in
sl3task'sadd_interactionsmethod to support interactions that involve factors. This method is most commonly used byLrnr_define_interactions, which is intended for use with another learner (e.g.,Lrnr_glmnetorLrnr_glm) in aPipeline. - Modified
Lrnr_gamformula (if not specified by user) to not usemgcv's defaultk=10degrees of freedom for each smoothsterm when there are less thank=10degrees of freedom. This bypasses anmgcv::gamerror, and tends to be relevant only for small n. - Incorporated
min_screenargumentLrnr_screener_coefs, which tries to ensure that at leastmin_screennumber of covariates are selected. If this argument is specified and thelearnerargument inLrnr_screener_coefsis aLrnr_glmnet, thenlambdais increased untilmin_screennumber of covariates are selected and a warning is produced. Ifmin_screenis specified and thelearnerargument inLrnr_screener_coefsis not aLrnr_glmnetthen it will error. - Added
formulaparameter andprocess_formulafunction to the base learner,Lrnr_base, whose methods carry over to all other learners. When aformulais supplied as a learner parameter, theprocess_formula function constructs a design matrix by supplying theformulatomodel.matrix. This implementation allowsformulato be supplied to all learners, even those without nativeformulasupport. Theformulashould be an object of class "formula`", or a character string that can be coerced to that class. - Added factory function for performance-based risks for binary outcomes with
ROCRperformance measurescustom_ROCR_risk. Supports cutoff-dependent and scalarROCRperformance measures. The risk is defined as 1 - performance, and is transformed back to the performance measure incv_riskandimportancefunctions. This change prompted the revision of argument nameloss_funandloss_functiontoeval_funandeval_function, respectively, since the evaluation of predictions relative to the observations can be either a risk or a loss function. This argument name change impacted the following:Lrnr_solnp,Lrnr_optim,Lrnr_cv_selector,cv_risk,importance, andCV_Lrnr_sl. - Incorporated stratified cross-validation when
foldsare not supplied to thesl3_Taskand the outcome is a discrete (i.e., binary or categorical) variable. - Added to the
importancemethod the option to evaluate importance overcovariate_groups, by removing/permuting all covariates in the same group together. - Added
Lrnr_gaas another metalearner.
See the NEWS file for complete details.
v1.4.2
v1.4.2 of sl3 is a major release, featuring numerous updates to core functionality and improvements to available learners, including
- Updates to variable importance functionality, including calculation of risk ratio and risk differences under covariate deletion or permutation.
- Addition of a
importance_plotto summarize variable importance findings. - Additions of new methods
reparameterizeandretraintoLrnr_base, which allows modification of the covariate set while training on a conserved task and prediction on a new task using previously trained learners, respectively. - Updates to variable importance functionality, including use of risk ratios.
- Change
Lrnr_hal9001andLrnr_glmnetto respect observation-level IDs. - Removal of
Remotesand deprecation ofLrnr_rfcdeandLrnr_condensier.
v1.3.7
v1.3.7 of sl3 is a major release, which features updated functionality:
- sampling methods for Monte Carlo integration and related procedures
- a metalearner for the cross-validation selector (discrete super learner)
- a learner for bounding, including support for bounded losses
- resolution of a number of older issues (#264)
- relaxation of checks inside
Stackobjects for time series learners - addition of a learner property table to
README.Rmd - maintenance and documentation updates
v1.3.5
v1.3.5 of sl3 is a minor release, featuring several important updates to the core software:
- New screening methods and convex combination in
Lrnr_nnlsby @rachaelvphillips - Overhaul of data preprocessing by @Zyx0Wu
- Bug fixes by @jeremyrcoyle, including covariate subsetting and better handling of NAs
- Package and documentation cleanup, continuous integration and testing fixes, reproducibility updates (including new versioning and DOI minting) by @nhejazi
v1.3.0
v1.3.0 of sl3 represents a set of major updates to the core software. An inexhaustive list of the included changes include
- fixing incorrect handling of missingness in the automatic imputation procedure
- addition of new standard learners, including from the
gamandcaretpackages - addition of custom learners for conditional density estimation, including semiparametric methods based on conditional mean and conditional mean/variance estimation as well as generalized functionality for density estimation via a pooled hazards approach
v1.2.0
v1.2.0 of sl3 represents a set of major updates to the core software. An inexhaustive list of the included changes include
- default metalearners based on task outcome types
- handling of imputation internally in task objects
- addition of new learners, including from the
gbm,earth,polsplinepackages - fixing errors in existing learners (e.g., subtle parallelization in
xgboostandranger) - support for multivariate outcomes and (default) revere-style cross-validation
- support for cross-validated super learner and variable importance