You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add argument for one hot encoding to parsnip (#332)
* Add one hot option to encoding options
* one_hot = FALSE for almost all models, one_hot = TRUE for glmnet models
* changed one_hot to logical; less confusing
* revert glmnet encodings to one_hot
* Switch from logical to none/traditional/one_hot
* Update predictor_indicators in model infrastructure
* change objective function name for xgboost regression
* more encoding updates related to intercepts
* set defaults for parsnip objects with no encoding information
* "one-hot" not "one_hot"
* apply encoding changes to form_xy and xy_form paths
* fully export contrast function
* "one_hot" not "one-hot"
* fixed a few bugs
* revert xgboost change (in another PR)
* updated news
* two more global variable false positives
* updates for how many engines handle dummy variables (if at all)
* details on encoding options
* one_hot documentation
* Update R/aaa_models.R
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update R/aaa_models.R
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update R/aaa_models.R
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update R/aaa_models.R
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update R/aaa_models.R
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update R/contr_one_hot.R
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update man/rmd/one-hot.Rmd
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update man/rmd/one-hot.Rmd
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* documentation updates for one-hot
* Update man/rmd/one-hot.Rmd
Co-authored-by: Julia Silge <julia.silge@gmail.com>
* Update man/rmd/one-hot.Rmd
Co-authored-by: Julia Silge <julia.silge@gmail.com>
Co-authored-by: Max Kuhn <mxkuhn@gmail.com>
Copy file name to clipboardExpand all lines: NEWS.md
+8-2Lines changed: 8 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,14 +1,20 @@
1
1
# parsnip (development version)
2
2
3
+
## Breaking Changes
4
+
5
+
*`parsnip` now has options to set specific types of predictor encodings for different models. For example, `ranger` models run using `parsnip` and `workflows` do the same thing by _not_ creating indicator variables. These encodings can be overridden using the `blueprint` options in `workflows`. As a consequence, it is possible to get a different model fit that previous versions of `parsnip`. More details about specific encoding changes are below. (#326)
6
+
3
7
## Other Changes
4
8
5
9
*`tidyr` >= 1.0.0 is now required.
6
10
7
-
* SVM models produced by `kernlab` now use the formula method. This change was due to how `ksvm()` made indicator variables for factor predictors (with one-hot encodings). Since the ordinary formula method did not do this, the data are passed as-is to `ksvm()` so that the results are closer to what one would get if `ksmv()` were called directly.
11
+
* SVM models produced by `kernlab` now use the formula method (see breaking change notice above). This change was due to how `ksvm()` made indicator variables for factor predictors (with one-hot encodings). Since the ordinary formula method did not do this, the data are passed as-is to `ksvm()` so that the results are closer to what one would get if `ksmv()` were called directly.
8
12
9
13
* MARS models produced by `earth` now use the formula method.
10
14
11
-
* Under-the-hood changes were made so that non-standard data arguments in the modeling packages can be accomodated. (#315)
15
+
* For `xgboost`, a one-hot encoding is used when indicator variables are created.
16
+
17
+
* Under-the-hood changes were made so that non-standard data arguments in the modeling packages can be accommodated. (#315)
0 commit comments