|
11 | 11 | #' randomly sampled at each split when creating the tree models. |
12 | 12 | #' \item \code{trees}: The number of trees contained in the ensemble. |
13 | 13 | #' \item \code{min_n}: The minimum number of data points in a node |
14 | | -#' that are required for the node to be split further. |
| 14 | +#' that is required for the node to be split further. |
15 | 15 | #' \item \code{tree_depth}: The maximum depth of the tree (i.e. number of |
16 | 16 | #' splits). |
17 | 17 | #' \item \code{learn_rate}: The rate at which the boosting algorithm adapts |
|
23 | 23 | #' stopping. |
24 | 24 | #' } |
25 | 25 | #' These arguments are converted to their specific names at the |
26 | | -#' time that the model is fit. Other options and argument can be |
27 | | -#' set using the `set_engine()` function. If left to their defaults |
| 26 | +#' time that the model is fit. Other options and arguments can be |
| 27 | +#' set using the `set_engine()` function. If left to their defaults |
28 | 28 | #' here (`NULL`), the values are taken from the underlying model |
29 | | -#' functions. If parameters need to be modified, `update()` can be used |
| 29 | +#' functions. If parameters need to be modified, `update()` can be used |
30 | 30 | #' in lieu of recreating the object from scratch. |
31 | 31 | #' |
32 | 32 | #' @param mode A single character string for the type of model. |
|
38 | 38 | #' @param trees An integer for the number of trees contained in |
39 | 39 | #' the ensemble. |
40 | 40 | #' @param min_n An integer for the minimum number of data points |
41 | | -#' in a node that are required for the node to be split further. |
| 41 | +#' in a node that is required for the node to be split further. |
42 | 42 | #' @param tree_depth An integer for the maximum depth of the tree (i.e. number |
43 | 43 | #' of splits) (`xgboost` only). |
44 | 44 | #' @param learn_rate A number for the rate at which the boosting algorithm adapts |
45 | 45 | #' from iteration-to-iteration (`xgboost` only). |
46 | 46 | #' @param loss_reduction A number for the reduction in the loss function required |
47 | | -#' to split further (`xgboost` only). |
| 47 | +#' to split further (`xgboost` only). |
48 | 48 | #' @param sample_size A number for the number (or proportion) of data that is |
49 | | -#' exposed to the fitting routine. For `xgboost`, the sampling is done at at |
| 49 | +#' exposed to the fitting routine. For `xgboost`, the sampling is done at |
50 | 50 | #' each iteration while `C5.0` samples once during training. |
51 | 51 | #' @param stop_iter The number of iterations without improvement before |
52 | | -#' stopping (`xgboost` only). |
| 52 | +#' stopping (`xgboost` only). |
53 | 53 | #' @details |
54 | 54 | #' The data given to the function are not saved and are only used |
55 | 55 | #' to determine the _mode_ of the model. For `boost_tree()`, the |
|
58 | 58 | #' The model can be created using the `fit()` function using the |
59 | 59 | #' following _engines_: |
60 | 60 | #' \itemize{ |
61 | | -#' \item \pkg{R}: `"xgboost"` (the default), `"C5.0"` |
| 61 | +#' \item \pkg{R}: `"xgboost"` (the default), `"C5.0"` |
62 | 62 | #' \item \pkg{Spark}: `"spark"` |
63 | 63 | #' } |
64 | 64 | #' |
@@ -278,7 +278,7 @@ check_args.boost_tree <- function(object) { |
278 | 278 | #' of training set samples use for these purposes. |
279 | 279 | #' @param early_stop An integer or `NULL`. If not `NULL`, it is the number of |
280 | 280 | #' training iterations without improvement before stopping. If `validation` is |
281 | | -#' used, performance is base on the validation set; otherwise the training set |
| 281 | +#' used, performance is base on the validation set; otherwise, the training set |
282 | 282 | #' is used. |
283 | 283 | #' @param ... Other options to pass to `xgb.train`. |
284 | 284 | #' @return A fitted `xgboost` object. |
|
0 commit comments