Rank the results by a metric — rank_results • workflowsets

This function sorts the results by a specific performance metric.

rank_results(x, rank_metric = NULL, eval_time = NULL, select_best = FALSE)

Arguments

x: A workflow_set object that has been evaluated with workflow_map().
rank_metric: A character string for a metric.
eval_time: A single numeric time point where dynamic event time metrics should be chosen (e.g., the time-dependent ROC curve, etc). The values should be consistent with the values used to create x. The NULL default will automatically use the first evaluation time used by x.
select_best: A logical giving whether the results should only contain the numerically best submodel per workflow.

Value

A tibble with columns: wflow_id, .config, .metric, mean, std_err, n, preprocessor, model, and rank.

Details

If some models have the exact same performance, rank(value, ties.method = "random") is used (with a reproducible seed) so that all ranks are integers.

No columns are returned for the tuning parameters since they are likely to be different (or not exist) for some models. The wflow_id and .config columns can be used to determine the corresponding parameter values.

Note

The package supplies two pre-generated workflow sets, two_class_set and chi_features_set, and associated sets of model fits two_class_res and chi_features_res.

The two_class_* objects are based on a binary classification problem using the two_class_dat data from the modeldata package. The six models utilize either a bare formula or a basic recipe utilizing recipes::step_YeoJohnson() as a preprocessor, and a decision tree, logistic regression, or MARS model specification. See ?two_class_set for source code.

The chi_features_* objects are based on a regression problem using the Chicago data from the modeldata package. Each of the three models utilize a linear regression model specification, with three different recipes of varying complexity. The objects are meant to approximate the sequence of models built in Section 1.3 of Kuhn and Johnson (2019). See ?chi_features_set for source code.

Examples

chi_features_res
#> # A workflow set/tibble: 3 × 4
#>   wflow_id         info             option    result   
#>   <chr>            <list>           <list>    <list>   
#> 1 date_lm          <tibble [1 × 4]> <opts[2]> <rsmp[+]>
#> 2 plus_holidays_lm <tibble [1 × 4]> <opts[2]> <rsmp[+]>
#> 3 plus_pca_lm      <tibble [1 × 4]> <opts[3]> <tune[+]>

rank_results(chi_features_res)
#> # A tibble: 40 × 9
#>    wflow_id    .config      .metric  mean std_err     n preprocessor model  rank
#>    <chr>       <chr>        <chr>   <dbl>   <dbl> <int> <chr>        <chr> <int>
#>  1 plus_pca_lm Preprocesso… rmse    0.574      NA     1 recipe       line…     1
#>  2 plus_pca_lm Preprocesso… rsq     0.989      NA     1 recipe       line…     1
#>  3 plus_pca_lm Preprocesso… rmse    0.586      NA     1 recipe       line…     2
#>  4 plus_pca_lm Preprocesso… rsq     0.989      NA     1 recipe       line…     2
#>  5 plus_pca_lm Preprocesso… rmse    0.590      NA     1 recipe       line…     3
#>  6 plus_pca_lm Preprocesso… rsq     0.988      NA     1 recipe       line…     3
#>  7 plus_pca_lm Preprocesso… rmse    0.591      NA     1 recipe       line…     4
#>  8 plus_pca_lm Preprocesso… rsq     0.988      NA     1 recipe       line…     4
#>  9 plus_pca_lm Preprocesso… rmse    0.594      NA     1 recipe       line…     5
#> 10 plus_pca_lm Preprocesso… rsq     0.989      NA     1 recipe       line…     5
#> # ℹ 30 more rows
rank_results(chi_features_res, select_best = TRUE)
#> # A tibble: 6 × 9
#>   wflow_id         .config  .metric  mean std_err     n preprocessor model  rank
#>   <chr>            <chr>    <chr>   <dbl>   <dbl> <int> <chr>        <chr> <int>
#> 1 plus_pca_lm      Preproc… rmse    0.574      NA     1 recipe       line…     1
#> 2 plus_pca_lm      Preproc… rsq     0.989      NA     1 recipe       line…     1
#> 3 plus_holidays_lm Preproc… rmse    0.646      NA     1 recipe       line…     2
#> 4 plus_holidays_lm Preproc… rsq     0.986      NA     1 recipe       line…     2
#> 5 date_lm          Preproc… rmse    0.733      NA     1 recipe       line…     3
#> 6 date_lm          Preproc… rsq     0.982      NA     1 recipe       line…     3
rank_results(chi_features_res, rank_metric = "rsq")
#> # A tibble: 40 × 9
#>    wflow_id    .config      .metric  mean std_err     n preprocessor model  rank
#>    <chr>       <chr>        <chr>   <dbl>   <dbl> <int> <chr>        <chr> <int>
#>  1 plus_pca_lm Preprocesso… rmse    0.594      NA     1 recipe       line…     1
#>  2 plus_pca_lm Preprocesso… rsq     0.989      NA     1 recipe       line…     1
#>  3 plus_pca_lm Preprocesso… rmse    0.574      NA     1 recipe       line…     2
#>  4 plus_pca_lm Preprocesso… rsq     0.989      NA     1 recipe       line…     2
#>  5 plus_pca_lm Preprocesso… rmse    0.586      NA     1 recipe       line…     3
#>  6 plus_pca_lm Preprocesso… rsq     0.989      NA     1 recipe       line…     3
#>  7 plus_pca_lm Preprocesso… rmse    0.591      NA     1 recipe       line…     4
#>  8 plus_pca_lm Preprocesso… rsq     0.988      NA     1 recipe       line…     4
#>  9 plus_pca_lm Preprocesso… rmse    0.590      NA     1 recipe       line…     5
#> 10 plus_pca_lm Preprocesso… rsq     0.988      NA     1 recipe       line…     5
#> # ℹ 30 more rows