Summarize mark results.
# S3 method for class 'bench_mark'
summary(object, filter_gc = TRUE, relative = FALSE, time_unit = NULL, ...)bench_mark object to summarize.
If TRUE remove iterations that contained at least one
garbage collection before summarizing. If TRUE but an expression had
a garbage collection in every iteration, filtering is disabled, with a warning.
If TRUE all summaries are computed relative to the minimum
execution time rather than absolute time.
If NULL the times are reported in a human readable
fashion depending on each value. If one of 'ns', 'us', 'ms', 's', 'm', 'h',
'd', 'w' the time units are instead expressed as nanoseconds, microseconds,
milliseconds, seconds, hours, minutes, days or weeks respectively.
Additional arguments ignored.
A tibble with the additional summary columns. The following summary columns are computed
expression - bench_expr The deparsed expression that was evaluated
(or its name if one was provided).
min - bench_time The minimum execution time.
median - bench_time The sample median of execution time.
itr/sec - double The estimated number of executions performed per
second.
mem_alloc - bench_bytes Total amount of memory allocated by R while
running the expression. Memory allocated outside the R heap, e.g. by
malloc() or new directly is not tracked, take care to avoid
misinterpreting the results if running code that may do this.
gc/sec - double The number of garbage collections per second.
n_itr - integer Total number of iterations after filtering
garbage collections (if filter_gc == TRUE).
n_gc - double Total number of garbage collections performed over all
iterations. This is a psudo-measure of the pressure on the garbage collector, if
it varies greatly between to alternatives generally the one with fewer
collections will cause fewer allocation in real usage.
total_time - bench_time The total time to perform the benchmarks.
result - list A list column of the object(s) returned by the
evaluated expression(s).
memory - list A list column with results from Rprofmem().
time - list A list column of bench_time vectors for each evaluated
expression.
gc - list A list column with tibbles containing the level of
garbage collection (0-2, columns) for each iteration (rows).
If filter_gc == TRUE (the default) runs that contain a garbage
collection will be removed before summarizing. This is most useful for fast
expressions when the majority of runs do not contain a gc. Call
summary(filter_gc = FALSE) if you would like to compute summaries with
these times, such as expressions with lots of allocations when all or most
runs contain a gc.
dat <- data.frame(x = runif(10000, 1, 1000), y=runif(10000, 1, 1000))
# `bench::mark()` implicitly calls summary() automatically
results <- bench::mark(
dat[dat$x > 500, ],
dat[which(dat$x > 500), ],
subset(dat, x > 500))
# However you can also do so explicitly to filter gc differently.
summary(results, filter_gc = FALSE)
#> # A tibble: 3 × 13
#> expression min median `itr/sec` mem_alloc `gc/sec` n_itr n_gc total_time
#> <bch:expr> <bch> <bch:> <dbl> <bch:byt> <dbl> <int> <dbl> <bch:tm>
#> 1 dat[dat$x > … 193µs 310µs 2096. 375KB 18.0 1048 9 500ms
#> 2 dat[which(da… 149µs 190µs 3240. 258KB 18.0 1620 9 500ms
#> 3 subset(dat, … 273µs 438µs 1692. 493KB 17.6 867 9 513ms
#> # ℹ 4 more variables: result <list>, memory <list>, time <list>, gc <list>
# Or output relative times
summary(results, relative = TRUE)
#> # A tibble: 3 × 13
#> expression min median `itr/sec` mem_alloc `gc/sec` n_itr n_gc total_time
#> <bch:expr> <dbl> <dbl> <dbl> <dbl> <dbl> <int> <dbl> <bch:tm>
#> 1 dat[dat$x > … 1.30 1.63 1.30 1.45 1.07 1039 9 328ms
#> 2 dat[which(da… 1 1 1.96 1 1.04 1611 9 338ms
#> 3 subset(dat, … 1.84 2.31 1 1.91 1 858 9 352ms
#> # ℹ 4 more variables: result <list>, memory <list>, time <list>, gc <list>