unique returns a vector like x but with duplicate elements/rows removed.
a vector or a data frame or an array or NULL.
ignored
The order in which unique values will be returned, see details
NULL or the number of unique values (including NA). Providing
nunique can speed-up matching when x has no cache. Note that a wrong
`nunique“ can cause undefined behaviour up to a crash.
NULL for automatic method selection or a suitable low-level method, see details
ignored
For a vector, an object of the same type of x, but with only
one copy of each duplicated element. No attributes are copied (so
the result has no names).
This function automatically chooses from several low-level functions
considering the size of x and the availability of a cache.
Suitable methods are
hashmapuni (simultaneously creating and using a hashmap)
hashuni (first creating a hashmap then using it)
sortuni (fast sorting for sorted order only)
sortorderuni (fast ordering for original order only)
orderuni (memory saving ordering).
The default order="original" returns unique values in the order of the
first appearance in x like in unique(), this costs extra processing.
order="values" returns unique values in sorted order like in table(),
this costs extra processing with the hash methods but comes for free.
order="any" returns unique values in undefined order, possibly faster.
For hash methods this will be a quasi random order, for sort methods this
will be sorted order.
unique() for the generic, unipos() which gives the indices
of the unique elements and table.integer64() which gives frequencies
of the unique elements.
x <- as.integer64(sample(c(rep(NA, 9), 1:9), 32, TRUE))
unique(x)
#> integer64
#> [1] <NA> 8 7 9 2 3 1 4 6
unique(x, order="values")
#> integer64
#> [1] <NA> 1 2 3 4 6 7 8 9
stopifnot(identical(unique(x), x[!duplicated(x)]))
stopifnot(identical(unique(x), as.integer64(unique(as.integer(x)))))
stopifnot(identical(unique(x, order="values")
, as.integer64(sort(unique(as.integer(x)), na.last=FALSE))))