Skip to contents

This dynamically determines the names of the languages for which stemming is currently supported by this package.

Usage

getStemLanguages()

Details

The language names in lower case are returned, though please note that two- and three- letter ISO-639 codes are also accepted by wordStem (see references for the list of codes).

This queries the C code for the list of languages that were compiled when the package was installed which in turn is determined by the code that was included in the distributed package itself.

Value

A character vector giving the names of the languages.

Author

Milan Bouchet-Valat

See also

Examples

  getStemLanguages()
#>  [1] "arabic"     "basque"     "catalan"    "danish"     "dutch"     
#>  [6] "english"    "finnish"    "french"     "german"     "greek"     
#> [11] "hindi"      "hungarian"  "indonesian" "irish"      "italian"   
#> [16] "lithuanian" "nepali"     "norwegian"  "porter"     "portuguese"
#> [21] "romanian"   "russian"    "spanish"    "swedish"    "tamil"     
#> [26] "turkish"