Package org.egothor.stemmer
Class Compile
java.lang.Object
org.egothor.stemmer.Compile
Command-line compiler of stemmer dictionary files into compressed binary
FrequencyTrie artifacts.
The CLI reads an input file in the same syntax as the project's stemmer
resource files, compiles it into a read-only FrequencyTrie of patch
commands, applies the selected subtree reduction strategy, and writes the
resulting trie in the project binary format under GZip compression.
Remarks introduced by # or // are supported through
StemmerDictionaryParser.
Supported arguments:
--input <file> --output <file> --reduction-mode <mode> [--store-original] [--dominant-winner-min-percent <1..100>] [--dominant-winner-over-second-ratio <1..n>] [--overwrite] [--help]
-
Method Summary
-
Method Details
-
main
CLI entry point.- Parameters:
arguments- command-line arguments
-