Index
All Classes and Interfaces|All Packages|Constant Field Values
A
- API Documentation - Search tag in Overview
- Section
- apply(String, String) - Static method in class org.egothor.stemmer.PatchCommandEncoder
-
Applies a compact patch command to the supplied source word.
B
- build() - Method in class org.egothor.stemmer.FrequencyTrie.Builder
-
Builds a compiled read-only trie.
- Builder(IntFunction<V[]>, ReductionMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder using default thresholds for the supplied reduction mode.
- Builder(IntFunction<V[]>, ReductionSettings) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder with the provided settings.
C
- canonicalNodeCount() - Method in class org.egothor.stemmer.trie.ReductionContext
-
Returns the number of canonical nodes.
- childNodeIds() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal child-node identifier array.
- children() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal child-node array.
- children() - Method in class org.egothor.stemmer.trie.MutableNode
-
Returns the internal child-node map indexed by transition character.
- children() - Method in class org.egothor.stemmer.trie.ReducedNode
-
Returns the internal canonical child map indexed by transition character.
- Compile - Class in org.egothor.stemmer
-
Command-line compiler of stemmer dictionary files into compressed binary
FrequencyTrieartifacts. - CompiledNode<V> - Record Class in org.egothor.stemmer.trie
-
Immutable compiled trie node optimized for read access.
- CompiledNode(char[], CompiledNode<V>[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.CompiledNode
-
Creates one validated compiled node.
- copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionMode) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
-
Reconstructs a new writable builder from a compiled read-only trie using default settings for the supplied reduction mode.
- copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionSettings) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
-
Reconstructs a new writable builder from a compiled read-only trie.
- count() - Method in record class org.egothor.stemmer.ValueCount
-
Returns the value of the
countrecord component. - create(LocalValueSummary<V>, Map<Character, ReducedNode<V>>, ReductionSettings) - Static method in class org.egothor.stemmer.trie.ReductionSignature
-
Creates a subtree signature according to the selected reduction mode.
D
- DA_DK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Danish.
- DE_DE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
German.
- DEFAULT_DOMINANT_WINNER_MIN_PERCENT - Static variable in record class org.egothor.stemmer.ReductionSettings
-
Default minimum dominant winner percentage.
- DEFAULT_DOMINANT_WINNER_OVER_SECOND_RATIO - Static variable in record class org.egothor.stemmer.ReductionSettings
-
Default minimum winner-over-second ratio.
- dominantWinnerMinPercent() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns the value of the
dominantWinnerMinPercentrecord component. - dominantWinnerOverSecondRatio() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns the value of the
dominantWinnerOverSecondRatiorecord component.
E
- edgeLabels() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal edge-label array.
- edgeLabels() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal edge-label array.
- encode(String, String) - Method in class org.egothor.stemmer.PatchCommandEncoder
-
Produces a compact patch command that transforms
sourceintotarget. - entryCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
entryCountrecord component. - equals(Object) - Method in record class org.egothor.stemmer.ReductionSettings
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.trie.NodeData
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in class org.egothor.stemmer.trie.ReductionSignature
- equals(Object) - Method in record class org.egothor.stemmer.ValueCount
-
Indicates whether some other object is "equal to" this one.
- ES_ES - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Spanish.
F
- findChild(char) - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Finds a child for the supplied edge character.
- FR_FR - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
French.
- FrequencyTrie<V> - Class in org.egothor.stemmer
-
Read-only trie mapping
Stringkeys to one or more values with frequency tracking. - FrequencyTrie.Builder<V> - Class in org.egothor.stemmer
-
Builder of
FrequencyTrie. - FrequencyTrie.ValueStreamCodec<V> - Interface in org.egothor.stemmer
-
Codec used to persist values stored in the trie.
- FrequencyTrieBuilders - Class in org.egothor.stemmer
-
Factory utilities related to
FrequencyTrie.Builder.
G
- get(String) - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns the most frequent value stored at the node addressed by the supplied key.
- getAll(String) - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns all values stored at the node addressed by the supplied key, ordered by descending frequency.
- getEntries(String) - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns all values stored at the node addressed by the supplied key together with their occurrence counts, ordered by the same rules as
FrequencyTrie.getAll(String).
H
- hashCode() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns a hash code value for this object.
- hashCode() - Method in class org.egothor.stemmer.trie.ReductionSignature
- hashCode() - Method in record class org.egothor.stemmer.ValueCount
-
Returns a hash code value for this object.
I
- ignoredLineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
ignoredLineCountrecord component. - IT_IT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Italian.
L
- License - Search tag in Overview
- Section
- lineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
lineCountrecord component. - load(String, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using default settings for the supplied reduction mode.
- load(String, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using explicit reduction settings.
- load(Path, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using default settings for the supplied reduction mode.
- load(Path, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using explicit reduction settings.
- load(StemmerPatchTrieLoader.Language, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a bundled dictionary using default settings for the supplied reduction mode.
- load(StemmerPatchTrieLoader.Language, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a bundled dictionary using explicit reduction settings.
- loadBinary(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a GZip-compressed binary patch-command trie from an input stream.
- loadBinary(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a GZip-compressed binary patch-command trie from a filesystem path string.
- loadBinary(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a GZip-compressed binary patch-command trie from a filesystem path.
- localCounts() - Method in class org.egothor.stemmer.trie.ReducedNode
-
Returns the internal aggregated local value-count map.
- LocalValueSummary<V> - Class in org.egothor.stemmer.trie
-
Local terminal value summary of a node.
- LocalValueSummary(V[], int[], int, V, int, int) - Constructor for class org.egothor.stemmer.trie.LocalValueSummary
-
Creates a summary.
- lookup(ReductionSignature<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
-
Looks up a canonical node.
M
- main(String[]) - Static method in class org.egothor.stemmer.Compile
-
CLI entry point.
- MERGE_SUBTREES_WITH_EQUIVALENT_DOMINANT_GET_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
-
Merges subtrees whose preferred
get()results are equivalent for every reachable key suffix, provided that the locally dominant winner satisfies the configured dominance constraints. - MERGE_SUBTREES_WITH_EQUIVALENT_RANKED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
-
Merges subtrees whose
getAll()results are equivalent for every reachable key suffix and whose local result ordering is the same. - MERGE_SUBTREES_WITH_EQUIVALENT_UNORDERED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
-
Merges subtrees whose
getAll()results are equivalent for every reachable key suffix, regardless of the local ordering of values. - mergeChildren(Map<Character, ReducedNode<V>>) - Method in class org.egothor.stemmer.trie.ReducedNode
-
Merges child references into this node.
- mergeLocalCounts(Map<V, Integer>) - Method in class org.egothor.stemmer.trie.ReducedNode
-
Merges additional local counts into this node.
- MutableNode<V> - Class in org.egothor.stemmer.trie
-
Mutable build-time node.
- MutableNode() - Constructor for class org.egothor.stemmer.trie.MutableNode
-
Creates an empty node.
N
- NL_NL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Dutch.
- NO_NO - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Norwegian.
- NodeData<V> - Record Class in org.egothor.stemmer.trie
-
Intermediate node data used during deserialization before child references are resolved.
- NodeData(char[], int[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.NodeData
-
Creates one validated node-data holder.
O
- of(Map<V, Integer>, IntFunction<V[]>) - Static method in class org.egothor.stemmer.trie.LocalValueSummary
-
Builds a summary from local counts.
- onEntry(String, String[], int) - Method in interface org.egothor.stemmer.StemmerDictionaryParser.EntryHandler
-
Accepts one parsed dictionary entry.
- orderedCounts() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal ordered-counts array.
- orderedCounts() - Method in class org.egothor.stemmer.trie.LocalValueSummary
-
Returns ordered counts.
- orderedCounts() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal ordered-counts array.
- orderedValues() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal ordered-values array.
- orderedValues() - Method in class org.egothor.stemmer.trie.LocalValueSummary
-
Returns ordered values.
- orderedValues() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal ordered-values array.
- org.egothor.stemmer - package org.egothor.stemmer
-
Provides the core Egothor-style stemming infrastructure based on compact patch-command tries.
- org.egothor.stemmer.trie - package org.egothor.stemmer.trie
-
Provides internal trie infrastructure used by
FrequencyTriecompilation, reduction, canonicalization, and binary reconstruction.
P
- Packages - Search tag in Overview
- Section
- parse(Reader, String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary from a reader.
- parse(String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary file from a path string.
- parse(Path, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary file from a filesystem path.
- ParseStatistics(String, int, int, int) - Constructor for record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Creates parsing statistics.
- PatchCommandEncoder - Class in org.egothor.stemmer
-
Encodes a compact patch command that transforms one word form into another and applies such commands back to source words.
- PatchCommandEncoder() - Constructor for class org.egothor.stemmer.PatchCommandEncoder
-
Creates an encoder with the traditional Egothor cost model: insert = 1, delete = 1, replace = 1, match = 0.
- PatchCommandEncoder(int, int, int, int) - Constructor for class org.egothor.stemmer.PatchCommandEncoder
-
Creates an encoder with explicit operation costs.
- Project Scope - Search tag in Overview
- Section
- PT_PT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Portuguese.
- put(String, V) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
-
Stores a value for the supplied key and increments its local frequency.
- put(String, V, int) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
-
Stores a value for the supplied key and increments its local frequency by the specified positive count.
R
- Radixor - Search tag in Overview
- Section
- read(DataInputStream) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
-
Reads one value from the supplied data input.
- read(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads a GZip-compressed binary patch-command trie from an input stream.
- read(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads a GZip-compressed binary patch-command trie from a filesystem path string.
- read(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads a GZip-compressed binary patch-command trie from a filesystem path.
- readFrom(InputStream, IntFunction<V[]>, FrequencyTrie.ValueStreamCodec<V>) - Static method in class org.egothor.stemmer.FrequencyTrie
-
Reads a compiled trie from the supplied input stream.
- ReducedNode<V> - Class in org.egothor.stemmer.trie
-
Canonical reduced node used during subtree merging.
- ReducedNode(ReductionSignature<V>, Map<V, Integer>, Map<Character, ReducedNode<V>>) - Constructor for class org.egothor.stemmer.trie.ReducedNode
-
Creates a new reduced node.
- ReductionContext<V> - Class in org.egothor.stemmer.trie
-
Reduction context used while canonicalizing mutable nodes.
- ReductionContext(ReductionSettings) - Constructor for class org.egothor.stemmer.trie.ReductionContext
-
Creates a new context.
- reductionMode() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns the value of the
reductionModerecord component. - ReductionMode - Enum Class in org.egothor.stemmer
-
Defines the subtree reduction strategy applied during trie compilation.
- ReductionSettings - Record Class in org.egothor.stemmer
-
Immutable reduction configuration used by
FrequencyTrie.Builder. - ReductionSettings(ReductionMode, int, int) - Constructor for record class org.egothor.stemmer.ReductionSettings
-
Creates a new instance.
- ReductionSignature<V> - Class in org.egothor.stemmer.trie
-
Immutable reduction signature of a full subtree.
- register(ReductionSignature<V>, ReducedNode<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
-
Registers a canonical node.
- resourceDirectory() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns the resource directory name.
- resourcePath() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns the classpath resource path of the stemmer dictionary.
- RU_RU - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Russian.
S
- saveBinary(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Saves a compiled patch-command trie as a GZip-compressed binary file.
- saveBinary(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Saves a compiled patch-command trie as a GZip-compressed binary file.
- settings() - Method in class org.egothor.stemmer.trie.ReductionContext
-
Returns the settings.
- signature() - Method in class org.egothor.stemmer.trie.ReducedNode
-
Returns the reduction signature of this canonical node.
- size() - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns the number of canonical compiled nodes reachable from the root.
- sourceDescription() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
sourceDescriptionrecord component. - StemmerDictionaryParser - Class in org.egothor.stemmer
-
Parser of line-oriented stemmer dictionary files.
- StemmerDictionaryParser.EntryHandler - Interface in org.egothor.stemmer
-
Callback receiving one parsed dictionary line.
- StemmerDictionaryParser.ParseStatistics - Record Class in org.egothor.stemmer
-
Immutable parsing statistics.
- StemmerPatchTrieBinaryIO - Class in org.egothor.stemmer
-
Binary persistence helper for patch-command stemmer tries.
- StemmerPatchTrieLoader - Class in org.egothor.stemmer
-
Loader of patch-command tries from bundled stemmer dictionaries.
- StemmerPatchTrieLoader.Language - Enum Class in org.egothor.stemmer
-
Supported bundled stemmer dictionaries.
- SV_SE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Swedish.
T
- toString() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.ValueCount
-
Returns a string representation of this record class.
U
- US_UK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
English.
- US_UK_PROFI - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
English professional dictionary.
V
- value() - Method in record class org.egothor.stemmer.ValueCount
-
Returns the value of the
valuerecord component. - ValueCount<V> - Record Class in org.egothor.stemmer
-
Immutable value-count pair returned by read-only trie queries.
- ValueCount(V, int) - Constructor for record class org.egothor.stemmer.ValueCount
-
Creates a new value-count pair.
- valueCounts() - Method in class org.egothor.stemmer.trie.MutableNode
-
Returns the internal local terminal value-count map.
- valueOf(String) - Static method in enum class org.egothor.stemmer.ReductionMode
-
Returns the enum constant of this class with the specified name.
- valueOf(String) - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns the enum constant of this class with the specified name.
- values() - Static method in enum class org.egothor.stemmer.ReductionMode
-
Returns an array containing the constants of this enum class, in the order they are declared.
- values() - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns an array containing the constants of this enum class, in the order they are declared.
W
- withDefaults(ReductionMode) - Static method in record class org.egothor.stemmer.ReductionSettings
-
Creates settings with default dominance thresholds.
- write(DataOutputStream, V) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
-
Writes one value to the supplied data output.
- write(FrequencyTrie<String>, OutputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Writes a GZip-compressed binary patch-command trie to an output stream.
- write(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Writes a GZip-compressed binary patch-command trie to a filesystem path string.
- write(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Writes a GZip-compressed binary patch-command trie to a filesystem path.
- writeTo(OutputStream, FrequencyTrie.ValueStreamCodec<V>) - Method in class org.egothor.stemmer.FrequencyTrie
-
Writes this compiled trie to the supplied output stream.
All Classes and Interfaces|All Packages|Constant Field Values