Index
All Classes and Interfaces|All Packages|Constant Field Values
A
- API Documentation - Search tag in Overview
- Section
- apply(String, String) - Static method in class org.egothor.stemmer.PatchCommandEncoder
-
Applies a compact patch command to the supplied source word using the historical backward traversal direction.
- apply(String, String, WordTraversalDirection) - Static method in class org.egothor.stemmer.PatchCommandEncoder
-
Applies a compact patch command to the supplied source word using the specified traversal direction.
- applyWithConfiguredDirection(String, String) - Method in class org.egothor.stemmer.PatchCommandEncoder
-
Applies a compact patch command using this encoder instance traversal direction.
- AS_IS - Enum constant in enum class org.egothor.stemmer.CaseProcessingMode
-
Preserves input character casing exactly as provided by the dictionary source.
- AS_IS - Enum constant in enum class org.egothor.stemmer.DiacriticProcessingMode
-
Preserves dictionary entries and lookup keys exactly as provided.
- AS_IS_AND_STRIPPED_FALLBACK - Enum constant in enum class org.egothor.stemmer.DiacriticProcessingMode
-
Planned dual-path mode where lookup may continue along both the original diacritic edge and a normalized non-diacritic alternative.
- averageUniqueCandidateCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
averageUniqueCandidateCountrecord component.
B
- BACKWARD - Enum constant in enum class org.egothor.stemmer.WordTraversalDirection
-
Traverses a word from its logical end toward its logical beginning.
- build() - Method in class org.egothor.stemmer.FrequencyTrie.Builder
-
Builds a compiled read-only trie.
- build() - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
-
Builds a direction-specialized encoder instance.
- builder() - Static method in class org.egothor.stemmer.PatchCommandEncoder
-
Creates a fluent builder for constructing a direction-specialized encoder.
- Builder() - Constructor for class org.egothor.stemmer.PatchCommandEncoder.Builder
-
Creates a builder initialized with the default Egothor-compatible cost model and backward traversal.
- Builder(IntFunction<V[]>, ReductionMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder using default thresholds for the supplied reduction mode.
- Builder(IntFunction<V[]>, ReductionMode, WordTraversalDirection) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder using default thresholds for the supplied reduction mode and explicit traversal direction.
- Builder(IntFunction<V[]>, ReductionSettings) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder with the provided settings.
- Builder(IntFunction<V[]>, ReductionSettings, WordTraversalDirection) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder with the provided settings and explicit traversal direction.
- Builder(IntFunction<V[]>, ReductionSettings, WordTraversalDirection, CaseProcessingMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder with the provided settings, explicit traversal direction, and explicit case processing mode.
- Builder(IntFunction<V[]>, ReductionSettings, WordTraversalDirection, CaseProcessingMode, DiacriticProcessingMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
-
Creates a new builder with the provided settings, explicit traversal direction, explicit case processing mode, and explicit diacritic processing mode.
C
- canonicalNodeCount() - Method in class org.egothor.stemmer.trie.ReductionContext
-
Returns the number of canonical nodes.
- caseProcessingMode() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns the value of the
caseProcessingModerecord component. - CaseProcessingMode - Enum Class in org.egothor.stemmer
-
Defines how dictionary items are normalized with respect to letter casing.
- childNodeIds() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal child-node identifier array.
- children() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal child-node array.
- children() - Method in class org.egothor.stemmer.trie.MutableNode
-
Returns the internal child-node map indexed by transition character.
- children() - Method in class org.egothor.stemmer.trie.ReducedNode
-
Returns the internal canonical child map indexed by transition character.
- Compile - Class in org.egothor.stemmer
-
Command-line compiler of stemmer dictionary files into compressed binary
FrequencyTrieartifacts. - CompiledNode<V> - Record Class in org.egothor.stemmer.trie
-
Immutable compiled trie node optimized for read access.
- CompiledNode(char[], CompiledNode<V>[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.CompiledNode
-
Creates one validated compiled node.
- copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionMode) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
-
Reconstructs a new writable builder from a compiled read-only trie using default settings for the supplied reduction mode.
- copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionSettings) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
-
Reconstructs a new writable builder from a compiled read-only trie.
- count() - Method in record class org.egothor.stemmer.ValueCount
-
Returns the value of the
countrecord component. - create(LocalValueSummary<V>, Map<Character, ReducedNode<V>>, ReductionSettings) - Static method in class org.egothor.stemmer.trie.ReductionSignature
-
Creates a subtree signature according to the selected reduction mode.
- CS_CZ - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Czech.
- csvHeader() - Static method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the stable CSV header of this result format.
- current(int, WordTraversalDirection, ReductionSettings) - Static method in record class org.egothor.stemmer.TrieMetadata
-
Creates metadata populated with current-format defaults for freshly compiled tries.
- currentFormatVersion() - Static method in class org.egothor.stemmer.FrequencyTrie
-
Returns the current persisted binary stream format version.
D
- DA_DK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Danish.
- DE_DE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
German.
- DEFAULT_DOMINANT_WINNER_MIN_PERCENT - Static variable in record class org.egothor.stemmer.ReductionSettings
-
Default minimum dominant winner percentage.
- DEFAULT_DOMINANT_WINNER_OVER_SECOND_RATIO - Static variable in record class org.egothor.stemmer.ReductionSettings
-
Default minimum winner-over-second ratio.
- deleteCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
-
Sets cost of a delete operation.
- diacriticProcessingMode() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns the value of the
diacriticProcessingModerecord component. - DiacriticProcessingMode - Enum Class in org.egothor.stemmer
-
Defines how dictionary loading and trie traversal should treat diacritics.
- dictionaryEntryCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
dictionaryEntryCountrecord component. - dominantWinnerMinPercent() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns the value of the
dominantWinnerMinPercentrecord component. - dominantWinnerOverSecondRatio() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns the value of the
dominantWinnerOverSecondRatiorecord component.
E
- edgeLabels() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal edge-label array.
- edgeLabels() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal edge-label array.
- encode(String, String) - Method in class org.egothor.stemmer.PatchCommandEncoder
-
Produces a compact patch command that transforms
sourceintotarget. - entryCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
entryCountrecord component. - equals(Object) - Method in record class org.egothor.stemmer.ReductionSettings
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.trie.NodeData
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in class org.egothor.stemmer.trie.ReductionSignature
- equals(Object) - Method in record class org.egothor.stemmer.TrieMetadata
-
Indicates whether some other object is "equal to" this one.
- equals(Object) - Method in record class org.egothor.stemmer.ValueCount
-
Indicates whether some other object is "equal to" this one.
- ES_ES - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Spanish.
- evaluate(Reader, String, String, long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Evaluates a dictionary provided through an arbitrary reader.
- evaluateAllBundledLanguages(long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Evaluates all supported bundled dictionaries using the supplied seed.
- evaluateBundledLanguage(StemmerPatchTrieLoader.Language, long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Evaluates one bundled dictionary across all supported experiment configurations.
- evaluatedInputCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
evaluatedInputCountrecord component. - evaluatePath(Path, long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Evaluates one filesystem dictionary across all supported experiment configurations.
F
- FA_IR - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Persian.
- FI_FI - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Finnish.
- findChild(char) - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Finds a child for the supplied edge character.
- forCompilation(WordTraversalDirection, ReductionSettings, DiacriticProcessingMode, CaseProcessingMode) - Static method in record class org.egothor.stemmer.TrieMetadata
-
Creates metadata for a newly compiled trie using the currently persisted binary stream format version.
- formatVersion() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns the value of the
formatVersionrecord component. - FORWARD - Enum constant in enum class org.egothor.stemmer.WordTraversalDirection
-
Traverses a word from its logical beginning toward its logical end.
- FR_FR - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
French.
- FrequencyTrie<V> - Class in org.egothor.stemmer
-
Read-only trie mapping
Stringkeys to one or more values with frequency tracking. - FrequencyTrie.Builder<V> - Class in org.egothor.stemmer
-
Builder of
FrequencyTrie. - FrequencyTrie.ValueStreamCodec<V> - Interface in org.egothor.stemmer
-
Codec used to persist values stored in the trie.
- FrequencyTrieBuilders - Class in org.egothor.stemmer
-
Factory utilities related to
FrequencyTrie.Builder. - fromTextBlock(int, String) - Static method in record class org.egothor.stemmer.TrieMetadata
-
Parses metadata from a text block produced by
TrieMetadata.toTextBlock().
G
- get(String) - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns the most frequent value stored at the node addressed by the supplied key.
- getAccuracy() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getAccuracyrecord component. - getAll(String) - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns all values stored at the node addressed by the supplied key, ordered by descending frequency.
- getAllCoveredInputCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getAllCoveredInputCountrecord component. - getAllF1() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getAllF1record component. - getAllFalsePositiveCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getAllFalsePositiveCountrecord component. - getAllPrecision() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getAllPrecisionrecord component. - getAllRecall() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getAllRecallrecord component. - getAllTruePositiveCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getAllTruePositiveCountrecord component. - getCorrectCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
getCorrectCountrecord component. - getEntries(String) - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns all values stored at the node addressed by the supplied key together with their occurrence counts, ordered by the same rules as
FrequencyTrie.getAll(String).
H
- hashCode() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns a hash code value for this object.
- hashCode() - Method in class org.egothor.stemmer.trie.ReductionSignature
- hashCode() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns a hash code value for this object.
- hashCode() - Method in record class org.egothor.stemmer.ValueCount
-
Returns a hash code value for this object.
- HE_IL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Hebrew.
- HU_HU - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Hungarian.
I
- ignoredLineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
ignoredLineCountrecord component. - includeStemInEvaluation() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
includeStemInEvaluationrecord component. - insertCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
-
Sets cost of an insert operation.
- isRightToLeft() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns whether the language is written right-to-left.
- IT_IT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Italian.
K
- KNOWLEDGE_PERCENT_STEP - Static variable in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Step between adjacent evaluated knowledge percentages.
- knowledgePercent() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
knowledgePercentrecord component.
L
- language() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
languagerecord component. - legacy(int, WordTraversalDirection) - Static method in record class org.egothor.stemmer.TrieMetadata
-
Creates metadata compatible with a legacy artifact version that did not store the full configuration explicitly.
- License - Search tag in Overview
- Section
- lineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
lineCountrecord component. - load(String, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using default settings for the supplied reduction mode.
- load(String, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using explicit reduction settings.
- load(String, boolean, ReductionSettings, WordTraversalDirection) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using explicit reduction settings and explicit traversal direction.
- load(String, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using explicit reduction settings, explicit traversal direction, and explicit case processing mode.
- load(String, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode, DiacriticProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using explicit reduction settings, explicit traversal direction, explicit case processing mode, and explicit diacritic processing mode.
- load(String, boolean, TrieMetadata) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path string using explicit trie compilation metadata.
- load(Path, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using default settings for the supplied reduction mode.
- load(Path, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using explicit reduction settings.
- load(Path, boolean, ReductionSettings, WordTraversalDirection) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using explicit reduction settings and explicit traversal direction.
- load(Path, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using explicit reduction settings, explicit traversal direction, and explicit case processing mode.
- load(Path, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode, DiacriticProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using explicit reduction settings, traversal direction, case processing mode, and diacritic processing mode.
- load(Path, boolean, TrieMetadata) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a dictionary from a filesystem path using explicit trie compilation metadata.
- load(StemmerPatchTrieLoader.Language, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a bundled dictionary using default settings for the supplied reduction mode.
- load(StemmerPatchTrieLoader.Language, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a bundled dictionary using explicit reduction settings.
- load(StemmerPatchTrieLoader.Language, boolean, TrieMetadata) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a bundled dictionary using explicit trie compilation metadata.
- loadBinary(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a GZip-compressed binary patch-command trie from an input stream.
- loadBinary(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a GZip-compressed binary patch-command trie from a filesystem path string.
- loadBinary(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads a GZip-compressed binary patch-command trie from a filesystem path.
- loadBinaryMetadata(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads only persisted metadata from a GZip-compressed binary patch-command trie stream.
- loadBinaryMetadata(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads only persisted metadata from a GZip-compressed binary patch-command trie file.
- loadBinaryMetadata(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Loads only persisted metadata from a GZip-compressed binary patch-command trie file.
- localCounts() - Method in class org.egothor.stemmer.trie.ReducedNode
-
Returns the internal aggregated local value-count map.
- LocalValueSummary<V> - Class in org.egothor.stemmer.trie
-
Local terminal value summary of a node.
- LocalValueSummary(V[], int[], int, V, int, int) - Constructor for class org.egothor.stemmer.trie.LocalValueSummary
-
Creates a summary.
- logicalIndex(int, int) - Method in enum class org.egothor.stemmer.WordTraversalDirection
-
Returns the logical character index addressed by the supplied traversal offset.
- lookup(ReductionSignature<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
-
Looks up a canonical node.
- LOWERCASE_WITH_LOCALE_ROOT - Enum constant in enum class org.egothor.stemmer.CaseProcessingMode
-
Normalizes all dictionary content to lower case using
Locale.ROOT.
M
- main(String[]) - Static method in class org.egothor.stemmer.Compile
-
CLI entry point.
- main(String[]) - Static method in class org.egothor.stemmer.StemmerKnowledgeExperimentCli
-
Executes the CLI as a standalone process.
- matchCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
-
Sets cost of a match operation.
- MAXIMUM_KNOWLEDGE_PERCENT - Static variable in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Maximum supported knowledge percentage.
- MERGE_SUBTREES_WITH_EQUIVALENT_DOMINANT_GET_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
-
Merges subtrees whose preferred
get()results are equivalent for every reachable key suffix, provided that the locally dominant winner satisfies the configured dominance constraints. - MERGE_SUBTREES_WITH_EQUIVALENT_RANKED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
-
Merges subtrees whose
getAll()results are equivalent for every reachable key suffix and whose local result ordering is the same. - MERGE_SUBTREES_WITH_EQUIVALENT_UNORDERED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
-
Merges subtrees whose
getAll()results are equivalent for every reachable key suffix, regardless of the local ordering of values. - mergeChildren(Map<Character, ReducedNode<V>>) - Method in class org.egothor.stemmer.trie.ReducedNode
-
Merges child references into this node.
- mergeLocalCounts(Map<V, Integer>) - Method in class org.egothor.stemmer.trie.ReducedNode
-
Merges additional local counts into this node.
- metadata() - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns immutable persisted metadata associated with this trie.
- MINIMUM_KNOWLEDGE_PERCENT - Static variable in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Minimum supported knowledge percentage.
- MutableNode<V> - Class in org.egothor.stemmer.trie
-
Mutable build-time node.
- MutableNode() - Constructor for class org.egothor.stemmer.trie.MutableNode
-
Creates an empty node.
N
- NB_NO - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Norwegian Bokmål.
- NL_NL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Dutch.
- NN_NO - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Norwegian Nynorsk.
- NodeData<V> - Record Class in org.egothor.stemmer.trie
-
Intermediate node data used during deserialization before child references are resolved.
- NodeData(char[], int[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.NodeData
-
Creates one validated node-data holder.
O
- of(Map<V, Integer>, IntFunction<V[]>) - Static method in class org.egothor.stemmer.trie.LocalValueSummary
-
Builds a summary from local counts.
- onEntry(String, String[], int) - Method in interface org.egothor.stemmer.StemmerDictionaryParser.EntryHandler
-
Accepts one parsed dictionary entry.
- orderedCounts() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal ordered-counts array.
- orderedCounts() - Method in class org.egothor.stemmer.trie.LocalValueSummary
-
Returns ordered counts.
- orderedCounts() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal ordered-counts array.
- orderedValues() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns the internal ordered-values array.
- orderedValues() - Method in class org.egothor.stemmer.trie.LocalValueSummary
-
Returns ordered values.
- orderedValues() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns the internal ordered-values array.
- org.egothor.stemmer - package org.egothor.stemmer
-
Provides the core Egothor-style stemming infrastructure based on compact patch-command tries.
- org.egothor.stemmer.trie - package org.egothor.stemmer.trie
-
Provides internal trie infrastructure used by
FrequencyTriecompilation, reduction, canonicalization, and binary reconstruction.
P
- Packages - Search tag in Overview
- Section
- parse(Reader, String, CaseProcessingMode, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary from a reader.
- parse(Reader, String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary from a reader.
- parse(String, CaseProcessingMode, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary file from a path string.
- parse(String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary file from a path string.
- parse(Path, CaseProcessingMode, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary file from a filesystem path.
- parse(Path, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
-
Parses a dictionary file from a filesystem path.
- ParseStatistics(String, int, int, int) - Constructor for record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Creates parsing statistics.
- PatchCommandEncoder - Class in org.egothor.stemmer
-
Encodes a compact patch command that transforms one word form into another and applies such commands back to source words.
- PatchCommandEncoder.Builder - Class in org.egothor.stemmer
-
Fluent builder for creating direction-specialized
PatchCommandEncoderinstances. - PL_PL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Polish.
- Project Scope - Search tag in Overview
- Section
- PT_PT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Portuguese.
- put(String, V) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
-
Stores a value for the supplied key and increments its local frequency.
- put(String, V, int) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
-
Stores a value for the supplied key and increments its local frequency by the specified positive count.
R
- Radixor - Search tag in Overview
- Section
- read(DataInputStream) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
-
Reads one value from the supplied data input.
- read(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads a GZip-compressed binary patch-command trie from an input stream.
- read(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads a GZip-compressed binary patch-command trie from a filesystem path string.
- read(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads a GZip-compressed binary patch-command trie from a filesystem path.
- readFrom(InputStream, IntFunction<V[]>, FrequencyTrie.ValueStreamCodec<V>) - Static method in class org.egothor.stemmer.FrequencyTrie
-
Reads a compiled trie from the supplied input stream.
- readMetadata(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads only metadata from a GZip-compressed binary patch-command trie from an input stream.
- readMetadata(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads only metadata from a GZip-compressed binary patch-command trie stored at a filesystem path string.
- readMetadata(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Reads only metadata from a GZip-compressed binary patch-command trie stored at a filesystem path.
- ReducedNode<V> - Class in org.egothor.stemmer.trie
-
Canonical reduced node used during subtree merging.
- ReducedNode(ReductionSignature<V>, Map<V, Integer>, Map<Character, ReducedNode<V>>) - Constructor for class org.egothor.stemmer.trie.ReducedNode
-
Creates a new reduced node.
- ReductionContext<V> - Class in org.egothor.stemmer.trie
-
Reduction context used while canonicalizing mutable nodes.
- ReductionContext(ReductionSettings) - Constructor for class org.egothor.stemmer.trie.ReductionContext
-
Creates a new context.
- reductionMode() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns the value of the
reductionModerecord component. - reductionMode() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
reductionModerecord component. - ReductionMode - Enum Class in org.egothor.stemmer
-
Defines the subtree reduction strategy applied during trie compilation.
- reductionSettings() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns the value of the
reductionSettingsrecord component. - ReductionSettings - Record Class in org.egothor.stemmer
-
Immutable reduction configuration used by
FrequencyTrie.Builder. - ReductionSettings(ReductionMode, int, int) - Constructor for record class org.egothor.stemmer.ReductionSettings
-
Creates a new instance.
- ReductionSignature<V> - Class in org.egothor.stemmer.trie
-
Immutable reduction signature of a full subtree.
- register(ReductionSignature<V>, ReducedNode<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
-
Registers a canonical node.
- REMOVE - Enum constant in enum class org.egothor.stemmer.DiacriticProcessingMode
-
Removes diacritics from dictionary entries before trie construction and removes diacritics from lookup keys before traversal.
- replaceCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
-
Sets cost of a replace operation.
- resourceDirectory() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns the resource directory name.
- resourcePath() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns the classpath resource path of the bundled stemmer dictionary.
- ResultRow(String, String, boolean, boolean, int, long, int, long, long, long, double, long, long, long, double, double, double, double) - Constructor for record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Creates one immutable result row.
- RU_RU - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Russian.
S
- saveBinary(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Saves a compiled patch-command trie as a GZip-compressed binary file.
- saveBinary(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
-
Saves a compiled patch-command trie as a GZip-compressed binary file.
- seed() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
seedrecord component. - settings() - Method in class org.egothor.stemmer.trie.ReductionContext
-
Returns the settings.
- signature() - Method in class org.egothor.stemmer.trie.ReducedNode
-
Returns the reduction signature of this canonical node.
- size() - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns the number of canonical compiled nodes reachable from the root.
- sourceDescription() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns the value of the
sourceDescriptionrecord component. - startIndex(int) - Method in enum class org.egothor.stemmer.WordTraversalDirection
-
Returns the traversal start index for a character sequence of the supplied length.
- StemmerDictionaryParser - Class in org.egothor.stemmer
-
Parser of line-oriented stemmer dictionary files.
- StemmerDictionaryParser.EntryHandler - Interface in org.egothor.stemmer
-
Callback receiving one parsed dictionary line.
- StemmerDictionaryParser.ParseStatistics - Record Class in org.egothor.stemmer
-
Immutable parsing statistics.
- StemmerKnowledgeExperiment - Class in org.egothor.stemmer
-
Evaluates how stemming quality degrades when the compiled trie is built from only a deterministic subset of the available dictionary knowledge.
- StemmerKnowledgeExperiment() - Constructor for class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Creates a new experiment harness.
- StemmerKnowledgeExperiment.ResultRow - Record Class in org.egothor.stemmer
-
One immutable result row of the knowledge experiment.
- StemmerKnowledgeExperimentCli - Class in org.egothor.stemmer
-
Command-line entry point for the stemmer knowledge experiment.
- StemmerPatchTrieBinaryIO - Class in org.egothor.stemmer
-
Binary persistence helper for patch-command stemmer tries.
- StemmerPatchTrieLoader - Class in org.egothor.stemmer
-
Loader of patch-command tries from bundled stemmer dictionaries.
- StemmerPatchTrieLoader.Language - Enum Class in org.egothor.stemmer
-
Supported bundled stemmer dictionaries.
- storeOriginal() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
storeOriginalrecord component. - SV_SE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Swedish.
T
- toCsvRow() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Serializes this row as one CSV record.
- toString() - Method in record class org.egothor.stemmer.ReductionSettings
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.trie.CompiledNode
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.trie.NodeData
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns a string representation of this record class.
- toString() - Method in record class org.egothor.stemmer.ValueCount
-
Returns a string representation of this record class.
- toTextBlock() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns metadata encoded as a deterministic human-readable text block.
- toTraversalCharacters(String) - Method in enum class org.egothor.stemmer.WordTraversalDirection
-
Returns the characters of the supplied word in this traversal order.
- trainingEntryCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
-
Returns the value of the
trainingEntryCountrecord component. - traversalDirection() - Method in class org.egothor.stemmer.FrequencyTrie
-
Returns the logical key traversal direction used by this trie.
- traversalDirection() - Method in record class org.egothor.stemmer.TrieMetadata
-
Returns the value of the
traversalDirectionrecord component. - traversalDirection(WordTraversalDirection) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
-
Sets traversal direction used by the created encoder.
- traversalPathToLogicalKey(CharSequence) - Method in enum class org.egothor.stemmer.WordTraversalDirection
-
Converts a path represented in traversal order back to the logical key form.
- TrieMetadata - Record Class in org.egothor.stemmer
-
Immutable metadata persisted together with a compiled trie artifact.
- TrieMetadata(int, WordTraversalDirection, ReductionSettings, DiacriticProcessingMode, CaseProcessingMode) - Constructor for record class org.egothor.stemmer.TrieMetadata
-
Creates a new metadata instance.
U
- UK_UA - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Ukrainian.
- US_UK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
English.
V
- value() - Method in record class org.egothor.stemmer.ValueCount
-
Returns the value of the
valuerecord component. - ValueCount<V> - Record Class in org.egothor.stemmer
-
Immutable value-count pair returned by read-only trie queries.
- ValueCount(V, int) - Constructor for record class org.egothor.stemmer.ValueCount
-
Creates a new value-count pair.
- valueCounts() - Method in class org.egothor.stemmer.trie.MutableNode
-
Returns the internal local terminal value-count map.
- valueOf(String) - Static method in enum class org.egothor.stemmer.CaseProcessingMode
-
Returns the enum constant of this class with the specified name.
- valueOf(String) - Static method in enum class org.egothor.stemmer.DiacriticProcessingMode
-
Returns the enum constant of this class with the specified name.
- valueOf(String) - Static method in enum class org.egothor.stemmer.ReductionMode
-
Returns the enum constant of this class with the specified name.
- valueOf(String) - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns the enum constant of this class with the specified name.
- valueOf(String) - Static method in enum class org.egothor.stemmer.WordTraversalDirection
-
Returns the enum constant of this class with the specified name.
- values() - Static method in enum class org.egothor.stemmer.CaseProcessingMode
-
Returns an array containing the constants of this enum class, in the order they are declared.
- values() - Static method in enum class org.egothor.stemmer.DiacriticProcessingMode
-
Returns an array containing the constants of this enum class, in the order they are declared.
- values() - Static method in enum class org.egothor.stemmer.ReductionMode
-
Returns an array containing the constants of this enum class, in the order they are declared.
- values() - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Returns an array containing the constants of this enum class, in the order they are declared.
- values() - Static method in enum class org.egothor.stemmer.WordTraversalDirection
-
Returns an array containing the constants of this enum class, in the order they are declared.
W
- withDefaults(ReductionMode) - Static method in record class org.egothor.stemmer.ReductionSettings
-
Creates settings with default dominance thresholds.
- WordTraversalDirection - Enum Class in org.egothor.stemmer
-
Defines the logical direction in which word characters are traversed.
- write(DataOutputStream, V) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
-
Writes one value to the supplied data output.
- write(FrequencyTrie<String>, OutputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Writes a GZip-compressed binary patch-command trie to an output stream.
- write(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Writes a GZip-compressed binary patch-command trie to a filesystem path string.
- write(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
-
Writes a GZip-compressed binary patch-command trie to a filesystem path.
- writeCsv(Path, List<StemmerKnowledgeExperiment.ResultRow>) - Static method in class org.egothor.stemmer.StemmerKnowledgeExperiment
-
Writes result rows as UTF-8 CSV with a stable fixed header.
- writeTo(OutputStream, FrequencyTrie.ValueStreamCodec<V>) - Method in class org.egothor.stemmer.FrequencyTrie
-
Writes this compiled trie to the supplied output stream.
Y
- YI - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
-
Yiddish.
All Classes and Interfaces|All Packages|Constant Field Values