Index

A B C D E F G H I K L M N O P R S T U V W Y 
All Classes and Interfaces|All Packages|Constant Field Values

A

API Documentation - Search tag in Overview
Section
apply(String, String) - Static method in class org.egothor.stemmer.PatchCommandEncoder
Applies a compact patch command to the supplied source word using the historical backward traversal direction.
apply(String, String, WordTraversalDirection) - Static method in class org.egothor.stemmer.PatchCommandEncoder
Applies a compact patch command to the supplied source word using the specified traversal direction.
applyWithConfiguredDirection(String, String) - Method in class org.egothor.stemmer.PatchCommandEncoder
Applies a compact patch command using this encoder instance traversal direction.
AS_IS - Enum constant in enum class org.egothor.stemmer.CaseProcessingMode
Preserves input character casing exactly as provided by the dictionary source.
AS_IS - Enum constant in enum class org.egothor.stemmer.DiacriticProcessingMode
Preserves dictionary entries and lookup keys exactly as provided.
AS_IS_AND_STRIPPED_FALLBACK - Enum constant in enum class org.egothor.stemmer.DiacriticProcessingMode
Planned dual-path mode where lookup may continue along both the original diacritic edge and a normalized non-diacritic alternative.
averageUniqueCandidateCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the averageUniqueCandidateCount record component.

B

BACKWARD - Enum constant in enum class org.egothor.stemmer.WordTraversalDirection
Traverses a word from its logical end toward its logical beginning.
build() - Method in class org.egothor.stemmer.FrequencyTrie.Builder
Builds a compiled read-only trie.
build() - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
Builds a direction-specialized encoder instance.
builder() - Static method in class org.egothor.stemmer.PatchCommandEncoder
Creates a fluent builder for constructing a direction-specialized encoder.
Builder() - Constructor for class org.egothor.stemmer.PatchCommandEncoder.Builder
Creates a builder initialized with the default Egothor-compatible cost model and backward traversal.
Builder(IntFunction<V[]>, ReductionMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder using default thresholds for the supplied reduction mode.
Builder(IntFunction<V[]>, ReductionMode, WordTraversalDirection) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder using default thresholds for the supplied reduction mode and explicit traversal direction.
Builder(IntFunction<V[]>, ReductionSettings) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder with the provided settings.
Builder(IntFunction<V[]>, ReductionSettings, WordTraversalDirection) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder with the provided settings and explicit traversal direction.
Builder(IntFunction<V[]>, ReductionSettings, WordTraversalDirection, CaseProcessingMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder with the provided settings, explicit traversal direction, and explicit case processing mode.
Builder(IntFunction<V[]>, ReductionSettings, WordTraversalDirection, CaseProcessingMode, DiacriticProcessingMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder with the provided settings, explicit traversal direction, explicit case processing mode, and explicit diacritic processing mode.

C

canonicalNodeCount() - Method in class org.egothor.stemmer.trie.ReductionContext
Returns the number of canonical nodes.
caseProcessingMode() - Method in record class org.egothor.stemmer.TrieMetadata
Returns the value of the caseProcessingMode record component.
CaseProcessingMode - Enum Class in org.egothor.stemmer
Defines how dictionary items are normalized with respect to letter casing.
childNodeIds() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal child-node identifier array.
children() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal child-node array.
children() - Method in class org.egothor.stemmer.trie.MutableNode
Returns the internal child-node map indexed by transition character.
children() - Method in class org.egothor.stemmer.trie.ReducedNode
Returns the internal canonical child map indexed by transition character.
Compile - Class in org.egothor.stemmer
Command-line compiler of stemmer dictionary files into compressed binary FrequencyTrie artifacts.
CompiledNode<V> - Record Class in org.egothor.stemmer.trie
Immutable compiled trie node optimized for read access.
CompiledNode(char[], CompiledNode<V>[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.CompiledNode
Creates one validated compiled node.
copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionMode) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
Reconstructs a new writable builder from a compiled read-only trie using default settings for the supplied reduction mode.
copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionSettings) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
Reconstructs a new writable builder from a compiled read-only trie.
count() - Method in record class org.egothor.stemmer.ValueCount
Returns the value of the count record component.
create(LocalValueSummary<V>, Map<Character, ReducedNode<V>>, ReductionSettings) - Static method in class org.egothor.stemmer.trie.ReductionSignature
Creates a subtree signature according to the selected reduction mode.
CS_CZ - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Czech.
csvHeader() - Static method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the stable CSV header of this result format.
current(int, WordTraversalDirection, ReductionSettings) - Static method in record class org.egothor.stemmer.TrieMetadata
Creates metadata populated with current-format defaults for freshly compiled tries.
currentFormatVersion() - Static method in class org.egothor.stemmer.FrequencyTrie
Returns the current persisted binary stream format version.

D

DA_DK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Danish.
DE_DE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
German.
DEFAULT_DOMINANT_WINNER_MIN_PERCENT - Static variable in record class org.egothor.stemmer.ReductionSettings
Default minimum dominant winner percentage.
DEFAULT_DOMINANT_WINNER_OVER_SECOND_RATIO - Static variable in record class org.egothor.stemmer.ReductionSettings
Default minimum winner-over-second ratio.
deleteCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
Sets cost of a delete operation.
diacriticProcessingMode() - Method in record class org.egothor.stemmer.TrieMetadata
Returns the value of the diacriticProcessingMode record component.
DiacriticProcessingMode - Enum Class in org.egothor.stemmer
Defines how dictionary loading and trie traversal should treat diacritics.
dictionaryEntryCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the dictionaryEntryCount record component.
dominantWinnerMinPercent() - Method in record class org.egothor.stemmer.ReductionSettings
Returns the value of the dominantWinnerMinPercent record component.
dominantWinnerOverSecondRatio() - Method in record class org.egothor.stemmer.ReductionSettings
Returns the value of the dominantWinnerOverSecondRatio record component.

E

edgeLabels() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal edge-label array.
edgeLabels() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal edge-label array.
encode(String, String) - Method in class org.egothor.stemmer.PatchCommandEncoder
Produces a compact patch command that transforms source into target.
entryCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the entryCount record component.
equals(Object) - Method in record class org.egothor.stemmer.ReductionSettings
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.trie.CompiledNode
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.trie.NodeData
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in class org.egothor.stemmer.trie.ReductionSignature
 
equals(Object) - Method in record class org.egothor.stemmer.TrieMetadata
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.ValueCount
Indicates whether some other object is "equal to" this one.
ES_ES - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Spanish.
evaluate(Reader, String, String, long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
Evaluates a dictionary provided through an arbitrary reader.
evaluateAllBundledLanguages(long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
Evaluates all supported bundled dictionaries using the supplied seed.
evaluateBundledLanguage(StemmerPatchTrieLoader.Language, long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
Evaluates one bundled dictionary across all supported experiment configurations.
evaluatedInputCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the evaluatedInputCount record component.
evaluatePath(Path, long) - Method in class org.egothor.stemmer.StemmerKnowledgeExperiment
Evaluates one filesystem dictionary across all supported experiment configurations.

F

FA_IR - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Persian.
FI_FI - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Finnish.
findChild(char) - Method in record class org.egothor.stemmer.trie.CompiledNode
Finds a child for the supplied edge character.
forCompilation(WordTraversalDirection, ReductionSettings, DiacriticProcessingMode, CaseProcessingMode) - Static method in record class org.egothor.stemmer.TrieMetadata
Creates metadata for a newly compiled trie using the currently persisted binary stream format version.
formatVersion() - Method in record class org.egothor.stemmer.TrieMetadata
Returns the value of the formatVersion record component.
FORWARD - Enum constant in enum class org.egothor.stemmer.WordTraversalDirection
Traverses a word from its logical beginning toward its logical end.
FR_FR - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
French.
FrequencyTrie<V> - Class in org.egothor.stemmer
Read-only trie mapping String keys to one or more values with frequency tracking.
FrequencyTrie.Builder<V> - Class in org.egothor.stemmer
Builder of FrequencyTrie.
FrequencyTrie.ValueStreamCodec<V> - Interface in org.egothor.stemmer
Codec used to persist values stored in the trie.
FrequencyTrieBuilders - Class in org.egothor.stemmer
Factory utilities related to FrequencyTrie.Builder.
fromTextBlock(int, String) - Static method in record class org.egothor.stemmer.TrieMetadata
Parses metadata from a text block produced by TrieMetadata.toTextBlock().

G

get(String) - Method in class org.egothor.stemmer.FrequencyTrie
Returns the most frequent value stored at the node addressed by the supplied key.
getAccuracy() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getAccuracy record component.
getAll(String) - Method in class org.egothor.stemmer.FrequencyTrie
Returns all values stored at the node addressed by the supplied key, ordered by descending frequency.
getAllCoveredInputCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getAllCoveredInputCount record component.
getAllF1() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getAllF1 record component.
getAllFalsePositiveCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getAllFalsePositiveCount record component.
getAllPrecision() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getAllPrecision record component.
getAllRecall() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getAllRecall record component.
getAllTruePositiveCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getAllTruePositiveCount record component.
getCorrectCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the getCorrectCount record component.
getEntries(String) - Method in class org.egothor.stemmer.FrequencyTrie
Returns all values stored at the node addressed by the supplied key together with their occurrence counts, ordered by the same rules as FrequencyTrie.getAll(String).

H

hashCode() - Method in record class org.egothor.stemmer.ReductionSettings
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.trie.NodeData
Returns a hash code value for this object.
hashCode() - Method in class org.egothor.stemmer.trie.ReductionSignature
 
hashCode() - Method in record class org.egothor.stemmer.TrieMetadata
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.ValueCount
Returns a hash code value for this object.
HE_IL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Hebrew.
HU_HU - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Hungarian.

I

ignoredLineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the ignoredLineCount record component.
includeStemInEvaluation() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the includeStemInEvaluation record component.
insertCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
Sets cost of an insert operation.
isRightToLeft() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns whether the language is written right-to-left.
IT_IT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Italian.

K

KNOWLEDGE_PERCENT_STEP - Static variable in class org.egothor.stemmer.StemmerKnowledgeExperiment
Step between adjacent evaluated knowledge percentages.
knowledgePercent() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the knowledgePercent record component.

L

language() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the language record component.
legacy(int, WordTraversalDirection) - Static method in record class org.egothor.stemmer.TrieMetadata
Creates metadata compatible with a legacy artifact version that did not store the full configuration explicitly.
License - Search tag in Overview
Section
lineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the lineCount record component.
load(String, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using default settings for the supplied reduction mode.
load(String, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using explicit reduction settings.
load(String, boolean, ReductionSettings, WordTraversalDirection) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using explicit reduction settings and explicit traversal direction.
load(String, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using explicit reduction settings, explicit traversal direction, and explicit case processing mode.
load(String, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode, DiacriticProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using explicit reduction settings, explicit traversal direction, explicit case processing mode, and explicit diacritic processing mode.
load(String, boolean, TrieMetadata) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using explicit trie compilation metadata.
load(Path, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using default settings for the supplied reduction mode.
load(Path, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using explicit reduction settings.
load(Path, boolean, ReductionSettings, WordTraversalDirection) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using explicit reduction settings and explicit traversal direction.
load(Path, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using explicit reduction settings, explicit traversal direction, and explicit case processing mode.
load(Path, boolean, ReductionSettings, WordTraversalDirection, CaseProcessingMode, DiacriticProcessingMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using explicit reduction settings, traversal direction, case processing mode, and diacritic processing mode.
load(Path, boolean, TrieMetadata) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using explicit trie compilation metadata.
load(StemmerPatchTrieLoader.Language, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a bundled dictionary using default settings for the supplied reduction mode.
load(StemmerPatchTrieLoader.Language, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a bundled dictionary using explicit reduction settings.
load(StemmerPatchTrieLoader.Language, boolean, TrieMetadata) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a bundled dictionary using explicit trie compilation metadata.
loadBinary(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a GZip-compressed binary patch-command trie from an input stream.
loadBinary(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a GZip-compressed binary patch-command trie from a filesystem path string.
loadBinary(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a GZip-compressed binary patch-command trie from a filesystem path.
loadBinaryMetadata(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads only persisted metadata from a GZip-compressed binary patch-command trie stream.
loadBinaryMetadata(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads only persisted metadata from a GZip-compressed binary patch-command trie file.
loadBinaryMetadata(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads only persisted metadata from a GZip-compressed binary patch-command trie file.
localCounts() - Method in class org.egothor.stemmer.trie.ReducedNode
Returns the internal aggregated local value-count map.
LocalValueSummary<V> - Class in org.egothor.stemmer.trie
Local terminal value summary of a node.
LocalValueSummary(V[], int[], int, V, int, int) - Constructor for class org.egothor.stemmer.trie.LocalValueSummary
Creates a summary.
logicalIndex(int, int) - Method in enum class org.egothor.stemmer.WordTraversalDirection
Returns the logical character index addressed by the supplied traversal offset.
lookup(ReductionSignature<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
Looks up a canonical node.
LOWERCASE_WITH_LOCALE_ROOT - Enum constant in enum class org.egothor.stemmer.CaseProcessingMode
Normalizes all dictionary content to lower case using Locale.ROOT.

M

main(String[]) - Static method in class org.egothor.stemmer.Compile
CLI entry point.
main(String[]) - Static method in class org.egothor.stemmer.StemmerKnowledgeExperimentCli
Executes the CLI as a standalone process.
matchCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
Sets cost of a match operation.
MAXIMUM_KNOWLEDGE_PERCENT - Static variable in class org.egothor.stemmer.StemmerKnowledgeExperiment
Maximum supported knowledge percentage.
MERGE_SUBTREES_WITH_EQUIVALENT_DOMINANT_GET_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
Merges subtrees whose preferred get() results are equivalent for every reachable key suffix, provided that the locally dominant winner satisfies the configured dominance constraints.
MERGE_SUBTREES_WITH_EQUIVALENT_RANKED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
Merges subtrees whose getAll() results are equivalent for every reachable key suffix and whose local result ordering is the same.
MERGE_SUBTREES_WITH_EQUIVALENT_UNORDERED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
Merges subtrees whose getAll() results are equivalent for every reachable key suffix, regardless of the local ordering of values.
mergeChildren(Map<Character, ReducedNode<V>>) - Method in class org.egothor.stemmer.trie.ReducedNode
Merges child references into this node.
mergeLocalCounts(Map<V, Integer>) - Method in class org.egothor.stemmer.trie.ReducedNode
Merges additional local counts into this node.
metadata() - Method in class org.egothor.stemmer.FrequencyTrie
Returns immutable persisted metadata associated with this trie.
MINIMUM_KNOWLEDGE_PERCENT - Static variable in class org.egothor.stemmer.StemmerKnowledgeExperiment
Minimum supported knowledge percentage.
MutableNode<V> - Class in org.egothor.stemmer.trie
Mutable build-time node.
MutableNode() - Constructor for class org.egothor.stemmer.trie.MutableNode
Creates an empty node.

N

NB_NO - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Norwegian Bokmål.
NL_NL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Dutch.
NN_NO - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Norwegian Nynorsk.
NodeData<V> - Record Class in org.egothor.stemmer.trie
Intermediate node data used during deserialization before child references are resolved.
NodeData(char[], int[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.NodeData
Creates one validated node-data holder.

O

of(Map<V, Integer>, IntFunction<V[]>) - Static method in class org.egothor.stemmer.trie.LocalValueSummary
Builds a summary from local counts.
onEntry(String, String[], int) - Method in interface org.egothor.stemmer.StemmerDictionaryParser.EntryHandler
Accepts one parsed dictionary entry.
orderedCounts() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal ordered-counts array.
orderedCounts() - Method in class org.egothor.stemmer.trie.LocalValueSummary
Returns ordered counts.
orderedCounts() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal ordered-counts array.
orderedValues() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal ordered-values array.
orderedValues() - Method in class org.egothor.stemmer.trie.LocalValueSummary
Returns ordered values.
orderedValues() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal ordered-values array.
org.egothor.stemmer - package org.egothor.stemmer
Provides the core Egothor-style stemming infrastructure based on compact patch-command tries.
org.egothor.stemmer.trie - package org.egothor.stemmer.trie
Provides internal trie infrastructure used by FrequencyTrie compilation, reduction, canonicalization, and binary reconstruction.

P

Packages - Search tag in Overview
Section
parse(Reader, String, CaseProcessingMode, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary from a reader.
parse(Reader, String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary from a reader.
parse(String, CaseProcessingMode, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary file from a path string.
parse(String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary file from a path string.
parse(Path, CaseProcessingMode, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary file from a filesystem path.
parse(Path, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary file from a filesystem path.
ParseStatistics(String, int, int, int) - Constructor for record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Creates parsing statistics.
PatchCommandEncoder - Class in org.egothor.stemmer
Encodes a compact patch command that transforms one word form into another and applies such commands back to source words.
PatchCommandEncoder.Builder - Class in org.egothor.stemmer
Fluent builder for creating direction-specialized PatchCommandEncoder instances.
PL_PL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Polish.
Project Scope - Search tag in Overview
Section
PT_PT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Portuguese.
put(String, V) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
Stores a value for the supplied key and increments its local frequency.
put(String, V, int) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
Stores a value for the supplied key and increments its local frequency by the specified positive count.

R

Radixor - Search tag in Overview
Section
read(DataInputStream) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
Reads one value from the supplied data input.
read(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads a GZip-compressed binary patch-command trie from an input stream.
read(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads a GZip-compressed binary patch-command trie from a filesystem path string.
read(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads a GZip-compressed binary patch-command trie from a filesystem path.
readFrom(InputStream, IntFunction<V[]>, FrequencyTrie.ValueStreamCodec<V>) - Static method in class org.egothor.stemmer.FrequencyTrie
Reads a compiled trie from the supplied input stream.
readMetadata(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads only metadata from a GZip-compressed binary patch-command trie from an input stream.
readMetadata(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads only metadata from a GZip-compressed binary patch-command trie stored at a filesystem path string.
readMetadata(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads only metadata from a GZip-compressed binary patch-command trie stored at a filesystem path.
ReducedNode<V> - Class in org.egothor.stemmer.trie
Canonical reduced node used during subtree merging.
ReducedNode(ReductionSignature<V>, Map<V, Integer>, Map<Character, ReducedNode<V>>) - Constructor for class org.egothor.stemmer.trie.ReducedNode
Creates a new reduced node.
ReductionContext<V> - Class in org.egothor.stemmer.trie
Reduction context used while canonicalizing mutable nodes.
ReductionContext(ReductionSettings) - Constructor for class org.egothor.stemmer.trie.ReductionContext
Creates a new context.
reductionMode() - Method in record class org.egothor.stemmer.ReductionSettings
Returns the value of the reductionMode record component.
reductionMode() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the reductionMode record component.
ReductionMode - Enum Class in org.egothor.stemmer
Defines the subtree reduction strategy applied during trie compilation.
reductionSettings() - Method in record class org.egothor.stemmer.TrieMetadata
Returns the value of the reductionSettings record component.
ReductionSettings - Record Class in org.egothor.stemmer
Immutable reduction configuration used by FrequencyTrie.Builder.
ReductionSettings(ReductionMode, int, int) - Constructor for record class org.egothor.stemmer.ReductionSettings
Creates a new instance.
ReductionSignature<V> - Class in org.egothor.stemmer.trie
Immutable reduction signature of a full subtree.
register(ReductionSignature<V>, ReducedNode<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
Registers a canonical node.
REMOVE - Enum constant in enum class org.egothor.stemmer.DiacriticProcessingMode
Removes diacritics from dictionary entries before trie construction and removes diacritics from lookup keys before traversal.
replaceCost(int) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
Sets cost of a replace operation.
resourceDirectory() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns the resource directory name.
resourcePath() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns the classpath resource path of the bundled stemmer dictionary.
ResultRow(String, String, boolean, boolean, int, long, int, long, long, long, double, long, long, long, double, double, double, double) - Constructor for record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Creates one immutable result row.
RU_RU - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Russian.

S

saveBinary(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Saves a compiled patch-command trie as a GZip-compressed binary file.
saveBinary(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Saves a compiled patch-command trie as a GZip-compressed binary file.
seed() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the seed record component.
settings() - Method in class org.egothor.stemmer.trie.ReductionContext
Returns the settings.
signature() - Method in class org.egothor.stemmer.trie.ReducedNode
Returns the reduction signature of this canonical node.
size() - Method in class org.egothor.stemmer.FrequencyTrie
Returns the number of canonical compiled nodes reachable from the root.
sourceDescription() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the sourceDescription record component.
startIndex(int) - Method in enum class org.egothor.stemmer.WordTraversalDirection
Returns the traversal start index for a character sequence of the supplied length.
StemmerDictionaryParser - Class in org.egothor.stemmer
Parser of line-oriented stemmer dictionary files.
StemmerDictionaryParser.EntryHandler - Interface in org.egothor.stemmer
Callback receiving one parsed dictionary line.
StemmerDictionaryParser.ParseStatistics - Record Class in org.egothor.stemmer
Immutable parsing statistics.
StemmerKnowledgeExperiment - Class in org.egothor.stemmer
Evaluates how stemming quality degrades when the compiled trie is built from only a deterministic subset of the available dictionary knowledge.
StemmerKnowledgeExperiment() - Constructor for class org.egothor.stemmer.StemmerKnowledgeExperiment
Creates a new experiment harness.
StemmerKnowledgeExperiment.ResultRow - Record Class in org.egothor.stemmer
One immutable result row of the knowledge experiment.
StemmerKnowledgeExperimentCli - Class in org.egothor.stemmer
Command-line entry point for the stemmer knowledge experiment.
StemmerPatchTrieBinaryIO - Class in org.egothor.stemmer
Binary persistence helper for patch-command stemmer tries.
StemmerPatchTrieLoader - Class in org.egothor.stemmer
Loader of patch-command tries from bundled stemmer dictionaries.
StemmerPatchTrieLoader.Language - Enum Class in org.egothor.stemmer
Supported bundled stemmer dictionaries.
storeOriginal() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the storeOriginal record component.
SV_SE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Swedish.

T

toCsvRow() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Serializes this row as one CSV record.
toString() - Method in record class org.egothor.stemmer.ReductionSettings
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.trie.NodeData
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.TrieMetadata
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.ValueCount
Returns a string representation of this record class.
toTextBlock() - Method in record class org.egothor.stemmer.TrieMetadata
Returns metadata encoded as a deterministic human-readable text block.
toTraversalCharacters(String) - Method in enum class org.egothor.stemmer.WordTraversalDirection
Returns the characters of the supplied word in this traversal order.
trainingEntryCount() - Method in record class org.egothor.stemmer.StemmerKnowledgeExperiment.ResultRow
Returns the value of the trainingEntryCount record component.
traversalDirection() - Method in class org.egothor.stemmer.FrequencyTrie
Returns the logical key traversal direction used by this trie.
traversalDirection() - Method in record class org.egothor.stemmer.TrieMetadata
Returns the value of the traversalDirection record component.
traversalDirection(WordTraversalDirection) - Method in class org.egothor.stemmer.PatchCommandEncoder.Builder
Sets traversal direction used by the created encoder.
traversalPathToLogicalKey(CharSequence) - Method in enum class org.egothor.stemmer.WordTraversalDirection
Converts a path represented in traversal order back to the logical key form.
TrieMetadata - Record Class in org.egothor.stemmer
Immutable metadata persisted together with a compiled trie artifact.
TrieMetadata(int, WordTraversalDirection, ReductionSettings, DiacriticProcessingMode, CaseProcessingMode) - Constructor for record class org.egothor.stemmer.TrieMetadata
Creates a new metadata instance.

U

UK_UA - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Ukrainian.
US_UK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
English.

V

value() - Method in record class org.egothor.stemmer.ValueCount
Returns the value of the value record component.
ValueCount<V> - Record Class in org.egothor.stemmer
Immutable value-count pair returned by read-only trie queries.
ValueCount(V, int) - Constructor for record class org.egothor.stemmer.ValueCount
Creates a new value-count pair.
valueCounts() - Method in class org.egothor.stemmer.trie.MutableNode
Returns the internal local terminal value-count map.
valueOf(String) - Static method in enum class org.egothor.stemmer.CaseProcessingMode
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class org.egothor.stemmer.DiacriticProcessingMode
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class org.egothor.stemmer.ReductionMode
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class org.egothor.stemmer.WordTraversalDirection
Returns the enum constant of this class with the specified name.
values() - Static method in enum class org.egothor.stemmer.CaseProcessingMode
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class org.egothor.stemmer.DiacriticProcessingMode
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class org.egothor.stemmer.ReductionMode
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class org.egothor.stemmer.WordTraversalDirection
Returns an array containing the constants of this enum class, in the order they are declared.

W

withDefaults(ReductionMode) - Static method in record class org.egothor.stemmer.ReductionSettings
Creates settings with default dominance thresholds.
WordTraversalDirection - Enum Class in org.egothor.stemmer
Defines the logical direction in which word characters are traversed.
write(DataOutputStream, V) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
Writes one value to the supplied data output.
write(FrequencyTrie<String>, OutputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Writes a GZip-compressed binary patch-command trie to an output stream.
write(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Writes a GZip-compressed binary patch-command trie to a filesystem path string.
write(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Writes a GZip-compressed binary patch-command trie to a filesystem path.
writeCsv(Path, List<StemmerKnowledgeExperiment.ResultRow>) - Static method in class org.egothor.stemmer.StemmerKnowledgeExperiment
Writes result rows as UTF-8 CSV with a stable fixed header.
writeTo(OutputStream, FrequencyTrie.ValueStreamCodec<V>) - Method in class org.egothor.stemmer.FrequencyTrie
Writes this compiled trie to the supplied output stream.

Y

YI - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Yiddish.
A B C D E F G H I K L M N O P R S T U V W Y 
All Classes and Interfaces|All Packages|Constant Field Values