Index

A B C D E F G H I L M N O P R S T U V W 
All Classes and Interfaces|All Packages|Constant Field Values

A

apply(String, String) - Static method in class org.egothor.stemmer.PatchCommandEncoder
Applies a compact patch command to the supplied source word.

B

build() - Method in class org.egothor.stemmer.FrequencyTrie.Builder
Builds a compiled read-only trie.
Builder(IntFunction<V[]>, ReductionMode) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder using default thresholds for the supplied reduction mode.
Builder(IntFunction<V[]>, ReductionSettings) - Constructor for class org.egothor.stemmer.FrequencyTrie.Builder
Creates a new builder with the provided settings.

C

canonicalNodeCount() - Method in class org.egothor.stemmer.trie.ReductionContext
Returns the number of canonical nodes.
childNodeIds() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal child-node identifier array.
children() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal child-node array.
children() - Method in class org.egothor.stemmer.trie.MutableNode
Returns the internal child-node map indexed by transition character.
children() - Method in class org.egothor.stemmer.trie.ReducedNode
Returns the internal canonical child map indexed by transition character.
Compile - Class in org.egothor.stemmer
Command-line compiler of stemmer dictionary files into compressed binary FrequencyTrie artifacts.
CompiledNode<V> - Record Class in org.egothor.stemmer.trie
Immutable compiled trie node optimized for read access.
CompiledNode(char[], CompiledNode<V>[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.CompiledNode
Creates one validated compiled node.
copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionMode) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
Reconstructs a new writable builder from a compiled read-only trie using default settings for the supplied reduction mode.
copyOf(FrequencyTrie<V>, IntFunction<V[]>, ReductionSettings) - Static method in class org.egothor.stemmer.FrequencyTrieBuilders
Reconstructs a new writable builder from a compiled read-only trie.
count() - Method in record class org.egothor.stemmer.ValueCount
Returns the value of the count record component.
create(LocalValueSummary<V>, Map<Character, ReducedNode<V>>, ReductionSettings) - Static method in class org.egothor.stemmer.trie.ReductionSignature
Creates a subtree signature according to the selected reduction mode.

D

DA_DK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Danish.
DE_DE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
German.
DEFAULT_DOMINANT_WINNER_MIN_PERCENT - Static variable in record class org.egothor.stemmer.ReductionSettings
Default minimum dominant winner percentage.
DEFAULT_DOMINANT_WINNER_OVER_SECOND_RATIO - Static variable in record class org.egothor.stemmer.ReductionSettings
Default minimum winner-over-second ratio.
dominantWinnerMinPercent() - Method in record class org.egothor.stemmer.ReductionSettings
Returns the value of the dominantWinnerMinPercent record component.
dominantWinnerOverSecondRatio() - Method in record class org.egothor.stemmer.ReductionSettings
Returns the value of the dominantWinnerOverSecondRatio record component.

E

edgeLabels() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal edge-label array.
edgeLabels() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal edge-label array.
encode(String, String) - Method in class org.egothor.stemmer.PatchCommandEncoder
Produces a compact patch command that transforms source into target.
entryCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the entryCount record component.
equals(Object) - Method in record class org.egothor.stemmer.ReductionSettings
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.trie.CompiledNode
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in record class org.egothor.stemmer.trie.NodeData
Indicates whether some other object is "equal to" this one.
equals(Object) - Method in class org.egothor.stemmer.trie.ReductionSignature
 
equals(Object) - Method in record class org.egothor.stemmer.ValueCount
Indicates whether some other object is "equal to" this one.
ES_ES - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Spanish.

F

findChild(char) - Method in record class org.egothor.stemmer.trie.CompiledNode
Finds a child for the supplied edge character.
FR_FR - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
French.
FrequencyTrie<V> - Class in org.egothor.stemmer
Read-only trie mapping String keys to one or more values with frequency tracking.
FrequencyTrie.Builder<V> - Class in org.egothor.stemmer
Builder of FrequencyTrie.
FrequencyTrie.ValueStreamCodec<V> - Interface in org.egothor.stemmer
Codec used to persist values stored in the trie.
FrequencyTrieBuilders - Class in org.egothor.stemmer
Factory utilities related to FrequencyTrie.Builder.

G

get(String) - Method in class org.egothor.stemmer.FrequencyTrie
Returns the most frequent value stored at the node addressed by the supplied key.
getAll(String) - Method in class org.egothor.stemmer.FrequencyTrie
Returns all values stored at the node addressed by the supplied key, ordered by descending frequency.
getEntries(String) - Method in class org.egothor.stemmer.FrequencyTrie
Returns all values stored at the node addressed by the supplied key together with their occurrence counts, ordered by the same rules as FrequencyTrie.getAll(String).

H

hashCode() - Method in record class org.egothor.stemmer.ReductionSettings
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns a hash code value for this object.
hashCode() - Method in record class org.egothor.stemmer.trie.NodeData
Returns a hash code value for this object.
hashCode() - Method in class org.egothor.stemmer.trie.ReductionSignature
 
hashCode() - Method in record class org.egothor.stemmer.ValueCount
Returns a hash code value for this object.

I

ignoredLineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the ignoredLineCount record component.
IT_IT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Italian.

L

lineCount() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the lineCount record component.
load(String, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using default settings for the supplied reduction mode.
load(String, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path string using explicit reduction settings.
load(Path, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using default settings for the supplied reduction mode.
load(Path, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a dictionary from a filesystem path using explicit reduction settings.
load(StemmerPatchTrieLoader.Language, boolean, ReductionMode) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a bundled dictionary using default settings for the supplied reduction mode.
load(StemmerPatchTrieLoader.Language, boolean, ReductionSettings) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a bundled dictionary using explicit reduction settings.
loadBinary(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a GZip-compressed binary patch-command trie from an input stream.
loadBinary(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a GZip-compressed binary patch-command trie from a filesystem path string.
loadBinary(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Loads a GZip-compressed binary patch-command trie from a filesystem path.
localCounts() - Method in class org.egothor.stemmer.trie.ReducedNode
Returns the internal aggregated local value-count map.
LocalValueSummary<V> - Class in org.egothor.stemmer.trie
Local terminal value summary of a node.
LocalValueSummary(V[], int[], int, V, int, int) - Constructor for class org.egothor.stemmer.trie.LocalValueSummary
Creates a summary.
lookup(ReductionSignature<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
Looks up a canonical node.

M

main(String[]) - Static method in class org.egothor.stemmer.Compile
CLI entry point.
MERGE_SUBTREES_WITH_EQUIVALENT_DOMINANT_GET_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
Merges subtrees whose preferred get() results are equivalent for every reachable key suffix, provided that the locally dominant winner satisfies the configured dominance constraints.
MERGE_SUBTREES_WITH_EQUIVALENT_RANKED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
Merges subtrees whose getAll() results are equivalent for every reachable key suffix and whose local result ordering is the same.
MERGE_SUBTREES_WITH_EQUIVALENT_UNORDERED_GET_ALL_RESULTS - Enum constant in enum class org.egothor.stemmer.ReductionMode
Merges subtrees whose getAll() results are equivalent for every reachable key suffix, regardless of the local ordering of values.
mergeChildren(Map<Character, ReducedNode<V>>) - Method in class org.egothor.stemmer.trie.ReducedNode
Merges child references into this node.
mergeLocalCounts(Map<V, Integer>) - Method in class org.egothor.stemmer.trie.ReducedNode
Merges additional local counts into this node.
MutableNode<V> - Class in org.egothor.stemmer.trie
Mutable build-time node.
MutableNode() - Constructor for class org.egothor.stemmer.trie.MutableNode
Creates an empty node.

N

NL_NL - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Dutch.
NO_NO - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Norwegian.
NodeData<V> - Record Class in org.egothor.stemmer.trie
Intermediate node data used during deserialization before child references are resolved.
NodeData(char[], int[], V[], int...) - Constructor for record class org.egothor.stemmer.trie.NodeData
Creates one validated node-data holder.

O

of(Map<V, Integer>, IntFunction<V[]>) - Static method in class org.egothor.stemmer.trie.LocalValueSummary
Builds a summary from local counts.
onEntry(String, String[], int) - Method in interface org.egothor.stemmer.StemmerDictionaryParser.EntryHandler
Accepts one parsed dictionary entry.
orderedCounts() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal ordered-counts array.
orderedCounts() - Method in class org.egothor.stemmer.trie.LocalValueSummary
Returns ordered counts.
orderedCounts() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal ordered-counts array.
orderedValues() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns the internal ordered-values array.
orderedValues() - Method in class org.egothor.stemmer.trie.LocalValueSummary
Returns ordered values.
orderedValues() - Method in record class org.egothor.stemmer.trie.NodeData
Returns the internal ordered-values array.
org.egothor.stemmer - package org.egothor.stemmer
Provides the core Egothor-style stemming infrastructure based on compact patch-command tries.
org.egothor.stemmer.trie - package org.egothor.stemmer.trie
Provides internal trie infrastructure used by FrequencyTrie compilation, reduction, canonicalization, and binary reconstruction.

P

parse(Reader, String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary from a reader.
parse(String, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary file from a path string.
parse(Path, StemmerDictionaryParser.EntryHandler) - Static method in class org.egothor.stemmer.StemmerDictionaryParser
Parses a dictionary file from a filesystem path.
ParseStatistics(String, int, int, int) - Constructor for record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Creates parsing statistics.
PatchCommandEncoder - Class in org.egothor.stemmer
Encodes a compact patch command that transforms one word form into another and applies such commands back to source words.
PatchCommandEncoder() - Constructor for class org.egothor.stemmer.PatchCommandEncoder
Creates an encoder with the traditional Egothor cost model: insert = 1, delete = 1, replace = 1, match = 0.
PatchCommandEncoder(int, int, int, int) - Constructor for class org.egothor.stemmer.PatchCommandEncoder
Creates an encoder with explicit operation costs.
PT_PT - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Portuguese.
put(String, V) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
Stores a value for the supplied key and increments its local frequency.
put(String, V, int) - Method in class org.egothor.stemmer.FrequencyTrie.Builder
Stores a value for the supplied key and increments its local frequency by the specified positive count.

R

read(DataInputStream) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
Reads one value from the supplied data input.
read(InputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads a GZip-compressed binary patch-command trie from an input stream.
read(String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads a GZip-compressed binary patch-command trie from a filesystem path string.
read(Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Reads a GZip-compressed binary patch-command trie from a filesystem path.
readFrom(InputStream, IntFunction<V[]>, FrequencyTrie.ValueStreamCodec<V>) - Static method in class org.egothor.stemmer.FrequencyTrie
Reads a compiled trie from the supplied input stream.
ReducedNode<V> - Class in org.egothor.stemmer.trie
Canonical reduced node used during subtree merging.
ReducedNode(ReductionSignature<V>, Map<V, Integer>, Map<Character, ReducedNode<V>>) - Constructor for class org.egothor.stemmer.trie.ReducedNode
Creates a new reduced node.
ReductionContext<V> - Class in org.egothor.stemmer.trie
Reduction context used while canonicalizing mutable nodes.
ReductionContext(ReductionSettings) - Constructor for class org.egothor.stemmer.trie.ReductionContext
Creates a new context.
reductionMode() - Method in record class org.egothor.stemmer.ReductionSettings
Returns the value of the reductionMode record component.
ReductionMode - Enum Class in org.egothor.stemmer
Defines the subtree reduction strategy applied during trie compilation.
ReductionSettings - Record Class in org.egothor.stemmer
Immutable reduction configuration used by FrequencyTrie.Builder.
ReductionSettings(ReductionMode, int, int) - Constructor for record class org.egothor.stemmer.ReductionSettings
Creates a new instance.
ReductionSignature<V> - Class in org.egothor.stemmer.trie
Immutable reduction signature of a full subtree.
register(ReductionSignature<V>, ReducedNode<V>) - Method in class org.egothor.stemmer.trie.ReductionContext
Registers a canonical node.
resourceDirectory() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns the resource directory name.
resourcePath() - Method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns the classpath resource path of the stemmer dictionary.
RU_RU - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Russian.

S

saveBinary(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Saves a compiled patch-command trie as a GZip-compressed binary file.
saveBinary(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieLoader
Saves a compiled patch-command trie as a GZip-compressed binary file.
settings() - Method in class org.egothor.stemmer.trie.ReductionContext
Returns the settings.
signature() - Method in class org.egothor.stemmer.trie.ReducedNode
Returns the reduction signature of this canonical node.
size() - Method in class org.egothor.stemmer.FrequencyTrie
Returns the number of canonical compiled nodes reachable from the root.
sourceDescription() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns the value of the sourceDescription record component.
StemmerDictionaryParser - Class in org.egothor.stemmer
Parser of line-oriented stemmer dictionary files.
StemmerDictionaryParser.EntryHandler - Interface in org.egothor.stemmer
Callback receiving one parsed dictionary line.
StemmerDictionaryParser.ParseStatistics - Record Class in org.egothor.stemmer
Immutable parsing statistics.
StemmerPatchTrieBinaryIO - Class in org.egothor.stemmer
Binary persistence helper for patch-command stemmer tries.
StemmerPatchTrieLoader - Class in org.egothor.stemmer
Loader of patch-command tries from bundled stemmer dictionaries.
StemmerPatchTrieLoader.Language - Enum Class in org.egothor.stemmer
Supported bundled stemmer dictionaries.
SV_SE - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Swedish.

T

toString() - Method in record class org.egothor.stemmer.ReductionSettings
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.StemmerDictionaryParser.ParseStatistics
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.trie.CompiledNode
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.trie.NodeData
Returns a string representation of this record class.
toString() - Method in record class org.egothor.stemmer.ValueCount
Returns a string representation of this record class.

U

US_UK - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
English.
US_UK_PROFI - Enum constant in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
English professional dictionary.

V

value() - Method in record class org.egothor.stemmer.ValueCount
Returns the value of the value record component.
ValueCount<V> - Record Class in org.egothor.stemmer
Immutable value-count pair returned by read-only trie queries.
ValueCount(V, int) - Constructor for record class org.egothor.stemmer.ValueCount
Creates a new value-count pair.
valueCounts() - Method in class org.egothor.stemmer.trie.MutableNode
Returns the internal local terminal value-count map.
valueOf(String) - Static method in enum class org.egothor.stemmer.ReductionMode
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns the enum constant of this class with the specified name.
values() - Static method in enum class org.egothor.stemmer.ReductionMode
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class org.egothor.stemmer.StemmerPatchTrieLoader.Language
Returns an array containing the constants of this enum class, in the order they are declared.

W

withDefaults(ReductionMode) - Static method in record class org.egothor.stemmer.ReductionSettings
Creates settings with default dominance thresholds.
write(DataOutputStream, V) - Method in interface org.egothor.stemmer.FrequencyTrie.ValueStreamCodec
Writes one value to the supplied data output.
write(FrequencyTrie<String>, OutputStream) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Writes a GZip-compressed binary patch-command trie to an output stream.
write(FrequencyTrie<String>, String) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Writes a GZip-compressed binary patch-command trie to a filesystem path string.
write(FrequencyTrie<String>, Path) - Static method in class org.egothor.stemmer.StemmerPatchTrieBinaryIO
Writes a GZip-compressed binary patch-command trie to a filesystem path.
writeTo(OutputStream, FrequencyTrie.ValueStreamCodec<V>) - Method in class org.egothor.stemmer.FrequencyTrie
Writes this compiled trie to the supplied output stream.
A B C D E F G H I L M N O P R S T U V W 
All Classes and Interfaces|All Packages|Constant Field Values