Class HyphenationCompoundWordTokenFilter
A TokenFilter that decomposes compound words found in many Germanic languages.
"Donaudampfschiff" becomes Donau, dampf, schiff so that you can find "Donaudampfschiff" even when you only enter "schiff". It uses a hyphenation grammar and a word dictionary to achieve this.
You must specify the required LuceneVersion compatibility when creating CompoundWordTokenFilterBase:
- As of 3.1, CompoundWordTokenFilterBase correctly handles Unicode 4.0 supplementary characters in strings and char arrays provided as compound word dictionaries.
Inheritance
System.Object
HyphenationCompoundWordTokenFilter
Inherited Members
System.Object.Equals(System.Object, System.Object)
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.GetType()
System.Object.MemberwiseClone()
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
[Serializable]
public class HyphenationCompoundWordTokenFilter : CompoundWordTokenFilterBase, IDisposable
Constructors
Methods
Name | Description |
---|---|
Decompose() | |
GetHyphenationTree(FileInfo) | Create a hyphenator tree |
GetHyphenationTree(FileInfo, Encoding) | Create a hyphenator tree |
GetHyphenationTree(Stream) | Create a hyphenator tree |
GetHyphenationTree(Stream, Encoding) | Create a hyphenator tree |
GetHyphenationTree(String) | Create a hyphenator tree |
GetHyphenationTree(String, Encoding) | Create a hyphenator tree |