Class ThaiAnalyzer
Analyzer for Thai language. It uses BreakIterator to break words.
You must specify the required LuceneVersion compatibility when creating ThaiAnalyzer:
- As of 3.6, a set of Thai stopwords is used by default
Inherited Members
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>)
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>, Lucene.Net.Analysis.ReuseStrategy)
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>, System.Func<System.String, System.IO.TextReader, System.IO.TextReader>)
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>, System.Func<System.String, System.IO.TextReader, System.IO.TextReader>, Lucene.Net.Analysis.ReuseStrategy)
Lucene.Net.Analysis.Analyzer.CreateComponents(System.String, System.IO.TextReader)
Lucene.Net.Analysis.Analyzer.GetTokenStream(System.String, System.IO.TextReader)
Lucene.Net.Analysis.Analyzer.InitReader(System.String, System.IO.TextReader)
Lucene.Net.Analysis.Analyzer.GetObjectData(System.Runtime.Serialization.SerializationInfo, System.Runtime.Serialization.StreamingContext)
Assembly: Lucene.Net.ICU.dll
Syntax
public sealed class ThaiAnalyzer : StopwordAnalyzerBase, IDisposable
Constructors
Name | Description |
---|---|
ThaiAnalyzer(LuceneVersion) | Builds an analyzer with the default stop words. |
ThaiAnalyzer(LuceneVersion, CharArraySet) | Builds an analyzer with the given stop words. |
Fields
Name | Description |
---|---|
DEFAULT_STOPWORD_FILE | File containing default Thai stopwords. |
Properties
Name | Description |
---|---|
DefaultStopSet | Returns an unmodifiable instance of the default stop words set. |
Methods
Name | Description |
---|---|
CreateComponents(String, TextReader) | Creates
TokenStreamComponents
used to tokenize all the text in the provided |