Class CJKAnalyzer
An Analyzer that tokenizes text with StandardTokenizer, normalizes content with CJKWidthFilter, folds case with LowerCaseFilter, forms bigrams of CJK with CJKBigramFilter, and filters stopwords with StopFilter
Inherited Members
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>)
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>, Lucene.Net.Analysis.ReuseStrategy)
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>, System.Func<System.String, System.IO.TextReader, System.IO.TextReader>)
Lucene.Net.Analysis.Analyzer.NewAnonymous(System.Func<System.String, System.IO.TextReader, Lucene.Net.Analysis.TokenStreamComponents>, System.Func<System.String, System.IO.TextReader, System.IO.TextReader>, Lucene.Net.Analysis.ReuseStrategy)
Lucene.Net.Analysis.Analyzer.GetTokenStream(System.String, System.IO.TextReader)
Lucene.Net.Analysis.Analyzer.InitReader(System.String, System.IO.TextReader)
Lucene.Net.Analysis.Analyzer.GetObjectData(System.Runtime.Serialization.SerializationInfo, System.Runtime.Serialization.StreamingContext)
System.Object.ToString()
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
Assembly: Lucene.Net.Analysis.Common.dll
Syntax
[Serializable]
public sealed class CJKAnalyzer : StopwordAnalyzerBase, IDisposable
Constructors
Name | Description |
---|---|
CJKAnalyzer(LuceneVersion) | Builds an analyzer which removes words in DefaultStopSet. |
CJKAnalyzer(LuceneVersion, CharArraySet) | Builds an analyzer with the given stop words |
Fields
Name | Description |
---|---|
DEFAULT_STOPWORD_FILE | File containing default CJK stopwords. Currently it contains some common English words that are not usually useful for searching and some double-byte interpunctions. |
Properties
Name | Description |
---|---|
DefaultStopSet | Returns an unmodifiable instance of the default stop-words set. |
Methods
Name | Description |
---|---|
CreateComponents(String, TextReader) |