Class EdgeNGramTokenizer

Tokenizes the input from an edge into n-grams of given size(s).

This Tokenizer create n-grams from the beginning edge or ending edge of a input token.

As of Lucene 4.4, this tokenizer

can handle
```
maxGram
```
larger than 1024 chars, but beware that this will result in increased memory usage
doesn't trim the input,
sets position increments equal to 1 instead of 1 for the first token and 0 for all other ones
doesn't support backward n-grams anymore.
supports IsTokenChar(Int32) pre-tokenization,
correctly handles supplementary characters.

Although highly discouraged, it is still possible to use the old behavior through Lucene43EdgeNGramTokenizer.

Inheritance

System.Object

EdgeNGramTokenizer

Inherited Members

NGramTokenizer.DEFAULT_MIN_NGRAM_SIZE

NGramTokenizer.DEFAULT_MAX_NGRAM_SIZE

NGramTokenizer.IncrementToken()

NGramTokenizer.IsTokenChar(Int32)

NGramTokenizer.End()

NGramTokenizer.Reset()

Tokenizer.m_input

Tokenizer.Dispose(Boolean)

Tokenizer.CorrectOffset(Int32)

Lucene.Net.Analysis.Tokenizer.SetReader(System.IO.TextReader)

TokenStream.Dispose()

AttributeSource.GetAttributeFactory()

AttributeSource.GetAttributeClassesEnumerator()

AttributeSource.GetAttributeImplsEnumerator()

AttributeSource.AddAttributeImpl(Attribute)

AttributeSource.AddAttribute<T>()

AttributeSource.HasAttributes

AttributeSource.HasAttribute<T>()

AttributeSource.GetAttribute<T>()

AttributeSource.ClearAttributes()

AttributeSource.CaptureState()

AttributeSource.RestoreState(AttributeSource.State)

AttributeSource.GetHashCode()

AttributeSource.Equals(Object)

AttributeSource.ReflectAsString(Boolean)

AttributeSource.ReflectWith(IAttributeReflector)

AttributeSource.CloneAttributes()

AttributeSource.CopyTo(AttributeSource)

AttributeSource.ToString()

System.Object.Equals(System.Object, System.Object)

System.Object.ReferenceEquals(System.Object, System.Object)

System.Object.GetType()

System.Object.MemberwiseClone()

Assembly: Lucene.Net.Analysis.Common.dll

Syntax

[Serializable]
public class EdgeNGramTokenizer : NGramTokenizer, IDisposable

Constructors

Name	Description
EdgeNGramTokenizer(LuceneVersion, AttributeSource.AttributeFactory, TextReader, Int32, Int32)	Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
EdgeNGramTokenizer(LuceneVersion, TextReader, Int32, Int32)	Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range

Fields

Name	Description
DEFAULT_MAX_GRAM_SIZE
DEFAULT_MIN_GRAM_SIZE

Extension Methods

Number.IsNumber(Object)

SystemTypesHelpers.toString(Object)

SystemTypesHelpers.equals(Object, Object)