Class JapaneseIterationMarkCharFilter

Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.

Inheritance

System.Object

CharFilter

JapaneseIterationMarkCharFilter

Inherited Members

CharFilter.m_input

CharFilter.Dispose(Boolean)

CharFilter.CorrectOffset(Int32)

CharFilter.Skip(Int32)

CharFilter.Reset()

CharFilter.IsReady

CharFilter.IsMarkSupported

CharFilter.Mark(Int32)

Assembly: Lucene.Net.Analysis.Kuromoji.dll

Syntax

public class JapaneseIterationMarkCharFilter : CharFilter

Remarks

Sequences of iteration marks are supported. In case an illegal sequence of iteration marks is encountered, the implementation emits the illegal source character as-is without considering its script. For example, with input "??", we get "??" even though "?" isn't hiragana.

Note that a full stop punctuation character "?" (U+3002) can not be iterated (see below). Iteration marks themselves can be emitted in case they are illegal, i.e. if they go back past the beginning of the character stream.

The implementation buffers input until a full stop punctuation character (U+3002) or EOF is reached in order to not keep a copy of the character stream in memory. Vertical iteration marks, which are even rarer than horizontal iteration marks in contemporary Japanese, are unsupported.

Constructors

Name	Description
JapaneseIterationMarkCharFilter(TextReader)	Constructor. Normalizes both kanji and kana iteration marks by default.
JapaneseIterationMarkCharFilter(TextReader, Boolean, Boolean)	Constructor

Fields

Name	Description
NORMALIZE_KANA_DEFAULT	Normalize kana iteration marks by default
NORMALIZE_KANJI_DEFAULT	Normalize kanji iteration marks by default

Methods

Name	Description
Correct(Int32)
Read()	Reads the next character from the text reader and advances the character position by one character.
Read(Char[], Int32, Int32)	Reads a specified maximum number of characters from the current reader and writes the data to a buffer, beginning at the specified index.

Extension Methods

Number.IsNumber(Object)

SystemTypesHelpers.toString(Object)

SystemTypesHelpers.equals(Object, Object)