Class IcuBreakIterator
A BreakIterator implementation that encapsulates the functionality
of icu.net's
Assembly: Lucene.Net.ICU.dll
Syntax
public class IcuBreakIterator : BreakIterator
Constructors
Name | Description |
---|---|
IcuBreakIterator(Icu.BreakIterator.UBreakIteratorType) | |
IcuBreakIterator(Icu.BreakIterator.UBreakIteratorType, CultureInfo) |
Fields
Name | Description |
---|---|
m_end | The end offset for the string, if supplied by a CharacterIterator |
m_start | The start offset for the string, if supplied by a CharacterIterator |
Properties
Name | Description |
---|---|
Current | Returns the current iteration position. |
EnableHacks | |
Text | Gets the text being analyzed. |
Methods
Name | Description |
---|---|
CapitalizeFirst(String) | LUCENENET TODO: This is a temporary workaround for an issue with icu-dotnet where it doesn't correctly break sentences unless they begin with a capital letter. If/when ICU is fixed, this method should be deleted and the IcuBreakIterator code changed to remove calls to this method. |
First() | Sets the current iteration position to the beginning of the text. |
Following(Int32) | Sets the iterator to refer to the first boundary position following the specified position. |
IsBoundary(Int32) | Returns true if the specified character offset is a text boundary. |
Last() | Sets the current iteration position to the end of the text. |
Next() | Advances the iterator to the next boundary position. |
Next(Int32) | Advances the iterator either forward or backward the specified number of steps. Negative values move backward, and positive values move forward. This is equivalent to repeatedly calling Next() or Previous(). |
Preceding(Int32) | Sets the iterator to refer to the last boundary position before the specified position. |
Previous() | Advances the iterator backwards, to the last boundary preceding this one. |
SetText(CharacterIterator) | |
SetText(String) | Set the iterator to analyze a new piece of text. This function resets the current iteration position to the beginning of the text. |