Class HTMLScanner
This class implements a table-driven scanner for HTML, allowing for lots of defects. It implements the Scanner interface, which accepts a Reader object to fetch characters from and a ScanHandler object to report lexical events to.
Inheritance
System.Object
HTMLScanner
Assembly: Lucene.Net.Benchmark.dll
Syntax
public class HTMLScanner : object, IScanner, ILocator
Constructors
Name | Description |
---|---|
HTMLScanner() |
Properties
Name | Description |
---|---|
ColumnNumber | |
LineNumber | |
PublicId | |
SystemId |
Methods
Name | Description |
---|---|
ResetDocumentLocator(String, String) | Reset document locator, supplying systemid and publicid. |
Scan(TextReader, IScanHandler) | Scan HTML source, reporting lexical events. |
StartCDATA() | A callback for the ScanHandler that allows it to force the lexer state to CDATA content (no markup is recognized except the end of element. |