Parser

Parser class

Represents the main class that controls text, images, container extraction and parsing functionality.

public sealed class Parser : IDisposable

Constructors

Name Description
Parser(DbConnection) Initializes a new instance of the Parser class to extract data from a database.
Parser(EmailConnection) Initializes a new instance of the Parser class to extract data from a remote email server.
Parser(Stream) Initializes a new instance of the Parser class.
Parser(string) Initializes a new instance of the Parser class.
Parser(DbConnection, ParserSettings) Initializes a new instance of the Parser class to extract data from a database.
Parser(EmailConnection, ParserSettings) Initializes a new instance of the Parser class to extract data from a remote email server.
Parser(Stream, LoadOptions) Initializes a new instance of the Parser class with LoadOptions.
Parser(string, LoadOptions) Initializes a new instance of the Parser class with LoadOptions.
Parser(Stream, LoadOptions, ParserSettings) Initializes a new instance of the Parser class with LoadOptions and ParserSettings.
Parser(string, LoadOptions, ParserSettings) Initializes a new instance of the Parser class with LoadOptions and ParserSettings.

Properties

Name Description
Features { get; } Gets the supported features.

Methods

Name Description
Dispose() Performs application-defined tasks associated with freeing, releasing, or resetting unmanaged resources.
GeneratePreview(PreviewOptions) Get pages preview.
GetBarcodes() Extracts barcodes from the document.
GetBarcodes(int) Extracts barcodes from the document page.
GetBarcodes(PageAreaOptions) Extracts barcodes from the document using customization options (to set the rectangular area that contains barcodes).
GetBarcodes(int, PageAreaOptions) Extracts barcodes from the document page using customization options (to set the rectangular area that contains barcodes).
GetContainer() Extracts a container object from the document to work with formats that contain attachments, ZIP archives etc.
GetDocumentInfo() Returns the general information about the document.
GetFormattedText(FormattedTextOptions) Extracts a formatted text from the document.
GetFormattedText(int, FormattedTextOptions) Extracts a formatted text from the document page.
GetHighlight(int, bool, HighlightOptions) Extracts a highlight from the document.
GetHyperlinks() Extracts hyperlinks from the document.
GetHyperlinks(int) Extracts hyperlinks from the document page.
GetHyperlinks(PageAreaOptions) Extracts hyperlinks from the document using customization options (to set the rectangular area that contains hyperlinks).
GetHyperlinks(int, PageAreaOptions) Extracts hyperlinks from the document page using customization options (to set the rectangular area that contains hyperlinks).
GetImages() Extracts images from the document.
GetImages(int) Extracts images from the document page.
GetImages(PageAreaOptions) Extracts images from the document using customization options (to set the rectangular area that contains images).
GetImages(int, PageAreaOptions) Extracts images from the document page using customization options (to set the rectangular area that contains images).
GetMetadata() Extracts metadata from the document.
GetStructure() Extracts a structured text from the document.
GetTables(PageTableAreaOptions) Extracts tables from the document.
GetTables(int, PageTableAreaOptions) Extracts tables from the document page.
GetText() Extracts a text from the document.
GetText(int) Extracts a text from the document page.
GetText(TextOptions) Extracts a text page from the document using text options (to enable raw fast text extraction mode).
GetText(int, TextOptions) Extracts a text from the document page using text options (to enable raw fast text extraction mode).
GetTextAreas() Extracts text areas from the document.
GetTextAreas(int) Extracts text areas from the document page.
GetTextAreas(PageTextAreaOptions) Extracts text areas from the document using customization options (regular expression, match case, etc.).
GetTextAreas(int, PageTextAreaOptions) Extracts text areas from the document page using customization options (regular expression, match case, etc.).
GetToc() Extracts a table of contents from the document.
ParseByTemplate(Template) Parses the document by the user-generated template.
ParseForm() Parses the document form.
Search(string) Searches a keyword in the document.
Search(string, SearchOptions) Searches a keyword in the document using search options (regular expression, match case, etc.).
static GetFileInfo(Stream) Returns the general information about a file.
static GetFileInfo(string) Returns the general information about a file.
static GetFileInfo(Stream, LoadOptions) Returns the general information about a file.
static GetFileInfo(string, LoadOptions) Returns the general information about a file.

See Also