ExtractionOptions

ExtractionOptions class

Provides options for extracting data from documents.

public class ExtractionOptions

Constructors

Name Description
ExtractionOptions() Initializes a new instance of the ExtractionOptions class.

Properties

Name Description
AutoDetectEncoding { get; set; } Gets or sets a value indicating whether to detect encoding automatically or not. The default value is false.
CustomExtractor { get; set; } Gets or sets the custom text extractor. The default value is null.
Encoding { get; set; } Gets or sets the encoding used to extract text from text documents. The default value is null, which means that the default encoding UTF-8 is used. If AutoDetectEncoding is true then this value is used as the default encoding.
ImageIndexingOptions { get; } Gets the image indexing options for reverse image search.
MetadataIndexingOptions { get; } Gets the options for indexing metadata fields.
OcrIndexingOptions { get; } Gets the options for OCR processing and indexing recognized text.
UseRawTextExtraction { get; set; } Gets or sets a value indicating whether the raw mode is used for text extraction if possible. The default value is true. The raw mode can significantly increase the indexing speed, but normal mode improves the formatting of the extracted text.

See Also