Extractor
Contents
[
Hide
]
Inheritance: java.lang.Object, com.groupdocs.search.events.EventHubBase
public class Extractor extends EventHubBase
Represents a tool for preliminary data extraction from documents for separating the stage of subsequent fast indexing.
Constructors
| Constructor | Description |
|---|---|
| Extractor() | Initializes a new instance of the Extractor class. |
Fields
| Field | Description |
|---|---|
| ErrorOccurred | Occurs when an error happens during an extractor operation. |
| ImagePreparing | Occurs when an image is going to be prepared for indexing. |
| PasswordRequired | Occurs when a document requires password for opening. |
Methods
| Method | Description |
|---|---|
| getSettings() | Gets the extractor settings. |
| extract(Document document, ExtractionOptions extractionOptions) | Extracts data from a document. |
| raiseErrorOccurredPublic(String message, boolean isCritical) | |
| raiseImagePreparingPublic(String documentKey, String[] innerPath, int imageIndex, ImageFrame[] frames, InputStream stream) | |
| raisePasswordRequiredPublic(String filePath) |
Extractor()
public Extractor()
Initializes a new instance of the Extractor class.
ErrorOccurred
public final Event<EventHandler<IndexErrorEventArgs>> ErrorOccurred
Occurs when an error happens during an extractor operation.
ImagePreparing
public final Event<EventHandler<ImagePreparingEventArgs>> ImagePreparing
Occurs when an image is going to be prepared for indexing.
PasswordRequired
public final Event<EventHandler<PasswordRequiredEventArgs>> PasswordRequired
Occurs when a document requires password for opening.
getSettings()
public final ExtractorSettings getSettings()
Gets the extractor settings.
Returns: ExtractorSettings - The extractor settings.
extract(Document document, ExtractionOptions extractionOptions)
public final ExtractedData extract(Document document, ExtractionOptions extractionOptions)
Extracts data from a document.
Parameters:
| Parameter | Type | Description |
|---|---|---|
| document | Document | The document from file system, stream or structure. |
| extractionOptions | ExtractionOptions | The extraction options. |
Returns: ExtractedData - The extracted data of the document.
raiseErrorOccurredPublic(String message, boolean isCritical)
public final void raiseErrorOccurredPublic(String message, boolean isCritical)
Parameters:
| Parameter | Type | Description |
|---|---|---|
| message | java.lang.String | |
| isCritical | boolean |
raiseImagePreparingPublic(String documentKey, String[] innerPath, int imageIndex, ImageFrame[] frames, InputStream stream)
public final void raiseImagePreparingPublic(String documentKey, String[] innerPath, int imageIndex, ImageFrame[] frames, InputStream stream)
Parameters:
| Parameter | Type | Description |
|---|---|---|
| documentKey | java.lang.String | |
| innerPath | java.lang.String[] | |
| imageIndex | int | |
| frames | ImageFrame[] | |
| stream | java.io.InputStream |
raisePasswordRequiredPublic(String filePath)
public final String raisePasswordRequiredPublic(String filePath)
Parameters:
| Parameter | Type | Description |
|---|---|---|
| filePath | java.lang.String |
Returns: java.lang.String