Gets a value indicating whether to detect encoding automatically or not. The default value is false .
Returns:
boolean - A value indicating whether to detect encoding automatically or not.
setAutoDetectEncoding(boolean value)
public void setAutoDetectEncoding(boolean value)
Sets a value indicating whether to detect encoding automatically or not. The default value is false .
Parameters:
Parameter
Type
Description
value
boolean
A value indicating whether to detect encoding automatically or not.
getEncoding()
public String getEncoding()
Gets the encoding used to extract text from text documents. The default value is null , which means that the default encoding UTF-8 is used. If AutoDetectEncoding is true then this value is used as the default encoding.
Returns:
java.lang.String - The encoding used to extract text from text documents.
setEncoding(String value)
public void setEncoding(String value)
Sets the encoding used to extract text from text documents. The default value is null , which means that the default encoding UTF-8 is used. If AutoDetectEncoding is true then this value is used as the default encoding.
Parameters:
Parameter
Type
Description
value
java.lang.String
The encoding used to extract text from text documents.
getUseRawTextExtraction()
public boolean getUseRawTextExtraction()
Gets a value indicating whether the raw mode is used for text extraction if possible. The default value is true . The raw mode can significantly increase the indexing speed, but normal mode improves the formatting of the extracted text.
Returns:
boolean - A value indicating whether the raw mode is used for text extraction if possible.
setUseRawTextExtraction(boolean value)
public void setUseRawTextExtraction(boolean value)
Sets a value indicating whether the raw mode is used for text extraction if possible. The default value is true . The raw mode can significantly increase the indexing speed, but normal mode improves the formatting of the extracted text.
Parameters:
Parameter
Type
Description
value
boolean
A value indicating whether the raw mode is used for text extraction if possible.
getMetadataIndexingOptions()
public MetadataIndexingOptions getMetadataIndexingOptions()