Author |
Boris Capitanu |
Creation date |
06/06/2011 |
Firing policy |
all |
Package |
org.seasr.meandre.components.tools.text.io |
DESCRIPTION
Extracts text from the specified input location. Supported location references include: PDF files, HTML/XML files, text files.
INPUTS
Name |
Description |
Example |
|---|---|---|
location |
The document location TYPE: java.net.URI TYPE: java.net.URL TYPE: java.lang.String TYPE: org.seasr.datatypes.BasicDataTypes.Strings |
|
OUTPUTS
Name |
Description |
Example |
|---|---|---|
error |
This port is used to output any unhandled errors encountered during the execution of this component |
|
text |
The text extracted from the given document TYPE: org.seasr.datatypes.BasicDataTypes.Strings |
|
location |
The document location |
|
PROPERTIES
Name |
Description |
Default value |
|---|---|---|
_debug_level |
Controls the verbosity of debug messages printed by the component during execution. Possible values are: off, severe, warning, info, config, fine, finer, finest, all Append ',mirror' to any of the values above to mirror that output to the server logs. |
info |
read_timeout |
The read timeout in milliseconds (amount of time to wait for a read operation to complete before giving up; 0 = wait forever) |
0 |
connection_timeout |
The connection timeout in milliseconds (amount of time to wait for a connection to be established before giving up; 0 = wait forever) |
0 |
_ignore_errors |
Set to 'true' to ignore all unhandled exceptions and prevent the flow from being terminated. Setting this property to 'false' will result in the flow being terminated in the event an unhandled exception is thrown during the execution of this component |
false |
Add Comment