Skip to end of metadata
Go to start of metadata

Author

Boris Capitanu

Creation date

06/06/2011

Firing policy

all

Package

org.seasr.meandre.components.tools.text.io

DESCRIPTION

Extracts text from the specified input location. Supported location references include: PDF files, HTML/XML files, text files.

INPUTS

Name

Description

Example

location
The document location
TYPE: java.net.URI
TYPE: java.net.URL
TYPE: java.lang.String
TYPE: org.seasr.datatypes.BasicDataTypes.Strings

 

OUTPUTS

Name

Description

Example

error
This port is used to output any unhandled errors encountered during the execution of this component

 

text
The text extracted from the given document
TYPE: org.seasr.datatypes.BasicDataTypes.Strings

 

location
The document location

 

PROPERTIES

Name

Description

Default value

_debug_level
Controls the verbosity of debug messages printed by the component during execution.
Possible values are: off, severe, warning, info, config, fine, finer, finest, all
Append ',mirror' to any of the values above to mirror that output to the server logs.
info
read_timeout
The read timeout in milliseconds (amount of time to wait for a read operation to complete before giving up; 0 = wait forever)
0
connection_timeout
The connection timeout in milliseconds (amount of time to wait for a connection to be established before giving up; 0 = wait forever)
0
_ignore_errors
Set to 'true' to ignore all unhandled exceptions and prevent the flow from being terminated. Setting this property to 'false' will result in the flow being terminated in the event an unhandled exception is thrown during the execution of this component
false
Write a comment…