Skip to end of metadata
Go to start of metadata

Author

Xavier Llora

Creation date

06/06/2011

Firing policy

all

Package

org.seasr.meandre.components.nlp.opennlp

DESCRIPTION

This component splits sentences of the text contained in the input using OpenNLP tokenizing facilities.

INPUTS

Name

Description

Example

text
The text to be split into sentences
TYPE: java.lang.String
TYPE: org.seasr.datatypes.BasicDataTypes.Strings
TYPE: byte[]
TYPE: org.seasr.datatypes.BasicDataTypes.Bytes
TYPE: java.lang.Object

 

OUTPUTS

Name

Description

Example

error
This port is used to output any unhandled errors encountered during the execution of this component

 

sentences
The sequence of sentences
TYPE: org.seasr.datatypes.BasicDataTypes.Strings

 

PROPERTIES

Name

Description

Default value

openNLPdir
OpenNLP directory, if non-empty, skip install
_debug_level
Controls the verbosity of debug messages printed by the component during execution.
Possible values are: off, severe, warning, info, config, fine, finer, finest, all
Append ',mirror' to any of the values above to mirror that output to the server logs.
info
language
The language to use in the tokenizer.
english
_ignore_errors
Set to 'true' to ignore all unhandled exceptions and prevent the flow from being terminated. Setting this property to 'false' will result in the flow being terminated in the event an unhandled exception is thrown during the execution of this component
false
Write a comment…