Author |
Mike Haberman |
Creation date |
06/06/2011 |
Firing policy |
all |
Package |
org.seasr.meandre.components.nlp.opennlp |
DESCRIPTION
This component tags the tokens of the incoming set of tokenized sentences using OpenNLP pos facilities.
INPUTS
Name |
Description |
Example |
|---|---|---|
tokenized_sentences |
The sequence of tokenized sentences TYPE: org.seasr.datatypes.BasicDataTypes.StringsMap |
|
OUTPUTS
Name |
Description |
Example |
|---|---|---|
meta_tuple |
meta data for tuples: (pos,sentenceId,offset,token) TYPE: org.seasr.datatypes.BasicDataTypes.Strings |
|
tuples |
set of tuples: (pos,sentenceId,offset,token) TYPE: org.seasr.datatypes.BasicDataTypes.StringsArray |
|
error |
This port is used to output any unhandled errors encountered during the execution of this component |
|
PROPERTIES
Name |
Description |
Default value |
|---|---|---|
openNLPdir |
OpenNLP directory, if non-empty, skip install |
|
_debug_level |
Controls the verbosity of debug messages printed by the component during execution. Possible values are: off, severe, warning, info, config, fine, finer, finest, all Append ',mirror' to any of the values above to mirror that output to the server logs. |
info |
filter_regex |
optional regular expression to inline filter POS (e.g. JJ|RB) |
|
_ignore_errors |
Set to 'true' to ignore all unhandled exceptions and prevent the flow from being terminated. Setting this property to 'false' will result in the flow being terminated in the event an unhandled exception is thrown during the execution of this component |
false |
language |
The language to use in the tokenizer. |
english |
Add Comment