Foundry-Components

Skip to end of metadata
Go to start of metadata

Name Package Description Creation date
Token Counter org.seasr.meandre.components.analytics.text.statistics
Given a collection of tokens from a document, this component counts all the different occurrences of the tokens. If the document contains multiple token sequences, the component aggregate all the sequences providing a cummulative count.
06/06/2011
Configurable Service Head Request org.seasr.meandre.components.tools.webservice
Configurable service head for a service that gets data via POSTs
06/06/2011
Link Creation to Protovis org.seasr.meandre.components.analytics.socialnetworking

Overview:
This component creates links between tuples that occur within a specified sentence distance from each other. The resulting graph is output as a Protovis document.

06/06/2011
Retrieve From File org.seasr.meandre.components.tools.io
Reconstructs an object based on its persisted (serialized) state.
06/06/2011
Persist To File org.seasr.meandre.components.tools.io
This component persists (serializes) the input into a file.
06/06/2011
Tuple Logger org.seasr.meandre.components.tools.tuples
This component prints the incoming set of tuples to the console (level info)
06/06/2011
Extract Request Parameters org.seasr.meandre.components.tools.webservice
Extract the given field in the property from the request, pushing the values one by one to the output.
Note: If used in streaming mode, the component outputs the initiator and terminator markers only on the raw data output port.
06/06/2011
Velocity Template To HTML org.seasr.meandre.components.vis.html
Generates output (html) via Velocity using a template
06/06/2011
Read Semantic Model org.seasr.meandre.components.tools.semantic.io
This component reads a RDF model. The model name is specified in the input. Also, it is able to read from URLs and local files using URL of file syntax. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution.
06/06/2011
Page Text Extractor org.seasr.meandre.components.tools.text.io
Extract the text of the pages in an XML JSTOR document.
06/06/2011
OpenNLP Chunker org.seasr.meandre.components.nlp.opennlp
This component performs treebank chunking on the incoming set of tokenized sentences unsing OpenNLP facilities.
06/06/2011
Parallel Coordinates org.seasr.meandre.components.vis.protovis
Protovis Parallel Coordinates
06/06/2011
Tuple To XML org.seasr.meandre.components.transform.text

Overview:
This component extracts information from tuples and outputs it as xml document. Only those entity types specified in this component's properties will be included in the output XML doucment.

06/06/2011
Write Evernote org.seasr.meandre.components.evernote
Demonstrates how to implement a interface to write text conforming to Evernote Markup Language(ENML) to the owner's default note book under sandbox.evernote.com. It should be noted that sandbox.evernote.com does not corporate with the api setting the default note book. ,so designating the default note book through sandbox.evernote.com before writting.
06/06/2011
NE from Gazetteer org.seasr.meandre.components.nlp.gazetteer
This component performs named entity tagging using GATE's Stand Alone Gazetteer facilities
06/06/2011
Zotero Author Extractor org.seasr.meandre.components.rdf.zotero
Extract the authors for each entry of a Zotero RDF
06/06/2011
Simile Timeline Generator org.seasr.meandre.components.vis.temporal
Generates the necessary HTML and XML files for viewing timeline and store them on the local machine. The two files will be stored under public/resources/timeline/. For fast browse, dates are grouped into different time slices.
06/06/2011
Simple Google Map Viewer org.seasr.meandre.components.vis.geographic
Presents a simple google map based on location inputs
06/06/2011
Update Ignore List org.seasr.meandre.components.tools.basic
Adds entries to an ignore list
06/06/2011
Tuple To SQL org.seasr.meandre.components.tools.tuples
This component writes tuples to a db table
06/06/2011
Stacked Area Viewer org.seasr.meandre.components.vis.flash
flash vis for stacked areas
06/06/2011
XML To XML With XSL 2 org.seasr.meandre.components.transform.xml
This component inputs two XML documents, one the XML data and the other the XSL. It transforms the XML data based on the XSL template and outputs the transformed XML.
06/06/2011
Concordance org.seasr.meandre.components.tools.tapor.restservice
Demonstrates how to construct a interface to consume rest service of concordance of Tapor at http://tada.mcmaster.ca/Main/TAPoRwareHTMLConcordance.
06/06/2011
Text Cleaner org.seasr.meandre.components.transform.text
Performs find and replace on text using regular expressions
06/06/2011
Read Content With Cookie org.seasr.meandre.components.tools.text.io
This component reads content as byte array from a remote location with cookie support. The text location is specified in the input andthe cookie is also an input. The component outputs the byte array of the content. A property controls the behavior of the component in the event of an IO error, allowing it to ignore the error and continue, or throw an exception, forcing the finalization of the flow execution.
06/06/2011
SCP org.seasr.meandre.components.tools.ssh
Securely copies a file or directory from a local to a remote server or from a remote server to local. One of the source or destination inputs needs to specify the remote resource to be copied in the following format: [host]:[path]
Example: source=my.server.com:/tmp/file.txt destination=/tmp
06/06/2011
Java String To Strings org.seasr.meandre.components.transform
Converts a Java string into an equivalent string protocol buffer wrapper.
06/06/2011
Text Format org.seasr.meandre.components.tools.basic
This component can be used to create text from the inputs based on a specified format
06/06/2011
Select JSON attribute org.seasr.meandre.components.transform.text
Select a JSON attribute and return its value.
06/06/2011
Dictionary Merger org.seasr.meandre.components.tools.text.normalize.porter
Merges the incoming dictionary.
06/06/2011
Retrive Persisted Data org.seasr.meandre.components.tools.db
This component retrieves persisted data from a database
06/06/2011
Push Text org.seasr.meandre.components.tools.basic
Pushes the value of the text message property to the output.
06/06/2011
DFR Author Extractor org.seasr.meandre.components.jstor.xml.extractors
This component extracts the set of authors from an XML result returned from a JSTOR DFR query
06/06/2011
OpenMary Client org.seasr.meandre.components.nlp.openmary
OpenMary client
06/06/2011
Stanford Parser org.seasr.meandre.components.nlp.stanford
This component is NOT for general Use. DO NOT USE.
06/06/2011
Table To JSON org.seasr.meandre.components.transform.table
Converts table to JSON format.
06/06/2011
Create Empty Model org.seasr.meandre.components.tools.semantic
Pushes an empty model. It provides a couple of properties to control how many times it needs to be pushed, and if it needs to be wrapped with terminators
06/06/2011
GC org.seasr.meandre.components.tools
Asks the JVM to perform garbage collection
06/06/2011
Tuple To CSV org.seasr.meandre.components.tools.tuples
This component writes the incoming set of tuples to CSV String
06/06/2011
PDF Text Extractor org.seasr.meandre.components.tools.text.io
This component extracts the text from a pdf document. The input is a String or URL specifiying the url of the pdf document. The output is the extracted text.
06/06/2011
Token To NGrams org.seasr.meandre.components.transform.text
Transforms token counts to ngram objects.
06/06/2011
Tokens To Text org.seasr.meandre.components.transform.totext
Given a collection of tokens, this component converts it into text.
06/06/2011
Error Aggregator org.seasr.meandre.components.tools
This component aggregates all errors for a stream
06/06/2011
Tag Cloud Image Maker org.seasr.meandre.components.vis.text
Creates a tag cloud image from a word count table. If there are many tags to be displayed, reduce the maximum size of the font or increase the size of the canvas to accommodate all of tags.
06/06/2011
Text Replacement org.seasr.meandre.components.transform.text
Performs simple text replacement, based on input configuration
06/06/2011
OpenNLP Sentence Tokenizer org.seasr.meandre.components.nlp.opennlp
This component breaks the text contained in the input sentences using OpenNLP tokenizing facilities.
06/06/2011
Top N Filter org.seasr.meandre.components.transform.filters
Inputs token counts or the Map structure and filters tokens based on their counts. A property can be set to filter for the highest counts or the lowest counts.
06/06/2011
Twitter Search org.seasr.meandre.apps.twitter
This component searches Twitter for stuff
06/06/2011
Read Evernote org.seasr.meandre.components.evernote
Demomstrates how to write a interface to read all of the notes under owner's account at sandbox.evernote.com, and outputs note's title and note's content as XML text. It should be pointed out that sandbox.evernote.com is only a test-only server.
06/06/2011
Custom VAM Query org.seasr.meandre.components.transform.text
Custom VAM component to query and requery until all results are received. And to select JSON attribute of records and return its value.
06/06/2011
Tuple Grouper org.seasr.meandre.components.tools.tuples
This component groups (frequency counts) consecutive tuples based on window size
06/06/2011
Push Text Streaming org.seasr.meandre.components.tools.basic
Pushes the value of the text message property to the output. It provides a couple of properties to control how many times it needs to be pushed, and if it needs to be wrapped with delimiters.
06/06/2011
Dunning Log Likelihood org.seasr.meandre.components.analytics.statistics
This component calculates DunningLogLikelihood based on the input of two counts. This major functionality was developed as part of the Monk Project (http://monkproject.org)

References: For more information on Dunning's log-likelihood statistic, see Dunning, T. 1993. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19.1 (Mar. 1993), 61-74. or Griffiths, D. Head First Statistics. 2009. Second edition. O'Reilly.

06/06/2011
Wrapped Models Reducer org.seasr.meandre.components.tools.semantic
This component is intended to work on wrapped model streams. Given a sequence of wrapped models, it will create a new model that accumulates/reduces all the information and then push them the resulting model. If no wrapped model is provided it will act as a simple pass through.
06/06/2011
CSV Text To Token Counts org.seasr.meandre.components.transform.text
Converts CSV text to tount counts.
06/06/2011
Read Zip org.seasr.meandre.components.tools.io
This component reads a zip file and passes each file as output.
06/06/2011
Token Counter Reducer org.seasr.meandre.components.analytics.text.statistics
This component is intended to work on wrapped model streams. Given a sequence of wrapped models, it will create a new model that accumulates/reduces the token counts and then pushes the resulting model. If no wrapped model is provided it will act as a simple pass through. This component is based on Wrapped models reducer.
06/06/2011
JSON To Tuple org.seasr.meandre.components.tools.tuples
This component converts a simple JSON array-of-simple-objects structure into a Tuple structure
06/06/2011
JSTOR DFR Query org.seasr.meandre.components.jstor.io
This component performs a query against the JSTOR 'Data For Research' data. The format of the query conforms to CQL (Contextual Query Language), a standard syntax for representing queries. A formal definition of CQL can be found on the Library of Congress web site.

Searchable fields:

  • dc.creator - Article author(s)
  • dc.date - Searches the articles date of publication, format='YYYY-MM-DDT00:00:00Z'
  • dc.description - Searches article abstract, if available
  • dc.identifier - Article UID
  • dc.language - Searches language field using a ISO 639-2 3 digit language code
  • dc.publisher - Refer to Publisher facet on DfR Explore page for publisher names
  • dc.subject - Searches auto-extracted keywords associated with article
  • dc.title - Article title
  • jstor.articletype - JSTOR article type, recognized values are 'research-article', 'book-review', 'misc', 'news', and 'editorial'
  • jstor.discipline - Refer to Discipline facet on DfR Explore page for discipline names
  • jstor.journaltitle - Refer to Journal facet facet on DfR Explore page for journal titles
  • jstor.text - Searches everything

Sample queries:

  • Search for documents in 'Education' discipline: jstor.discipline='Education'
  • Search for documents with keyterm 'space': dc.subject=space
  • Search for documents with publisher 'Springer': dc.publisher=springer
  • Search for documents with term 'jefferson' anywhere: jstor.text=jefferson

06/06/2011
Trigger Message Count org.seasr.meandre.components.tools.control
This component will receive a message, and a count.The message is sent out as many times as indicated by the count.
06/06/2011
Tuple Value Filter org.seasr.meandre.components.tools.tuples
This component filters the incoming set of tuples based on a regular expression
06/06/2011
Generic Template org.seasr.meandre.components.tools.text.io
Generates and displays a webpage via a Velocity Template
06/06/2011
R Serve Wrapper org.seasr.meandre.components.analytics.statistics.r
This component sends an R script that it receives as input to the R Server at the connection it receives as input.
06/06/2011
Model To RDF Text org.seasr.meandre.components.tools.semantic
This component takes the input semantic model and converts it into a text form. Properties allow to specify the dialect to use
06/06/2011
Universal Text Extractor org.seasr.meandre.components.tools.text.io
Extracts text from the specified input location. Supported location references include: PDF files, HTML/XML files, text files.
06/06/2011
Trigger Message org.seasr.meandre.components.tools.control
This component will receive a message and a trigger.The message is saved so that it can be output for every trigger received.If a new message is received, then it replaces the previous message.
06/06/2011
Twitter Tuple Web Server org.seasr.meandre.apps.twitter
Service head for a twitter data service that gets data via posts
06/06/2011
Twitter to Tuple org.seasr.meandre.apps.twitter
This component reads a twitter stream and pushes out tuples (id,text,userid, followers, location)
06/06/2011
Token Count To JSON org.seasr.meandre.components.transform.text
Converts token count to JSON.
06/06/2011
Google Map Viewer org.seasr.meandre.components.vis.geographic
Generates a web page containing google map marked with locations from the original XML document.
06/06/2011
OpenNLP Named Entity org.seasr.meandre.components.nlp.opennlp
This component tags entities from the incoming set of tokenized sentences using OpenNLP named entity facilities.
06/06/2011
XML Tag Filter org.seasr.meandre.components.transform.xml
This component generates an xsl template for extracting the structured content for a specific tag. The tag is set up through a property.
06/06/2011
Input Text org.seasr.meandre.components.tools.text.io
Presents a simple text area for user to input string.
06/06/2011
Text Splitter org.seasr.meandre.components.tools.basic
This component takes a Strings object with multiple elements and creates a stream containing each individual element separately
06/06/2011
Wrap Text Into Model org.seasr.meandre.components.tools.semantic
Given the text provided in the input, this component generates a new semantic wrapper model containing the this text.
06/06/2011
Flesch Kincaid Readability Measure org.seasr.meandre.components.analytics.text.readability
Computes the Flesch Kincaid readability measure as explained at http://en.wikipedia.org/wiki/Flesch-Kincaid_Readability_Test. The code is based on the work done by Daniel Shiffman at http://www.shiffman.net/teaching/a2z/week1/
06/06/2011
Upload File org.seasr.meandre.components.tools.io
Uploads and submits files from user's own machine. This component works with Fluid Infusion 1.2.
06/06/2011
Dictionary Of Stemming org.seasr.meandre.components.tools.text.normalize.porter
Constructs a dictionary mapping the stemmed words back to the actual words in the original document, so for the output map, the stemmed words are keys and the actual words are values. If several words have the same stem, the shortest word is choosen as the representative.
06/06/2011
Attribute Cache Lookup org.seasr.meandre.components.tools.tuples
This component looks up the key for a tuple in the ignore file, if specified, and ignores the tuple (passes it on as is) if it's found there, otherwise it looks it up in the cache file and if an attribute is found in cache it is associated with the tuple, otherwise, if a cache miss occurred the tuple is pushed out on the 'tuple_not_cached' port for processing by other means.
06/06/2011
Count Merger org.seasr.meandre.components.tools.text.normalize.porter
Merges the incoming count.
06/06/2011
JPlayer org.seasr.meandre.components.vis.multimedia
Audio / video player
06/06/2011
Stanford POS Tagger org.seasr.meandre.components.nlp.stanford
This component performs pos tagging using Stanford's NLP facilities
06/06/2011
Text Accumulator org.seasr.meandre.components.tools.basic
This component accumulates multiple text values and pushes them out as a single concatenated value.
06/06/2011
Frame Maker org.seasr.meandre.components.vis.html
Arranges into frames multiple HTML documents that are part of a stream
06/06/2011
Tuple Value Frequency Counter org.seasr.meandre.components.tools.tuples
This component counts the incoming set of tuples, based on a unique field value
06/06/2011
Entity XML To Simile XML org.seasr.meandre.components.tools.xml.transform
Conforms the input XML document, such as, ... ,to the standard which Simile Timeline requires, such as, ...
06/06/2011
Stanford NE Tuple Tagger org.seasr.meandre.components.nlp.stanford
This component performs named entity tagging using Stanford's NLP facilities
06/06/2011
Tuple Labeler org.seasr.meandre.components.tools.tuples
This component takes in two different sets of tuples. The first set is used to build a key-value map. The key field is specified as well as the value field. The second set of tuples is then labelled using the map built from the first set. A value from the tuple is used (as the key into the map) to get the label (the value returned from the map) This label is appended to the incoming tuple are returned as a new tuple set
06/06/2011
Update Tuple Cache org.seasr.meandre.components.tools.tuples
Adds entries to the cache
06/06/2011
Spell Check org.seasr.meandre.components.transform.text
Performs spell checking on the input and optionally replaces misspelled words with the top ranked suggestion. The component also produces a list of the misspellings in the document.
06/06/2011
HTML Text Extractor org.seasr.meandre.components.tools.text.io
Converts an HTML doc to plain text. All nodes from the dom tree that are plain text nodes are appended together and returned as a string.
06/06/2011
Write Semantic Model org.seasr.meandre.components.tools.semantic.io
This component writes a RDF model. The model name is specified in the input. Also, it is able to read from URLs and local files using URL of file syntax. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution.
06/06/2011
Link Creation to GraphML org.seasr.meandre.components.analytics.socialnetworking

Overview:
This component creates links between tuples that occur within a specified sentence distance from each other. The resulting graph is output as a GraphML document.

06/06/2011
Tuple To JSON org.seasr.meandre.components.tools.tuples
This component converts the rows of a tuple into an array of JSON objects that are keyed on the tuple column labels
06/06/2011
Tuple Value To String org.seasr.meandre.components.tools.tuples
This component extracts the value(s) of an attribute from the tuple(s) and pushes it out.
06/06/2011
Author Centrality Analysis org.seasr.meandre.components.analytics.socialnetworking
Given a collection of authors, grouped by publication, this component generates a report based on the social network analysis. This analysis uses the JUNG network importance algorithms to rank the authors. This component uses Betweenness Centrality, which ranks each author in the author-citation graph derived from the number of shortest paths that pass through them.
06/06/2011
Input Data org.seasr.meandre.components.tools.text.io
This component allows the user to specify the dataset(s) to be processed. The user can use URL(s), file(s), or input the raw text to be processed.
06/06/2011
Data Type Decoder org.seasr.meandre.components.tools.control
This component decodes the input data type and pushes out the decoded types on different outputs
06/06/2011
Print To Console org.seasr.meandre.components.tools.basic
This component takes the input and prints it to the console.
06/06/2011
Token Counts To Map org.seasr.meandre.components.transform
Given a collection of token counts, this component converts them to a Java map.
06/06/2011
Tuple Splitter org.seasr.meandre.components.tools.tuples
This component splits an aggregate tuple into individual tuples
06/06/2011
Google Docs Downloader org.seasr.meandre.components.GoogleDocs
This component extracts data from a Google Docs directory and creates a local copy of the directory
06/06/2011
Write To File org.seasr.meandre.components.tools.io
This component writes the given data to a file.
06/06/2011
SSH Public Key org.seasr.meandre.components.tools.ssh
Executes a command based on SSH and returns result.
06/06/2011
HITS Summarizer org.seasr.meandre.components.analytics.text.summarization
This component ranks and sorts the tokenized input sentences providing a simple summarization by sentence seletion.
06/06/2011
Strings To Java String org.seasr.meandre.components.transform
Converts a Google string into an equivalent Java string
06/06/2011
Fork x2 org.seasr.meandre.components.tools.control

Description: This component takes in any Java objectand distributes it (or copies of it) across multiple outputs. The user can choose from five object replication methods -- by reference, shallow copy by clone, deep copy via serialization, copy via constructor, or copy via custom method.

06/06/2011
URL Decode org.seasr.meandre.components.transform.text
This component takes a URL encoded input and decodes it as a regular string.
06/06/2011
GeoLocation Cleaner org.seasr.meandre.components.tools.geo
This component uses the Yahoo GeoService to attempt to fully qualify location entities within a single sentence.
06/06/2011
Tuple Value Cleaner org.seasr.meandre.components.sentiment
DO NOT USE in production code :) testing only. This component cleans a value of a tuple based on a regular expression
06/06/2011
Select Nodes via XPath org.seasr.meandre.components.transform.xml
This component extracts XML nodes matching an XPath expression from an XML document and outputs them one by one.
06/06/2011
Twitter Circle Packing Graph org.seasr.meandre.apps.twitter
Twitter Circle Packing Graph
06/06/2011
Force Directed Graph org.seasr.meandre.components.vis.protovis
This components creates a Protovis link node graph that is a force directed graph to layout the graph. It is based on JSON data that specifies the nodes and links.
06/06/2011
Sentences To Text org.seasr.meandre.components.transform.totext
Given a collection of sentences, this component converts it into text.
06/06/2011
HTML Viewer org.seasr.meandre.components.vis.html
Generates a webpage from the HTML text that it receives as input.
06/06/2011
Tuple Aggregator org.seasr.meandre.components.tools.tuples
This component aggregates a set of tuples
06/06/2011
Unique Tuple Filter org.seasr.meandre.components.tools.tuples
This component pushes unique tuples (uniqueness based on the value of an attribute) on the 'unique_tuples' port and the duplicate tuples on the 'duplicate_tuples' port
06/06/2011
Cross Domain Policy Server org.seasr.meandre.components.vis.flash
flash xml cross domain policy requests
06/06/2011
Tuple To HTML org.seasr.meandre.components.tools.tuples
Creates HTML for displaying a set of tuples in a table
06/06/2011
Read Text org.seasr.meandre.components.tools.text.io
This component reads text from a local or remote location. The text location is specified in the input. The component outputs the text read. A property controls the behavior of the component in the event of an IO error, allowing it to ignore the error and continue, or throw an exception, forcing the finalization of the flow execution.
06/06/2011
XML To Text org.seasr.meandre.components.tools.xml
This component write a XML in text form and generates it text form. The XML document to convert is received in its input. The component outputs the text generated. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution.
06/06/2011
Token Filter org.seasr.meandre.components.transform.filters
This component filters the tokens of the input based on the list of tokens provided. The component has 3 inputs for the type of data to be filtered (tokens, token counts or tokenized sentencesand one input for the list of tokens to filter. It will output the same data type it received. If new tokens to filter are provide they either replace the current ones or add them to the black list. The component waits for a black list and then begins processing the data it receives. The component outputs the filtered tokens, token counts or tokenized sentences. The comparison of blacklisted tokens to the data will ignore case by default. Set ignore_case=false to work in case sensitive mode.
06/06/2011
Input User Password org.seasr.meandre.components.tools.text.io
This component allows a user to enter credentials that can be passed to other components
06/06/2011
HelloWorld org.seasr.meandre.components.vis.gwt
Example component that uses GWT
06/06/2011
Text To XML org.seasr.meandre.components.tools.xml
This component reads a XML in text form and buids a manipulatable document object. The text to convert is received in its input. The component outputs the XML object read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty XML or throwing and exception forcing the finalization of the flow execution.
06/06/2011
Concept Finder org.seasr.meandre.components.sentiment
This component connects to the SynNet service and computes the best concept for the input token based on the defined concept rules
06/06/2011
Zotero URL Extractor org.seasr.meandre.components.rdf.zotero
Extract the urls for each of the entry of a Zotero RDF
06/06/2011
Read XML org.seasr.meandre.components.tools.xml.io
This component reads a XML. The XML location is specified in the input. Also, it is able to read from URLs and local files using URL of file syntax. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty XML or throwing and exception forcing the finalization of the flow execution.
06/06/2011
Service Tail Text org.seasr.meandre.components.tools.webservice
Service tail for a web service
06/06/2011
Input Counter org.seasr.meandre.components.tools.control
This component counts the number of data objects passing through it and outputs the current count. If the component is set to act on a particular stream (by setting 'streamId' appropriately) then at the end of the stream the component will output a total count of all elements in that stream.
06/06/2011
Date Filter org.seasr.meandre.components.transform.xml
This component generates an xsl template to filter an xml file and include only the dates between the minimum and maximum year. This is used to filter the Simile xml file that is generated.
06/06/2011
Add Tuple Attribute org.seasr.meandre.components.tools.tuples
This component adds an extra attribute to existing tuple(s).
06/06/2011
CSV To Tuple org.seasr.meandre.components.tools.tuples
This component converts a csv string into tuples. Each line of the incoming text is a new tuple. It does not handle missing values
06/06/2011
Aggregate Mallet Instances org.seasr.meandre.components.analytics.mallet
This component accumulates the instances belonging to the specified stream and produces a list of machine learning instances, typically used for training or testing of a machine learning algorithm.
06/06/2011
Tuple To Mallet Feature Sequence org.seasr.meandre.components.analytics.mallet
This component converts a specific attribute into a Mallet feature sequence
06/06/2011
Normalize Token Counts org.seasr.meandre.components.transform.text
This component sets the count of tokens with a count greater than zero to 1.
06/06/2011
HTML Fragment Maker org.seasr.meandre.components.vis.html
Generates an HTML fragment based on the input data.The encoding of the data is specified via the encoding property. Supported MIME types: 'text/plain', 'image/' (where is one of the standard image types; ex: jpg, png...)
06/06/2011
XML Aggregator org.seasr.meandre.components.transform.xml
This component extracts information from input xml files and, constructs a new xml file to output. It is for TEI only.
06/06/2011
Arc Diagram org.seasr.meandre.components.vis.protovis
This components creates a Protovis arc diagram based on JSON data that specifies the nodes and links.
06/06/2011
Concept Counter org.seasr.meandre.components.sentiment
This component tags tuples representing documents with an incremental 'docId' and a count of the frequencies of the concepts present in the document.
06/06/2011
OpenNLP Tokenizer org.seasr.meandre.components.nlp.opennlp
This component breaks the text contained in the input using OpenNLP tokenizing facilities.
06/06/2011
Token Count To Table org.seasr.meandre.components.transform.text
Converts token count to table.
06/06/2011
Stream Delimiter Filter org.seasr.meandre.components.tools.control
This component filters out some or all stream delimiters
06/06/2011
URL To Tuple org.seasr.meandre.components.tools.tuples
This component reads a text resources via file or http (assumes no authentication required)
06/06/2011
Service Head Request org.seasr.meandre.components.tools.webservice
Service head for a service that gets data via posts
06/06/2011
SSH org.seasr.meandre.components.tools.ssh
Executes a command based on SSH and returns result.
06/06/2011
Google Search To Tuple org.seasr.meandre.components.tools.tuples
This component performs a Google search by accessing the Google search API.The search term is an input and the search results are output as tuples.
06/06/2011
Generic Viewer org.seasr.meandre.components.tools.text.io
Generates and displays a webpage via a Velocity Template
06/06/2011
Stem org.seasr.meandre.components.tools.text.normalize.porter

Overview:
This component transforms terms into their word stems. In this way, different forms of the same word (plurals etc...) will be recognized as the same term.The algorithm used is the Porter stemming method.

References:
See: http://www.tartarus.org/~martin/PorterStemmer/

Data Type Restrictions:
The input document must have been tokenized.

Data Handling:
This component will modify (as described above) the document object that is input.

Scalability:
This compnent makes one pass over the token list resulting in linear time complexity per the number of tokens. Memory usage is proportional to the number tokens.

Trigger Criteria:
All.

06/06/2011
Spell Check with Counts org.seasr.meandre.components.transform.text
Performs spell checking on the input and optionally replaces misspelled words with the top ranked suggestion based on the supplied token counts. The component also produces a list of the misspellings in the document.
06/06/2011
Token Count Filter org.seasr.meandre.components.transform.filters
This component filters (removes) the tokens with token counts satisfying the given constraints
06/06/2011
Google Map Generator org.seasr.meandre.components.vis.geographic
Calculates latitude and longitude for an address contained in the input XML document.
06/06/2011
Concatenate Text org.seasr.meandre.components.tools.basic
Concatenates text from multiple inputs
06/06/2011
Search Text org.seasr.meandre.components.tools.basic
Searches the text input for the regular expression pattern. If the pattern is found and the regular expression specifies a capturing group, then the matching text for the capturing group is pushed out on port Text_Found. If the pattern is found and there are no capturing groups defined, the maching text for the pattern is pushed out on port Text_Found. In both cases multiple outputs are possible.If the pattern is not found, it outputs the original text on port Text.
06/06/2011
Write To Zip org.seasr.meandre.components.tools.io
This component writes a zip file containing all the data passed in the stream
06/06/2011
R Serve Connection org.seasr.meandre.components.analytics.statistics.r
This component establishes the connection to the R Server for using the R statistics package.
06/06/2011
Stanford NE Tagger org.seasr.meandre.components.nlp.stanford
This component performs named entity tagging using Stanford's NLP facilities
06/06/2011
Transform Token From Dictionary org.seasr.meandre.components.tools.text.normalize.porter
Replaces tokens with their entries from the dictionary.
06/06/2011
XML To XML With XSL org.seasr.meandre.components.transform.xml
This component inputs two XML documents, one the XML data and the other the XSL. It transforms the XML data based on the XSL template and outputs the transformed XML.
06/06/2011
SQL To Tuple org.seasr.meandre.components.tools.tuples
This component reads a mysql database
06/06/2011
To Lowercase org.seasr.meandre.components.transform.text
Converts a text to lowercase.
06/06/2011
Summarizer Viewer org.seasr.meandre.components.vis.text
Visualizes tokens and sentences from HitsSummarizer. Its inputs should be connected directly to HitsSummarizer.
06/06/2011
Tokenized Sentences Reducer org.seasr.meandre.components.analytics.text.statistics
This component is intended to work on wrapped model streams. Given a sequence of wrapped models, it will create a new model that accumulates/reduces the tokenized sentences and then pushes the resulting model. If no wrapped model is provided it will act as a simple pass through. This component is based on Wrapped models reducer.
06/06/2011
RDF Text To Model org.seasr.meandre.components.tools.semantic
This component reads an RDF model in text form and buids the model. The text to convert is received in its input. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution.
06/06/2011
Text Segmentation org.seasr.meandre.components.transform.text
The component breaks a document into chunks (segments) for further processing. It transforms the document of tokenized sentences into segments of size that approximates the number of tuples specified in the property. Segments always end at sentence boundaries.
06/06/2011
SCP Public Key org.seasr.meandre.components.tools.ssh
Securely copies a file or directory from a local to a remote server or from a remote server to local. One of the source or destination inputs needs to specify the remote resource to be copied in the following format: [host]:[path]
Example: source=my.server.com:/tmp/file.txt destination=/tmp
06/06/2011
URL Encode org.seasr.meandre.components.transform.text
This component takes the input and encodes it as a URL string.
06/06/2011
OpenNLP Sentence Detector org.seasr.meandre.components.nlp.opennlp
This component splits sentences of the text contained in the input using OpenNLP tokenizing facilities.
06/06/2011
OpenNLP POS Tagger org.seasr.meandre.components.nlp.opennlp
This component tags the tokens of the incoming set of tokenized sentences using OpenNLP pos facilities.
06/06/2011
List Directory Files org.seasr.meandre.components.tools.basic
Given a [server-relative] directory, this component pushes all the file names available that match a certain regular expression given in the properties.
06/06/2011
Read Text Set Cookie org.seasr.meandre.components.tools.text.io
This component reads text from a remote location with cookie support. The text location is specified in the input. The component outputs the text read and the cookie obtained from the request. A property controls the behavior of the component in the event of an IO error, allowing it to ignore the error and continue, or throw an exception, forcing the finalization of the flow execution.
06/06/2011
Table Viewer org.seasr.meandre.components.vis.table
This component provides a table viewer for a data set. The input can come from a Vector or a Table. If a vector is used, then each element of Vector is mapped into a row in table and each element of Object array is mapped into a cell in table.This component can be used with the 'CSV Reader' or Map2Table components.Rows are fetched automatically as needed and column rendering is done only for the visiblecolumns to improve performance for large datasets.
06/06/2011
Token Counts To Text org.seasr.meandre.components.transform.totext
Given a collection of token counts, this component converts it into text. The default separator is a comma, so make sure tokens do not have commas or change the separator.
06/06/2011
Fork x5 org.seasr.meandre.components.tools.control

Description: This component takes in any Java object and distributes it (or copies of it) across multiple outputs. The user can choose from five object replication methods -- by reference, shallow copy by clone, deep copy via serialization, copy via constructor, or copy via custom method.

06/06/2011
Persist To DB org.seasr.meandre.components.tools.db
This component persists (serializes) the input into a database
06/06/2011
Create Mallet Instance org.seasr.meandre.components.analytics.mallet
This component creates a Mallet instance object.

This represents a machine learning 'example' to be used in training, testing or performance of various machine learning algorithms.

An instance contains four generic fields of predefined name: 'data', 'target', 'name', and 'source'. 'Data' holds the data represented by the instance, 'target' is often a label associated with the instance, 'name' is a short identifying name for the instance a(such as a filename), and 'source' is human-readable source information, (such as the original text).

06/06/2011
Image To ASCII Text org.seasr.meandre.components.vis.text
Converts an image to ASCII text
06/06/2011
Tuple Text Replacement org.seasr.meandre.components.transform.text
Performs simple tuple text replacement, based on input configuration
06/06/2011
List Words org.seasr.meandre.components.tools.tapor.restservice
Demonstrates how to construct a interface to consume rest service of list words of Tapor at http://tada.mcmaster.ca/Main/TAPoRwareHTMLListWords.
06/06/2011
Mallet Topic Modeling org.seasr.meandre.components.analytics.mallet
This component perform topic analysis in the style of LDA and its variants using Mallet
06/06/2011

Labels:
None
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.
Page: Add Tuple Attribute Page: Aggregate Mallet Instances Page: Annotation To XML Page: Arc Diagram Page: Attribute Cache Lookup Page: Author Centrality Analysis Page: Auth SSH Wrapper Page: Concatenate Text Page: Concept Counter Page: Concept Finder Page: Concordance Page: Configurable Service Head Post Page: Configurable Service Head Request Page: Count Merger Page: Create Empty Model Page: Create Mallet Instance Page: Cross Domain Policy Server Page: CSV Text To Token Counts Page: CSV To Tuple Page: CSV To Tuples Page: Custom VAM Query Page: Data Type Decoder Page: Date Filter Page: DFR Author Extractor Page: Dictionary Merger Page: Dictionary Of Stemming Page: Dunning Log Likelihood Page: Entity XML To Simile XML Page: Error Aggregator Page: Extract Request Parameters Page: Extract Text Field From Map Page: Flesch Kincaid Readability Measure Page: Force Directed Graph Page: Fork x2 Page: Fork x5 Page: Frame Maker Page: GC Page: Generic Template Page: Generic Viewer Page: GeoLocation Cleaner Page: Google Docs Downloader Page: Google Map Generator Page: Google Map Viewer Page: Google Search To Tuple Page: HelloWorld Page: HITS Summarizer Page: HTML Fragment Maker Page: HTML Text Extractor Page: HTML Viewer Page: Image To ASCII Text Page: Input Counter Page: Input Data Page: Input Text Page: Input User Password Page: Java String To Strings Page: JPlayer Page: JSON To Tuple Page: JSTOR DFR Query Page: Link Creation Page: Link Creation to GraphML Page: Link Creation to Protovis Page: List Directory Files Page: List HTML Tags Page: List Words Page: Mallet Topic Modeling Page: Model To RDF Text Page: NE from Gazetteer Page: Normalize Token Counts Page: OpenMary Client Page: OpenNLP Chunker Page: OpenNLP Named Entity Page: OpenNLP POS Tagger Page: OpenNLP Sentence Detector Page: OpenNLP Sentence Tokenizer Page: OpenNLP Tokenizer Page: Page Text Extractor Page: Parallel Coordinates Page: PDF Text Extractor Page: Persist To DB Page: Persist To File Page: Print To Console Page: Push Text Page: Push Text Streaming Page: RDF Text To Model Page: Read Content With Cookie Page: Read Evernote Page: Read Semantic Model Page: Read Text Page: Read Text Set Cookie Page: Read XML Page: Read Zip Page: Retrieve From File Page: Retrive Persisted Data Page: R Serve Connection Page: R Serve Wrapper Page: SCP Page: SCP Public Key Page: SCP Wrapper Page: Search Text Page: Select JSON attribute Page: Select Nodes via XPath Page: Sentences To Text Page: Service Head Post Page: Service Head Request Page: Service Tail Text Page: Simile Timeline Generator Page: Simple Google Map Viewer Page: Simple Text Cleaner Page: Spell Check Page: Spell Check with Counts Page: SQL To Tuple Page: SSH Page: SSH Public Key Page: SSH Wrapper Page: Stacked Area Viewer Page: Stanford Named Entity Tagger Page: Stanford NE Tagger Page: Stanford NE Tuple Tagger Page: Stanford Parser Page: Stanford Pos Tagger Page: Stem Page: Stream Delimiter Filter Page: Strings To Java String Page: Summarizer Viewer Page: Tab Delimited Text To Token Counts Page: Tab Delimited Text To Token Counts Page: Table To JSON Page: Table Viewer Page: Tag Cloud Image Maker Page: Text Accumulator Page: Text Cleaner Page: Text Format Page: Text Replacement Page: Text Segmentation Page: Text Splitter Page: Text To Document Page: Text To XML Page: Token Concept Labeler Page: Token Counter Page: Token Counter Reducer Page: Token Count Filter Page: Token Counts To Map Page: Token Counts To Text Page: Token Count To JSON Page: Token Count To Table Page: Token Filter Page: Tokenized Sentences Reducer Page: Tokens To Text Page: Token To NGrams Page: To Lowercase Page: Top N Filter Page: Transform Token From Dictionary Page: Trigger Message Page: Trigger Message Count Page: Tuple Aggregator Page: Tuple Grouper Page: Tuple Labeler Page: Tuple Logger Page: Tuple Splitter Page: Tuple Text Replacement Page: Tuple To CSV Page: Tuple To CSV File Page: Tuple To HTML Page: Tuple To JSON Page: Tuple To Mallet Feature Sequence Page: Tuple To SQL Page: Tuple To XML Page: Tuple Value Cleaner Page: Tuple Value Filter Page: Tuple Value Frequency Counter Page: Tuple Value To String Page: Twitter Circle Packing Graph Page: Twitter Search Page: Twitter to Tuple Page: Twitter Tuple Web Server Page: Unique Tuple Filter Page: Universal Text Extractor Page: Update Ignore List Page: Update Tuple Cache Page: Upload File Page: URL Decode Page: URL Encode Page: URL To Tuple Page: Velocity Template To HTML Page: Wrapped Models Reducer Page: Wrap Text Into Model Page: Write Evernote Page: Write Semantic Model Page: Write Text Page: Write To File Page: Write To Zip Page: Write XML Page: XML Aggregator Page: XML Tag Filter Page: XML To Text Page: XML To XML With XSL Page: XML To XML With XSL 2 Page: Zotero Author Extractor Page: Zotero URL Extractor