| Name | Package | Description | Creation date |
|---|---|---|---|
| Token Counter | org.seasr.meandre.components.analytics.text.statistics | Given a collection of tokens from a document, this component counts all the different occurrences of the tokens. If the document contains multiple token sequences, the component aggregate all the sequences providing a cummulative count. |
06/06/2011 |
| Configurable Service Head Request | org.seasr.meandre.components.tools.webservice | Configurable service head for a service that gets data via POSTs |
06/06/2011 |
| Link Creation to Protovis | org.seasr.meandre.components.analytics.socialnetworking | Overview: |
06/06/2011 |
| Retrieve From File | org.seasr.meandre.components.tools.io | Reconstructs an object based on its persisted (serialized) state. |
06/06/2011 |
| Persist To File | org.seasr.meandre.components.tools.io | This component persists (serializes) the input into a file. |
06/06/2011 |
| Tuple Logger | org.seasr.meandre.components.tools.tuples | This component prints the incoming set of tuples to the console (level info) |
06/06/2011 |
| Extract Request Parameters | org.seasr.meandre.components.tools.webservice | Extract the given field in the property from the request, pushing the values one by one to the output. Note: If used in streaming mode, the component outputs the initiator and terminator markers only on the raw data output port. |
06/06/2011 |
| Velocity Template To HTML | org.seasr.meandre.components.vis.html | Generates output (html) via Velocity using a template |
06/06/2011 |
| Read Semantic Model | org.seasr.meandre.components.tools.semantic.io | This component reads a RDF model. The model name is specified in the input. Also, it is able to read from URLs and local files using URL of file syntax. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution. |
06/06/2011 |
| Page Text Extractor | org.seasr.meandre.components.tools.text.io | Extract the text of the pages in an XML JSTOR document. |
06/06/2011 |
| OpenNLP Chunker | org.seasr.meandre.components.nlp.opennlp | This component performs treebank chunking on the incoming set of tokenized sentences unsing OpenNLP facilities. |
06/06/2011 |
| Parallel Coordinates | org.seasr.meandre.components.vis.protovis | Protovis Parallel Coordinates |
06/06/2011 |
| Tuple To XML | org.seasr.meandre.components.transform.text | Overview: |
06/06/2011 |
| Write Evernote | org.seasr.meandre.components.evernote | Demonstrates how to implement a interface to write text conforming to Evernote Markup Language(ENML) to the owner's default note book under sandbox.evernote.com. It should be noted that sandbox.evernote.com does not corporate with the api setting the default note book. ,so designating the default note book through sandbox.evernote.com before writting. |
06/06/2011 |
| NE from Gazetteer | org.seasr.meandre.components.nlp.gazetteer | This component performs named entity tagging using GATE's Stand Alone Gazetteer facilities |
06/06/2011 |
| Zotero Author Extractor | org.seasr.meandre.components.rdf.zotero | Extract the authors for each entry of a Zotero RDF |
06/06/2011 |
| Simile Timeline Generator | org.seasr.meandre.components.vis.temporal | Generates the necessary HTML and XML files for viewing timeline and store them on the local machine. The two files will be stored under public/resources/timeline/. For fast browse, dates are grouped into different time slices. |
06/06/2011 |
| Simple Google Map Viewer | org.seasr.meandre.components.vis.geographic | Presents a simple google map based on location inputs |
06/06/2011 |
| Update Ignore List | org.seasr.meandre.components.tools.basic | Adds entries to an ignore list |
06/06/2011 |
| Tuple To SQL | org.seasr.meandre.components.tools.tuples | This component writes tuples to a db table |
06/06/2011 |
| Stacked Area Viewer | org.seasr.meandre.components.vis.flash | flash vis for stacked areas |
06/06/2011 |
| XML To XML With XSL 2 | org.seasr.meandre.components.transform.xml | This component inputs two XML documents, one the XML data and the other the XSL. It transforms the XML data based on the XSL template and outputs the transformed XML. |
06/06/2011 |
| Concordance | org.seasr.meandre.components.tools.tapor.restservice | Demonstrates how to construct a interface to consume rest service of concordance of Tapor at http://tada.mcmaster.ca/Main/TAPoRwareHTMLConcordance. |
06/06/2011 |
| Text Cleaner | org.seasr.meandre.components.transform.text | Performs find and replace on text using regular expressions |
06/06/2011 |
| Read Content With Cookie | org.seasr.meandre.components.tools.text.io | This component reads content as byte array from a remote location with cookie support. The text location is specified in the input andthe cookie is also an input. The component outputs the byte array of the content. A property controls the behavior of the component in the event of an IO error, allowing it to ignore the error and continue, or throw an exception, forcing the finalization of the flow execution. |
06/06/2011 |
| SCP | org.seasr.meandre.components.tools.ssh | Securely copies a file or directory from a local to a remote server or from a remote server to local. One of the source or destination inputs needs to specify the remote resource to be copied in the following format: [host]:[path] Example: source=my.server.com:/tmp/file.txt destination=/tmp |
06/06/2011 |
| Java String To Strings | org.seasr.meandre.components.transform | Converts a Java string into an equivalent string protocol buffer wrapper. |
06/06/2011 |
| Text Format | org.seasr.meandre.components.tools.basic | This component can be used to create text from the inputs based on a specified format |
06/06/2011 |
| Select JSON attribute | org.seasr.meandre.components.transform.text | Select a JSON attribute and return its value. |
06/06/2011 |
| Dictionary Merger | org.seasr.meandre.components.tools.text.normalize.porter | Merges the incoming dictionary. |
06/06/2011 |
| Retrive Persisted Data | org.seasr.meandre.components.tools.db | This component retrieves persisted data from a database |
06/06/2011 |
| Push Text | org.seasr.meandre.components.tools.basic | Pushes the value of the text message property to the output. |
06/06/2011 |
| DFR Author Extractor | org.seasr.meandre.components.jstor.xml.extractors | This component extracts the set of authors from an XML result returned from a JSTOR DFR query |
06/06/2011 |
| OpenMary Client | org.seasr.meandre.components.nlp.openmary | OpenMary client |
06/06/2011 |
| Stanford Parser | org.seasr.meandre.components.nlp.stanford | This component is NOT for general Use. DO NOT USE. |
06/06/2011 |
| Table To JSON | org.seasr.meandre.components.transform.table | Converts table to JSON format. |
06/06/2011 |
| Create Empty Model | org.seasr.meandre.components.tools.semantic | Pushes an empty model. It provides a couple of properties to control how many times it needs to be pushed, and if it needs to be wrapped with terminators |
06/06/2011 |
| GC | org.seasr.meandre.components.tools | Asks the JVM to perform garbage collection |
06/06/2011 |
| Tuple To CSV | org.seasr.meandre.components.tools.tuples | This component writes the incoming set of tuples to CSV String |
06/06/2011 |
| PDF Text Extractor | org.seasr.meandre.components.tools.text.io | This component extracts the text from a pdf document. The input is a String or URL specifiying the url of the pdf document. The output is the extracted text. |
06/06/2011 |
| Token To NGrams | org.seasr.meandre.components.transform.text | Transforms token counts to ngram objects. |
06/06/2011 |
| Tokens To Text | org.seasr.meandre.components.transform.totext | Given a collection of tokens, this component converts it into text. |
06/06/2011 |
| Error Aggregator | org.seasr.meandre.components.tools | This component aggregates all errors for a stream |
06/06/2011 |
| Tag Cloud Image Maker | org.seasr.meandre.components.vis.text | Creates a tag cloud image from a word count table. If there are many tags to be displayed, reduce the maximum size of the font or increase the size of the canvas to accommodate all of tags. |
06/06/2011 |
| Text Replacement | org.seasr.meandre.components.transform.text | Performs simple text replacement, based on input configuration |
06/06/2011 |
| OpenNLP Sentence Tokenizer | org.seasr.meandre.components.nlp.opennlp | This component breaks the text contained in the input sentences using OpenNLP tokenizing facilities. |
06/06/2011 |
| Top N Filter | org.seasr.meandre.components.transform.filters | Inputs token counts or the Map |
06/06/2011 |
| Twitter Search | org.seasr.meandre.apps.twitter | This component searches Twitter for stuff |
06/06/2011 |
| Read Evernote | org.seasr.meandre.components.evernote | Demomstrates how to write a interface to read all of the notes under owner's account at sandbox.evernote.com, and outputs note's title and note's content as XML text. It should be pointed out that sandbox.evernote.com is only a test-only server. |
06/06/2011 |
| Custom VAM Query | org.seasr.meandre.components.transform.text | Custom VAM component to query and requery until all results are received. And to select JSON attribute of records and return its value. |
06/06/2011 |
| Tuple Grouper | org.seasr.meandre.components.tools.tuples | This component groups (frequency counts) consecutive tuples based on window size |
06/06/2011 |
| Push Text Streaming | org.seasr.meandre.components.tools.basic | Pushes the value of the text message property to the output. It provides a couple of properties to control how many times it needs to be pushed, and if it needs to be wrapped with delimiters. |
06/06/2011 |
| Dunning Log Likelihood | org.seasr.meandre.components.analytics.statistics | This component calculates DunningLogLikelihood based on the input of two counts. This major functionality was developed as part of the Monk Project (http://monkproject.org) References: For more information on Dunning's log-likelihood statistic, see Dunning, T. 1993. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19.1 (Mar. 1993), 61-74. or Griffiths, D. Head First Statistics. 2009. Second edition. O'Reilly. |
06/06/2011 |
| Wrapped Models Reducer | org.seasr.meandre.components.tools.semantic | This component is intended to work on wrapped model streams. Given a sequence of wrapped models, it will create a new model that accumulates/reduces all the information and then push them the resulting model. If no wrapped model is provided it will act as a simple pass through. |
06/06/2011 |
| CSV Text To Token Counts | org.seasr.meandre.components.transform.text | Converts CSV text to tount counts. |
06/06/2011 |
| Read Zip | org.seasr.meandre.components.tools.io | This component reads a zip file and passes each file as output. |
06/06/2011 |
| Token Counter Reducer | org.seasr.meandre.components.analytics.text.statistics | This component is intended to work on wrapped model streams. Given a sequence of wrapped models, it will create a new model that accumulates/reduces the token counts and then pushes the resulting model. If no wrapped model is provided it will act as a simple pass through. This component is based on Wrapped models reducer. |
06/06/2011 |
| JSON To Tuple | org.seasr.meandre.components.tools.tuples | This component converts a simple JSON array-of-simple-objects structure into a Tuple structure |
06/06/2011 |
| JSTOR DFR Query | org.seasr.meandre.components.jstor.io | This component performs a query against the JSTOR 'Data For Research' data. The format of the query conforms to CQL (Contextual Query Language), a standard syntax for representing queries. A formal definition of CQL can be found on the Library of Congress web site. Searchable fields:
Sample queries:
|
06/06/2011 |
| Trigger Message Count | org.seasr.meandre.components.tools.control | This component will receive a message, and a count.The message is sent out as many times as indicated by the count. |
06/06/2011 |
| Tuple Value Filter | org.seasr.meandre.components.tools.tuples | This component filters the incoming set of tuples based on a regular expression |
06/06/2011 |
| Generic Template | org.seasr.meandre.components.tools.text.io | Generates and displays a webpage via a Velocity Template |
06/06/2011 |
| R Serve Wrapper | org.seasr.meandre.components.analytics.statistics.r | This component sends an R script that it receives as input to the R Server at the connection it receives as input. |
06/06/2011 |
| Model To RDF Text | org.seasr.meandre.components.tools.semantic | This component takes the input semantic model and converts it into a text form. Properties allow to specify the dialect to use |
06/06/2011 |
| Universal Text Extractor | org.seasr.meandre.components.tools.text.io | Extracts text from the specified input location. Supported location references include: PDF files, HTML/XML files, text files. |
06/06/2011 |
| Trigger Message | org.seasr.meandre.components.tools.control | This component will receive a message and a trigger.The message is saved so that it can be output for every trigger received.If a new message is received, then it replaces the previous message. |
06/06/2011 |
| Twitter Tuple Web Server | org.seasr.meandre.apps.twitter | Service head for a twitter data service that gets data via posts |
06/06/2011 |
| Twitter to Tuple | org.seasr.meandre.apps.twitter | This component reads a twitter stream and pushes out tuples (id,text,userid, followers, location) |
06/06/2011 |
| Token Count To JSON | org.seasr.meandre.components.transform.text | Converts token count to JSON. |
06/06/2011 |
| Google Map Viewer | org.seasr.meandre.components.vis.geographic | Generates a web page containing google map marked with locations from the original XML document. |
06/06/2011 |
| OpenNLP Named Entity | org.seasr.meandre.components.nlp.opennlp | This component tags entities from the incoming set of tokenized sentences using OpenNLP named entity facilities. |
06/06/2011 |
| XML Tag Filter | org.seasr.meandre.components.transform.xml | This component generates an xsl template for extracting the structured content for a specific tag. The tag is set up through a property. |
06/06/2011 |
| Input Text | org.seasr.meandre.components.tools.text.io | Presents a simple text area for user to input string. |
06/06/2011 |
| Text Splitter | org.seasr.meandre.components.tools.basic | This component takes a Strings object with multiple elements and creates a stream containing each individual element separately |
06/06/2011 |
| Wrap Text Into Model | org.seasr.meandre.components.tools.semantic | Given the text provided in the input, this component generates a new semantic wrapper model containing the this text. |
06/06/2011 |
| Flesch Kincaid Readability Measure | org.seasr.meandre.components.analytics.text.readability | Computes the Flesch Kincaid readability measure as explained at http://en.wikipedia.org/wiki/Flesch-Kincaid_Readability_Test. The code is based on the work done by Daniel Shiffman at http://www.shiffman.net/teaching/a2z/week1/ |
06/06/2011 |
| Upload File | org.seasr.meandre.components.tools.io | Uploads and submits files from user's own machine. This component works with Fluid Infusion 1.2. |
06/06/2011 |
| Dictionary Of Stemming | org.seasr.meandre.components.tools.text.normalize.porter | Constructs a dictionary mapping the stemmed words back to the actual words in the original document, so for the output map, the stemmed words are keys and the actual words are values. If several words have the same stem, the shortest word is choosen as the representative. |
06/06/2011 |
| Attribute Cache Lookup | org.seasr.meandre.components.tools.tuples | This component looks up the key for a tuple in the ignore file, if specified, and ignores the tuple (passes it on as is) if it's found there, otherwise it looks it up in the cache file and if an attribute is found in cache it is associated with the tuple, otherwise, if a cache miss occurred the tuple is pushed out on the 'tuple_not_cached' port for processing by other means. |
06/06/2011 |
| Count Merger | org.seasr.meandre.components.tools.text.normalize.porter | Merges the incoming count. |
06/06/2011 |
| JPlayer | org.seasr.meandre.components.vis.multimedia | Audio / video player |
06/06/2011 |
| Stanford POS Tagger | org.seasr.meandre.components.nlp.stanford | This component performs pos tagging using Stanford's NLP facilities |
06/06/2011 |
| Text Accumulator | org.seasr.meandre.components.tools.basic | This component accumulates multiple text values and pushes them out as a single concatenated value. |
06/06/2011 |
| Frame Maker | org.seasr.meandre.components.vis.html | Arranges into frames multiple HTML documents that are part of a stream |
06/06/2011 |
| Tuple Value Frequency Counter | org.seasr.meandre.components.tools.tuples | This component counts the incoming set of tuples, based on a unique field value |
06/06/2011 |
| Entity XML To Simile XML | org.seasr.meandre.components.tools.xml.transform | Conforms the input XML document, such as, |
06/06/2011 |
| Stanford NE Tuple Tagger | org.seasr.meandre.components.nlp.stanford | This component performs named entity tagging using Stanford's NLP facilities |
06/06/2011 |
| Tuple Labeler | org.seasr.meandre.components.tools.tuples | This component takes in two different sets of tuples. The first set is used to build a key-value map. The key field is specified as well as the value field. The second set of tuples is then labelled using the map built from the first set. A value from the tuple is used (as the key into the map) to get the label (the value returned from the map) This label is appended to the incoming tuple are returned as a new tuple set |
06/06/2011 |
| Update Tuple Cache | org.seasr.meandre.components.tools.tuples | Adds entries to the cache |
06/06/2011 |
| Spell Check | org.seasr.meandre.components.transform.text | Performs spell checking on the input and optionally replaces misspelled words with the top ranked suggestion. The component also produces a list of the misspellings in the document. |
06/06/2011 |
| HTML Text Extractor | org.seasr.meandre.components.tools.text.io | Converts an HTML doc to plain text. All nodes from the dom tree that are plain text nodes are appended together and returned as a string. |
06/06/2011 |
| Write Semantic Model | org.seasr.meandre.components.tools.semantic.io | This component writes a RDF model. The model name is specified in the input. Also, it is able to read from URLs and local files using URL of file syntax. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution. |
06/06/2011 |
| Link Creation to GraphML | org.seasr.meandre.components.analytics.socialnetworking | Overview: |
06/06/2011 |
| Tuple To JSON | org.seasr.meandre.components.tools.tuples | This component converts the rows of a tuple into an array of JSON objects that are keyed on the tuple column labels |
06/06/2011 |
| Tuple Value To String | org.seasr.meandre.components.tools.tuples | This component extracts the value(s) of an attribute from the tuple(s) and pushes it out. |
06/06/2011 |
| Author Centrality Analysis | org.seasr.meandre.components.analytics.socialnetworking | Given a collection of authors, grouped by publication, this component generates a report based on the social network analysis. This analysis uses the JUNG network importance algorithms to rank the authors. This component uses Betweenness Centrality, which ranks each author in the author-citation graph derived from the number of shortest paths that pass through them. |
06/06/2011 |
| Input Data | org.seasr.meandre.components.tools.text.io | This component allows the user to specify the dataset(s) to be processed. The user can use URL(s), file(s), or input the raw text to be processed. |
06/06/2011 |
| Data Type Decoder | org.seasr.meandre.components.tools.control | This component decodes the input data type and pushes out the decoded types on different outputs |
06/06/2011 |
| Print To Console | org.seasr.meandre.components.tools.basic | This component takes the input and prints it to the console. |
06/06/2011 |
| Token Counts To Map | org.seasr.meandre.components.transform | Given a collection of token counts, this component converts them to a Java map. |
06/06/2011 |
| Tuple Splitter | org.seasr.meandre.components.tools.tuples | This component splits an aggregate tuple into individual tuples |
06/06/2011 |
| Google Docs Downloader | org.seasr.meandre.components.GoogleDocs | This component extracts data from a Google Docs directory and creates a local copy of the directory |
06/06/2011 |
| Write To File | org.seasr.meandre.components.tools.io | This component writes the given data to a file. |
06/06/2011 |
| SSH Public Key | org.seasr.meandre.components.tools.ssh | Executes a command based on SSH and returns result. |
06/06/2011 |
| HITS Summarizer | org.seasr.meandre.components.analytics.text.summarization | This component ranks and sorts the tokenized input sentences providing a simple summarization by sentence seletion. |
06/06/2011 |
| Strings To Java String | org.seasr.meandre.components.transform | Converts a Google string into an equivalent Java string |
06/06/2011 |
| Fork x2 | org.seasr.meandre.components.tools.control | Description: This component takes in any Java objectand distributes it (or copies of it) across multiple outputs. The user can choose from five object replication methods -- by reference, shallow copy by clone, deep copy via serialization, copy via constructor, or copy via custom method. |
06/06/2011 |
| URL Decode | org.seasr.meandre.components.transform.text | This component takes a URL encoded input and decodes it as a regular string. |
06/06/2011 |
| GeoLocation Cleaner | org.seasr.meandre.components.tools.geo | This component uses the Yahoo GeoService to attempt to fully qualify location entities within a single sentence. |
06/06/2011 |
| Tuple Value Cleaner | org.seasr.meandre.components.sentiment | DO NOT USE in production code :) testing only. This component cleans a value of a tuple based on a regular expression |
06/06/2011 |
| Select Nodes via XPath | org.seasr.meandre.components.transform.xml | This component extracts XML nodes matching an XPath expression from an XML document and outputs them one by one. |
06/06/2011 |
| Twitter Circle Packing Graph | org.seasr.meandre.apps.twitter | Twitter Circle Packing Graph |
06/06/2011 |
| Force Directed Graph | org.seasr.meandre.components.vis.protovis | This components creates a Protovis link node graph that is a force directed graph to layout the graph. It is based on JSON data that specifies the nodes and links. |
06/06/2011 |
| Sentences To Text | org.seasr.meandre.components.transform.totext | Given a collection of sentences, this component converts it into text. |
06/06/2011 |
| HTML Viewer | org.seasr.meandre.components.vis.html | Generates a webpage from the HTML text that it receives as input. |
06/06/2011 |
| Tuple Aggregator | org.seasr.meandre.components.tools.tuples | This component aggregates a set of tuples |
06/06/2011 |
| Unique Tuple Filter | org.seasr.meandre.components.tools.tuples | This component pushes unique tuples (uniqueness based on the value of an attribute) on the 'unique_tuples' port and the duplicate tuples on the 'duplicate_tuples' port |
06/06/2011 |
| Cross Domain Policy Server | org.seasr.meandre.components.vis.flash | flash xml cross domain policy requests |
06/06/2011 |
| Tuple To HTML | org.seasr.meandre.components.tools.tuples | Creates HTML for displaying a set of tuples in a table |
06/06/2011 |
| Read Text | org.seasr.meandre.components.tools.text.io | This component reads text from a local or remote location. The text location is specified in the input. The component outputs the text read. A property controls the behavior of the component in the event of an IO error, allowing it to ignore the error and continue, or throw an exception, forcing the finalization of the flow execution. |
06/06/2011 |
| XML To Text | org.seasr.meandre.components.tools.xml | This component write a XML in text form and generates it text form. The XML document to convert is received in its input. The component outputs the text generated. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution. |
06/06/2011 |
| Token Filter | org.seasr.meandre.components.transform.filters | This component filters the tokens of the input based on the list of tokens provided. The component has 3 inputs for the type of data to be filtered (tokens, token counts or tokenized sentencesand one input for the list of tokens to filter. It will output the same data type it received. If new tokens to filter are provide they either replace the current ones or add them to the black list. The component waits for a black list and then begins processing the data it receives. The component outputs the filtered tokens, token counts or tokenized sentences. The comparison of blacklisted tokens to the data will ignore case by default. Set ignore_case=false to work in case sensitive mode. |
06/06/2011 |
| Input User Password | org.seasr.meandre.components.tools.text.io | This component allows a user to enter credentials that can be passed to other components |
06/06/2011 |
| HelloWorld | org.seasr.meandre.components.vis.gwt | Example component that uses GWT |
06/06/2011 |
| Text To XML | org.seasr.meandre.components.tools.xml | This component reads a XML in text form and buids a manipulatable document object. The text to convert is received in its input. The component outputs the XML object read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty XML or throwing and exception forcing the finalization of the flow execution. |
06/06/2011 |
| Concept Finder | org.seasr.meandre.components.sentiment | This component connects to the SynNet service and computes the best concept for the input token based on the defined concept rules |
06/06/2011 |
| Zotero URL Extractor | org.seasr.meandre.components.rdf.zotero | Extract the urls for each of the entry of a Zotero RDF |
06/06/2011 |
| Read XML | org.seasr.meandre.components.tools.xml.io | This component reads a XML. The XML location is specified in the input. Also, it is able to read from URLs and local files using URL of file syntax. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty XML or throwing and exception forcing the finalization of the flow execution. |
06/06/2011 |
| Service Tail Text | org.seasr.meandre.components.tools.webservice | Service tail for a web service |
06/06/2011 |
| Input Counter | org.seasr.meandre.components.tools.control | This component counts the number of data objects passing through it and outputs the current count. If the component is set to act on a particular stream (by setting 'streamId' appropriately) then at the end of the stream the component will output a total count of all elements in that stream. |
06/06/2011 |
| Date Filter | org.seasr.meandre.components.transform.xml | This component generates an xsl template to filter an xml file and include only the dates between the minimum and maximum year. This is used to filter the Simile xml file that is generated. |
06/06/2011 |
| Add Tuple Attribute | org.seasr.meandre.components.tools.tuples | This component adds an extra attribute to existing tuple(s). |
06/06/2011 |
| CSV To Tuple | org.seasr.meandre.components.tools.tuples | This component converts a csv string into tuples. Each line of the incoming text is a new tuple. It does not handle missing values |
06/06/2011 |
| Aggregate Mallet Instances | org.seasr.meandre.components.analytics.mallet | This component accumulates the instances belonging to the specified stream and produces a list of machine learning instances, typically used for training or testing of a machine learning algorithm. |
06/06/2011 |
| Tuple To Mallet Feature Sequence | org.seasr.meandre.components.analytics.mallet | This component converts a specific attribute into a Mallet feature sequence |
06/06/2011 |
| Normalize Token Counts | org.seasr.meandre.components.transform.text | This component sets the count of tokens with a count greater than zero to 1. |
06/06/2011 |
| HTML Fragment Maker | org.seasr.meandre.components.vis.html | Generates an HTML fragment based on the input data.The encoding of the data is specified via the encoding property. Supported MIME types: 'text/plain', 'image/ |
06/06/2011 |
| XML Aggregator | org.seasr.meandre.components.transform.xml | This component extracts information from input xml files and, constructs a new xml file to output. It is for TEI only. |
06/06/2011 |
| Arc Diagram | org.seasr.meandre.components.vis.protovis | This components creates a Protovis arc diagram based on JSON data that specifies the nodes and links. |
06/06/2011 |
| Concept Counter | org.seasr.meandre.components.sentiment | This component tags tuples representing documents with an incremental 'docId' and a count of the frequencies of the concepts present in the document. |
06/06/2011 |
| OpenNLP Tokenizer | org.seasr.meandre.components.nlp.opennlp | This component breaks the text contained in the input using OpenNLP tokenizing facilities. |
06/06/2011 |
| Token Count To Table | org.seasr.meandre.components.transform.text | Converts token count to table. |
06/06/2011 |
| Stream Delimiter Filter | org.seasr.meandre.components.tools.control | This component filters out some or all stream delimiters |
06/06/2011 |
| URL To Tuple | org.seasr.meandre.components.tools.tuples | This component reads a text resources via file or http (assumes no authentication required) |
06/06/2011 |
| Service Head Request | org.seasr.meandre.components.tools.webservice | Service head for a service that gets data via posts |
06/06/2011 |
| SSH | org.seasr.meandre.components.tools.ssh | Executes a command based on SSH and returns result. |
06/06/2011 |
| Google Search To Tuple | org.seasr.meandre.components.tools.tuples | This component performs a Google search by accessing the Google search API.The search term is an input and the search results are output as tuples. |
06/06/2011 |
| Generic Viewer | org.seasr.meandre.components.tools.text.io | Generates and displays a webpage via a Velocity Template |
06/06/2011 |
| Stem | org.seasr.meandre.components.tools.text.normalize.porter | Overview: References: Data Type Restrictions: Data Handling: Scalability: Trigger Criteria: |
06/06/2011 |
| Spell Check with Counts | org.seasr.meandre.components.transform.text | Performs spell checking on the input and optionally replaces misspelled words with the top ranked suggestion based on the supplied token counts. The component also produces a list of the misspellings in the document. |
06/06/2011 |
| Token Count Filter | org.seasr.meandre.components.transform.filters | This component filters (removes) the tokens with token counts satisfying the given constraints |
06/06/2011 |
| Google Map Generator | org.seasr.meandre.components.vis.geographic | Calculates latitude and longitude for an address contained in the input XML document. |
06/06/2011 |
| Concatenate Text | org.seasr.meandre.components.tools.basic | Concatenates text from multiple inputs |
06/06/2011 |
| Search Text | org.seasr.meandre.components.tools.basic | Searches the text input for the regular expression pattern. If the pattern is found and the regular expression specifies a capturing group, then the matching text for the capturing group is pushed out on port Text_Found. If the pattern is found and there are no capturing groups defined, the maching text for the pattern is pushed out on port Text_Found. In both cases multiple outputs are possible.If the pattern is not found, it outputs the original text on port Text. |
06/06/2011 |
| Write To Zip | org.seasr.meandre.components.tools.io | This component writes a zip file containing all the data passed in the stream |
06/06/2011 |
| R Serve Connection | org.seasr.meandre.components.analytics.statistics.r | This component establishes the connection to the R Server for using the R statistics package. |
06/06/2011 |
| Stanford NE Tagger | org.seasr.meandre.components.nlp.stanford | This component performs named entity tagging using Stanford's NLP facilities |
06/06/2011 |
| Transform Token From Dictionary | org.seasr.meandre.components.tools.text.normalize.porter | Replaces tokens with their entries from the dictionary. |
06/06/2011 |
| XML To XML With XSL | org.seasr.meandre.components.transform.xml | This component inputs two XML documents, one the XML data and the other the XSL. It transforms the XML data based on the XSL template and outputs the transformed XML. |
06/06/2011 |
| SQL To Tuple | org.seasr.meandre.components.tools.tuples | This component reads a mysql database |
06/06/2011 |
| To Lowercase | org.seasr.meandre.components.transform.text | Converts a text to lowercase. |
06/06/2011 |
| Summarizer Viewer | org.seasr.meandre.components.vis.text | Visualizes tokens and sentences from HitsSummarizer. Its inputs should be connected directly to HitsSummarizer. |
06/06/2011 |
| Tokenized Sentences Reducer | org.seasr.meandre.components.analytics.text.statistics | This component is intended to work on wrapped model streams. Given a sequence of wrapped models, it will create a new model that accumulates/reduces the tokenized sentences and then pushes the resulting model. If no wrapped model is provided it will act as a simple pass through. This component is based on Wrapped models reducer. |
06/06/2011 |
| RDF Text To Model | org.seasr.meandre.components.tools.semantic | This component reads an RDF model in text form and buids the model. The text to convert is received in its input. The component outputs the semantic model read. A property allows to control the behaviour of the component in front of an IO error, allowing to continue pushing and empty model or throwing and exception forcing the finalization of the flow execution. |
06/06/2011 |
| Text Segmentation | org.seasr.meandre.components.transform.text | The component breaks a document into chunks (segments) for further processing. It transforms the document of tokenized sentences into segments of size that approximates the number of tuples specified in the property. Segments always end at sentence boundaries. |
06/06/2011 |
| SCP Public Key | org.seasr.meandre.components.tools.ssh | Securely copies a file or directory from a local to a remote server or from a remote server to local. One of the source or destination inputs needs to specify the remote resource to be copied in the following format: [host]:[path] Example: source=my.server.com:/tmp/file.txt destination=/tmp |
06/06/2011 |
| URL Encode | org.seasr.meandre.components.transform.text | This component takes the input and encodes it as a URL string. |
06/06/2011 |
| OpenNLP Sentence Detector | org.seasr.meandre.components.nlp.opennlp | This component splits sentences of the text contained in the input using OpenNLP tokenizing facilities. |
06/06/2011 |
| OpenNLP POS Tagger | org.seasr.meandre.components.nlp.opennlp | This component tags the tokens of the incoming set of tokenized sentences using OpenNLP pos facilities. |
06/06/2011 |
| List Directory Files | org.seasr.meandre.components.tools.basic | Given a [server-relative] directory, this component pushes all the file names available that match a certain regular expression given in the properties. |
06/06/2011 |
| Read Text Set Cookie | org.seasr.meandre.components.tools.text.io | This component reads text from a remote location with cookie support. The text location is specified in the input. The component outputs the text read and the cookie obtained from the request. A property controls the behavior of the component in the event of an IO error, allowing it to ignore the error and continue, or throw an exception, forcing the finalization of the flow execution. |
06/06/2011 |
| Table Viewer | org.seasr.meandre.components.vis.table | This component provides a table viewer for a data set. The input can come from a Vector |
06/06/2011 |
| Token Counts To Text | org.seasr.meandre.components.transform.totext | Given a collection of token counts, this component converts it into text. The default separator is a comma, so make sure tokens do not have commas or change the separator. |
06/06/2011 |
| Fork x5 | org.seasr.meandre.components.tools.control | Description: This component takes in any Java object and distributes it (or copies of it) across multiple outputs. The user can choose from five object replication methods -- by reference, shallow copy by clone, deep copy via serialization, copy via constructor, or copy via custom method. |
06/06/2011 |
| Persist To DB | org.seasr.meandre.components.tools.db | This component persists (serializes) the input into a database |
06/06/2011 |
| Create Mallet Instance | org.seasr.meandre.components.analytics.mallet | This component creates a Mallet instance object. This represents a machine learning 'example' to be used in training, testing or performance of various machine learning algorithms. An instance contains four generic fields of predefined name: 'data', 'target', 'name', and 'source'. 'Data' holds the data represented by the instance, 'target' is often a label associated with the instance, 'name' is a short identifying name for the instance a(such as a filename), and 'source' is human-readable source information, (such as the original text). |
06/06/2011 |
| Image To ASCII Text | org.seasr.meandre.components.vis.text | Converts an image to ASCII text |
06/06/2011 |
| Tuple Text Replacement | org.seasr.meandre.components.transform.text | Performs simple tuple text replacement, based on input configuration |
06/06/2011 |
| List Words | org.seasr.meandre.components.tools.tapor.restservice | Demonstrates how to construct a interface to consume rest service of list words of Tapor at http://tada.mcmaster.ca/Main/TAPoRwareHTMLListWords. |
06/06/2011 |
| Mallet Topic Modeling | org.seasr.meandre.components.analytics.mallet | This component perform topic analysis in the style of LDA and its variants using Mallet |
06/06/2011 |