| Name | Package | Description | Creation date |
|---|---|---|---|
| FeatureLensWord | org.seasr.meandre.components.io.datasource.featurelens | Adds the individual words to the Pattern database table of FeatureLens. |
06/03/2010 |
| XML Document To String | org.seasr.meandre.components.transform.conversion | Converts XML doucment to string. This component can be used to write PMML output to a string. |
06/03/2010 |
| CLOSET | org.seasr.meandre.components.discovery.ruleassociation.fptree | Overview: This module implements the CLOSET algorithm to generate closed frequent itemsets consisting of items that occur in a sufficient number of examples to satisfy the minimum support criteria. Detailed Description: This module takes an FPProb object that has been generated by a Large Item Table Generator module and uses the CLOSET algorithm to find the combinations of items that satisfy a minimum support criteria. An item is an [attribute,value] pair that occurs in the set of examples being mined. The user controls the support criteria via the References: For more information on the CLOSET frequent pattern mining algorithm, see " CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets ", Jian Pei, Jiawei Han, Runying Mao, 2000. Data Type Restrictions: While this module can operate on attributes of any datatype, in practice it is usually infeasible to use it with continuous-valued attributes. The module considers each [attribute,value] pair that occurs in the examples individually when building the frequent itemsets. Continuous attributes (and categorical attributes with a large number of values) are less likely to meet the Minimum Support requirements and can result in unacceptably long execution time. |
06/03/2010 |
| SearchBox | org.seasr.meandre.javascript | Present a search box for user to input query. |
06/03/2010 |
| FP Growth | org.seasr.meandre.components.discovery.ruleassociation.fpgrowth | This component implements the FPGrowth algorithm to generate frequent itemsets consisting of items that occur in a sufficient number of examples to satisfy the minimum support criteria. Detailed Description: This component takes an Item Sets object that has been generated by a Table To Item Sets component and uses the FPGrowth algorithm to find the combinations of items that satisfy a minimum support criteria. An item is an [attribute,value] pair that occurs in the set of examples being mined. The user controls the support criteria via the Minimum Support % property that specifies the percentage of all examples that must contain a given combination of items before that combination is included in the generated output. Each combination of items that satisfies the Minimum Support % is called a Frequent Itemset. The user can restrict the maximum number of items included in any frequent itemset with the Maximum Items Per Rule property. The generation of sets with large number of items can be computationally expensive, so setting this property in conjunction with the Minimum Support % property helps keep the component runtime reasonable. In a typical flow the Frequent Item Sets output port from this component is connected to a Compute Confidence component which forms association rules that satisfy a minimum confidence value. References: For more information on the FPGrowth frequent pattern mining algorithm, see "Mining Frequent Patterns without Candidate Generation"Jiawei Han, Jian Pei, and Yiwen Yin, 2000. Limitations: The FPGrowth and Compute Confidence components currently build rules with a single item in the consequent. Data Type Restrictions: While this component can operate on attributes of any datatype, in practice it is usually infeasible to use it with continuous-valued attributes. The component considers each [attribute,value] pair that occurs in the examples individually when building the frequent itemsets. Continuous attributes (and categorical attributes with a large number of values) are less likely to meet the Minimum Support requirements and can result in unacceptably long execution time. Typically Choose Attributes and Binning components should appear in the itinerary prior to the Table to Item Sets component, whose output produces the Item Sets object used as input by this component. The Choosing/Binning components can reduce the number of distinct [attribute,value] pairs that must be considered in this component to a reasonable number. Data Handling: This component does not modify the input Item Sets in any way. Scalability: This component creates an array of integers to hold the indices of the items in each frequent itemset. The component may be computationally intensive, and scales with the number of Item Sets entries to search. The user can limit the size of the frequent itemsets although this will have little effect on performance for this algorithm. Choosing/Binning components can be included in the itinerary prior to this components to reduce the number of Item Sets entries. |
06/03/2010 |
| Large Item Table Generator | org.seasr.meandre.components.discovery.ruleassociation.fptree | Overview: This module transforms a SparseExampleTable containing term frequency values into a data structure, FPProb, that efficiently represents item occurrences within user supplied support constraints. NOTE: All non-zero values for term frequency are treated alike -- they are counted as a positive occurrence for that row. References: N/A. Data Type Restrictions: The input table must be a SparseExampleTable containing term frequency information. Data Handling: This module does not modify the input SparseExampleTable Scalability: This module makes a constant number of passes over the table data. Memory usage is proportional to the size of the input SparseExampleTable |
06/03/2010 |
| FeatureFilterLite | org.seasr.meandre.components.transform.table | Overview: This module scans the input SparseTable and any term (column) whose support does not fall within the range specified has its column removed from the table. This can greatly reduce the total number features used for learning -- improving accuracy and performance. Data Type Restrictions: The input Table must be an instance of a SparseTable. Data Handling: A new SparseTable instance is created and only columns that will be kept are copied into it. Scalability: Creates a second table on the same order of size as the original. Columns from the first table are inserted into the second table; no copies are made. Algorithm makes one pass over the table columns and one pass over the table data. |
06/03/2010 |
| Get URLs | org.seasr.meandre.components.io | Overview: |
06/03/2010 |
| HAC Model Builder | org.seasr.meandre.components.discovery.cluster.hac | Takes a d2k table object and builds a full bottom up cluster tree. |
06/03/2010 |
| Basic Table Factory Injector | org.seasr.meandre.components.io.table | Overview: Outputs a TableFactory suitable for creating D2K Tables from the ncsa.d2k.modules.core.datatype.table.basic package. This is a standard Table suitable for most applications. |
06/03/2010 |
| Write Rule Assoc PMML | org.seasr.meandre.components.discovery.ruleassociation | Write a RuleAssociationModel out in PMML(Predictive Model Markup Language) format complying with the PMML 2.0 DTD. |
06/03/2010 |
| CreateDTModel | org.seasr.meandre.components.prediction.decisiontree.c45 | Overview: Given a DecisionTreeNode that is the root of a decision tree, creates a new DecisionTreeModel.Detailed Description: Creates a DecisionTreeModel from Decision Tree Root. The Example Tablemust be the same set of examples used to construct the decision tree.Data Type Restrictions: Output feature must be nominal.Data Handling: This module will create a PredictionTable from Example Table and proceed to make a prediction for each example in Example Table.Scalability: This module will make a prediction for each example in Example Table. There must be sufficient memory to hold these predictions. |
06/03/2010 |
| Input URL Or Path | org.seasr.meandre.components.io.file.input | This module is used to enter the url to a local or remote resource. Detailed Description: Collect a URL or local path, and create a WebdavClient to access it. The module provides a properties editor that can be used to enter a url to a local or remote resource. If the url points to a local file, the user can enter the name directly into a text area.If the url points to a remote file, the user has to type in the host url, which include protocol, path and port in the text area for host url, and the relative path of the resource to the server in the text area for file name.This module does not perform any checks to verify that the url exists and is accessible with the username and password given by the user. A check is performed to make sure that a file name has been entered and an exception is thrown if the editor text area is blank. The WebdavClient is made available on the WebdavClient output port. For local url, a path may or may not be included in the file name string. |
06/03/2010 |
| XML To Triples | org.seasr.meandre.components.transform.metadatastore | Converts TEI-encoded XML data to RDF triples. The TEI-XML format contains two sections ('teiHeader' and 'text') which this component processes separately to generate triples for each section. Note: The end-of-section processing is indicated by pushing out the string: 'EOF' |
06/03/2010 |
| Initialize JNDI with WebUI | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| WebRuleVis | org.seasr.meandre.applet.ruleassociation | This module provides a visual representation of the association rules encapsulated in the input Rule Table. Detailed Description: This module presents a visual representation of association rules identified by a discovery algorithm. Seasr includes several modules that implement association rule discovery algorithms, all of which save their results in a Rule Table structure that can be used as input to this module. The main region of the display contains a matrix that visually depicts the rules. Each numbered column in the matrix corresponds to an association rule that met the minimum support and confidence requirements specified by the user in the rule discovery modules. Items used in the rules, that is attribute-value pairs, are listed along the left side of the matrix. Note that some items in the original data set may not be included in any rule because there was insufficient support and/or confidence to consider the item significant. An icon in the matrix cell corresponding to ( row = item i, column = rule r) indicates that item i is included in rule r. If the matrix cell icon is a box, then the item is part of the rule antecedent. If the icon is a check mark, then the item is part of the rule consequent. For example, if the rules being displayed indicate whether or not a mushroom is edible, a rule might be odor=none and ring_number=one then edibility=edible. This rule would be displayed in a column with a box in the row for the item odor=none and a box in the row for ring_number=one, and there would be a check in the row for edibility=edible. Above the main matrix are two rows of bars labeled Confidence and Support. These bars align with the corresponding rule columns in the main matrix. For any given rule, the confidence and support values for that rule are represented by the degree to which the bars above the rule column are filled in. Brushing the mouse on a confidence or support bar displays the exact value that is graphically represented by the bar height. The rules can be ordered by confidence or by support. To sort the rules, click either the support or the confidence label -- these labels are clickable radio buttons. If support is selected, rules will be sorted using support as the primary key and confidence as the secondary key. Conversely, if the confidence button is chosen, confidence is the primary sort key and support is the secondary key. Directly above the confidence and support display is a toolbar that provides additional functionality. On the left side of the toolbar are two buttons that allow the rows of the table to be displayed according to different sorting schemes. One of the buttons is active at all times. The Alphabetize button sorts the attribute-value pairs alphabetically. The Rank button sorts the rows based on the current Confidence/Support selection, moving the consequents and antecedents of the highest ranking rules to the top of the attribute-value list. On the right side of the toolbar are four additional buttons: Restore Original reverts back to the original table that was displayed before any sorting was done. Filter provides an interface that allows the user to display a subset of the generated rules. The user can scroll to different part of the matrix to get the full picture of a large matrix. Help displays information describing the visualization. Scalability: While the visualization can display a large number of items and rules, there can be a noticeable delay in opening the visualization when a large number of cells are involved. Also, as the number of cells increases beyond a certain point, it is difficult to gain insights from the display. Advanced features to help in these cases are being discussed for a future release. |
06/03/2010 |
| C45TreeBuilder | org.seasr.meandre.components.prediction.decisiontree.c45 | Build a C4.5 decision tree. The tree is build recursively, always choosing the attribute with the highest information gain as the root. The gain ratio is used, whereby the information gain is divided by the information given by the size of the subsets that each branch creates. This prevents highly branching attributes from always becoming the root. The minimum number of records per leaf can be specified. If a leaf is created that has less than the minimum number of records per leaf, the parent will be turned into a leaf itself. |
06/03/2010 |
| Database Resultset to Table | org.seasr.meandre.components.io.conversion | Overview: |
06/03/2010 |
| GraphML To Prefuse From URL | org.seasr.meandre.components.io.graph | Overview: |
06/03/2010 |
| SupportBoundaryFinder | org.seasr.meandre.components.discovery.ruleassociation.fptree | Overview: This module will scan the item sets in an FPProb object and determine the minimum and maximum supports. It adds this information to the FPProb object before writing it to output. References: For more information on the FPGrowth frequent pattern mining algorithm, see "Mining Frequent Patterns without Candidate Generation"Jiawei Han, Jian Pei, and Yiwen Yin, 2000. Data Handling: This module modifies the support boundary values in the FPProb object. |
06/03/2010 |
| PredictionTableReport | org.seasr.meandre.applet | Provides a visualization to evaluate the performance of a predictive model.Detailed Description: Given a PredictionTable with both the predictions and the actual values, this module will provide a simple visualization to evaluate the performance of a predictive model on a data set. The accuracy, defined as the number of correct predictions, is displayed. A pie chart depicting the accuracy is also shown. A confusion matrix for the PredictionTable is created. The confusion matrix displays the precision and recall of the predictive model. Precision is defined as the number of correct predictions within a class divided by the number of predictions within a class. Recall is the number of relevant predictions within a class divided by the number that actually exist in a class. The confusion matrix also displays the Type I and Type II errors. Type I error is defined as accepting an item as a member of a class when it is actually false, known as a false positive. Type II error is defined as rejecting an item as a member of class when it is actually true, known as a false negative. The confusion matrix is shown with the unique predictions along the top. These are labeled Prediction. The actual values are displayed along the side. These are labeled Ground Truth.Data Type Restrictions: A PredictionTable with both the predictions and actual values is required. This module only supports classification predictions. Continuous predictions are not supported.Data Handling: This module does not modify the input data.Scalability: This module makes one pass over the data to count the number of correct and incorrect predictions. |
06/03/2010 |
| Create Delimited File Parser | org.seasr.meandre.components.io.file.input | This module creates a parser for the specified WevdavClient. The file is expected to have a consistent delimiter character. Detailed Description: Data Handling: The module does not destroy or modify the input data. |
06/03/2010 |
| InputStreamToItemSets | org.seasr.meandre.components.discovery.ruleassociation | Creates Item sets from .is formatted files |
06/03/2010 |
| Jung To Prefuse | org.seasr.meandre.components.io.graph | Overview: Converts a JUNG graph (directed or not) to a Prefuse graph. Acknowledgement: This module uses functionality from the JUNG project. See http://jung.sourceforge.net. Acknowledgement: This module uses functionality from the Prefuse project. See http://prefuse.org. |
06/03/2010 |
| Initialize JNDI Datsources | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| C45TreePruner | org.seasr.meandre.components.prediction.decisiontree.c45 | This module prunes a decision tree built by the C4.5 Tree Builder. Detailed Description: This module prunes a decision tree using a reduced-error pruning technique. Error estimates for the leaves and subtrees are computed by classifying all the examples of the Example Table. Both subtree replacement and subtree raising are used. Subtree replacement will replace a node by one of its leaves if the induced error of the replacement is less than the sum of the errors for the leaves of the node. Subtree raising will lift a subtree if the error for the raised subtree is less than the original. The complexity of pruning the tree is O(n (log n)2).References: C4.5: Programs for Machine Learning by J. Ross QuinlanData Type Restrictions: The Unpruned Root must be a DecisionTreeNode built by the C4.5 Tree Builder.Data Handling: This module will attempt to classify the examples in the Example Table N times, where N is the number of nodes in the tree.Scalability: This module will classify the examples in the Example Table at least once for each node of the tree. This module will need enough memory to hold those predictions. |
06/03/2010 |
| Parse File To Table | org.seasr.meandre.components.io.file.input | Given a FlatFileParser, this module creates a Table initialized with the contents of a flat file from disk. |
06/03/2010 |
| Table To Item Sets | org.seasr.meandre.components.discovery.ruleassociation | This module reads a Table and extracts from it items for use in mining association rules with the Apriori algorithm. Detailed Description: This module takes as input a Table or an Example Table, and extracts items that are used by the Apriori rule association algorithm. An item is an [attribute,value] pair that occurs in the input table. The module uses information from the original table to determine which attributes should be used to form items being considered as possible rule antecedents and rule consequents. A compact representation is created indicating which items are contained in rows in the original table. The items and other information used by the Apriori algorithm are written to the Item Sets output port. If a Table or an Example Table with no specified input or output attributes is loaded, all attributes (columns) will be used to form items being considered as possible antecedents and consequents for the association rules. If an Example Table with only input attributes or only output attributes is loaded, the chosen attributes will be used to form items considered as possible rule antecedents and possible rule consequents. If an Example Table with both input and output attributes is loaded, the inputs will be used to form items considered as possible rule antecedents, and the outputs used to form items considered as possible rule consequents. The computational complexity of the Apriori algorithm depends on the number of possible antecedents and consequents, so narrowing the search prior to this step is highly recommended. Use the module Choose Attributes to specify the subset of table attributes that are of interest. If the table has continuous attributes as possible rule antecedents or targets, a Binning module should be used prior to this module to reduce the number of possible values for those continuous attributes. In a typical itinerary the Item Sets output port from this module is connected to a Generate Multiple Outputs module and then to an Apriori module which forms frequent itemsets based on a minimum support value, and to a Compute Confidence module which forms association rules that satisfy a minimum confidence value. Limitations: The Apriori and Compute Confidence modules currently build rules with a single item in the consequent. Data Handling: This module does not modify the input Table in any way. Scalability: A representation of each row of the table is stored in memory. The representation is usually smaller than the original data. |
06/03/2010 |
| FeatureLensChunk | org.seasr.meandre.components.io.datasource.featurelens | Writes the Chunk table of FeatureLens to a database. |
06/03/2010 |
| Clustering Viz | org.seasr.meandre.components.vis.clustering | Clustering visualization. |
06/03/2010 |
| List Database ResultSet | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| Close Database Connection | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| ScalarizeNominals | org.seasr.meandre.components.transform.attribute | This module examines columns in a MutableTable and, for appropriate columns which contain nominal values, converts these single columns into multiple columns -- one for each possible value of the attribute.Detailed Description: If the input MutableTable implements the ExampleTable interface, only columns marked as inputs and outputs will be converted. Otherwise, all columns containing nominal values will be converted. Through a property of the module, the user can select whether the generated columns are double or boolean.Data Handling: This module modifies its input data; each relevant nominal column may be replaced with an arbitrary number of new ones. In addition, columns with blank labels are assigned default ones. |
06/03/2010 |
| Webdav Browser | org.seasr.meandre.components.io | Outputs the list of files in a WebDAV tree matching the specified criteria. |
06/03/2010 |
| WriteClusterDisplayPMML | org.seasr.meandre.components.discovery.cluster | Write a ClusterModel out in PMML(Predictive Model Markup Language) format, optimized for Processing Display Algorithm. |
06/03/2010 |
| XML To Graph From URL | org.seasr.meandre.components.io.graph | Overview: This module creates a Jung Graph object from a GraphML file. The input may be a local file or remote. Note: This module is the same as deprecated module XMLToGraphFromURL, extended to access the data through aDataObjectProxy. Acknowledgement: This module uses functionality from the JUNG project. See http://jung.sourceforge.net. |
06/03/2010 |
| PrepareForVis | org.seasr.meandre.components.prediction.naivebayes | Overview: This module prepares a NaiveBayesModel for visualization. Many calculations that are needed by the visualization are done here, before the model is visualized.Detailed Description: This module determines which of the input features of the training data are the best predictors of the output. This is done by performing predictions on the training data and leaving out one input feature each time. Data structures to hold values for the pie charts in NaiveBayesVis are also created here.Data Type Restrictions: noneData Handling: The data structures used to display pie charts in NaiveBayesVis are created by this module. These structures are stored in Naive Bayes Model. The number of pie charts is proportional to the number of discrete values in the inputs of the training data set.Scalability: This module will perform NxM predictions, where N is the number of inputs and M is the number of training examples. |
06/03/2010 |
| Dendrogram Vis | org.seasr.meandre.components.vis.dendrogram.gwt | Dendrogram visualization of SEASR cluster models. Properties: |
06/03/2010 |
| Table Viewer Old | org.seasr.meandre.components.vis.table | This component provides a table viewer for a data set. The input can come from a Vector |
06/03/2010 |
| ModelPredict | org.seasr.meandre.components.prediction | This module applies a prediction model to a table of examples and makes predictions for each output attribute based on the values of the input attributes.Detailed Description: This module applies a previously built model to a new set of examples that have the same attributes as those used to train/build the model. The module creates a new table that contains columns for each of the values the model predicts, in addition to the columns found in the original table. The new columns are filled in with values predicted by the model based on the values of the input attributes. |
06/03/2010 |
| AutoBin | org.seasr.meandre.components.transform.binning | Overview: Automatically discretize scalar data for the Naive Bayesian classification model.Detailed Description: Given a table of Examples, define the bins for each scalar input column. When binning Uniformly, the number of bins is determined by Number of Bins property, and the boundaries of the bins are set so that they divide evenly over the range of the binned column.When binning by weight, Number of Items per Bin sets the size of each bin. The values are then binned so that in each bin there is the same number of items. For more details see description of property Number of Items per Bin'.Data Handling: This module does not modify the input data.Scalability: The module requires enough memory to make copies of each of the scalar input columns. |
06/03/2010 |
| Write Table to Database | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| WebDecisionTreeVis | org.seasr.meandre.applet.prediction.decisiontree | Given a ViewableDTModel, displays the structure and contents of the nodes of the decision tree. The Navigator on the left shows a small view of the entire tree. The main area shows an expanded view of the tree. For more information look up the help provided in the UI of the module |
06/03/2010 |
| SOAPClient | org.seasr.meandre.components.io.webservice | Consume web service written in MS .NET. |
06/03/2010 |
| Choose Attributes | org.seasr.meandre.components.vis.transform.attribute | This module allows the user to choose which columns of a table are inputs and outputs. Detailed Description: This module outputs an Example Table with the input and output features assigned. Inputs and outputs do not have to be selected, nor do they have to be mutually exclusive. Data Handling: This module does not modify the data in the table. It only sets the input and output features. |
06/03/2010 |
| SimpleTrainTest | org.seasr.meandre.components.transform.table | This module generates a training table and a testing table from the original table. Detailed Description: This module presents the user with property setting which allows them to specify the percentages of the original table examples that should be used to build train and test tables. The user can specify whether the train and test examples are selected at random or sequentially from the beginning (train data) and the end (test data) of the original examples. If the examples are selected randomly, the user can specify the seed used by the random number generator. If the train and test percentages sum to more than 100 percent, some examples will appear in both the train and test tables. The train and test percentages can be designated through the property setting. Data Type Restrictions: Although this module works with tables containing any type of data, many supervised learning algorithms will work only on doubles. If one of these algorithms is to be used, the conversion to floating point data should take place prior to this module. Data Handling: This module does not change the original data. It creates an instance of an example table that manages the data data differently. Scalability: This module should scale linearly with the number of rows in the table. The module needs to be able to allocate arrays of integers to hold the indices of the test and train examples. |
06/03/2010 |
| FeatureLensCollection | org.seasr.meandre.components.io.datasource.featurelens | Writes the Collection table of FeatureLens to a database. Specifically setup to take a directory name and create an row in the database for the top level and then create a row for each subdirectory with a link to the 1st row as parent. |
06/03/2010 |
| Triples To File | org.seasr.meandre.components.transform.metadatastore | Saves RDF triples to file |
06/03/2010 |
| ObjectSerializer | org.seasr.meandre.components.io.file.output | Serializes an object to a string using xstream |
06/03/2010 |
| AxisClient | org.seasr.meandre.components.io.webservice | Consume web service written in Java. |
06/03/2010 |
| FeatureLensPattern | org.seasr.meandre.components.io.datasource.featurelens | Writes the Pattern table of FeatureLens to a database. |
06/03/2010 |
| CreateNBModel | org.seasr.meandre.components.prediction.naivebayes | Overview: Generates a NaiveBayesModel from the given BinTree. The Naive Bayes Model performs all necessary calculations.Detailed Description: Given a BinTree object that contains counts for each discrete item in the training data set, this module creates a Naive Bayesian learning model. This method is based on Bayes's rule for conditional probability. It "naively" assumes independence of the input features.Data Type Restrictions: This model can only use nominal data as the inputs and can only classify one nominal output. The binning procedure will discretize any scalar inputs in the training data, but the output data is not binned and should be nominal. If the output data is binned, visualizations and prediction generated by the created model might be wrong and/or too corrupted to be displayed.Data Handling: The input data is neither modified nor destroyed.Missing Values Handling: Output data should be clean of missing values.Scalability: The module utilizes the counts in the BinTree, and as such does not perform any significant computations. |
06/03/2010 |
| List Database Tables | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| Connect to Database | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| CreateBinTree | org.seasr.meandre.components.transform.binning | Given a Binning Transformation containing the definition of the bins, and an Example Table that has the input/ output attribute labels and types, this module builds a Bin Tree that can be later used to classify data. A Bin Tree holds information about the number of examples that fall into each bin for each class. The Bin Tree can use only one output feature as a class. If more are selected in the Example Table, only the first one will be used.Scalability: a large enough number of features will result in an OutOfMemory error. Use feature selection to reduce the number of features. |
06/03/2010 |
| Patterns To Table | org.seasr.meandre.components.vis.ruleassociation | Generates a table containing two columns: pattern and support |
06/03/2010 |
| Compute Confidence | org.seasr.meandre.components.discovery.ruleassociation | This module works in conjunction with other modules implementing the Apriori rule association algorithm to generate association rules satisfying a minimum confidence threshold. Detailed Description: This module takes as input an Item Sets object generated by the Table To Item Sets module, and Frequent Itemsets generated by the Apriori or FPGrowth module. From these inputs, it develops a set of possible association rules, each with a single target item, where an item consists of an [attribute,value] pair. For each possible rule, this module computes the Confidence in the prediction, and accepts those rules that meet a minimum confidence threshold specified via the property editor. For a rule of the form Antecedent A implies Consequent C, the Confidence is the percentage of examples in the original data that contain A that also contain C. The formula to compute the confidence of the rule A->C is: Limitations: The Apriori, FPGrowthand Compute Confidence modules currently build rules with a single item in the consequent. Scalability: This module searches all the Items Sets to compute the confidence for each Frequent Itemset. The module allocated memory for the resulting Rule Table. |
06/03/2010 |
| Sparse Table Factory Injector | org.seasr.meandre.components.io.table.sparse | Overview: Outputs a TableFactory suitable for creating D2K Tables from the org.seasr.datatypes.datamining.table.sparse package. |
06/03/2010 |
| Query Database | org.seasr.meandre.components.io.datasource | Overview: |
06/03/2010 |
| WebNaiveBayesVis | org.seasr.meandre.applet.prediction.naivebayes | Overview: An evidence visualization for a NaiveBayesModel. Detailed Description: This evidence visualization shows pie charts that represent the different bins used. The slices of the pie chart represent the ratios of the output classes for that particular bin. Selecting a chart in the evidence section will update the conclusion pie chart. The conclusion shows the probability that each output has for classification given all the selected evidence charts.Data Type Restrictions: noneData Handling: This module does not destroy or modify the input data.Scalability: This module keeps data structures to represent the evidence charts. The amount of memory required is proportional to the number of bins used in the discretization process. |
06/03/2010 |
| Apply Transformation | org.seasr.meandre.components.transform | This module applies a Transformation to a Table. Detailed Description: This module applies a Transformation to a MutableTable and outputs the transformed table as a MutableTable. Data Handling: This module modifies the input Table |
06/03/2010 |
Add Comment