|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
See:
Description
| Interface Summary | |
| Classifier | |
| ClassifierB | An interface for a classifier. |
| Filter | A Filter takes a String object and returns the String after having applied the proper filter to it. |
| Smoother | Interface for classes which smooth a matrix |
| SmootherB | The Smoother should return a smoothed ProbabilitySet. |
| TokenReader | Title: Naive Bayes Classifier Description: An interface for converting something into a continuous string Copyright: Copyright (c) 2001 Company: |
| Class Summary | |
| AbstractClassifier | |
| AddEpsilonSmoother | Smooths a ProbabilitySet using a HashMap vocabulary. |
| AddOneSmoother | |
| ButFilter | This (very primitive) filter removes everything from the beginning of a sentence until the word "but". |
| CategoryMatrix | Wrapper for a Matrix whose columns are P(w|c). |
| Classification | This class holds the classification assigned to a FeatureSet |
| Classify | Takes two arguments: the first is a directory where the training sets are held. |
| ClassProbability | This class holds the counts and probabilities for the features of a single class (classification category). |
| ClassProbBuilder | Accepts a feature vector and a class id, and returns a ClassProbability |
| CrossValidator | This class takes a probability set and does cross-validation for as many folds as desired. |
| Feature | A wrapper for a String which allows you to add other attributes, such as weights, to a Feature. |
| FeatureAdder | This class takes a ProbabilitySet and the number of classes, and returns a new ProbabilitySet where each ClassProbability represents the probabilities for the entire class. |
| FeatureMaker | This class accepts a string and returns a HashMap that holds the counts for each token in the string. |
| FeatureSet | This class stores pairs of features and counts. |
| FeatureSetBuilder | Given a HashMap of Features, and an ID (true classification), this class can create a FeatureSet. |
| FileSorter | Takes a file that contains a set of lists of double/key pairs, one to a line, and sorts them. |
| FirstNFilter | This filter is used if you only want to use the first n sentences in a document. |
| GTPreparer | This class prepares the ProbabilitySet to_be_smooted for smoothing, and calls the Good Turing smoother |
| ID | |
| InfomapSmoother | |
| KNN | K nearest-neighbors classifier. |
| LinkedList | A linked list class. |
| MassVerifier | Tracks the statistics for a set of test documents. |
| MedlineHandler | |
| NaiveBayesClassifier | Naive Bayes Classifier. |
| NBClassifier | The classifier takes a set of (smoothed) probability vectors and a test vector, and classifies the test vector according to the probability vectors |
| NegativeFilter | This filter does some very primitive negative scoping, using the same method as the Almaden code. |
| Node | For the LinkedList? |
| POSFilter | A filter that takes a of tagged text string and a list of parts of speech to be kept, and returns the string that includes only those part of speech. |
| POSPrepFilter | Title: Description: Copyright: Copyright (c) 2001 Company: |
| POSTokenReader | This class reads in Epinions data that has been POS tagged using the Qtag system. |
| ProbabilitySet | This class manipulates an array of ClassProbabilities. |
| RemoveSpacesFilter | This filter removes the spaces in the middle of the POS tags, so that tokenization will happen properly. |
| ScoreHandler | |
| SimpleGoodTuring | |
| SortedArray | Maintains a sorted array of ints. |
| Tester | Builds a FeatureSet for the files in the directory "parent_dir" and attempts to classify them using the ProbabilitySet my_ps that was created during training. |
| TokenBreaker | Title: Naive Bayes Classifier Description: Puts a space between all tokens Copyright: Copyright (c) 2001 Company: |
| Trainer | Builds a ProbabilitySet for the classes in String[] my_dirnames and smooths that ProbabilitySet. |
| Verifier | Compares the classification of a FeatureSet (true value) to the classification of a Classification (predicted value) |
| VocabBuilder | Takes a ProbabilitySet and builds a hash table that contains the entire vocabulary. |
Data Structures and Algorithms for Classification. This package is currently being modified. Currently implements KNN and a passable but not good version of Naive Bayes. Right now it also has a lot of stuff in there that doesn't belong (POSFilter, Smoothers, LinkedList, NegativeFilter, etc.).
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||