edu.stanford.nlp.dbm
Class Ohsumed
java.lang.Object
|
+--java.util.AbstractCollection
|
+--java.util.AbstractList
|
+--edu.stanford.nlp.dbm.AbstractDataCollection
|
+--edu.stanford.nlp.dbm.FileDataCollection
|
+--edu.stanford.nlp.dbm.Ohsumed
- All Implemented Interfaces:
- Collection, DataCollection, List
- public class Ohsumed
- extends FileDataCollection
Contains methods dealing with populating a DBM given a file containing
all Ohsumed documents.
Constructor Summary |
Ohsumed(List db,
String filename)
|
Ohsumed(String filename)
Creates a Ohsumed document collection given a filename which contains a collection of Ohsumed articles in XML representation. |
Ohsumed(String filename,
String UIDfile,
String outfile)
Creates an Ohsumed Collection with only the documents with UIDs in UIDfile. |
Method Summary |
Iterator |
fileIterator()
returns an iterator over documents in the document collection |
Methods inherited from class java.util.AbstractList |
add, add, addAll, clear, equals, hashCode, indexOf, iterator, lastIndexOf, listIterator, listIterator, remove, removeRange, set, subList |
Methods inherited from interface java.util.List |
add, add, addAll, addAll, clear, contains, containsAll, equals, hashCode, indexOf, isEmpty, iterator, lastIndexOf, listIterator, listIterator, remove, remove, removeAll, retainAll, set, subList, toArray, toArray |
Ohsumed
public Ohsumed(String filename,
String UIDfile,
String outfile)
- Creates an Ohsumed Collection with only the documents with UIDs in UIDfile. Prints new Ohsumed Collection to
outfile
Ohsumed
public Ohsumed(String filename)
throws IOException
- Creates a Ohsumed document collection given a filename which contains a collection of Ohsumed articles in XML representation. Stores data, features, and their respective indices in memory. Uses standard Bag-of-Features Matrix, and uses Lowercaser.
Ohsumed
public Ohsumed(List db,
String filename)
throws IOException
fileIterator
public Iterator fileIterator()
- returns an iterator over documents in the document collection
- Specified by:
fileIterator
in class FileDataCollection
Stanford NLP Group