edu.stanford.nlp.dbm
Class Ohsumed

java.lang.Object
  |
  +--java.util.AbstractCollection
        |
        +--java.util.AbstractList
              |
              +--edu.stanford.nlp.dbm.AbstractDataCollection
                    |
                    +--edu.stanford.nlp.dbm.FileDataCollection
                          |
                          +--edu.stanford.nlp.dbm.Ohsumed
All Implemented Interfaces:
Collection, DataCollection, List

public class Ohsumed
extends FileDataCollection

Contains methods dealing with populating a DBM given a file containing all Ohsumed documents.


Field Summary
 
Fields inherited from class edu.stanford.nlp.dbm.FileDataCollection
datamatrix, numdocs
 
Fields inherited from class edu.stanford.nlp.dbm.AbstractDataCollection
data, features, name
 
Fields inherited from class java.util.AbstractList
modCount
 
Constructor Summary
Ohsumed(List db, String filename)
           
Ohsumed(String filename)
          Creates a Ohsumed document collection given a filename which contains a collection of Ohsumed articles in XML representation.
Ohsumed(String filename, String UIDfile, String outfile)
          Creates an Ohsumed Collection with only the documents with UIDs in UIDfile.
 
Method Summary
 Iterator fileIterator()
          returns an iterator over documents in the document collection
 
Methods inherited from class edu.stanford.nlp.dbm.FileDataCollection
add, dataMatrix, features, populate, set, set, set, setDefaults, setDefaults
 
Methods inherited from class edu.stanford.nlp.dbm.AbstractDataCollection
get, name, size, toString, toXMLString
 
Methods inherited from class java.util.AbstractList
add, add, addAll, clear, equals, hashCode, indexOf, iterator, lastIndexOf, listIterator, listIterator, remove, removeRange, set, subList
 
Methods inherited from class java.util.AbstractCollection
addAll, contains, containsAll, isEmpty, remove, removeAll, retainAll, toArray, toArray
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
 
Methods inherited from interface java.util.List
add, add, addAll, addAll, clear, contains, containsAll, equals, hashCode, indexOf, isEmpty, iterator, lastIndexOf, listIterator, listIterator, remove, remove, removeAll, retainAll, set, subList, toArray, toArray
 

Constructor Detail

Ohsumed

public Ohsumed(String filename,
               String UIDfile,
               String outfile)
Creates an Ohsumed Collection with only the documents with UIDs in UIDfile. Prints new Ohsumed Collection to outfile


Ohsumed

public Ohsumed(String filename)
        throws IOException
Creates a Ohsumed document collection given a filename which contains a collection of Ohsumed articles in XML representation. Stores data, features, and their respective indices in memory. Uses standard Bag-of-Features Matrix, and uses Lowercaser.


Ohsumed

public Ohsumed(List db,
               String filename)
        throws IOException
Method Detail

fileIterator

public Iterator fileIterator()
returns an iterator over documents in the document collection

Specified by:
fileIterator in class FileDataCollection


Stanford NLP Group