edu.stanford.nlp.dbm
Class DataSet

java.lang.Object
  |
  +--java.util.AbstractCollection
        |
        +--java.util.AbstractList
              |
              +--edu.stanford.nlp.dbm.AbstractDataCollection
                    |
                    +--edu.stanford.nlp.dbm.FileDataCollection
                          |
                          +--edu.stanford.nlp.dbm.DataSet
All Implemented Interfaces:
Collection, DataCollection, List
Direct Known Subclasses:
ContextSet

public abstract class DataSet
extends FileDataCollection

A Data Collection that does not allow duplicate data. If a datum occurs multiple times, the feature vectors are additively combined.


Field Summary
 
Fields inherited from class edu.stanford.nlp.dbm.FileDataCollection
datamatrix, numdocs
 
Fields inherited from class edu.stanford.nlp.dbm.AbstractDataCollection
data, features, name
 
Fields inherited from class java.util.AbstractList
modCount
 
Constructor Summary
DataSet(String filename)
          Creates a DataSet given a filename which contains a collection of DataSet articles in XML representation.
 
Method Summary
 int add(Datum d)
          inserts a Datum into the Data Collection assigns Datum to lowest unassigned index in FileDataCollection and returns this index note: this allows for duplicate objects to be stored with different indices.
protected  void setDefaults(String filename)
          Sets DataMatrix, List, Processors, to default values, where features and data are stored in memory.
protected  void setDefaults(String filename, String dir)
          Sets DataMatrix, List, Processors, to default values, where features and data are stored on disk in directory dir.
 
Methods inherited from class edu.stanford.nlp.dbm.FileDataCollection
dataMatrix, features, fileIterator, populate, set, set, set
 
Methods inherited from class edu.stanford.nlp.dbm.AbstractDataCollection
get, name, size, toString, toXMLString
 
Methods inherited from class java.util.AbstractList
add, add, addAll, clear, equals, hashCode, indexOf, iterator, lastIndexOf, listIterator, listIterator, remove, removeRange, set, subList
 
Methods inherited from class java.util.AbstractCollection
addAll, contains, containsAll, isEmpty, remove, removeAll, retainAll, toArray, toArray
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
 
Methods inherited from interface java.util.List
add, add, addAll, addAll, clear, contains, containsAll, equals, hashCode, indexOf, isEmpty, iterator, lastIndexOf, listIterator, listIterator, remove, remove, removeAll, retainAll, set, subList, toArray, toArray
 

Constructor Detail

DataSet

public DataSet(String filename)
        throws IOException
Creates a DataSet given a filename which contains a collection of DataSet articles in XML representation. Stores data, features, and their respective indices in memory.

Method Detail

setDefaults

protected void setDefaults(String filename)
Sets DataMatrix, List, Processors, to default values, where features and data are stored in memory.

Overrides:
setDefaults in class FileDataCollection

setDefaults

protected void setDefaults(String filename,
                           String dir)
                    throws IOException
Sets DataMatrix, List, Processors, to default values, where features and data are stored on disk in directory dir.

Overrides:
setDefaults in class FileDataCollection
IOException

add

public int add(Datum d)
inserts a Datum into the Data Collection assigns Datum to lowest unassigned index in FileDataCollection and returns this index note: this allows for duplicate objects to be stored with different indices.

Specified by:
add in interface DataCollection
Overrides:
add in class FileDataCollection


Stanford NLP Group