edu.stanford.nlp.trees
Class PennSentenceMrgNormalizer

java.lang.Object
  |
  +--edu.stanford.nlp.trees.SentenceNormalizer
        |
        +--edu.stanford.nlp.trees.PennSentenceNormalizer
              |
              +--edu.stanford.nlp.trees.PennSentenceMrgNormalizer

public class PennSentenceMrgNormalizer
extends PennSentenceNormalizer

A class for sentence normalization. This one knows about the funny things in Penn Treebank pos files -- like lots of equals signs and square brackets. Additionally, it recodes brackets as strings like -LRB- so as to make them the same as the encoding used in the parsed files. A Singleton.


Constructor Summary
PennSentenceMrgNormalizer()
           
 
Method Summary
 String normalizeString(String word)
          Normalizes a read string word (and maybe intern it).
 
Methods inherited from class edu.stanford.nlp.trees.PennSentenceNormalizer
endSentenceToken, normalizeSentence
 
Methods inherited from class edu.stanford.nlp.trees.SentenceNormalizer
eolIsSentenceEnd
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PennSentenceMrgNormalizer

public PennSentenceMrgNormalizer()
Method Detail

normalizeString

public String normalizeString(String word)
Normalizes a read string word (and maybe intern it).

Overrides:
normalizeString in class PennSentenceNormalizer
Parameters:
word - The word to normalize
Returns:
The normalized form


Stanford NLP Group