edu.mit.csail.brill
Class Sentence

java.lang.Object
  |
  +--edu.mit.csail.brill.Sentence

public class Sentence
extends Object

Object representing a tagged sentence. It is essentially a collection of TaggedTokens.

Author:
Jimmy Lin

Constructor Summary
Sentence(String original, String tokenized, String[][] tokens)
          Creates a sentence object.
Sentence(String original, String tokenized, edu.mit.csail.brill.TaggedToken[] tokens)
          Creates a sentence object.
 
Method Summary
 boolean equals(Object obj)
           
 boolean equals(edu.mit.csail.brill.Sentence s)
           
 String getOriginalSentence()
          Returns the original untokenized sentence.
 edu.mit.csail.brill.TaggedToken getToken(int i)
          Returns the ith token.
 String getTokenizedSentence()
          Returns the tokenized sentence.
 int getTokenOffset(int i)
          Returns the character offset of the ith token.
 int getTokenOffset(edu.mit.csail.brill.TaggedToken tok)
          Returns the character offset of a token.
 Iterator getTokens()
          Returns an interator over the TaggedTokens.
 int hashCode()
           
 int numTokens()
          Returns the number of tokens.
 String toString()
           
 
Methods inherited from class java.lang.Object
getClass, notify, notifyAll, wait, wait, wait
 

Constructor Detail

Sentence

public Sentence(String original,
                String tokenized,
                edu.mit.csail.brill.TaggedToken[] tokens)
Creates a sentence object.

Parameters:
original - original untokenized sentence
tokenized - tokenized sentence
tokens - array of tokens

Sentence

public Sentence(String original,
                String tokenized,
                String[][] tokens)
Creates a sentence object.

Parameters:
original - original untokenized sentence
tokenized - tokenized sentence
tokens - array of (tokens, tag) pairs
Method Detail

getOriginalSentence

public String getOriginalSentence()
Returns the original untokenized sentence.


getTokenizedSentence

public String getTokenizedSentence()
Returns the tokenized sentence.


getTokens

public Iterator getTokens()
Returns an interator over the TaggedTokens.


getTokenOffset

public int getTokenOffset(edu.mit.csail.brill.TaggedToken tok)
Returns the character offset of a token.


getTokenOffset

public int getTokenOffset(int i)
Returns the character offset of the ith token.


getToken

public edu.mit.csail.brill.TaggedToken getToken(int i)
Returns the ith token.


numTokens

public int numTokens()
Returns the number of tokens.


toString

public String toString()
Overrides:
toString in class Object

hashCode

public int hashCode()
Overrides:
hashCode in class Object

equals

public boolean equals(Object obj)
Overrides:
equals in class Object

equals

public boolean equals(edu.mit.csail.brill.Sentence s)