|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectedu.stanford.nlp.trees.AbstractTreebankLanguagePack
edu.stanford.nlp.trees.international.pennchinese.ChineseTreebankLanguagePack
public class ChineseTreebankLanguagePack
Language pack for Chinese treebank. (Look into using native2ascii to edit this file as a GB file)
| Field Summary | |
|---|---|
static String |
ENCODING
|
| Fields inherited from class edu.stanford.nlp.trees.AbstractTreebankLanguagePack |
|---|
DEFAULT_ENCODING |
| Constructor Summary | |
|---|---|
ChineseTreebankLanguagePack()
|
|
| Method Summary | |
|---|---|
static Filter |
chineseColonAcceptFilter()
|
static Filter |
chineseCommaAcceptFilter()
|
static Filter |
chineseDashAcceptFilter()
|
static Filter |
chineseDouHaoAcceptFilter()
|
static Filter |
chineseEndSentenceAcceptFilter()
|
static Filter |
chineseLeftParenthesisAcceptFilter()
|
static Filter |
chineseLeftQuoteMarkAcceptFilter()
|
static Filter |
chineseOtherAcceptFilter()
|
static Filter |
chineseParenthesisAcceptFilter()
|
static Filter |
chineseQuoteMarkAcceptFilter()
|
static Filter |
chineseRightParenthesisAcceptFilter()
|
static Filter |
chineseRightQuoteMarkAcceptFilter()
|
String |
getEncoding()
Return the input Charset encoding for the Treebank. |
TokenizerFactory |
getTokenizerFactory()
Return a tokenizer which might be suitable for tokenizing text that will be used with this Treebank/Language pair, without tokenizing carriage returns (i.e., treating them as white space). |
GrammaticalStructureFactory |
grammaticalStructureFactory()
Return a GrammaticalStructureFactory suitable for this language/treebank. |
boolean |
isEvalBIgnoredPunctuationTag(String str)
Accepts a String that is a punctuation tag that should be ignored by EVALB-style evaluation, and rejects everything else. |
boolean |
isPunctuationTag(String str)
Accepts a String that is a punctuation tag name, and rejects everything else. |
boolean |
isPunctuationWord(String str)
Accepts a String that is a punctuation word, and rejects everything else. |
boolean |
isSentenceFinalPunctuationTag(String str)
Accepts a String that is a sentence end punctuation tag, and rejects everything else. |
char[] |
labelAnnotationIntroducingCharacters()
Return an array of characters at which a String should be truncated to give the basic syntactic category of a label. |
String[] |
punctuationTags()
Returns a String array of punctuation tags for this treebank/language. |
String[] |
punctuationWords()
Returns a String array of punctuation words for this treebank/language. |
String[] |
sentenceFinalPunctuationTags()
Returns a String array of sentence final punctuation tags for this treebank/language. |
String[] |
sentenceFinalPunctuationWords()
Returns a String array of sentence final punctuation words for this treebank/language. |
static void |
setTokenizerFactory(TokenizerFactory tf)
|
String[] |
startSymbols()
Returns a String array of treebank start symbols. |
String |
treebankFileExtension()
Returns the extension of treebank files for this treebank. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final String ENCODING
| Constructor Detail |
|---|
public ChineseTreebankLanguagePack()
| Method Detail |
|---|
public static void setTokenizerFactory(TokenizerFactory tf)
public TokenizerFactory getTokenizerFactory()
AbstractTreebankLanguagePackWhitespaceTokenizer.
getTokenizerFactory in interface TreebankLanguagePackgetTokenizerFactory in class AbstractTreebankLanguagePackpublic String getEncoding()
Charset class.
getEncoding in interface TreebankLanguagePackgetEncoding in class AbstractTreebankLanguagePackpublic boolean isPunctuationTag(String str)
isPunctuationTag in interface TreebankLanguagePackisPunctuationTag in class AbstractTreebankLanguagePackpublic boolean isPunctuationWord(String str)
isPunctuationWord in interface TreebankLanguagePackisPunctuationWord in class AbstractTreebankLanguagePackpublic boolean isSentenceFinalPunctuationTag(String str)
isSentenceFinalPunctuationTag in interface TreebankLanguagePackisSentenceFinalPunctuationTag in class AbstractTreebankLanguagePackpublic String[] punctuationTags()
punctuationTags in interface TreebankLanguagePackpunctuationTags in class AbstractTreebankLanguagePackpublic String[] punctuationWords()
punctuationWords in interface TreebankLanguagePackpunctuationWords in class AbstractTreebankLanguagePackpublic String[] sentenceFinalPunctuationTags()
sentenceFinalPunctuationTags in interface TreebankLanguagePacksentenceFinalPunctuationTags in class AbstractTreebankLanguagePackpublic String[] sentenceFinalPunctuationWords()
sentenceFinalPunctuationWords in interface TreebankLanguagePackpublic boolean isEvalBIgnoredPunctuationTag(String str)
isEvalBIgnoredPunctuationTag in interface TreebankLanguagePackisEvalBIgnoredPunctuationTag in class AbstractTreebankLanguagePackpublic char[] labelAnnotationIntroducingCharacters()
labelAnnotationIntroducingCharacters in interface TreebankLanguagePacklabelAnnotationIntroducingCharacters in class AbstractTreebankLanguagePackpublic String[] startSymbols()
startSymbols in interface TreebankLanguagePackstartSymbols in class AbstractTreebankLanguagePackpublic static Filter chineseCommaAcceptFilter()
public static Filter chineseEndSentenceAcceptFilter()
public static Filter chineseDouHaoAcceptFilter()
public static Filter chineseQuoteMarkAcceptFilter()
public static Filter chineseParenthesisAcceptFilter()
public static Filter chineseColonAcceptFilter()
public static Filter chineseDashAcceptFilter()
public static Filter chineseOtherAcceptFilter()
public static Filter chineseLeftParenthesisAcceptFilter()
public static Filter chineseRightParenthesisAcceptFilter()
public static Filter chineseLeftQuoteMarkAcceptFilter()
public static Filter chineseRightQuoteMarkAcceptFilter()
public String treebankFileExtension()
treebankFileExtension in interface TreebankLanguagePackpublic GrammaticalStructureFactory grammaticalStructureFactory()
AbstractTreebankLanguagePack
grammaticalStructureFactory in interface TreebankLanguagePackgrammaticalStructureFactory in class AbstractTreebankLanguagePack
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||