|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams
edu.stanford.nlp.parser.lexparser.ChineseTreebankParserParams
public class ChineseTreebankParserParams
Parameter file for parsing the Penn Chinese Treebank. Includes category enrichments specific to the Penn Chinese Treebank.
Nested Class Summary |
---|
Nested classes/interfaces inherited from class edu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams |
---|
AbstractTreebankParserParams.SubcategoryStripper |
Field Summary | |
---|---|
boolean |
bikelHeadFinder
|
boolean |
charTags
|
static boolean |
chineseSelectiveTagPA
|
static boolean |
chineseSplitDouHao
Chinese: Split the dou hao (a punctuation mark separating members of a list) from other punctuation. |
static boolean |
chineseSplitPunct
Chinese: split Chinese punctuation several ways, along the lines of English punctuation plus another category for the dou hao. |
static boolean |
chineseSplitPunctLR
Chinese: split left right/paren quote (if chineseSplitPunct is also true. |
static boolean |
chineseSplitVP3
Chinese: split VPs into VP-COMP, VP-CRD, VP-ADJ. |
static boolean |
chineseVerySelectiveTagPA
|
boolean |
discardFrags
|
static boolean |
gpaAD
Grandparent annotate all AD. |
static boolean |
markADgrandchildOfIP
Chinese: mark ADs that are grandchild of IP. |
static boolean |
markCC
Mark phrases which are conjunctions. |
static boolean |
markIPadjsubj
|
static boolean |
markIPconj
Chinese: mark IPs that are conjuncts. |
static boolean |
markIPsisDEC
Chinese: mark IPs that are part of prenominal modifiers. |
static boolean |
markIPsisterBA
Chinese: mark IPs that are sister of BA. |
static boolean |
markIPsisterVVorP
Chinese: mark IP's that are sister of VV or P. |
static boolean |
markModifiedNP
Chinese: mark left-modified NPs (rightmost NPs with a left-side mod). |
static boolean |
markMultiNtag
Chinese: mark nominal tags that are part of multi-nominal rewrites. |
static boolean |
markNPconj
Chinese: mark NPs that are conjuncts. |
static boolean |
markNPmodNP
Chinese: mark NP modifiers of NPs. |
static boolean |
markPostverbalP
Chinese: mark P with a left aunt VV, and PP with a left sister VV. |
static boolean |
markPostverbalPP
|
static boolean |
markPsisterIP
Chinese: mark P's that are sister of IP. |
static boolean |
markVPadjunct
Chinese: mark phrases that are adjuncts of VP (these tend to be locatives/temporals, and have a specific distribution). |
static boolean |
markVVsisterIP
Chinese: mark VVs that are sister of IP (communication & small-clause-taking verbs). |
static boolean |
mergeNNVV
Chinese: merge NN and VV. |
static boolean |
paRootDtr
Chinese: parent annotate daughter of root. |
boolean |
segmentMarkov
|
boolean |
segmentMaxMatch
|
static boolean |
splitBaseNP
Mark base NPs. |
static boolean |
splitNPTMP
Whether to retain the -TMP functional tag on various phrasal categories. |
static boolean |
splitPPTMP
|
static boolean |
splitXPTMP
|
boolean |
sunJurafskyHeadFinder
|
static boolean |
tagWordSize
Annotate tags for number of characters contained. |
static boolean |
unaryCP
|
static boolean |
unaryIP
Chinese: unary category marking |
boolean |
useCharacterBasedLexicon
|
boolean |
useMaxentDepGrammar
|
boolean |
useMaxentLexicon
|
boolean |
useSimilarWordMap
|
Fields inherited from class edu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams |
---|
inputEncoding, outputEncoding, tlp |
Constructor Summary | |
---|---|
ChineseTreebankParserParams()
|
Method Summary | |
---|---|
TreeTransformer |
collinizer()
Returns a ChineseCollinizer |
TreeTransformer |
collinizerEvalb()
Returns a ChineseCollinizer that doesn't delete punctuation |
List |
defaultTestSentence()
Return a default sentence for the language (for testing) |
edu.stanford.nlp.parser.lexparser.Extractor |
dependencyGrammarExtractor(Options op)
|
DiskTreebank |
diskTreebank()
Uses a DiskTreebank with a CHTBTokenizer and a BobChrisTreeNormalizer. |
void |
display()
display language-specific settings |
HeadFinder |
headFinder()
Returns a ChineseHeadFinder |
Lexicon |
lex(Options.LexOptions op)
Returns a ChineseLexicon |
static void |
main(String[] args)
For testing: loads a treebank and prints the trees. |
MemoryTreebank |
memoryTreebank()
Uses a MemoryTreebank with a CHTBTokenizer and a BobChrisTreeNormalizer |
double[] |
MLEDependencyGrammarSmoothingParams()
Give the parameters for smoothing in the MLEDependencyGrammar. |
int |
setOptionFlag(String[] args,
int i)
Set language-specific options according to flags. |
String[] |
sisterSplitters()
Returns the splitting strings used for selective splits. |
Tree |
transformTree(Tree t,
Tree root)
transformTree does all language-specific tree transformations. |
TreeReaderFactory |
treeReaderFactory()
Returns a factory for reading in trees from the source you want. |
Methods inherited from class edu.stanford.nlp.parser.lexparser.AbstractTreebankParserParams |
---|
dependencyObjectify, getInputEncoding, getOutputEncoding, lex, parsevalObjectify, parsevalObjectify, pw, pw, setInputEncoding, setOutputEncoding, subcategoryStripper, testMemoryTreebank, treebankLanguagePack, treeTokenizerFactory, typedDependencyClasser, typedDependencyObjectify, untypedDependencyObjectify |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public boolean charTags
public boolean useCharacterBasedLexicon
public boolean useMaxentLexicon
public boolean useMaxentDepGrammar
public boolean segmentMarkov
public boolean segmentMaxMatch
public boolean sunJurafskyHeadFinder
public boolean bikelHeadFinder
public boolean discardFrags
public boolean useSimilarWordMap
public static boolean chineseSplitDouHao
public static boolean chineseSplitPunct
public static boolean chineseSplitPunctLR
public static boolean markVVsisterIP
public static boolean markPsisterIP
public static boolean markIPsisterVVorP
public static boolean markADgrandchildOfIP
public static boolean gpaAD
public static boolean chineseVerySelectiveTagPA
public static boolean chineseSelectiveTagPA
public static boolean markIPsisterBA
public static boolean markVPadjunct
public static boolean markNPmodNP
public static boolean markModifiedNP
public static boolean markNPconj
public static boolean markMultiNtag
public static boolean markIPsisDEC
public static boolean markIPconj
public static boolean markIPadjsubj
public static boolean chineseSplitVP3
public static boolean mergeNNVV
public static boolean unaryIP
public static boolean unaryCP
public static boolean paRootDtr
public static boolean markPostverbalP
public static boolean markPostverbalPP
public static boolean splitBaseNP
public static boolean tagWordSize
public static boolean markCC
public static boolean splitNPTMP
public static boolean splitPPTMP
public static boolean splitXPTMP
Constructor Detail |
---|
public ChineseTreebankParserParams()
Method Detail |
---|
public HeadFinder headFinder()
headFinder
in interface TreebankLangParserParams
headFinder
in class AbstractTreebankParserParams
public Lexicon lex(Options.LexOptions op)
lex
in interface TreebankLangParserParams
lex
in class AbstractTreebankParserParams
public double[] MLEDependencyGrammarSmoothingParams()
AbstractTreebankParserParams
MLEDependencyGrammarSmoothingParams
in interface TreebankLangParserParams
MLEDependencyGrammarSmoothingParams
in class AbstractTreebankParserParams
public TreeReaderFactory treeReaderFactory()
TreebankLangParserParams
public DiskTreebank diskTreebank()
public MemoryTreebank memoryTreebank()
memoryTreebank
in interface TreebankLangParserParams
memoryTreebank
in class AbstractTreebankParserParams
public TreeTransformer collinizer()
collinizer
in interface TreebankLangParserParams
collinizer
in class AbstractTreebankParserParams
public TreeTransformer collinizerEvalb()
collinizerEvalb
in interface TreebankLangParserParams
collinizerEvalb
in class AbstractTreebankParserParams
public String[] sisterSplitters()
AbstractTreebankParserParams
sisterSplitters
in interface TreebankLangParserParams
sisterSplitters
in class AbstractTreebankParserParams
public Tree transformTree(Tree t, Tree root)
transformTree
in interface TreebankLangParserParams
transformTree
in class AbstractTreebankParserParams
t
- The input tree (with non-language specific annotation already
done, so you need to strip back to basic categories)root
- The root of the current tree (can be null for words)
public void display()
AbstractTreebankParserParams
display
in interface TreebankLangParserParams
display
in class AbstractTreebankParserParams
public int setOptionFlag(String[] args, int i)
setOptionFlag
in interface TreebankLangParserParams
setOptionFlag
in class AbstractTreebankParserParams
args
- Array of command line argumentsi
- Index in command line arguments to try to process as an option
public edu.stanford.nlp.parser.lexparser.Extractor dependencyGrammarExtractor(Options op)
dependencyGrammarExtractor
in interface TreebankLangParserParams
dependencyGrammarExtractor
in class AbstractTreebankParserParams
public List defaultTestSentence()
public static void main(String[] args)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |