public class NLPSeg extends ComplexSeg
behindLatin, config, ctrlMask, dic, ialist, idx, isb, reader, wordPoolCHECK_CE_MASk, CHECK_CF_MASK, CHECK_EC_MASK, START_SS_MASK| Constructor and Description |
|---|
NLPSeg(JcsegTaskConfig config,
ADictionary dic) |
NLPSeg(Reader input,
JcsegTaskConfig config,
ADictionary dic) |
| Modifier and Type | Method and Description |
|---|---|
protected IWord |
getNextCJKWord(int c,
int pos)
get the next CJK word from the current position of the input stream
|
protected IWord |
getNextDatetimeWord(IWord word)
get and return the next date-time word
|
protected IWord |
getNextTimeMergedWord(IWord word)
get and return the next time merged date-time word
|
IWord |
next()
Override the next method to add the date-time entity recognition
And we also invoke the parent.next method to get the next token
|
protected IWord |
nextLatinWord(int c,
int pos)
find the letter or digit word from the current position
count until the char is whitespace or not letter_digit
|
getBestCJKChunkappendLatinSyn, appendWordFeatures, enSecondSeg, findCHName, findCHName, getConfig, getDict, getNextLatinWord, getNextMatch, getNextMixedWord, getNextPunctuationPairWord, getPairPunctuationText, getStreamPosition, nextCJKSentence, nextCNNumeric, nextLatinString, nextLetterNumber, nextOtherNumber, pushBack, pushBack, readNext, reset, setConfig, setDictpublic NLPSeg(Reader input, JcsegTaskConfig config, ADictionary dic) throws IOException
IOExceptionpublic NLPSeg(JcsegTaskConfig config, ADictionary dic) throws IOException
IOExceptionpublic IWord next() throws IOException
next in interface ISegmentnext in class ASegmentIOExceptionASegment.next()protected IWord getNextTimeMergedWord(IWord word) throws IOException
word - IOExceptionprotected IWord getNextDatetimeWord(IWord word) throws IOException
word - IOExceptionprotected IWord getNextCJKWord(int c, int pos) throws IOException
ASegmentgetNextCJKWord in class ASegmentIOExceptionASegment.getNextCJKWord(int, int)protected IWord nextLatinWord(int c, int pos) throws IOException
nextLatinWord in class ASegmentc - pos - IOExceptionCopyright © 2017. All Rights Reserved.