| Package | Description |
|---|---|
| org.lionsoul.jcseg.tokenizer | |
| org.lionsoul.jcseg.tokenizer.core |
| Modifier and Type | Class and Description |
|---|---|
class |
Chunk
chunk concept for the mmseg chinese word segment algorithm has implemented IChunk interface
|
| Modifier and Type | Method and Description |
|---|---|
protected abstract IChunk |
ASegment.getBestCJKChunk(char[] chars,
int index)
an abstract method to gain a CJK word from the
current position.
|
IChunk |
ComplexSeg.getBestCJKChunk(char[] chars,
int index) |
IChunk |
SimpleSeg.getBestCJKChunk(char[] chars,
int index) |
protected IChunk |
SearchSeg.getBestCJKChunk(char[] chars,
int index)
here we don't have to do anything
|
| Modifier and Type | Method and Description |
|---|---|
protected String |
ASegment.findCHName(char[] chars,
int index,
IChunk chunk)
find an Chinese name from the current position of the input chars
|
boolean |
ASegment.findCHName(IWord w,
IChunk chunk)
Deprecated.
|
| Modifier and Type | Method and Description |
|---|---|
static IChunk[] |
MMSegFilter.getLargestAverageWordLengthChunks(IChunk[] chunks)
2.
|
static IChunk[] |
MMSegFilter.getLargestSingleMorphemicFreedomChunks(IChunk[] chunks)
the largest sum of degree of morphemic freedom of one-character words
this rule will return the chunks that own the largest sum of degree of morphemic freedom
of one-character
|
static IChunk[] |
MMSegFilter.getMaximumMatchChunks(IChunk[] chunks)
1.
|
static IChunk[] |
MMSegFilter.getSmallestVarianceWordLengthChunks(IChunk[] chunks)
the smallest variance word length
this rule will the chunks that one the smallest variance word length
|
| Modifier and Type | Method and Description |
|---|---|
static IChunk[] |
MMSegFilter.getLargestAverageWordLengthChunks(IChunk[] chunks)
2.
|
static IChunk[] |
MMSegFilter.getLargestSingleMorphemicFreedomChunks(IChunk[] chunks)
the largest sum of degree of morphemic freedom of one-character words
this rule will return the chunks that own the largest sum of degree of morphemic freedom
of one-character
|
static IChunk[] |
MMSegFilter.getMaximumMatchChunks(IChunk[] chunks)
1.
|
static IChunk[] |
MMSegFilter.getSmallestVarianceWordLengthChunks(IChunk[] chunks)
the smallest variance word length
this rule will the chunks that one the smallest variance word length
|
Copyright © 2017. All Rights Reserved.