public final class TibSyllableTokenizer
extends org.apache.lucene.analysis.util.CharTokenizer
Adjacent sequences of Tibetan Letter and/or Digit characters form tokens.
Derived from Lucene 6.4.1 analysis.core.WhitespaceTokenizer.java
| Constructor and Description |
|---|
TibSyllableTokenizer()
Construct a new TibSyllableTokenizer.
|
| Modifier and Type | Method and Description |
|---|---|
protected boolean |
isTibLetterOrDigit(int c) |
protected boolean |
isTokenChar(int c)
Collects only characters which satisfy isTibetanLetterOrDigit()
|
end, fromSeparatorCharPredicate, fromSeparatorCharPredicate, fromSeparatorCharPredicate, fromSeparatorCharPredicate, fromTokenCharPredicate, fromTokenCharPredicate, fromTokenCharPredicate, fromTokenCharPredicate, incrementToken, normalize, resetaddAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toStringCopyright © 2019. All rights reserved.