public class TibAffixedFilter
extends org.apache.lucene.analysis.TokenFilter
The འི is an affixed particle that can be usefully ignored in search and indexing so that "པོ" and "པོའི" will match. This should help searches to be more lenient.
Derived from Lucene 6.4.1 analysis.standard.ClassicFilter
| Constructor and Description |
|---|
TibAffixedFilter(org.apache.lucene.analysis.TokenStream input) |
| Modifier and Type | Method and Description |
|---|---|
boolean |
incrementToken()
Returns the next token in the stream, or null at EOS.
|
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, endAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, removeAllAttributes, restoreState, toStringpublic TibAffixedFilter(org.apache.lucene.analysis.TokenStream input)
public final boolean incrementToken()
throws IOException
Removes འི, འོ, འིའོ, འམ, འང and འིས from the end of words.
incrementToken in class org.apache.lucene.analysis.TokenStreamIOExceptionCopyright © 2019. All rights reserved.