- TibAffixedFilter - Class in io.bdrc.lucene.bo
-
Removes འི, འོ, འིའོ, འམ, འང and འིས characters at end of token.
- TibAffixedFilter(TokenStream) - Constructor for class io.bdrc.lucene.bo.TibAffixedFilter
-
- TibCharFilter - Class in io.bdrc.lucene.bo
-
- TibCharFilter(Reader) - Constructor for class io.bdrc.lucene.bo.TibCharFilter
-
- TibetanAnalyzer - Class in io.bdrc.lucene.bo
-
An Analyzer that uses
TibSyllableTokenizer and filters with StopFilter
Derived from Lucene 6.4.1 analysis.core.WhitespaceAnalyzer.java
- TibetanAnalyzer(boolean, boolean, boolean, String, String) - Constructor for class io.bdrc.lucene.bo.TibetanAnalyzer
-
- TibetanAnalyzer() - Constructor for class io.bdrc.lucene.bo.TibetanAnalyzer
-
- TibEwtsFilter - Class in io.bdrc.lucene.bo
-
A filter that converts EWTS input into Tibetan Unicode
Partially inpired from Lucene 6 org.apache.lucene.analysis.charfilterMappingCharFilter
- TibEwtsFilter(Reader) - Constructor for class io.bdrc.lucene.bo.TibEwtsFilter
-
- TibEwtsFilter(Reader, String) - Constructor for class io.bdrc.lucene.bo.TibEwtsFilter
-
- TibSyllableTokenizer - Class in io.bdrc.lucene.bo
-
A TibSyllableTokenizer divides text between sequences of Tibetan Letter and/or Digit
characters and sequences of all other characters - typically some sort of white space
but other punctuation and characters from other language code-pages are not considered
as constituents of tokens for the purpose of search and indexing.
- TibSyllableTokenizer() - Constructor for class io.bdrc.lucene.bo.TibSyllableTokenizer
-
Construct a new TibSyllableTokenizer.
- TibWordTokenizer - Class in io.bdrc.lucene.bo
-
A maximal-matching word tokenizer for Tibetan that uses a Trie.
- TibWordTokenizer(String) - Constructor for class io.bdrc.lucene.bo.TibWordTokenizer
-
Constructs a TibWordTokenizer using the file designed by filename
- TibWordTokenizer() - Constructor for class io.bdrc.lucene.bo.TibWordTokenizer
-
Constructs a TibWordTokenizer using a default lexicon file (here "resource/output/total_lexicon.txt")