public final class SimpleAnalyzer extends Analyzer
Analyzer
that filters LetterTokenizer
with LowerCaseFilter
You may specify the Version
compatibility
when creating SimpleAnalyzer
:
LowerCaseTokenizer
uses an int based API to normalize and
detect token codepoints. See CharTokenizer.isTokenChar(int)
and
CharTokenizer.normalize(int)
for details.Analyzer.GlobalReuseStrategy, Analyzer.PerFieldReuseStrategy, Analyzer.ReuseStrategy, Analyzer.TokenStreamComponents
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
Constructor and Description |
---|
SimpleAnalyzer()
Creates a new
SimpleAnalyzer |
SimpleAnalyzer(Version matchVersion)
Deprecated.
Use
SimpleAnalyzer() |
Modifier and Type | Method and Description |
---|---|
protected Analyzer.TokenStreamComponents |
createComponents(String fieldName,
Reader reader)
Creates a new
Analyzer.TokenStreamComponents instance for this analyzer. |
close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, setVersion, tokenStream, tokenStream
public SimpleAnalyzer()
SimpleAnalyzer
@Deprecated public SimpleAnalyzer(Version matchVersion)
SimpleAnalyzer()
protected Analyzer.TokenStreamComponents createComponents(String fieldName, Reader reader)
Analyzer
Analyzer.TokenStreamComponents
instance for this analyzer.createComponents
in class Analyzer
fieldName
- the name of the fields content passed to the
Analyzer.TokenStreamComponents
sink as a readerreader
- the reader passed to the Tokenizer
constructorAnalyzer.TokenStreamComponents
for this analyzer.Copyright © 2000–2016 The Apache Software Foundation. All rights reserved.