back to listing index

elasticsearch - guide - Custom Analyzer

[web search]
Original source (www.elasticsearch.org)
Tags: elasticsearch search custom-analyzer
Clipped on: 2012-11-01

An analyzer of type custom that allows to combine a Tokenizer with zero or more Token Filters, and zero or more Char Filters. The custom analyzer accepts a logical/registered name of the tokenizer to use, and a list of logical/registered names of token filters.

The following are settings that can be set for a custom analyzer type:

Setting Description
tokenizerThe logical / registered name of the tokenizer to use.
filterAn optional list of logical / registered name of token filters.
char_filterAn optional list of logical / registered name of char filters.

Here is an example:

index :
    analysis
:
        analyzer
:
            myAnalyzer2
:
                type
: custom
                tokenizer
: myTokenizer1
                filter
: [myTokenFilter1, myTokenFilter2]
                char_filter
: [my_html]
        tokenizer
:
            myTokenizer1
:
                type
: standard
                max_token_length
: 900
        filter
:
            myTokenFilter1
:
                type
: stop
                stopwords
: [stop1, stop2, stop3, stop4]
            myTokenFilter2
:
                type
: length
                min
: 0
                max
: 2000
        char_filter
:
              my_html
:
                type
: html_strip
                escaped_tags
: [xxx, yyy]
                read_ahead
: 1024