Skip to content

Latest commit

 

History

History
68 lines (44 loc) · 2.38 KB

analysis.asciidoc

File metadata and controls

68 lines (44 loc) · 2.38 KB

Analysis Plugins

Analysis plugins extend Elasticsearch by adding new analyzers, tokenizers, token filters, or character filters to Elasticsearch.

Core analysis plugins

The core analysis plugins are:

ICU

Adds extended Unicode support using the ICU libraries, including better analysis of Asian languages, Unicode normalization, Unicode-aware case folding, collation support, and transliteration.

Kuromoji

Advanced analysis of Japanese using the Kuromoji analyzer.

Phonetic

Analyzes tokens into their phonetic equivalent using Soundex, Metaphone, Caverphone, and other codecs.

SmartCN

An analyzer for Chinese or mixed Chinese-English text. This analyzer uses probabilistic knowledge to find the optimal word segmentation for Simplified Chinese text. The text is first broken into sentences, then each sentence is segmented into words.

Stempel

Provides high quality stemming for Polish.

Ukrainian

Provides stemming for Ukrainian.

Community contributed analysis plugins

A number of analysis plugins have been contributed by our community: