This chapter discusses the relationship between Lucene and Elasticsearch, and explains some of the Lucene internals that the advanced user should be aware of.
-
A conceptual overview
-
Documents, Fields
-
4-Dimension API (Fields → Terms → Docs → Positions)
-
-
Lifetime of a Document
-
Lifetime of a Query
-
Query Rewriting
-
Weight & Scorer
-
Query Types
-
MultiTermQueries vs. “common” queries
-
-
Data structures explained
-
File Formats on a high level (ie. the basic datastructures)
-
-
Document a-time retrieval
-
Scoring Models
-
TF-IDF, BM25, etc
-
Similarity & Per-Field Scoring
-
-
Proximity Scoring
-
PhraseQueries and their impact
-
-
Custom Scoring