- coordinator
- overlord
- broker
- historical
- middleManager
- peon
- zk (view of the cluster: data topology)
- mysql
- hdfs
- TimeSeries
- TopN
- GroupBy
- Time Boundary
- Search
- Select
- query type
- granularity
- filter
- aggregation
- post-aggregation
timestamp与metric都是array of int of floating point (LZ4压缩)
dimension要稍微复杂一点,由以下三种数据结构构成
- A dictionary that maps values (which are always treated as strings) to integer IDs,
- A list of the column’s values, encoded using the dictionary in 1, and
- For each distinct value in the column, a bitmap that indicates which rows contain that value. (也被称为倒排)
字符串类型的倒排索引