Skip to content

Commit 716702a

Browse files
authored
HBASE-24455 Correct the doc of "On the number of column families" (#1799)
Signed-off-by: Wellington Ramos Chevreuil <wchevreuil@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org>
1 parent f5b90fc commit 716702a

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

src/main/asciidoc/_chapters/schema_design.adoc

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -127,8 +127,9 @@ ____
127127
== On the number of column families
128128
129129
HBase currently does not do well with anything above two or three column families so keep the number of column families in your schema low.
130-
Currently, flushing and compactions are done on a per Region basis so if one column family is carrying the bulk of the data bringing on flushes, the adjacent families will also be flushed even though the amount of data they carry is small.
131-
When many column families exist the flushing and compaction interaction can make for a bunch of needless i/o (To be addressed by changing flushing and compaction to work on a per column family basis). For more information on compactions, see <<compaction>>.
130+
Currently, flushing is done on a per Region basis so if one column family is carrying the bulk of the data bringing on flushes, the adjacent families will also be flushed even though the amount of data they carry is small.
131+
When many column families exist the flushing interaction can make for a bunch of needless i/o (To be addressed by changing flushing to work on a per column family basis).
132+
In addition, compactions triggered at table/region level will happen per store too.
132133
133134
Try to make do with one column family if you can in your schemas.
134135
Only introduce a second and third column family in the case where data access is usually column scoped; i.e.

0 commit comments

Comments
 (0)