[DOC-9267] Guidance for Search Service SIzing#4078
[DOC-9267] Guidance for Search Service SIzing#4078sarahlwelton wants to merge 6 commits intorelease/8.0from
Conversation
|
|
||
| Based on these variables, the required vCPUs could be either: | ||
|
|
||
| * stem:[290], using a value of 300 in the vCPU calculation. |
There was a problem hiding this comment.
We should have two cases for 150 & 200 respectively instead of 200 & 300 as 150 & 200 are the values of QPS/vCPU that we have documented earlier
|
|
||
| |==== | ||
|
|
||
| Based on these variables, the required vCPUs would be stem:[40], based on the more complex queries needing a higher QPS per vCPU and using a value of 300 in the calculation. |
There was a problem hiding this comment.
Same here, the value should be 200
There was a problem hiding this comment.
@sarahlwelton I've some comments around search index nomenclature and requirements for sizing right.
I think the estimation itself @TusharMadaan04 's done nice work on and you've captured it quite well here.
| Search Service nodes manage Search indexes and serve your Search queries. | ||
|
|
||
| Basic Search indexes are lists of all the unique terms that appear in the documents on your cluster. | ||
| For each term, the Search index also contains a list of the documents where that term appears. |
There was a problem hiding this comment.
It'd be good to use the nomenclature - inverted index here .. which is the primary data structure within the search index that indicates the list of the documents where the term appears.
There was a problem hiding this comment.
Sure, that's easy enough to add
|
|
||
| Basic Search indexes are lists of all the unique terms that appear in the documents on your cluster. | ||
| For each term, the Search index also contains a list of the documents where that term appears. | ||
| These lists inside a Search index can cause the Search index to be larger than your original dataset. |
There was a problem hiding this comment.
This is a possibility, it depends very much on the data though .. in many a situation it could be lesser too because we de-dup terms/prefixes/sub-strings/suffixes in the index and the inverted-index/postings-lists are compressed bitmaps alongside a map for doc keys.
Will defer to your judgement on how to frame all of this :)
There was a problem hiding this comment.
Perhaps this framing works better, just to keep things accurate?
| To size the Search Service nodes in your cluster, you need the following information: | ||
|
|
||
| * The number of documents you need to include in your Search index or indexes. | ||
| * The average size of the documents that need to be included in your Search index, in KB. |
There was a problem hiding this comment.
Not accurate enough - it's not the average size of the docs that we need, but the number of the fields and their sizes that'll drive footprint.
The long-awaited guidance for sizing the Search Service in a Couchbase Server deployment.
Preview URL:
https://preview.docs-test.couchbase.com/docs-server-DOC-9267-fts-sizing/server/current/install/sizing-general.html#sizing-search-service-nodes
You will need the Docs Team credentials on Confluence.