Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only one query node used in searching if 3 querynode replicas deployed #6415

Closed
3 tasks
del-zhenwu opened this issue Jul 9, 2021 · 8 comments
Closed
3 tasks
Assignees
Labels
do-not-merge/hold kind/bug Issues or changes related a bug needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. stale indicates no udpates for 30 days
Milestone

Comments

@del-zhenwu
Copy link
Contributor

del-zhenwu commented Jul 9, 2021

Please state your issue using the following template and, most importantly, in English.

Describe the bug
Only one query node used in searching if deployed 3 querynode replicas, the test client generates 100-user concurrent search request during this testing.

Kubernetes-Compute-Resources-Pod-Grafana (2)
Kubernetes-Compute-Resources-Pod-Grafana (1)
Kubernetes-Compute-Resources-Pod-Grafana

pod names:

NAME                                                   READY   STATUS    RESTARTS   AGE
benchmark-dm9h7-1-etcd-0                               1/1     Running   0          38s
benchmark-dm9h7-1-milvus-datacoord-7fccffb97d-g8zk5    1/1     Running   0          39s
benchmark-dm9h7-1-milvus-datanode-744b859975-2sjcq     1/1     Running   0          38s
benchmark-dm9h7-1-milvus-indexcoord-5744599f97-kcv4n   1/1     Running   0          38s
benchmark-dm9h7-1-milvus-indexnode-549d67b968-q4mqd    1/1     Running   0          38s
benchmark-dm9h7-1-milvus-proxy-858844cb57-7cmsz        1/1     Running   0          39s
benchmark-dm9h7-1-milvus-pulsar-758d7895db-t6j8v       1/1     Running   0          38s
benchmark-dm9h7-1-milvus-querycoord-59487bfb6-298xg    1/1     Running   0          39s
benchmark-dm9h7-1-milvus-querynode-d6ff86794-6t9qb     1/1     Running   0          39s
benchmark-dm9h7-1-milvus-querynode-d6ff86794-jfbcq     1/1     Running   0          39s
benchmark-dm9h7-1-milvus-querynode-d6ff86794-pxxsn     1/1     Running   0          39s
benchmark-dm9h7-1-milvus-rootcoord-5bff5c9fc8-rfj7s    1/1     Running   0          39s

logs:
https://kibana-dev.zilliz.cc/app/discover#/?_g=(filters:!(),refreshInterval:(pause:!t,value:0),time:(from:now-2h,to:now))&_a=(columns:!(log),filters:!(('$state':(store:appState),meta:(alias:!n,disabled:!f,index:ad1e1c50-c9d9-11eb-9e1a-b3dcad9693fd,key:kubernetes.pod_name,negate:!f,params:!(benchmark-dm9h7-1-milvus-querynode-d6ff86794-6t9qb),type:phrases,value:benchmark-dm9h7-1-milvus-querynode-d6ff86794-6t9qb),query:(bool:(minimum_should_match:1,should:!((match_phrase:(kubernetes.pod_name:benchmark-dm9h7-1-milvus-querynode-d6ff86794-6t9qb))))))),hideChart:!t,index:ad1e1c50-c9d9-11eb-9e1a-b3dcad9693fd,interval:auto,query:(language:kuery,query:''),sort:!(!('@timestamp',desc)))

Steps/Code to reproduce behavior
install cluster by helm, set querynode replicas 3.

dataset: sift-10m, use the default segment configuration

Expected behavior
A clear and concise description of what you expected to happen.

Method of installation

  • Docker/cpu
  • Docker/gpu
  • Build from source

Environment details

  • Hardware/Software conditions (OS, CPU, GPU, Memory)

  • Milvus version (master or released version)

cluster version with 3 query nodes

Configuration file
Settings you made in server_config.yaml or milvus.yaml

paste-file-content-here

Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Add any other context about the problem here.

@del-zhenwu del-zhenwu added the kind/bug Issues or changes related a bug label Jul 9, 2021
@del-zhenwu del-zhenwu changed the title Only one query node used in searching if deployed 3 querynode replicas Only one query node used in searching if 3 querynode replicas deployed Jul 9, 2021
@xige-16
Copy link
Contributor

xige-16 commented Jul 9, 2021

could you provide all the log file of the three query node? @del-zhenwu

@del-zhenwu
Copy link
Contributor Author

could you provide all the log file of the three query node? @del-zhenwu

log uri uploaded.

@xige-16
Copy link
Contributor

xige-16 commented Jul 9, 2021

querynode7.log

@xige-16
Copy link
Contributor

xige-16 commented Jul 9, 2021

the nodeIDs are 4, 7 ,8. there are only two segment when loadCollection, and assigned to node4 and node7, and two dmChannels also are asisgned to node4 and node7, node7 connect etcd failed but keep alive, so there is only one querynode serving search.

@xige-16
Copy link
Contributor

xige-16 commented Jul 9, 2021

@bigsheeper querynode shouldn't keep alive after connecting etcd.

@xige-16
Copy link
Contributor

xige-16 commented Jul 9, 2021

/kind bug

@del-zhenwu del-zhenwu modified the milestones: 2.0-Backlog, 2.0.0-RC2 Jul 9, 2021
@xige-16 xige-16 modified the milestones: 2.0.0-RC2, 2.0-Backlog Jul 9, 2021
@wxyucs wxyucs added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Jul 13, 2021
@stale
Copy link

stale bot commented Aug 25, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Rotten issues close after 30d of inactivity. Reopen the issue with /reopen.

@stale
Copy link

stale bot commented Sep 24, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Rotten issues close after 30d of inactivity. Reopen the issue with /reopen.

@stale stale bot added the stale indicates no udpates for 30 days label Sep 24, 2021
@stale stale bot closed this as completed Oct 1, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
do-not-merge/hold kind/bug Issues or changes related a bug needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. stale indicates no udpates for 30 days
Projects
None yet
Development

No branches or pull requests

4 participants