-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Insert timeout when running test_e2e.py
after Milvus recovered from datanode pod kill chaos
#17537
Comments
similar insert failure in issue #17524. /assign @bigsheeper |
in version failed job: https://github.com/milvus-io/milvus/runs/6886689798?check_suite_focus=true |
DataCoord crashed and was not able to come back.
Could be the same in #17335 |
DataCoord keep creating consumer and closing consumer over and over again:
|
version |
See also: milvus-io#17537 Signed-off-by: yangxuan <xuan.yang@zilliz.com>
version |
See also: #17537 Signed-off-by: yangxuan <xuan.yang@zilliz.com>
DataCoord unsubscribe the same channel over and over again: There should be many duplicated channels in Guess DataCoord cost much time to unsubscribe these plenty of duplicatehed channels, which caused DataCoord to failed to start or evict old DataNode. |
@bigsheeper why there is duplicated channels? |
pr has merged, could you please make a verify? @zhuwenxing |
/assign @zhuwenxing |
Not reproduced recently yet. Remove critical label. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Is there an existing issue for this?
Environment
Current Behavior
Insert timeout when running
test_e2e.py
after Milvus recovered from datanode pod kill chaosActually, create collection also cost a lot time, 23s.
Expected Behavior
all test cases passed
Steps To Reproduce
see https://github.com/milvus-io/milvus/runs/6867486184?check_suite_focus=true
Milvus Log
failed job: https://github.com/milvus-io/milvus/runs/6867486184?check_suite_focus=true
log: https://github.com/milvus-io/milvus/suites/6913763113/artifacts/268667204
Anything else?
some other issues caused by datanode pod kill
#17335
#17366
The text was updated successfully, but these errors were encountered: