Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

job not running告警优化 #697

Open
RolfHeG opened this issue Jun 16, 2020 · 0 comments
Open

job not running告警优化 #697

RolfHeG opened this issue Jun 16, 2020 · 0 comments

Comments

@RolfHeG
Copy link
Contributor

RolfHeG commented Jun 16, 2020

job not running增加判断,是否有其他分片在nextFireTime之前就已经开始运行到现在
假如有,说明可能处于以下两种情况,作业正常无需告警:
1.有重新分片任务下发到/necessary节点,当前分片机器正在block等待running的分片运行结束
2.当前分片被failover,但是其他executor都有该job的分片任务并处于running状态,failover无法立即运行

RolfHeG added a commit that referenced this issue Jun 16, 2020
RolfHeG added a commit that referenced this issue Jun 28, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant