-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add workqueue latency observation tool #4897
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is a useful tool. Sometimes during production we do wonder whether workqueue latency is too long which amplifies the chance for race condition, etc.
Two things:
First, could you add the tool with brief description into README.md (CPU and Scheduler Tools)
Second, sometimes we want to identify which original task is enqueued which caused long latency. So it might be useful to add another distribution factor (tid (together with its comm)).
What do you think?
thanks for your suggestions, and I don't fully understand your second suggestion, do you mean adding a '-T TID' option to filter the workqueue's source thread ID for this tool? |
Currently we have '-W' option to print histogram based on different workqueues. So we have
What I means is to add '-P' option to print histogram based on PIDs. (I think PID (process id) granularity is good enough).
If both -W and -P are specified, we can have
Similar to '-W', '-p ' is also supported, so we only care a particular pid. So we have support of the following combinations:
What do you think? |
-P option you mentioned in some cases does not make senses since kernel may commit the work to workqueue in interrupt context; and in the case histogram base on PID is not the real process which commit the work, right? |
Good point. I guess let us not do this PID thing now. I will do some investigation about different percentages from process context, softirq context, or others. I agree if majority is not from process context, then PID histogram probably not useful. Even if quite some workqueue works from process context, we might still want to filter out those not from process context. |
523ebfc
to
0dbc94a
Compare
Add workqueue latency observation tool.
Add workqueue latency observation tool.
add tool to observe work's waiting latency on kernel's workqueue