HWP PCU agent minor modification #599

17-sugiyama · 2023-12-19T12:19:02Z

I tested the recently added hwp_pcu agent, and found that 3 seconds of timeout is sometimes too short to communicate with the HWP PCU module.
This is a request to extend timeout for sending commands/receiving messages.

Description

I extended the timeout for send_command from 3 sec to 10 sec; and for get_status to 10 sec.

Motivation and Context

Improving the operation stability

How Has This Been Tested?

I tested this agent in the lab and it worked.
The recently added agent worked fine except for the timeout issue.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.

for more information, see https://pre-commit.ci

BrianJKoopman

This seems a bit unusual to me. I believe it could be due to the timeout in the 'acq' process being set to a non-zero value. Try changing the timeout on line 162 to 0 and leaving these timeouts modified here as they were and see if that helps:

socs/socs/agents/hwp_pcu/agent.py

Line 162 in aaf066e

with self.lock.acquire_timeout(timeout=3, job='acq') as acquired:

jlashner · 2023-12-19T13:51:38Z

@BrianJKoopman a bit confused about your comment (I think it was me who suggested non-zero timeouts). Why would a timeout of zero help here?

jlashner · 2023-12-19T13:53:44Z

I think the problem is the long acq loop time:

socs/socs/agents/hwp_pcu/agent.py

Line 193 in aaf066e

time.sleep(5)

While this is sleeping, the lock cannot be acquired by other processes. I think to fix we should make this a fast loop (.1 sec or so) and only take and publish data on iterations where 5 sec have elapsed.

Or we could also restructure the agent to avoid TimeoutLocks.

BrianJKoopman · 2023-12-19T14:06:16Z

I think the problem is the long acq loop time:

socs/socs/agents/hwp_pcu/agent.py

Line 193 in aaf066e

time.sleep(5)

While this is sleeping, the lock cannot be acquired by other processes. I think to fix we should make this a fast loop (.1 sec or so) and only take and publish data on iterations where 5 sec have elapsed.

Ah, yup, agreed this 5 second sleep is the issue, the task timeouts would need to be longer than that. Sorry for the noise. I've just seen non-zero acq timeouts cause issues in the past, and the example in the docs has a 0 timeout in long running processes.

jlashner · 2023-12-19T15:06:54Z

I realize it might not be clear what I'm talking about when I say lockless agent restructure, so I threw together what I have in mind in this draft PR for reference which should fix the locking issues. Feel free to use that fix if you would like, but may need some testing:

#600

17-sugiyama · 2023-12-20T14:20:34Z

Thank you so much for your comments and suggestions.
I prefer to use Jack's lockless agent. I tested Jack's agent with the PCU instrument and it worked.
I left some comments to #600.
I'd like to close this pull request.

17-sugiyama and others added 5 commits December 6, 2023 05:11

hwp_pcu is added.

c383c83

The name of the operation mode changed & timeout extended

e468a28

timeout extended

2f84db7

Update __init__.py

c5d1dbe

minor modification

34706b9

17-sugiyama requested review from jlashner and ykyohei December 19, 2023 12:19

[pre-commit.ci] auto fixes from pre-commit.com hooks

aaf066e

for more information, see https://pre-commit.ci

BrianJKoopman reviewed Dec 19, 2023

View reviewed changes

17-sugiyama closed this Dec 20, 2023

17-sugiyama deleted the hwp_pcu-agent branch December 20, 2023 14:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HWP PCU agent minor modification #599

HWP PCU agent minor modification #599

Uh oh!

17-sugiyama commented Dec 19, 2023

Uh oh!

BrianJKoopman left a comment

Uh oh!

jlashner commented Dec 19, 2023

Uh oh!

jlashner commented Dec 19, 2023

Uh oh!

BrianJKoopman commented Dec 19, 2023

Uh oh!

jlashner commented Dec 19, 2023

Uh oh!

17-sugiyama commented Dec 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HWP PCU agent minor modification #599

HWP PCU agent minor modification #599

Uh oh!

Conversation

17-sugiyama commented Dec 19, 2023

Description

Motivation and Context

How Has This Been Tested?

Types of changes

Checklist:

Uh oh!

BrianJKoopman left a comment

Choose a reason for hiding this comment

Uh oh!

jlashner commented Dec 19, 2023

Uh oh!

jlashner commented Dec 19, 2023

Uh oh!

BrianJKoopman commented Dec 19, 2023

Uh oh!

jlashner commented Dec 19, 2023

Uh oh!

17-sugiyama commented Dec 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants