PBS spawns unlimited amount of goroutine #3754

linux019 · 2024-06-14T12:05:38Z

On high RPS to /openrtb2/auction endpoint PBS spawns more and more goroutines. To handle the huge amount of traffic I had to put an RPS limit on the load balancer.
During normal work of PBS amount of goroutines is ~5K

PBS spaws a goroutine:

to make call to bidder adapter (each bid request)
call module hooks
I propose to limit the amount of currently working goroutines and switch to goroutines pools. If we run out of workers PBS will return HTTP 503 instead of taking more and more traffic.
On a high amount of goroutines golang scheduler spends a lot of CPU time to pick up the next goroutine.
I can do this in fork but it’s a significant change to PBS core and will cause many merge conflicts

The text was updated successfully, but these errors were encountered:

bretg · 2024-06-28T14:59:08Z

@linux019 - what library for goroutines pools are you proposing?

@zhongshixi has offered to provide a pointer to a potential solution.

@bsardo will coordinate a decision.

SyntaxNode · 2024-06-28T20:48:11Z

I had to put an RPS limit on the load balancer.

IMHO its good practice to use backpressure limiting layers in front of Prebid Server. This is the approach we use to avoid the situation described here.

I have no issue adding a goroutine limiting feature to PBS provided it doesn't add latency when unused due to either disabled (if we want to provide that option) or using a high limit value.

Slind14 · 2024-06-28T21:34:44Z

I think this is more about reducing the compute spent on mcall at normal usage.

Being able to deal better with traffic spikes would be a side effect.

There should be no need to add a library for this.

linux019 · 2024-07-02T05:22:58Z

@bretg we don't need third party library, many of them are over complicated. There is a good implementation https://github.com/panjf2000/ants it can be taken as example

zhongshixi · 2024-07-03T14:05:09Z

we use https://github.com/panjf2000/ants

it works very well in our system since it preallocate the resources for go routines you need. Some improvement we did

we have different ants pool in different parts of the system to make sure not all concurrent execution compete on the same pool.
you do not want to shoot your own foot by having strict limit on the number of go routines, you need to have a soft limit and have a capacity to allow it to grow otherwise your execution can be stuck waiting for a go routine to be available.

prebid-server-prioritization bot added this to Prebid Server Prioritization Jun 14, 2024

github-project-automation bot moved this to Triage in Prebid Server Prioritization Jun 14, 2024

bretg added the PBS-Go label Jun 28, 2024

bretg moved this from Triage to Research in Prebid Server Prioritization Jul 1, 2024

linux019 mentioned this issue Sep 10, 2024

Fix goroutine leak in hooks execution group #3911

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PBS spawns unlimited amount of goroutine #3754

PBS spawns unlimited amount of goroutine #3754

linux019 commented Jun 14, 2024 •

edited

Loading

bretg commented Jun 28, 2024 •

edited

Loading

SyntaxNode commented Jun 28, 2024

Slind14 commented Jun 28, 2024

linux019 commented Jul 2, 2024

zhongshixi commented Jul 3, 2024 •

edited

Loading

PBS spawns unlimited amount of goroutine #3754

PBS spawns unlimited amount of goroutine #3754

Comments

linux019 commented Jun 14, 2024 • edited Loading

bretg commented Jun 28, 2024 • edited Loading

SyntaxNode commented Jun 28, 2024

Slind14 commented Jun 28, 2024

linux019 commented Jul 2, 2024

zhongshixi commented Jul 3, 2024 • edited Loading

linux019 commented Jun 14, 2024 •

edited

Loading

bretg commented Jun 28, 2024 •

edited

Loading

zhongshixi commented Jul 3, 2024 •

edited

Loading