Introducing monitoring #1960

BentiGorlich · 2026-01-20T17:20:09Z

add monitoring capabilities for troubleshooting purposes. This cannot be enabled or disabled from the UI, you have to go into the .env files for this
Add monitoring capabilities for the biggest factors: curl requests (AP), twig rendering (frontend) and of course database querying
2 sources can start an execution context: an incoming request and a started message
added an admin UI for inspecting requests and their response times, with an overview grouped by route name

The main goal of this PR is to add monitoring capabilities, so we can see what our messengers are doing in the background. Additionally I think the graph in the overview (even though it is from a dev instance) shows pretty well that we have to tackle our twig rendering. My guess is that that is a big problem on larger instances, like fedia.io.

Some screenshots:

BentiGorlich · 2026-01-23T10:54:43Z

Just a warning: In about 2 days the gathered stats take up about 16GB on my relatively small server that mainly gets a lot of AP requests, but not that much user requests

table_name	table_size	indexes_size	total_size
"public"."monitoring_query"	14 GB	599 MB	14 GB
"public"."monitoring_twig_render"	1338 MB	294 MB	1633 MB
"public"."monitoring_execution_context"	345 MB	52 MB	396 MB
"public"."monitoring_curl_request"	14 MB	4792 kB	18 MB

blued-gear · 2026-01-23T11:39:38Z

Maybe the stats then should get their own DB connection configuration. This way admins can store the data somewhere where the fill up is not much of a problem or even choose a DB which allows compression.

melroy89 · 2026-01-23T15:58:45Z

This is really exciting !

But yeah, you definitely want this is a separate DB.. Or maybe even in a timeseries optimized DB. Especially due to the sizes.

BentiGorlich · 2026-01-23T16:02:01Z

I think I can bring down the size a lot if I separate the query string out to another table and only save a reference to that in the "instances" of the query. Also I want to make some things optional. At the moment it does record the parameters of each query as well, which might be overkill or just unnecessary, so I want it to be optional

BentiGorlich · 2026-01-23T16:02:41Z

And it is quite a lot of data, so you might not want to monitor it for that long anyway. Otherwise you just don't get through it :D

melroy89 · 2026-01-25T00:28:21Z

And it is quite a lot of data, so you might not want to monitor it for that long anyway. Otherwise you just don't get through it :D

We need monitoring of the monitoring, that is monitoring. 😅

BentiGorlich · 2026-01-28T13:57:00Z

Ok, so I pushed a few changes:

to save space in the DB:
- make parameter storing optional via env var
- similar queries/statements will not take up space for each call, but will be saved to a separate table and referenced by hash
Make the overview chart also display aggregated stats based on the current filter
Add a dropdown to switch between the total time and the mean time in the overview chart
Add a gradient to the twig renders either based on the percentage of the total duration or the parent duration

I have it live on gehirneimer, without saving the parameters to the DB. Lets see how much space it is taking up in 2 days :)

- add monitoring capabilities for troubleshooting purposes. This cannot be enabled or disabled from the UI, you have to go into the `.env` files for this - Add monitoring capabilities for the biggest factors: curl requests (AP), twig rendering (frontend) and of course database querying - 2 sources can start an execution context: an incoming request and a started message - added an admin UI for inspecting requests and their response times, with an overview grouped by route name

- Add filtering form and dto to the monitoring overview - Fix `Could not convert PHP type 'array' to 'json', as an 'Malformed UTF-8 characters, possibly incorrectly encoded' error was triggered by the serialization` and add a test for it - move chart data generation to the controller

Because the twig render is like flame graph there could be nested templates of the same type which the previous code did not support, move to a "stack" like first in last out model

Reason: we want to group performance by route name and message class. Before this we grouped performance by route name and transport, which does not make that much sense

This will end the execution context **after** the response has been sent to keep the monitoring overhead to a minimum from a users perspective, as quite a few entities will be created on this event

- to save space in the DB: - make parameter storing optional via env var - similar queries/statements will not take up space for each call, but will be saved to a separate table and referenced by hash - Make the overview chart also display aggregated stats based on the current filter - Add a dropdown to switch between the total time and the mean time in the overview chart - Add a gradient to the twig renders either based on the percentage of the total duration or the parent duration

- `router->matchRequest` throws an exception when it cannot match the route -> catch that - actually pass the whole request to the router, otherwise it could not match headers which makes basically all AP requests fail

BentiGorlich self-assigned this Jan 20, 2026

BentiGorlich added backend Backend related issues and pull requests performance This is a issue regarding performance. labels Jan 20, 2026

BentiGorlich requested review from blued-gear and melroy89 and removed request for blued-gear January 20, 2026 17:20

BentiGorlich force-pushed the new/monitoring branch from c5908b0 to 1564aa8 Compare January 20, 2026 17:22

BentiGorlich marked this pull request as draft January 20, 2026 17:38

BentiGorlich force-pushed the new/monitoring branch 3 times, most recently from 499ca01 to 8b4a4d4 Compare January 20, 2026 17:58

BentiGorlich force-pushed the new/monitoring branch from 1382885 to 7f1c063 Compare January 28, 2026 13:50

BentiGorlich added 10 commits January 30, 2026 13:25

Move to First-In-Last-Out for the twig renders

7085f4d

Because the twig render is like flame graph there could be nested templates of the same type which the previous code did not support, move to a "stack" like first in last out model

Remove control characters from the query parameters

bdd8833

Swap the property in which the message class is back

532f085

Reason: we want to group performance by route name and message class. Before this we grouped performance by route name and transport, which does not make that much sense

End execution context on kernel termination

5b14a3f

This will end the execution context **after** the response has been sent to keep the monitoring overhead to a minimum from a users perspective, as quite a few entities will be created on this event

Add response sending time to requests

9f24959

Add response sending time to requests

2eea5fe

Fix exception due to routing failing

7a8aa44

- `router->matchRequest` throws an exception when it cannot match the route -> catch that - actually pass the whole request to the router, otherwise it could not match headers which makes basically all AP requests fail

BentiGorlich force-pushed the new/monitoring branch from 445b767 to 7a8aa44 Compare January 30, 2026 12:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing monitoring #1960

Introducing monitoring #1960

Uh oh!

BentiGorlich commented Jan 20, 2026

Uh oh!

BentiGorlich commented Jan 23, 2026

Uh oh!

blued-gear commented Jan 23, 2026

Uh oh!

melroy89 commented Jan 23, 2026

Uh oh!

BentiGorlich commented Jan 23, 2026

Uh oh!

BentiGorlich commented Jan 23, 2026

Uh oh!

melroy89 commented Jan 25, 2026

Uh oh!

BentiGorlich commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Introducing monitoring #1960

Are you sure you want to change the base?

Introducing monitoring #1960

Uh oh!

Conversation

BentiGorlich commented Jan 20, 2026

Uh oh!

BentiGorlich commented Jan 23, 2026

Uh oh!

blued-gear commented Jan 23, 2026

Uh oh!

melroy89 commented Jan 23, 2026

Uh oh!

BentiGorlich commented Jan 23, 2026

Uh oh!

BentiGorlich commented Jan 23, 2026

Uh oh!

melroy89 commented Jan 25, 2026

Uh oh!

BentiGorlich commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants