Description
Expected behavior
We have an application that is logging sequential progresson of a work package comprising 18819 items.
For example, we log the following at INFO level:
2022-02-17 07:47:24.257 INFO 1 — [-StreamThread-6] c.s.g.i.b.p.k.t.p.PackageDoneTransformer : {"RUN_ID": 1071282}: Status update (17202/18819) received [{"PACKAGE_ID": "1071282|Close|Close|IFA|20220701|300532312|", "RUN_ID": 1071282, "CREINS_ID": null, "TREATY": null, "REPUN_1": null, "REPUN_2": null, "CoverageId": null, "BusinessDate": null, "CB_TO_DATE": null, "PackageCount": 18819, "RecordCountTotal": 83, "RecordCountProcessed": 0, "RecordCountError": 0, "OutputCountAms": 0, "OutputCountStp": 0, "OutputCountTav": 0, "LastUpdated": null, "Errors": []}]
This was scraped from the pod log output in AKS.
Actual behavior
The above example never makes it to the Log Analytics Workspace.
We do see log statements from before and after.
Item 17201 is logged, but we don't see any items logged again until 17217.
What we've checked
The loss of log messages does not coincide with data cap resets on either the LAW or App insights agent.
Sampling is 100% on the App Insights instance.
We don't see any App Insights network/connection errors reported in the pod log.
I have previously seen that there can be extra latency with ingestion that causes the TimeGenerated to be later than expected, but I don't see any evidence of this.
Goal
What else can we check that might contribute to the missing logging?
Is this approach to data capture with the App Insights agent not recommended as we cannot guarantee 100% log data capture?
System information
Please provide the following information:
- SDK Version: 3.2.3
- OS type and version: AKS with Debian 11-based containers (Google Distroless)
- Using spring-boot? Yes