Skip to content

Conversation

@ambarve
Copy link
Contributor

@ambarve ambarve commented Aug 25, 2025

Unit test TestGcsWaitProcessBridgeTerminated and the functional test TestHostProcess_whoami keeps failing intermittently on our github CI test runs. That blocks us from merging our PRs.

This commit fixes the race condition in the TestGcsWaitProcessBridgeTerminated test by adding a small sleep. The other test failure is probably more related to the test environment than the test itself. That will require additional investigations to fix the test, so the test is currently disabled.

Unit test `TestGcsWaitProcessBridgeTerminated` and the functional test
`TestHostProcess_whoami` keeps failing intermittently on our github CI test runs. That
blocks us from merging our PRs.

This commit fixes the race condition in the `TestGcsWaitProcessBridgeTerminated` test by
adding a small sleep. The other test failure is probably more related to the test
environment than the test itself. That will require additional investigations to fix the
test, so the test is currently disabled.

Signed-off-by: Amit Barve <ambarve@microsoft.com>
@ambarve ambarve requested a review from a team as a code owner August 25, 2025 19:15
@ambarve ambarve merged commit cb6213a into microsoft:main Aug 25, 2025
17 checks passed
helsaawy added a commit that referenced this pull request Nov 14, 2025
* [ms/release/0.1]Backport annotation, vNUMA, and GPU device changes (#2493)

* Organize annotations; change annotation expansions. (#2449)

Break out `pkg\annotations\annotations.go` constants into sections for
easier searching and readability.

Deprecate `AnnotationExpansions` and instead provide
`AnnotationExpansionMap()`, which returns the same value, but provides a
new copy every call, so the `map` cannot be modified.
Cannot delete it or change its type, since it is technically public.

Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>
(cherry picked from commit ffcf48b)
Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>

* Warn on incomplete vNUMA setting, clarify field names (#2466)

Warn if vNUMA is not completely specified in uVM creation options, as
this is likely a user error.

Rename `"uvm".Opts.MaxSizePerNode` to `MaxMemorySizePerNumaNode` and
clarify that it is measured in MiB. Similarly, rename
`"annotations".NumaMaximumSizePerNode` to
`NumaMaximumMemorySizePerNode`.

Format `prepareVNumaTopology` doc comment to display appropriately.

Related: switch to using `"logrus".IsLevelEnabled` rather than explicit
logging level comparison, and fix bug where `--debug` flag was not added
to runc if logging level is greater than `Debug` (i.e., `Trace`).

Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>
(cherry picked from commit 0842153)
Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>

* Fix CUDA for non-privileged containers (#2492)

CUDA initialization for GPUs fails for non-privileged containers.
Experimenting shows that adding `rw` for all character devices fixes the
error, so expand the
[default `c *:* m` permissions](https://github.com/opencontainers/runc/blob/6bae6cad4759a5b3537d550f43ea37d51c6b518a/libcontainer/specconv/spec_linux.go#L205-L222)
to `c *:* rwm`.

Add `"gpu"` string constant and streamline device assignment logic.

Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>
(cherry picked from commit 144c633)
Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>

---------

Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>

* [ms/release/0.1]Only `Reset` non-nil fields; Fix/Disable failing CI tests

Backport PR: #2558
Only `Reset` non-nil fields

Protobuf message's `Reset` assumes non-nil callers, so check to make
sure we don't cause a panic if the cgroup stats call didn't initialize
those fields.

Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>
(cherry picked from commit d3ffbb6)

Backport PR #2506
Fix/Disable failing CI tests

Unit test `TestGcsWaitProcessBridgeTerminated` and the functional test
`TestHostProcess_whoami` keeps failing intermittently on our github CI test runs. That
blocks us from merging our PRs.

This commit fixes the race condition in the `TestGcsWaitProcessBridgeTerminated` test by
adding a small sleep. The other test failure is probably more related to the test
environment than the test itself. That will require additional investigations to fix the
test, so the test is currently disabled.

Signed-off-by: Amit Barve <ambarve@microsoft.com>
(cherry picked from commit cb6213a)

Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>

---------

Signed-off-by: Hamza El-Saawy <hamzaelsaawy@microsoft.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants