Skip to content

Conversation

@michaelhixson
Copy link
Contributor

This is happening for me locally with mofuw. Its CMD line fails:

/bin/sh: 1: ./start-servers.sh: not found

and then the toolset waits around for a long time for it to be running.
With the change, it detects that it has failed quickly.

This is happening for me locally with mofuw.  Its CMD line fails:

  /bin/sh: 1: ./start-servers.sh: not found

and then the toolset waits around for a long time for it to be running.
With the change, it detects that it has failed quickly.
@michaelhixson
Copy link
Contributor Author

By the way, the build failure with mofuw was a local-only problem related to line endings. Nothing that needs to be fixed in the repo.

@NateBrady23
Copy link
Member

There's a potential issue here that I found last week which was causing all of those start/stop errors on citrine. We are sometimes losing a race where we tell the docker daemon to start a container in detached mode and immediately check its availability.

@msmith-techempower msmith-techempower merged commit 013ad2b into TechEmpower:master Apr 23, 2018
@michaelhixson
Copy link
Contributor Author

@msmith-techempower Did you see Nate's comment? Do you remember what that original fix for that race was?

@msmith-techempower
Copy link
Member

@michaelhixson Yeah, it was #3574

Basically, we found rare instances where the call to start the container daemonized would return control to the toolset and we would query the docker sdk for the running container before it was actually running. This check is unnecessary now and it was removed.

@michaelhixson
Copy link
Contributor Author

Ok, so just to be clear, I'm not reintroducing the race that Nate fixed. That's because our original check (which Nate removed) used client.containers.list(), which only returns running containers. My new check uses client.containers.get(id_or_name), which will return the container even if it isn't running yet.

@msmith-techempower
Copy link
Member

Ah yes, this does appear to reintroduce a similar mechanism, but with your safeguard.

NateBrady23 added a commit to NateBrady23/FrameworkBenchmarks that referenced this pull request Apr 24, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants