Improvement: Speed up public organization enumeration #49

AdnaneKhan · 2023-11-19T18:37:17Z

This PR addresses the core problems highlighted in #44

For large public organization enum, Gato will now only perform run log analysis if repositories pass a heuristic based on workflow analysis. Additionally, Gato will use the GraphQL API to download all workflow ymls and cache them before enumerating individual repositories.

Additionally, Gato will be more selective with the run logs it downloads to avoid downloading duplicate logs for the same workflow file and trigger. Furthermore, I've added a new heuristic that will determine whether a self-hosted runner is ephemeral. The heuristic works by looking for the clean repository step in the output for actions/checkout. Note, this is a heuristic, and it may be subject to false positives (rare, but possible in limited scenarios with caching), or false negatives (more likely).

With these changes, you can run Gato against a large organization like Microsoft within a reasonable time. This supports continuous testing and monitoring use cases as more organizations become aware of the dangers of self-hosted runner misconfigurations.

…ublic repos.

… enumeration.

AdnaneKhan added 25 commits July 21, 2023 23:00

Experimental deeper checks for SH runner.

3d43b49

fix non-ephemeral check logic

2d0cf21

remove extra print

ab58bd7

Make sure jobs key exists.

0f54e0b

Updates.

7f2a94f

update

f8b6dad

Fix unit tests to handle non-ephem checks.

c3617f3

Short circuit so we don't take forever. 30 run logs default.

9ae9ef3

Initial attempts to make gato fast for large orgs using graphql

75a1b1c

Add missing file

f6aabff

Fix minor error.

3323592

Add parameters to filter run log query.

ef58bd8

Fix a logic error, always dl run logs unless it is disabled for non-p…

a1c070f

…ublic repos.

Fix false positive case with larger runners.

cdd887d

Fix again.

3fd8da1

Merge branch 'dev' into update/checker_and_faster

4584482

fix runlog skip if small org or public repo.

8e4c304

Merge branch 'dev' into update/checker_and_faster

b9dab40

fix unit test.

3ccc6e5

Remove extra method

c98ebf9

Merge branch 'dev' into update/checker_and_faster

93d12e7

Handle non-200 from GraphQL API.

dcd0b51

Merge branch 'dev' into update/checker_and_faster

c59d888

Merge branch 'dev' into update/checker_and_faster

6deab78

Update wording now that we are tolerating some false positives during…

6f6a3be

… enumeration.

AdnaneKhan merged commit 3ca6e79 into dev Dec 8, 2023
21 checks passed

AdnaneKhan deleted the update/checker_and_faster branch December 19, 2023 02:56

mas0nd restored the update/checker_and_faster branch April 16, 2024 22:38

mas0nd deleted the update/checker_and_faster branch April 16, 2024 22:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvement: Speed up public organization enumeration #49

Improvement: Speed up public organization enumeration #49

AdnaneKhan commented Nov 19, 2023 •

edited

Loading

Improvement: Speed up public organization enumeration #49

Improvement: Speed up public organization enumeration #49

Conversation

AdnaneKhan commented Nov 19, 2023 • edited Loading

AdnaneKhan commented Nov 19, 2023 •

edited

Loading