gclear

a league of legends jungle bot that uses yolo for object detection and grok for decision-making.

sub 3 min clear

original whiteboarding

overview

gclear automates jungle clearing by combining two systems: a yolo model trained to detect camps and player position, and grok as a vision-language model that interprets game state and outputs actions.

the bot captures frames, runs detection, and passes the annotated frame to grok. grok then determines state transitions (attacking, walking, buying), target selection, and pathing. this replaces hardcoded logic with a model that can reason about what it sees.

system architecture

%%{init: {'theme': 'dark', 'themeVariables': { 'primaryColor': '#1e1e2e', 'primaryTextColor': '#cdd6f4', 'primaryBorderColor': '#45475a', 'lineColor': '#6c7086', 'secondaryColor': '#313244', 'tertiaryColor': '#45475a'}}}%%
flowchart LR
    %% styles
    classDef input fill:#45475a,stroke:#585b70,color:#cdd6f4
    classDef yolo fill:#363a4f,stroke:#494d64,color:#b8c0e0
    classDef grok fill:#5b3d3d,stroke:#6e4c4c,color:#f0b0b0
    classDef state fill:#4a4336,stroke:#5c5344,color:#e0cfa8
    classDef cua fill:#3d4a3d,stroke:#4c5c4c,color:#a8d4a8
    classDef macro fill:#3d4455,stroke:#4c5466,color:#a8b8d4

    subgraph input_layer["input"]
        direction TB
        screen["screen capture"]
        minimap["minimap region"]
        health_ocr["health bar ocr"]
        game_timer["game timer"]
    end

    subgraph yolo_layer["yolo detection"]
        direction TB
        yolo_inf["yolov11 inference"]
        camps["camps"]
        player_pos["player position"]
        mm_icon["minimap icon"]
        zone_det["zone detection"]
    end

    subgraph grok_layer["grok strategist"]
        direction TB
        ctx_objects["detected objects"]
        ctx_positions["positions + zones"]
        ctx_timer["game timer"]
        ctx_state["current state"]
        context["context builder"]
        grok_api["grok vision api"]
        decision["decision output"]
    end

    subgraph state_layer["state machine"]
        direction TB
        s_levelup["level up"]
        s_buy["buy items"]
        s_walk["walk to camp"]
        s_wait["wait for spawn"]
        s_engage["engage"]
        s_attack["attack"]
        s_clear["cleared"]
    end

    subgraph cua_layer["cua triggers"]
        direction TB
        c_walk["walk"]
        c_attack["attack"]
        c_buffer["buffer"]
        c_kite["kite"]
        c_ability["ability"]
    end

    subgraph macro_layer["macros"]
        direction TB
        m_spam["spam"]
        m_combo["combo"]
        m_smite["smite"]
        m_level["level up"]
    end

    %% main flow
    input_layer --> yolo_layer --> grok_layer --> state_layer --> cua_layer --> macro_layer

    %% internal connections
    screen --> yolo_inf
    minimap --> yolo_inf
    yolo_inf --> camps & player_pos & mm_icon
    mm_icon --> zone_det

    camps --> ctx_objects
    player_pos --> ctx_positions
    zone_det --> ctx_positions
    health_ocr --> ctx_timer
    game_timer --> ctx_timer
    ctx_objects & ctx_positions & ctx_timer & ctx_state --> context
    context --> grok_api --> decision

    s_levelup --> s_buy --> s_walk --> s_wait --> s_engage --> s_attack
    s_attack -->|"dead?"| s_clear
    s_clear -->|"next"| s_walk

    %% apply styles
    class screen,minimap,health_ocr,game_timer input
    class yolo_inf,camps,player_pos,mm_icon,zone_det yolo
    class context,ctx_objects,ctx_positions,ctx_timer,ctx_state,grok_api,decision grok
    class s_levelup,s_buy,s_walk,s_wait,s_engage,s_attack,s_clear state
    class c_walk,c_attack,c_buffer,c_kite,c_ability cua
    class m_spam,m_combo,m_smite,m_level macro

↑ frame + bounding boxes + confidence scores

color	layer
gray	input - raw game data
slate	detection - yolo inference
red	strategist - grok decisions
amber	state - game loop states
green	triggers - cua actions
steel	macros - automated inputs

how it works

detection layer

yolo runs inference on game frames to identify:

class	detection
`player`	champion position on screen
`mm_player`	minimap icon position
`blue`	blue sentinel
`red`	red brambleback
`gromp`	gromp
`wolves`	wolf camp
`raptors`	raptor camp
`krugs`	krug camp

trained on 83 hand-labeled frames from a hecarim clear, augmented to ~247 images.

model training

trained yolov11 on roboflow with the following results:

metrics
training data	performance

grok as the brain

grok receives:

current game frame
detected objects and positions
zone probabilities from minimap analysis
game timer
current state context

grok decides:

what state to transition to (attacking, walking, buying, kiting)
which target to focus
when a camp is dead and it's time to move
ability usage and timing
pathing decisions

the bot streams grok's chain-of-thought to an overlay so you can watch it reason through decisions in real-time.

state machine

stateDiagram-v2
    [*] --> startup_levelup
    startup_levelup --> startup_buy
    startup_buy --> walking_to_camp
    walking_to_camp --> waiting_for_spawn
    waiting_for_spawn --> engaging
    engaging --> attacking
    attacking --> camp_cleared : grok confirms dead
    camp_cleared --> walking_to_camp : next camp
    camp_cleared --> [*] : clear complete

default clear: blue → gromp → wolves → raptors → red → krugs

action triggers

the cua layer translates grok decisions into game inputs:

trigger	action
`walk`	click relative position to camp
`attack`	right click on detected target
`buffer`	queue W ability before arriving at camp
`kite`	timed movement between auto attacks
`ability`	q/w/e/r on cooldown or condition
`smite`	f key when camp hp threshold met

overlay system

two persistent overlays:

bot status: current state, recent actions
grok thoughts: streaming reasoning as decisions happen

project structure

gclear/
├── src/gclear/
│   ├── bot.py           # state machine + game loop
│   ├── strategist.py    # grok integration
│   ├── overlay.py       # pyqt6 overlays
│   ├── som.py           # minimap positions
│   └── data_collection/ # labeling tools
├── tests/
├── media/
├── logs/
└── docs/

running

uv sync
export XAI_API_KEY="your_key"
uv run python src/gclear/bot.py

requires python 3.12+, macos, tesseract ocr.

ideas / next steps

better training data

the current dataset is 83 hand-labeled frames from a single hecarim clear. this works for basic camp detection but doesn't generalize well. need frames from different champions, different skins, different game times (lighting changes throughout the match), and edge cases like camps mid-death animation or partially obscured by ability effects.

minimap state inference

right now the bot only looks at where the player icon is on the minimap. but there's way more information there - enemy positions, or more importantly, the absence of enemy positions. if mid laner was visible bot side 10 seconds ago and now they're gone, where are they? recalled? roaming through jungle? setting up a gank?

chess engines solve this with evaluation functions - hardcoded weights for piece positions, king safety, pawn structure. but league is too dynamic for static heuristics. champion matchups, power spikes, item breakpoints, objective timers all shift what's "good" moment to moment. the idea is to give grok a rolling window of minimap states and let it reason about probable enemy positions and threat levels.

pre-game planning

jungle pathing isn't solved in isolation. it depends on team comps, lane matchups, and win conditions. a team with strong early lanes wants the jungler to path toward them for dives. a scaling comp wants safe farming. grok could analyze the loading screen, identify the matchups, and generate a clear path before the game even starts. adjust for expected invade routes based on enemy jungler identity.

invade detection and adaptation

current bot follows a fixed clear order. if the enemy jungler shows up at your red buff, it doesn't know what to do. grok needs to recognize when the plan is compromised and generate a new one - vertical jungle, skip the contested camp, or fight for it depending on 1v1 matchup and nearby lane priority.

multi-champion support

hecarim has a specific clear pattern - he can full clear quickly and wants to spam q on cooldown. other junglers play completely differently. karthus wants to kite camps and stay at range. lee sin wants to animation cancel and manage energy. the detection model needs to generalize across champions, and grok needs to understand different kits and their optimal clear patterns.

learning from vods

pro junglers make thousands of micro-decisions per game that aren't documented anywhere. the idea is to feed grok vods with timestamps - "at 3:42, showmaker's jungler paths top side after seeing bot lane push in" - and have it extract the reasoning. build a corpus of decision patterns that inform real-time play.

benchmarking

need concrete metrics. clear time compared to human players, gold efficiency, health remaining after full clear. right now there's no baseline to measure improvement against.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
_layouts		_layouts
docs		docs
logs		logs
media		media
src/gclear		src/gclear
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Gemfile		Gemfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
_config.yml		_config.yml
index.md		index.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gclear

overview

system architecture

how it works

detection layer

model training

grok as the brain

state machine

action triggers

overlay system

project structure

running

ideas / next steps

better training data

minimap state inference

pre-game planning

invade detection and adaptation

multi-champion support

learning from vods

benchmarking

About

Uh oh!

Releases

Packages

Languages

License

qtzx06/gclear

Folders and files

Latest commit

History

Repository files navigation

gclear

overview

system architecture

how it works

detection layer

model training

grok as the brain

state machine

action triggers

overlay system

project structure

running

ideas / next steps

better training data

minimap state inference

pre-game planning

invade detection and adaptation

multi-champion support

learning from vods

benchmarking

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages