Discrete event simulator for inference platforms.
Go >= 1.21 is required.
git clone git@github.com:inference-sim/inference-sim.git
cd inference-sim
go build -o simulation_worker main.goCollect data from vllm-data-collection and post process results to get vllm aggregate metrics. Remember to use mode=test.
python offline_tokenizer.py --results_path <path to results_new/scenario in vllm-data-collection>Next, modify experiment_constants_test.py as required to specify test configs to BLIS. Finally, run BLIS sweeps.
python run_blis_sweep.py --mode testpython analyze_sim_results.pypip install git+https://github.com/inference-sim/inference-sim.gitspec = importlib.util.find_spec("run_blis_sweep")
SIMULATION_BASE_DIR = Path(spec.origin).parent
CONSTANTS_PATH = SIMULATION_BASE_DIR / "experiment_constants_inference.py"
RESULTS_PATH = SIMULATION_BASE_DIR / "results/sweep_params/simulator_inference_results.csv"os.environ["SIMULATION_BASE_DIR"] = str(SIMULATION_BASE_DIR)
runpy.run_module("run_blis_sweep", run_name="__main__")Modify experiment_constants_inference.py as required to specify inference configs to BLIS. Next, run the simulator through:
python run_blis_sweep.pygo run main.go run --help