pip install -r requirements.txt
Download data from the mind2web project, make sure you have test_task
, test_website
, test_domain
, and train
under the data
directory; download scores_all_data.pkl
for HTML filtering at [link].
To run offline workflow induction with training examples:
python offline_induction.py \
--mode auto --domain Travel --subdomain Airlines --website aa \
--model "gpt-4o" --output_dir "workflow"
You can also switch to --mode input
to dynamically input your desired website(s).
The above command will produce a workflow file workflow/aa.txt
, to augment this workflow in agent memory and run inference on test examples from the aa website:
python run_mind2web.py --website "aa" --workflow_path "workflow/aa.txt"
To run online workflow induction and utilization:
python pipeline.py --setup online \
--benchmark "test_task" --website aa \
--results_dir results/aa/workflow \
--workflow_path workflow/aa.txt
Simply change to --benchmark 'train'
if you want to run online setting on the training (or other) queries, but remember to apply to workflow and run inference on test examples afterwards.
To run the entire pipeline for both online and offline settings, you can use:
python pipeline.py --setup "offline" # or "online"
with other arguments specified as above.