Installation
Getting started with Terminal-Bench.
Terminal-Bench provides a CLI for running the benchmark, creating custom tasks, and running other popular benchmarks we've adapted to our framework.
Install dependencies
Terminal-Bench requires git and Docker to be installed.
Install the CLI
You can then run the CLI using terminal-bench or tb for convenience.
Run tb --help to see the available commands and options.