bashkit
Benches

Latest benchmark snapshot

Static aggregate generated from repository result artifacts. Use the linked files for raw measurements and full eval traces.

Latest reports

Open Markdown reports

Runtime snapshot

Latest benchmark categories

Browse benchmark runs
CategoryCasesLast run
startupSmall commands where interpreter startup dominates runtime.40.053 msbash median: 1.662 ms
stringsString expansion, pattern handling, and text manipulation.80.057 msbash median: 1.791 ms
variablesVariable assignment, lookup, expansion, and environment handling.80.058 msbash median: 1.688 ms
arraysIndexed array reads, writes, expansion, and iteration.60.059 msbash median: 1.713 ms
subshellCommand substitution and nested shell execution paths.60.061 msbash median: 3.143 ms
arithmeticInteger math, substitutions, and expression-heavy shell snippets.60.062 msbash median: 1.703 ms
pipesPipeline construction, streaming, and command chaining.60.065 msbash median: 3.131 ms
controlConditionals, loops, case statements, and branching scripts.90.076 msbash median: 1.711 ms
Eval pressure

Lowest eval categories

Browse eval runs
Latest LLM eval93%54/58 tasks
CategoryPassedPass rate
system_info1/250%tasks passed
file_operations3/466.7%tasks passed
scripting5/768.6%tasks passed
json_processing8/8100%tasks passed
data_transformation6/6100%tasks passed
complex_tasks6/6100%tasks passed
text_processing6/6100%tasks passed
pipelines5/5100%tasks passed