This repository contains benchmark evaluation infrastructure for Strix. It provides standardized evaluation pipelines for testing Strix capabilities across various security tasks.
| Benchmark | Description | Challenges |
|---|---|---|
| XBEN | XBOW web security CTF challenges | 104 |
Note
We are actively adding more benchmarks to our evaluation suite.