Reservoir
Reasoning
Technologies
CLARISSA
An end-to-end reservoir simulation agent.
Clarissa writes simulator decks from scratch: it extracts the assumptions from your case, validates the deck, runs the simulation, and reads the results back to you as engineering insight — not raw output. Under the hood it speaks the simulators’ own languages and checks its own work at every step.
The loop, end to end
- Writes — drafts a complete simulator deck from a plain-language case description.
- Validates — surfaces every assumption and confirms the deck before it ever runs.
- Runs — executes the simulation in the simulator’s own language.
- Analyzes — reads results back as engineering insight, with the work traceable.
RIGOR
A first-of-its-kind, open-source benchmark for reservoir-engineering agents.
As agents arrive in reservoir engineering, the field needs a common, trustworthy way to measure them. RIGOR is that yardstick: a public, open-source benchmark that puts agent harnesses through hand-authored OPM Flow tasks — writing simulator decks from scratch, modifying and debugging them, and analyzing results.
Scored on
- Deck validity — is the generated deck well-formed and runnable?
- Semantic requirements — does it capture what the case actually asked for?
- Successful runs — does the simulation complete?
- Output match — do the results line up with the gold reference?
- Analysis correctness — is the engineering read-back right?
It’s built for integrity. Gold references stay verifier-only, and a “naked” baseline isolates the real lift from guidance, skills, and tools — so the numbers mean something. Open-sourcing soon, so the whole field can measure progress on the same footing.