CLARISSA

An end-to-end reservoir simulation agent.

Clarissa writes simulator decks from scratch: it extracts the assumptions from your case, validates the deck, runs the simulation, and reads the results back to you as engineering insight — not raw output. Under the hood it speaks the simulators’ own languages and checks its own work at every step.

The loop, end to end

Writes — drafts a complete simulator deck from a plain-language case description.
Validates — surfaces every assumption and confirms the deck before it ever runs.
Runs — executes the simulation in the simulator’s own language.
Analyzes — reads results back as engineering insight, with the work traceable.

RIGOR

A first-of-its-kind, open-source benchmark for reservoir-engineering agents.

As agents arrive in reservoir engineering, the field needs a common, trustworthy way to measure them. RIGOR is that yardstick: a public, open-source benchmark that puts agent harnesses through hand-authored OPM Flow tasks — writing simulator decks from scratch, modifying and debugging them, and analyzing results.

Scored on

Deck validity — is the generated deck well-formed and runnable?
Semantic requirements — does it capture what the case actually asked for?
Successful runs — does the simulation complete?
Output match — do the results line up with the gold reference?
Analysis correctness — is the engineering read-back right?

It’s built for integrity. Gold references stay verifier-only, and a “naked” baseline isolates the real lift from guidance, skills, and tools — so the numbers mean something. Open-sourcing soon, so the whole field can measure progress on the same footing.