Keel Labs · The research team behind Keel

The scale

One of the largest, most complex systems in American life.

Benefits sit on top of trillions of dollars, thousands of carriers, and millions of choices. This is the ground every model we build has to stand on. For every person, in every language, every year.

MedicalDentalVisionLifeDisabilityHSA / FSAAccidentCritical illnessHospital indemnityPetLegal+ more

How Keel lowers the cost

Most of what people pay for benefits is not care. It is fees, admin, and avoidable mistakes. We take on all three.

Simplify exposure

Strip each plan down to what actually matters for this person, so no one pays for coverage they will never use.

Guide selection

Recommend the plan that fits a real life, and end the 24% premium most employees overspend by defaulting to the wrong one.

Optimize all year

Catch the overpayment, the unused HSA, the better option, every day. Not just in the two weeks of open enrollment.

Less complexity. Fewer wrong choices. A system that quietly takes cost out of healthcare, one person at a time.

Read the economics →

The ontology

You cannot reason about a network you have not mapped.

Every carrier, plan, rider, employer, broker, and member is a node. Every commission, rule, and relationship is an edge. We build a living ontology of the whole network, resolve it to one source of truth, and reconcile it in real time as the data drifts.

Entity resolution

One truth, not five.

The same plan shows up under five names across five systems. We resolve every record to one canonical entity.

Schema mapping

A thousand formats, one graph.

Every carrier speaks a different language. We normalize each feed into typed nodes and labeled edges.

Hierarchy

Answer at the right level.

Benefits nest: carrier, plan, tier, benefit, rider, rule. We model the whole hierarchy, so answers come from the exact level they live on.

Incentives

See where the cost hides.

Every node is paid differently, and misalignment is where money leaks. We encode the incentive structure into the graph itself.

The graph is never static. We continuously reconcile the drift between what each node believes and what is actually true, with full provenance on every edge.

Our first model

Fathom · v2.1

It reads any plan. It answers with the page.

Fathom is our benefits-native model. It reads any carrier's plan documents, reasons over the living graph, and answers the questions people actually ask. Is this covered? What will it cost me? Where do I go? Every answer points back to the exact line in the document, so people can trust it.

It is safe with private health data, and it always shows its work. Fathom is live inside Keel today, helping real employees enroll.

Read the model card → Try it in Keel →

Governance

A general-purpose model is not allowed to make these decisions. We build one that can prove every move it makes.

Benefits AI lives inside one of the most regulated environments in the country. HIPAA, ERISA, the ACA, and a fast-moving wave of state AI laws all govern how a model may touch a person's coverage. We do not bolt compliance on at the end. We build the agent so that every action it takes passes through governance first.

PHI safeEligibility rulesBias testedHuman reviewAudit logged

HIPAA

PHI never touches a general endpoint.

We run on BAA-covered, zero-retention infrastructure. A model that keeps prompts is a breach waiting to happen.

ERISA

The model advises. People decide.

Fiduciary duty means a black box cannot hold final authority. We keep the human in charge.

ACA §1557

Tested for bias, continuously.

The 2024 rule covers decision-support algorithms directly. We test inputs and outputs for disparate impact, always.

CMS · CA SB1120

A licensed human, by design.

Coverage decisions require individualized human review. Our agent routes to a person. It never denies care on its own.

NAIC AI Bulletin

A governance program, out of the box.

About 25 states expect documented AI governance with vendor oversight. We ship the controls and the exam-ready records.

NIST · ISO 42001 · SOC 2

The frameworks buyers diligence.

Govern, map, measure, manage, plus the security and healthcare certifications expected before anyone integrates.

We have seen what ungrounded automation does in this industry. Algorithms denying care at scale, with no individual review and no way to explain why. That is the opposite of what we build.

How it fits together

One stack. Research at the base, a face people trust on top.

It all runs on one spine: the living graph of benefits. The lab turns that graph into science. The model turns it into answers. Amanda turns it into a conversation a real person can have at 11pm on their phone.

Layer 01

Keel Labs

The research

We map the whole network into one living graph, publish the science behind it, and build the models no general-purpose system is allowed to.

Read the research →

Layer 02

Fathom

The model

Our benefits-native model. It reads any plan, reasons over the live graph, and proves every answer back to the exact line it came from.

Meet Fathom →

Layer 03

Amanda

The counselor

The voice people actually talk to. Fathom, given a face and a warm bedside manner, live inside Keel and answering real questions today.

Talk to Amanda →

Most labs stop at the model. We go one layer further, to the person, because being right only counts if someone actually understands the answer.

Research

Work on AI where being right actually matters.

In most of the field, a wrong answer is a bad demo. Here it is someone's healthcare, money, and family. These are the problems we work on, and would love your help with.

Grounding the answer

Finding the right answer inside messy, contradictory plan documents, and proving it is right. A wrong answer here is not a bad demo. It is a real problem for a real person.

→

Evals for benefits truth

There is no benchmark for whether benefits advice is correct, so we are building one. You cannot ship "mostly right" when it is someone's coverage.

→

Personalization with privacy

Understanding someone's real life without ever exposing their private data. Privacy is not a feature we add later. It is a constraint we design around from day one.

→

Agents that act, accountably

Software that enrolls, files, and follows up for a person, with a full record of everything it did and why it did it.

→

A paper every week. A field note every day.

Today's note: Arbitration was built for 17,000 disputes. It got 1.2 million.

Read today's note → All research →

We build the AI behind better benefits.