Plato’s Cave | Raul Valle Research Project

Overview

System that extracts structured nodes (claims, evidence, limitations, etc.) from papers and scores them to support literature review and comparison across a collection.

Reading dozens of papers is slow partly because the structure is inconsistent. Plato’s Cave uses modern language models to extract a consistent structure (e.g., claims, evidence, limitations) and then runs a scoring pipeline so papers can be compared more systematically.

Plato's Cave began as a GatorHack hackathon build — a first-place win at UF AI Days — and grew into a research paper on whether a paper's claims actually hold up.

What is Plato's Cave?

Plato's Cave is an open-source, human-centered system for validating research papers. You give it a PDF, a URL, or even a natural-language query; it locates the paper, breaks it into the individual claims the authors are making, sends browser-based agents out to check each claim against outside evidence, and returns a paper-level integrity score with the full reasoning behind it.

It started as a hackathon project — first place at the UF AI Days GatorHack — and grew, with a team at the University of Florida, into a research paper. I designed the system's mathematics and architecture: the claim-graph formulation and the trust-propagation scheme were my ideas, as was the construction of the experiments.

How the pipeline works

The system runs a six-stage pipeline. The important design idea is in the last stage: instead of scoring claims in isolation, trust propagates along the dependency edges of the graph, so a confident-sounding conclusion built on weak premises cannot inherit a high score.

1 · Validate: Accept a PDF upload or URL and extract the full text of the paper.
2 · Decompose: Use an LLM to break the paper into atomic, individually-checkable claims.
3 · Build graph: Structure the claims as a directed acyclic graph with role-aware nodes — Hypothesis, Evidence, Method, Conclusion.
4 · Organize agents: Dispatch browser-based agents to independently verify each claim against external sources.
5 · Compile evidence: Aggregate agent findings and score every node on six dimensions (below).
6 · Evaluate integrity: Propagate trust through the graph using role-aware edge confidences and trust gating, producing a final integrity score.

Figure 1 — The six-stage verification pipeline. Trust-gated propagation in stage six is what stops weak premises from being laundered into confident conclusions.

How each claim is scored

Every node in the graph is scored on six dimensions before trust is propagated. Keeping the axes explicit is what makes the final number auditable rather than a single opaque verdict.

Credibility — is the source of the claim trustworthy?
Relevance — does the cited evidence actually bear on the claim?
Evidence strength — how strong is the support behind it?
Method rigor — is the underlying method sound?
Reproducibility — could the result be reproduced?
Citation support — do the references back up what is asserted?

System architecture

Plato's Cave is deployed as three cooperating services rather than one monolith, which is what lets the verification agents run in parallel and in isolation from the app the reviewer is looking at.

Frontend: A Gatsby.js + React interface that renders the claim graph interactively, shows the integrity score, and lets a reviewer share a live browser session with an agent.
Backend: A Python service that handles claim extraction, graph construction, the six-dimension scorer, and trust-gated propagation.
Verification agents: Browser-use agents packaged in Docker containers, dispatched per claim to gather external evidence.
Deployment: The services run together on AWS EC2.

Figure 2 — The deployed architecture: frontend, scoring backend, and containerized verification agents as separate services behind one EC2 deployment.

The human stays in the loop

The part I care about most is that this is not meant to replace a reviewer. Because the frontend can share a browser session with an agent, a person can watch exactly what each agent checks and step in. The output is an integrity score you can trace back through the graph, not a verdict handed down from a black box.

FAQ

Is it open source?

Yes. The code is public on GitHub and the paper is available as an arXiv preprint.

Does it detect fraud?

No. It measures how well a paper's claims are supported by available evidence, which is a narrower and different question than authorial intent.

Why call it an 'integrity score' instead of a verdict?

Because it is an aggregate of six auditable dimensions propagated through a dependency graph — you can always open the graph and see which claim dragged the score down, rather than trusting a single opaque number.

What was your role on the team?

I designed the mathematics and architecture of the system — the claim-graph formulation and the trust-gated propagation were my ideas, as was the construction of the experiments. I am the paper's second author.

Cite this work

@misc{maldaner2026platoscave,
  title         = {Plato's Cave: A Human-Centered Research Verification System},
  author        = {Maldaner, Matheus Kunzler and Valle, Raul and Kim, Junsung and Sultan, Tonuka and Bhargava, Pranav and Maloni, Matthew and Courtney, John and Nguyen, Hoang and Sawant, Aamogh and O'Connor, Kristian and Wormald, Stephen and Woodard, Damon L.},
  year          = {2026},
  eprint        = {2603.23526},
  archivePrefix = {arXiv},
  url           = {https://arxiv.org/abs/2603.23526}
}

What I built

A batch pipeline that processes PDFs, extracts structured ‘nodes’ (e.g., claims/evidence), and stores results in machine-readable formats.
Scoring and normalization routines so outputs are comparable across papers.
Run outputs designed for auditing (logs, summaries, and artifacts).

How it works

Convert a paper to text.
Use a language model to label and structure key statements.
Score the resulting graph for quality and consistency.
Export summaries so humans can review quickly.

Related projects

Snapshot

Track

Tools · language + structure · reproducibility

Status

Active; evolving with new evaluation experiments.

Focus

Large language models (LLMs)Research toolingReproducibility

Stack

Python
Large language model APIs
Experiment logging

Glossary

LLM: Large language model: a model trained to generate and analyze text.
NLP: Natural language processing: methods for working with text.