EntheoGen-Dataset

About / For Researchers

Documentation directory

Research context for the read-only EntheoGen reviewer demo, the live consumer application, and the reviewed dataset snapshot.

Purpose

Research review layer

The EntheoGen Researcher UI is a structured research review workspace for collaborative dataset governance, evidence inspection, and publication preparation within the broader EntheoGen workflow.

  1. Research extraction and collaborator input
  2. EntheoGen-Dataset review layer
  3. Approved handoff artifacts
  4. EntheoGen curation and publication workflow

The interface is intentionally separate from the live EntheoGen application. Reviewers inspect candidate rows, evidence quotes, source metadata, duplicate flags, safety notes, and publication decisions before approved artifacts move downstream.

Companion materials

Public links

What the interface does

Review workflow

Data review

Researchers inspect extracted question-answer rows, evidence quotes, confidence scores, generated candidate text, and source metadata.

Safety review

Safety review makes potentially risky, unclear, or unsuitable rows visible before publication preparation.

Publication review

Publication review is the final decision layer before update artifact generation and handoff export.

Static demo boundaries

Read-only public surface

The Cloudflare Pages deployment is a sanitized, read-only static snapshot. It includes display rows, summary states, duplicate flags, artifact references, and review navigation. It does not include save controls, mutation endpoints, repository write access, PDF/source document access, private files, credentials, or secrets.

Local maintainers can use the mutable review environment to change decisions, save notes, generate artifacts, refresh exports, and prepare approved handoff files. Those local workflows write only to designated review and output directories in this dataset repository.

Governance

Review principles

Separate review from publication

Raw or unreviewed material should not flow directly into public-facing datasets.

Center the evidence

Reviewers compare evidence quotes, source metadata, proposed updates, and generated answers before trusting candidate rows.

Treat suggestions as advisory

Low-confidence warnings, placeholder alerts, and duplicate flags are reviewer aids, not automated decisions.

FAQ

Common questions

Is the online demo connected to the live EntheoGen dataset?

No. The Cloudflare Pages deployment is a read-only static snapshot using sanitized demonstration data.

Can reviewers edit data in the online demo?

No. Only the local mutable review UI supports saving decisions or generating updated artifacts.

Does approving a row publish it directly into EntheoGen?

No. Approval prepares a row for downstream handoff and curation workflows.

What counts as a held row?

Held rows require additional attention before publication preparation, such as unresolved safety review, deferred publication review, duplicate concerns, or incomplete reviewer notes.