From seven public databases to a cited answer in your browser — here is the full pipeline.
Large language models can fabricate plausible-sounding biology when asked open-ended questions. GeneE constrains the model in two ways: it only ever sees PubMed abstracts that PubTator has already linked to the gene in question, and it must attach a PMID to each claim. An automated validator then verifies that every cited PMID actually exists and that chromosome and disease claims match the structured data we already hold.
Summaries that fail this check are not displayed. Where an AI summary is unavailable, gene pages fall back to the original NCBI summary text or the structured data alone.
Full database list, license terms, and attribution are on the Data Sources & Attribution page.