← Back to search

About GeneE

An AI-native gene search platform with cited answers from peer-reviewed research.

What is GeneE?

GeneE combines authoritative gene data from public databases with AI-generated plain-language summaries, semantic search, and a modern interface. The goal is to make gene information accessible to everyone — from students encountering genetics for the first time to researchers looking for a quick, cited overview.

The platform currently covers approximately 20,000 human protein-coding genes. Each gene page aggregates identifiers, function annotations, disease associations, drug targets (with clinical phase and FDA status), tissue expression, related genes, pathogenic ClinVar variants, gnomAD constraint metrics, an embedded interactive 3D protein structure (experimental PDB when available, AlphaFold prediction otherwise), pre-computed functional and clinical badges, and an AI-written summary backed by PubMed citations.

How it works

GeneE pulls structured data from seven public databases, indexes it in PostgreSQL with pgvector and Typesense, and synthesizes plain-language summaries with Claude or Gemini — every claim backed by a PubMed citation that is verified before publishing.

See the full architecture →

Trust & accuracy

Every AI-generated summary undergoes automated validation before it is published. The validation pipeline checks that all cited PMIDs exist, cross-references disease and chromosome claims against structured data, and flags any unsupported statements. Summaries that do not pass validation are not displayed — users see the structured data instead.

Source relevance scores are computed from a weighted blend of semantic similarity, journal impact, publication recency, and whether the AI cited the source. Preprints are clearly flagged so users can weigh evidence appropriately.

Explore by dimension

Beyond searching for a specific gene, you can browse the full catalogue along 12 ranking dimensions — most researched, FDA-approved drug targets, most pathogenic variants, most constrained, and more — on the Gene Rankings page. Every sort-and-filter combination is a shareable URL.

A clickable chromosome ideogram on the home page and above the rankings filters lets you narrow the catalogue to a single chromosome in one click.

Limitations

Important
GeneE is an informational tool, not a diagnostic or clinical resource. AI-generated summaries should always be verified against primary sources. The current release covers human protein-coding genes only. Data freshness depends on source database update cycles and the weekly pipeline schedule.

Data sources

GeneE aggregates data from seven public databases. See the Data Sources & Attribution page for full details and license information.