Skip to content

Modern biomedical research is built on a shared foundation: the human reference genome.

The current reference genome (GRCh38) underpins nearly all genomic analysis, including sequence alignment, variant calling, genome annotation, disease association studies, and the training of AI and machine-learning models in biology.

However, this reference is structurally biased.

Approximately 70% of the current human reference genome is derived from a single individual, known as RP11. That individual is male.

As a result, the reference genome contains:

  • One X chromosome

  • No representation of female X-chromosome biology

  • No direct modelling of XX-specific structure or regulation