A graphic with stylised fingerprints. Most fingerprints are blue, but one is yellow, one is pink, and one is green.
Credit:

Jack Cole/Wellcome

Licence: Attribution CC BY

Data and diversity in genomics: landscaping report

This landscaping report outlines the state of diversity in global genomics. It examines the representativeness of human genomic datasets, finding significant variation across countries. It identifies opportunities for funders to make a positive impact.

Credit:

Jack Cole/Wellcome

Licence: Attribution CC BY

Report at a glance 

Strategic programme:
What's inside:
An overview of the current state of human genomic datasets globally; definitions of three levels of ‘genomic maturity’ in different geographical regions; recommendations for enhancing data diversity globally.
Who this is for:
Researchers, funders, policymakers
Creative commons:

Summary 

Diversity is a significant challenge in global genomics. Genomics projects and datasets worldwide are heavily skewed towards populations with European ancestry. Information linked to socio-demographic status is usually missing. 

Improving diversity will benefit global health and scientific progress. This report identifies opportunities for funders to make a positive impact.

Wellcome has a history of funding genomics and genome-related research. This includes large-scale investments in the Human Genome Project and the Wellcome Sanger Institute, and supporting many research teams, resources and projects. This report maps what Wellcome has supported in the genomic data diversity space to date.  

The report goes on to detail the current state of human genomic datasets globally. It defines three genomic diversity archetypes for different geographical regions:

  • High maturity (e.g. United States, European Union): large database size, focusing on chronic disease etiology research and precision medicine
  • Medium maturity (e.g. Japan, Taiwan): medium database size, focusing on regional understanding of genetic factors for disease etiology
  • Low maturity (e.g. Brazil, Uganda): low database size, focusing on expanding databases to understand chronic disease etiology 

Key findings 

Methodology and limitations 

Conclusion 

Maturity of genomics research varies significantly across geographic regions. Each region takes different approaches to diversity and faces different challenges.

High maturity regions are focusing on expanding datasets into minority populations. Medium and low maturity regions are working on building representative databases. 

Improving data collection, analysis and utilisation are the keys to increasing diversity. This is not straightforward. There are political, economic, social, technological, legal and environmental challenges to be overcome. 

There are, however, opportunities for funders to make a significant impact. The report suggests potential solutions, ranked by priority level. 

Recommendations for funders 

Contact us