Open access · CC-BY
via OpenAlex
A map of human genome variation from population-scale sequencing
Min Hu, Yuan Chen, James Stalker, Si Quang Le, Leopold Parts, Allison Coffey, Yujun Zhang, Jeffrey C. Barrett, Aarno Palotie, Matt E. Hurles, Harold P. Swerdlow, Carol Scott, John Burton, Chris Tyler-Smith, Sarah Lindsay
Nature · 2010 · ▲ 8,070 citations
Abstract
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother–father–child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10−8 per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research. This issue of Nature contains the first publication from The 1000 Genomes Project, an international collaboration that will produce an extensive public catalogue of human genetic variation. The plan, in fact, is to sequence about 2,000 unidentified individuals from 20 populations around the world. This first paper presents the results from the project's pilot phase, testing three different strategies for genome-wide sequencing with high-throughput platforms: low-coverage whole-genome sequencing of 179 individuals in three population groups, high-coverage sequencing of two mother–father–child trios, and exon-targeted sequencing of 697 individuals from seven populations. The goal of the 1000 Genomes Project is to provide in-depth information on variation in human genome sequences. In the pilot phase reported here, different strategies for genome-wide sequencing, using high-throughput sequencing platforms, were developed and compared. The resulting data set includes more than 95% of the currently accessible variants found in any individual, and can be used to inform association and functional studies.
◌ CITATION ONLY
Full text is not openly licensed for redistribution here. Read it at the source:
Provenance
- Source
- OpenAlex
- DOI
- 10.1038/nature09534
- Canonical
- link ↗
- Fetched
- 2026-06-02 MST
Cite this
APA
Hu, M., Chen, Y., Stalker, J., Le, S.Q., Parts, L., Coffey, A., Zhang, Y., Barrett, J.C., Palotie, A., Hurles, M.E., Swerdlow, H.P., Scott, C., Burton, J., Tyler-Smith, C., Lindsay, S., Xue, Y., Turner, D., Walter, K., Conrad, D.F., & Durbin, R.M. (2010). A map of human genome variation from population-scale sequencing. <em>Nature</em>. https://doi.org/10.1038/nature09534
Vancouver
Hu M, Chen Y, Stalker J, Le SQ, Parts L, Coffey A, et al. A map of human genome variation from population-scale sequencing. Nature. 2010. doi:10.1038/nature09534.
BibTeX
@article{min2010Amapof,
title = {A map of human genome variation from population-scale sequencing},
author = {Min Hu and Yuan Chen and James Stalker and Si Quang Le and Leopold Parts and Allison Coffey and Yujun Zhang and Jeffrey C. Barrett and Aarno Palotie and Matt E. Hurles and Harold P. Swerdlow and Carol Scott and John Burton and Chris Tyler-Smith and Sarah Lindsay and Yali Xue and Daniel Turner and Klaudia Walter and Donald F. Conrad and Richard M. Durbin and Quan Long and Thomas M. Keane and Tom Skelly and Daniel G. MacArthur},
journal = {Nature},
year = {2010},
doi = {10.1038/nature09534},
}
Research neighborhood
References, citing works, and semantically nearest findings. Click a node to open it.
Related findings
biorxiv 2024
Preprint · CC-BY
PSAP-genomic-regions: a method leveraging population data to prioritize coding and non-coding variants in whole genome sequencing for rare disease diagnosis
Carcinogenesis 2019
Open access · CC-BY
Genome-wide DNA methylation profile of early-onset endometrial cancer: its correlation with genetic aberrations and comparison with late-onset endometrial cancer
JAMA Oncology 2017
Open access · OA
Association Between Telomere Length and Risk of Cancer and Non-Neoplastic Diseases
University of California, Los Angeles 2013
Open access · US-GOV
Effects of Heavy Adolescent Cannabis Use on Brain Morphology in Aging
PLoS ONE 2013
Open access · CC-BY
Telomere Length and Genetic Anticipation in Lynch Syndrome
Arthritis & Rheumatism 2007
Citation only