Literature DB >> 19997067

Building the sequence map of the human pan-genome.

Ruiqiang Li1, Yingrui Li, Hancheng Zheng, Ruibang Luo, Hongmei Zhu, Qibin Li, Wubin Qian, Yuanyuan Ren, Geng Tian, Jinxiang Li, Guangyu Zhou, Xuan Zhu, Honglong Wu, Junjie Qin, Xin Jin, Dongfang Li, Hongzhi Cao, Xueda Hu, Hélène Blanche, Howard Cann, Xiuqing Zhang, Songgang Li, Lars Bolund, Karsten Kristiansen, Huanming Yang, Jun Wang, Jian Wang.   

Abstract

Here we integrate the de novo assembly of an Asian and an African genome with the NCBI reference human genome, as a step toward constructing the human pan-genome. We identified approximately 5 Mb of novel sequences not present in the reference genome in each of these assemblies. Most novel sequences are individual or population specific, as revealed by their comparison to all available human DNA sequence and by PCR validation using the human genome diversity cell line panel. We found novel sequences present in patterns consistent with known human migration paths. Cross-species conservation analysis of predicted genes indicated that the novel sequences contain potentially functional coding regions. We estimate that a complete human pan-genome would contain approximately 19-40 Mb of novel sequence not present in the extant reference genome. The extensive amount of novel sequence contributing to the genetic variation of the pan-genome indicates the importance of using complete genome sequencing and de novo assembly.

Entities:  

Mesh:

Year:  2009        PMID: 19997067     DOI: 10.1038/nbt.1596

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  37 in total

1.  A human genome diversity cell line panel.

Authors:  Howard M Cann; Claudia de Toma; Lucien Cazes; Marie-Fernande Legrand; Valerie Morel; Laurence Piouffre; Julia Bodmer; Walter F Bodmer; Batsheva Bonne-Tamir; Anne Cambon-Thomsen; Zhu Chen; J Chu; Carlo Carcassi; Licinio Contu; Ruofu Du; Laurent Excoffier; G B Ferrara; Jonathan S Friedlaender; Helena Groot; David Gurwitz; Trefor Jenkins; Rene J Herrera; Xiaoyi Huang; Judith Kidd; Kenneth K Kidd; Andre Langaney; Alice A Lin; S Qasim Mehdi; Peter Parham; Alberto Piazza; Maria Pia Pistillo; Yaping Qian; Qunfang Shu; Jiujin Xu; S Zhu; James L Weber; Henry T Greely; Marcus W Feldman; Gilles Thomas; Jean Dausset; L Luca Cavalli-Sforza
Journal:  Science       Date:  2002-04-12       Impact factor: 47.728

2.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

3.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

4.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

5.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

Review 6.  Genome-wide association studies for common diseases and complex traits.

Authors:  Joel N Hirschhorn; Mark J Daly
Journal:  Nat Rev Genet       Date:  2005-02       Impact factor: 53.242

7.  Extensive copy-number variation of the human olfactory receptor gene family.

Authors:  Janet M Young; Raelynn M Endicott; Sean S Parghi; Megan Walker; Jeffrey M Kidd; Barbara J Trask
Journal:  Am J Hum Genet       Date:  2008-08       Impact factor: 11.025

Review 8.  The Human Genome Diversity Project: past, present and future.

Authors:  L Luca Cavalli-Sforza
Journal:  Nat Rev Genet       Date:  2005-04       Impact factor: 53.242

Review 9.  Human genetic variation and its contribution to complex traits.

Authors:  Kelly A Frazer; Sarah S Murray; Nicholas J Schork; Eric J Topol
Journal:  Nat Rev Genet       Date:  2009-04       Impact factor: 53.242

10.  The diploid genome sequence of an individual human.

Authors:  Samuel Levy; Granger Sutton; Pauline C Ng; Lars Feuk; Aaron L Halpern; Brian P Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul A Kravitz; Dana A Busam; Karen Y Beeson; Tina C McIntosh; Karin A Remington; Josep F Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin E Frazier; Stephen W Scherer; Robert L Strausberg; J Craig Venter
Journal:  PLoS Biol       Date:  2007-09-04       Impact factor: 8.029

View more
  110 in total

Review 1.  A survey of sequence alignment algorithms for next-generation sequencing.

Authors:  Heng Li; Nils Homer
Journal:  Brief Bioinform       Date:  2010-05-11       Impact factor: 11.622

2.  Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing.

Authors:  Akihiro Fujimoto; Hidewaki Nakagawa; Naoya Hosono; Kaoru Nakano; Tetsuo Abe; Keith A Boroevich; Masao Nagasaki; Rui Yamaguchi; Tetsuo Shibuya; Michiaki Kubo; Satoru Miyano; Yusuke Nakamura; Tatsuhiko Tsunoda
Journal:  Nat Genet       Date:  2010-10-24       Impact factor: 38.330

Review 3.  Annotating non-coding regions of the genome.

Authors:  Roger P Alexander; Gang Fang; Joel Rozowsky; Michael Snyder; Mark B Gerstein
Journal:  Nat Rev Genet       Date:  2010-07-13       Impact factor: 53.242

4.  Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection.

Authors:  Hon-Ming Lam; Xun Xu; Xin Liu; Wenbin Chen; Guohua Yang; Fuk-Ling Wong; Man-Wah Li; Weiming He; Nan Qin; Bo Wang; Jun Li; Min Jian; Jian Wang; Guihua Shao; Jun Wang; Samuel Sai-Ming Sun; Gengyun Zhang
Journal:  Nat Genet       Date:  2010-11-14       Impact factor: 38.330

5.  Harnessing the power of genomics and immunoinformatics to produce improved vaccines.

Authors:  Leonard Moise; Leslie Cousens; Joanna Fueyo; Anne S De Groot
Journal:  Expert Opin Drug Discov       Date:  2010-12-01       Impact factor: 6.098

6.  The high polyphenol content of grapevine cultivar tannat berries is conferred primarily by genes that are not shared with the reference genome.

Authors:  Cecilia Da Silva; Gianpiero Zamperin; Alberto Ferrarini; Andrea Minio; Alessandra Dal Molin; Luca Venturini; Genny Buson; Paola Tononi; Carla Avanzato; Elisa Zago; Eduardo Boido; Eduardo Dellacassa; Carina Gaggero; Mario Pezzotti; Francisco Carrau; Massimo Delledonne
Journal:  Plant Cell       Date:  2013-12-06       Impact factor: 11.277

7.  De novo assembly of human genomes with massively parallel short read sequencing.

Authors:  Ruiqiang Li; Hongmei Zhu; Jue Ruan; Wubin Qian; Xiaodong Fang; Zhongbin Shi; Yingrui Li; Shengting Li; Gao Shan; Karsten Kristiansen; Songgang Li; Huanming Yang; Jian Wang; Jun Wang
Journal:  Genome Res       Date:  2009-12-17       Impact factor: 9.043

8.  Population-genetic properties of differentiated human copy-number polymorphisms.

Authors:  Catarina D Campbell; Nick Sampas; Anya Tsalenko; Peter H Sudmant; Jeffrey M Kidd; Maika Malig; Tiffany H Vu; Laura Vives; Peter Tsang; Laurakay Bruhn; Evan E Eichler
Journal:  Am J Hum Genet       Date:  2011-03-11       Impact factor: 11.025

9.  Incorporating the human gene annotations in different databases significantly improved transcriptomic and genetic analyses.

Authors:  Geng Chen; Charles Wang; Leming Shi; Xiongfei Qu; Jiwei Chen; Jianmin Yang; Caiping Shi; Long Chen; Peiying Zhou; Baitang Ning; Weida Tong; Tieliu Shi
Journal:  RNA       Date:  2013-02-19       Impact factor: 4.942

Review 10.  Deconstructing Mus gemischus: advances in understanding ancestry, structure, and variation in the genome of the laboratory mouse.

Authors:  John P Didion; Fernando Pardo-Manuel de Villena
Journal:  Mamm Genome       Date:  2012-12-09       Impact factor: 2.957

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.