Literature DB >> 33431880

Construction and integration of three de novo Japanese human genome assemblies toward a population-specific reference.

Jun Takayama1,2,3, Shu Tadaka2, Kenji Yano2,3, Fumiki Katsuoka1,2, Chinatsu Gocho2, Takamitsu Funayama2, Satoshi Makino2, Yasunobu Okamura1,2, Atsuo Kikuchi4, Sachiyo Sugimoto2, Junko Kawashima2, Akihito Otsuki2, Mika Sakurai-Yageta2, Jun Yasuda2,5, Shigeo Kure2,4, Kengo Kinoshita6,7,8, Masayuki Yamamoto9,10, Gen Tamiya11,12,13,14.   

Abstract

The complete human genome sequence is used as a reference for next-generation sequencing analyses. However, some ethnic ancestries are under-represented in the reference genome (e.g., GRCh37) due to its bias toward European and African ancestries. Here, we perform de novo assembly of three Japanese male genomes using > 100× Pacific Biosciences long reads and Bionano Genomics optical maps per sample. We integrate the genomes using the major allele for consensus and anchor the scaffolds using genetic and radiation hybrid maps to reconstruct each chromosome. The resulting genome sequence, JG1, is contiguous, accurate, and carries the Japanese major allele at most loci. We adopt JG1 as the reference for confirmatory exome re-analyses of seven rare-disease Japanese families and find that re-analysis using JG1 reduces total candidate variant calls versus GRCh37 while retaining disease-causing variants. These results suggest that integrating multiple genomes from a single population can aid genome analyses of that population.

Entities:  

Year:  2021        PMID: 33431880     DOI: 10.1038/s41467-020-20146-8

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


  54 in total

1.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

Review 2.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

3.  A draft sequence of the Neandertal genome.

Authors:  Johannes Krause; Adrian W Briggs; Tomislav Maricic; Udo Stenzel; Martin Kircher; Nick Patterson; Richard E Green; Heng Li; Weiwei Zhai; Markus Hsi-Yang Fritz; Nancy F Hansen; Eric Y Durand; Anna-Sapfo Malaspinas; Jeffrey D Jensen; Tomas Marques-Bonet; Can Alkan; Kay Prüfer; Matthias Meyer; Hernán A Burbano; Jeffrey M Good; Rigo Schultz; Ayinuer Aximu-Petri; Anne Butthof; Barbara Höber; Barbara Höffner; Madlen Siegemund; Antje Weihmann; Chad Nusbaum; Eric S Lander; Carsten Russ; Nathaniel Novod; Jason Affourtit; Michael Egholm; Christine Verna; Pavao Rudan; Dejana Brajkovic; Željko Kucan; Ivan Gušic; Vladimir B Doronichev; Liubov V Golovanova; Carles Lalueza-Fox; Marco de la Rasilla; Javier Fortea; Antonio Rosas; Ralf W Schmitz; Philip L F Johnson; Evan E Eichler; Daniel Falush; Ewan Birney; James C Mullikin; Montgomery Slatkin; Rasmus Nielsen; Janet Kelso; Michael Lachmann; David Reich; Svante Pääbo
Journal:  Science       Date:  2010-05-07       Impact factor: 47.728

4.  Modernizing reference genome assemblies.

Authors:  Deanna M Church; Valerie A Schneider; Tina Graves; Katherine Auger; Fiona Cunningham; Nathan Bouk; Hsiu-Chuan Chen; Richa Agarwala; William M McLaren; Graham R S Ritchie; Derek Albracht; Milinn Kremitzki; Susan Rock; Holland Kotkiewicz; Colin Kremitzki; Aye Wollam; Lee Trani; Lucinda Fulton; Robert Fulton; Lucy Matthews; Siobhan Whitehead; Will Chow; James Torrance; Matthew Dunn; Glenn Harden; Glen Threadgold; Jonathan Wood; Joanna Collins; Paul Heath; Guy Griffiths; Sarah Pelan; Darren Grafham; Evan E Eichler; George Weinstock; Elaine R Mardis; Richard K Wilson; Kerstin Howe; Paul Flicek; Tim Hubbard
Journal:  PLoS Biol       Date:  2011-07-05       Impact factor: 8.029

5.  Characterization and identification of hidden rare variants in the human genome.

Authors:  Alberto Magi; Romina D'Aurizio; Flavia Palombo; Ingrid Cifola; Lorenzo Tattini; Roberto Semeraro; Tommaso Pippucci; Betti Giusti; Giovanni Romeo; Rosanna Abbate; Gian Franco Gensini
Journal:  BMC Genomics       Date:  2015-04-24       Impact factor: 3.969

6.  Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.

Authors:  Valerie A Schneider; Tina Graves-Lindsay; Kerstin Howe; Nathan Bouk; Hsiu-Chuan Chen; Paul A Kitts; Terence D Murphy; Kim D Pruitt; Françoise Thibaud-Nissen; Derek Albracht; Robert S Fulton; Milinn Kremitzki; Vincent Magrini; Chris Markovic; Sean McGrath; Karyn Meltz Steinberg; Kate Auger; William Chow; Joanna Collins; Glenn Harden; Timothy Hubbard; Sarah Pelan; Jared T Simpson; Glen Threadgold; James Torrance; Jonathan M Wood; Laura Clarke; Sergey Koren; Matthew Boitano; Paul Peluso; Heng Li; Chen-Shan Chin; Adam M Phillippy; Richard Durbin; Richard K Wilson; Paul Flicek; Evan E Eichler; Deanna M Church
Journal:  Genome Res       Date:  2017-04-10       Impact factor: 9.043

7.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.

Authors:  Jacob F Degner; John C Marioni; Athma A Pai; Joseph K Pickrell; Everlyne Nkadori; Yoav Gilad; Jonathan K Pritchard
Journal:  Bioinformatics       Date:  2009-10-06       Impact factor: 6.937

8.  Characterizing the Major Structural Variant Alleles of the Human Genome.

Authors:  Peter A Audano; Arvis Sulovari; Tina A Graves-Lindsay; Stuart Cantsilieris; Melanie Sorensen; AnneMarie E Welch; Max L Dougherty; Bradley J Nelson; Ankeeta Shah; Susan K Dutcher; Wesley C Warren; Vincent Magrini; Sean D McGrath; Yang I Li; Richard K Wilson; Evan E Eichler
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

9.  Challenges imposed by minor reference alleles on the identification and reporting of clinical variants from exome data.

Authors:  Mahmoud Koko; Mohammed O E Abdallah; Mutaz Amin; Muntaser Ibrahim
Journal:  BMC Genomics       Date:  2018-01-15       Impact factor: 3.969

10.  Assembly of a pan-genome from deep sequencing of 910 humans of African descent.

Authors:  Rachel M Sherman; Juliet Forman; Valentin Antonescu; Daniela Puiu; Michelle Daya; Nicholas Rafaels; Meher Preethi Boorgula; Sameer Chavan; Candelaria Vergara; Victor E Ortega; Albert M Levin; Celeste Eng; Maria Yazdanbakhsh; James G Wilson; Javier Marrugo; Leslie A Lange; L Keoki Williams; Harold Watson; Lorraine B Ware; Christopher O Olopade; Olufunmilayo Olopade; Ricardo R Oliveira; Carole Ober; Dan L Nicolae; Deborah A Meyers; Alvaro Mayorga; Jennifer Knight-Madden; Tina Hartert; Nadia N Hansel; Marilyn G Foreman; Jean G Ford; Mezbah U Faruque; Georgia M Dunston; Luis Caraballo; Esteban G Burchard; Eugene R Bleecker; Maria I Araujo; Edwin F Herrera-Paz; Monica Campbell; Cassandra Foster; Margaret A Taub; Terri H Beaty; Ingo Ruczinski; Rasika A Mathias; Kathleen C Barnes; Steven L Salzberg
Journal:  Nat Genet       Date:  2018-11-19       Impact factor: 38.330

View more
  10 in total

1.  Identification of Dominant Transcripts in Oxidative Stress Response by a Full-Length Transcriptome Analysis.

Authors:  Akihito Otsuki; Yasunobu Okamura; Yuichi Aoki; Noriko Ishida; Kazuki Kumada; Naoko Minegishi; Fumiki Katsuoka; Kengo Kinoshita; Masayuki Yamamoto
Journal:  Mol Cell Biol       Date:  2021-01-25       Impact factor: 4.272

Review 2.  Existing Data Sources for Clinical Epidemiology: Database of the National Hospital Organization in Japan.

Authors:  Natsuko Kanazawa; Takuaki Tani; Shinobu Imai; Hiromasa Horiguchi; Kiyohide Fushimi; Norihiko Inoue
Journal:  Clin Epidemiol       Date:  2022-05-19       Impact factor: 5.814

3.  Analysis of the genotype-phenotype correlation of MYO15A variants in Chinese non-syndromic hearing loss patients.

Authors:  Ying Fu; Shasha Huang; Xue Gao; Mingyu Han; Guojian Wang; Dongyang Kang; Yongyi Yuan; Pu Dai
Journal:  BMC Med Genomics       Date:  2022-03-26       Impact factor: 3.063

4.  A novel 8.57-kb deletion of the upstream region of PRKAR1A in a family with Carney complex.

Authors:  Shin Ito; Aya Hashimoto; Kazunori Yamaguchi; Sadafumi Kawamura; Shingo Myoen; Maki Ogawa; Ikuro Sato; Takamichi Minato; Shingo Miyabe; Akira Nakazato; Keitaro Fujii; Mai Mochizuki; Haruna Fujimori; Keiichi Tamai; Tetsuya Niihori; Yoko Aoki; Akira Sugawara; Hironobu Sasano; Hiroshi Shima; Jun Yasuda
Journal:  Mol Genet Genomic Med       Date:  2022-02-06       Impact factor: 2.183

5.  Pan-human consensus genome significantly improves the accuracy of RNA-seq analyses.

Authors:  Benjamin Kaminow; Sara Ballouz; Jesse Gillis; Alexander Dobin
Journal:  Genome Res       Date:  2022-03-07       Impact factor: 9.438

6.  Whole Exome Sequencing in South Africa: Stakeholder Views on Return of Individual Research Results and Incidental Findings.

Authors:  Nicole Van Der Merwe; Raj Ramesar; Jantina De Vries
Journal:  Front Genet       Date:  2022-06-08       Impact factor: 4.772

7.  Construction of a trio-based structural variation panel utilizing activated T lymphocytes and long-read sequencing technology.

Authors:  Akihito Otsuki; Yasunobu Okamura; Noriko Ishida; Shu Tadaka; Jun Takayama; Kazuki Kumada; Junko Kawashima; Keiko Taguchi; Naoko Minegishi; Shinichi Kuriyama; Gen Tamiya; Kengo Kinoshita; Fumiki Katsuoka; Masayuki Yamamoto
Journal:  Commun Biol       Date:  2022-09-20

8.  Comparative analysis of draft genome assemblies developed from whole genome sequences of two Hyaloperonospora brassicae isolate samples differing in field virulence on Brassica napus.

Authors:  Ming Pei You; Javed Akhatar; Meenakshi Mittal; Martin J Barbetti; Solomon Maina; Surinder S Banga
Journal:  Biotechnol Rep (Amst)       Date:  2021-06-22

9.  KOREF_S1: phased, parental trio-binned Korean reference genome using long reads and Hi-C sequencing methods.

Authors:  Hui-Su Kim; Sungwon Jeon; Yeonkyung Kim; Changjae Kim; Jihun Bhak; Jong Bhak
Journal:  Gigascience       Date:  2022-03-24       Impact factor: 6.524

10.  Discovery of genomic variation across a generation.

Authors:  Brett Trost; Livia O Loureiro; Stephen W Scherer
Journal:  Hum Mol Genet       Date:  2021-10-01       Impact factor: 6.150

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.