| Literature DB >> 19936061 |
Fusheng Wei1, Jianwei Zhang, Shiguo Zhou, Ruifeng He, Mary Schaeffer, Kristi Collura, David Kudrna, Ben P Faga, Marina Wissotski, Wolfgang Golser, Susan M Rock, Tina A Graves, Robert S Fulton, Ed Coe, Patrick S Schnable, David C Schwartz, Doreen Ware, Sandra W Clifton, Richard K Wilson, Rod A Wing.
Abstract
Maize is a major cereal crop and an important model system for basic biological research. Knowledge gained from maize research can also be used to genetically improve its grass relatives such as sorghum, wheat, and rice. The primary objective of the Maize Genome Sequencing Consortium (MGSC) was to generate a reference genome sequence that was integrated with both the physical and genetic maps. Using a previously published integrated genetic and physical map, combined with in-coming maize genomic sequence, new sequence-based genetic markers, and an optical map, we dynamically picked a minimum tiling path (MTP) of 16,910 bacterial artificial chromosome (BAC) and fosmid clones that were used by the MGSC to sequence the maize genome. The final MTP resulted in a significantly improved physical map that reduced the number of contigs from 721 to 435, incorporated a total of 8,315 mapped markers, and ordered and oriented the majority of FPC contigs. The new integrated physical and genetic map covered 2,120 Mb (93%) of the 2,300-Mb genome, of which 405 contigs were anchored to the genetic map, totaling 2,103.4 Mb (99.2% of the 2,120 Mb physical map). More importantly, 336 contigs, comprising 94.0% of the physical map ( approximately 1,993 Mb), were ordered and oriented. Finally we used all available physical, sequence, genetic, and optical data to generate a golden path (AGP) of chromosome-based pseudomolecules, herein referred to as the B73 Reference Genome Sequence version 1 (B73 RefGen_v1).Entities:
Mesh:
Substances:
Year: 2009 PMID: 19936061 PMCID: PMC2774505 DOI: 10.1371/journal.pgen.1000715
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Sequencing pipeline for MTP clone selection and gap analysis.
(A) An example of STC-based clone walking. Candidate walking clone list for seed BAC c0245B14. The list showed clones in which BES shared >95% sequence identity with the seed BAC; (B) Gbrowse view of sequence and trace alignment of candidate clone b0566J07 to seed BAC c0245B14. (C) Gap analysis pipeline to check gaps between adjoining clones.
Position of bin markers in the B73 physical map and RefGen_v1a.
| Marker | Chr | Bin | Genetic | Original Map | Type | Seq. Source | Start | End | clone | FPC Ctg |
| tub1 | 1 | 1.01 | 2.5 | IBM2 | F | X52878 | 2022607 | 2024984 | c0363D20 | 1 |
| umc157a(chn) | 1 | 1.02 | 114.4 | IBM2 | P | G10823 | 12357364 | 12357663 | c0140E02 | 5 |
| umc76a | 1 | 1.03 | 198.4 | IBM2 | F | G10866 | 29364559 | 29364266 | c0380M20 | 9 |
| asg45(ptk) | 1 | 1.04 | 294.3 | Gnp2004 | P | AY771210 | 52239800 | 52240131 | b0109M14 | 12 |
| csu3 | 1 | 1.05 | 405 | IBM2 | F | DQ123891 | 81360132 | 81360551 | c0122B13 | 20 |
| umc67a | 1 | 1.06 | 496.6 | IBM2 | P | G13173 | 175505327 | 175505029 | c0152A14 | 36 |
| asg62 | 1 | 1.07 | 607.3 | IBM2 | F | DQ001865 | 198707401 | 198707865 | c0479A09 | 41 |
| umc128a | 1 | 1.08 | 722.4 | IBM2 | F | umc128 | 227601774 | 227602233 | b0310F15 | 46 |
| cdj2 | 1 | 1.09 | 812.3 | IBM2 | F | AY109456 | 252192856 | 252193562 | b0611E16 | 52 |
| umc107a(croc) | 1 | 1.1 | 886.9 | IBM2 | P | G10803 | 266927146 | 266927488 | c0293G16 | 56 |
| umc161a | 1 | 1.11 | 963.6 | IBM2 | F | AY771212 | 282140672 | 282141394 | c0086K08 | 61 |
| bnl6.32 | 1 | 1.12 | 1113 | IBM2 | F | bnl6.32 | 296840063 | 296840574 | c0455B14 | 63 |
| bnl8.45a | 2 | 2.01 | 3.3 | Gnp2004 | P | G10776 | 1546872 | 1547084 | b0252P05 | 68 |
| lox6 | 2 | 2.02 | 50.9 | IBM2 | F | AY771214 | 4175012 | 4174428 | c0468P22 | 69 |
| umc6a | 2 | 2.03 | 164.8 | IBM2 | F | G10856 | 14920255 | 14920433 | c0530G21 | 72 |
| umc34 | 2 | 2.04 | 243.3 | IBM2 | F | DQ001866 | 28063927 | 28064503 | c0030B11 | 74 |
| umc131 | 2 | 2.05 | 342.4 | IBM2 | F | umc131 | 71031565 | 71031939 | c0244C01 | 82 |
| umc255a | 2 | 2.06 | 364.5 | IBM2 | P | umc255 | 149697523 | 149697768 | b0120F07 | 90 |
| umc5a | 7 | 2.07 | 405.8 | Gnp2004 | P | umc5 | 116650892 | 116650654 | b0022A14 | 315 |
| asg20 | 2 | 2.08 | 478.7 | IBM2 | F | DQ123894 | 201356132 | 201355954 | c0158O02 | 103 |
| umc49a | 2 | 2.09 | 591.5 | IBM2 | F | DQ123895 | 219604574 | 219604915 | c0184K09 | 108 |
| php20581b(tb) | 2 | 2.1 | 692.7 | Gnp2004 | P | G10795 | 231788583 | 231788397 | b0109B01 | 109 |
| umc32a | 3 | 3.01 | 11.3 | UMC98 | P | umc32 | 1726276 | 1725856 | c0286H14 | 111 |
| csu32a | 3 | 3.02 | 60 | IBM2 | F | DQ123896 | 3837012 | 3837402 | c0299P11 | 111 |
| asg24a(gts) | 3 | 3.03 | 109 | IBM2 | P | AY771217 | 8405715 | 8482306 | b0166B24 | 112 |
| asg48a | 3 | 3.04 | 152.7 | IBM2 | F | G13184 | 12862813 | 12862593 | c0385I07 | 113 |
| umc102a | 3 | 3.05 | 297.9 | IBM2 | F | DQ005498 | 122406867 | 122407553 | c0072M24 | 124 |
| im30p1 | 3 | 3.06 | 391.4 | IBM2 | P | G10766 | 166733121 | 166732779 | b0583P10 | 131 |
| bnl6.16a | 3 | 3.07 | 520.7 | IBM2 | F | G10768 | 189303505 | 189303133 | c0328L01 | 138 |
| umc17a | 3 | 3.08 | 585.5 | IBM2 | F | AY771218 | 203506017 | 203506852 | b0460H12 | 145 |
| umc63a | 3 | 3.09 | 697.2 | IBM2 | F | G10857 | 214210836 | 214210676 | b0347M11 | 147 |
| cyp1 | 3 | 3.1 | 845.2 | Gnp2004 | P | DQ005499 | 230486027 | 230486291 | b0147G12 | 153 |
This is a partial list. The full list is in Table S4.
genetic position.
marker type, P: Placement, not as accurate as Framework (F).
Sequence Source, marker names with no GenBank accession number indicated that the sequences are available at http://www2.genome.arizona.edu/genomes/maize.
positions in B73 RefGen_v1.
Figure 2Use of the maize optical map for FPC contig anchoring.
In each panel, the top blue fragments represent a maize optical SwaI restriction map, and the bottom orange fragments represent the in silico optical SwaI restriction map from contig-based pseudomolecules. Red fragments in (B) and (C) indicate a mis-sassmbly in the pseudomolecule that required manual editing. (A) Well-anchored Ctg36 helped to orient Ctg33, which was previously only ordered, but not oriented. (B) Anchored Ctg407 aided order and orientation of Ctg470, which was neither ordered nor oriented. (C). The newly anchored Ctg470 facilitated ordering and orienting of Ctg459.
Contig anchoring quality and contig positions in B73 RefGen_v1a.
| Contig | Genetic Position | Order/Orien | Number of Clones | Number of Markers | Physical Length (Kb) | Chr | Start | End |
| 1 | 2.5 | 2 | 276 | 98 | 2523 | 1 | 1 | 2299274 |
| 2 | 13.5 | 2 | 132 | 53 | 1092 | 1 | 2300275 | 3419854 |
| 3 | 26.1 | 2 | 334 | 160 | 2126 | 1 | 3420855 | 5929995 |
| 4 | 82.8 | 2 | 616 | 262 | 4165 | 1 | 5930996 | 10045647 |
| 5 | 103 | 2 | 350 | 152 | 3224 | 1 | 10046648 | 13079531 |
| 6 | 124.7 | 2 | 459 | 172 | 2851 | 1 | 13080532 | 16193432 |
| 7 | 145 | 1 | 109 | 44 | 1156 | 1 | 16194433 | 17299506 |
| 8 | 160.6 | 2 | 793 | 280 | 5600 | 1 | 17300507 | 23505871 |
| 9 | 170 | 2 | 801 | 249 | 5933 | 1 | 23506872 | 29869500 |
| 10 | 205 | 2 | 2427 | 745 | 18541 | 1 | 29870501 | 48303993 |
| 12 | 290.1 | 2 | 506 | 179 | 4277 | 1 | 48304994 | 52419976 |
| 13 | 292.4 | 2 | 72 | 35 | 715 | 1 | 52420977 | 53153490 |
| 14 | 325.7 | 2 | 1672 | 434 | 12700 | 1 | 53154491 | 65772465 |
| 16 | 360.9 | 3 | 281 | 77 | 1920 | 1 | 65773466 | 67717626 |
| 474 | 385 | 0 | 79 | 39 | 705 | 1 | 67718627 | 68401065 |
| 17 | 386.4 | 2 | 247 | 63 | 2361 | 1 | 68402066 | 70727174 |
| 18 | 391.8 | 3 | 379 | 119 | 3189 | 1 | 70728175 | 73586599 |
| 19 | 392.95 | 3 | 626 | 136 | 4640 | 1 | 73587600 | 78384554 |
| 20 | 398.2 | 2 | 798 | 198 | 6350 | 1 | 78385555 | 84665058 |
| 22 | 406 | 3 | 253 | 68 | 1832 | 1 | 84666059 | 86598526 |
| 24 | 415 | 3 | 482 | 99 | 3973 | 1 | 86599527 | 90264916 |
| 23 | 417 | 2 | 578 | 114 | 4395 | 1 | 90265917 | 94733311 |
| 106 | 227.1 | 4 | 939 | 190 | 7359 | 9 | 53598148 | 60378895 |
| 432 | 227.2 | 4 | 701 | 135 | 4875 | 9 | 60379896 | 65011726 |
| 448 | 227.3 | 4 | 576 | 90 | 4512 | 9 | 65012727 | 69221205 |
| 425 | unknown | 5 | 218 | 51 | 1871 | 0 | 9718511 | 11319527 |
| 427 | unknown | 5 | 236 | 35 | 2033 | 0 | 11479785 | 13269771 |
| 429 | unknown | 5 | 166 | 41 | 1759 | 0 | 13443808 | 14680007 |
This is a partial list. The full list is in Table S5.
Code: 0, chromosomal assignment is known, but not ordered and oriented; 1, ordered, but not oriented; 2, genetically anchored and oriented; 3, anchored and oriented with assistance from optical map; 4, the block was anchored, but order and orientation are unknown; 5, unknown chromosomal context.
positions in B73 RefGen_v1.
Figure 3Direct comparison of sequence overlap between adjacent clones before (A) and after (B) the semi automated AGP pipeline.
Sequence summary of the maize chromosomes in B73 RefGen_v1.
| Chr | Length (bp) | Scaffold | Contig | ||||
| Number | Length (bp) | Average (bp) | Number | Length (bp) | Average (bp) | ||
|
| 14680007 | 647 | 14588907 | 22549 | 1206 | 14531607 | 12049 |
|
| 300239041 | 8696 | 299312341 | 34420 | 17683 | 298405441 | 16875 |
|
| 234752839 | 6661 | 234044439 | 35137 | 13694 | 233333939 | 17039 |
|
| 230558137 | 6612 | 229865037 | 34765 | 13509 | 229167937 | 16964 |
|
| 247095508 | 6834 | 246365608 | 36050 | 13975 | 245638708 | 17577 |
|
| 216915529 | 6547 | 216219929 | 33026 | 13148 | 215551729 | 16394 |
|
| 169254300 | 5257 | 168698300 | 32090 | 10986 | 168119000 | 15303 |
|
| 170974187 | 5239 | 170418187 | 32529 | 10858 | 169851287 | 15643 |
|
| 174515299 | 5452 | 173935699 | 31903 | 11450 | 173330599 | 15138 |
|
| 152350485 | 4653 | 151852185 | 32635 | 9243 | 151386685 | 16379 |
|
| 149686045 | 4563 | 149202145 | 32698 | 9573 | 148697745 | 15533 |
|
|
|
|
|
|
|
|
|
total of all unanchored contigs.