| Literature DB >> 23537070 |
Mi-Kyung Lee1, Yang Zhang, Meiping Zhang, Mark Goebel, Hee Jin Kim, Barbara A Triplett, David M Stelly, Hong-Bin Zhang.
Abstract
BACKGROUND: Cotton, one of the world's leading crops, is important to the world's textile and energy industries, and is a model species for studies of plant polyploidization, cellulose biosynthesis and cell wall biogenesis. Here, we report the construction of a plant-transformation-competent binary bacterial artificial chromosome (BIBAC) library and comparative genome sequence analysis of polyploid Upland cotton (Gossypium hirsutum L.) with one of its diploid putative progenitor species, G. raimondii Ulbr.Entities:
Mesh:
Year: 2013 PMID: 23537070 PMCID: PMC3623804 DOI: 10.1186/1471-2164-14-208
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Analysis of BIBACs randomly selected from the Upland cotton cv. TM-1 BIBAC library. The BIBACs were digested with Not I and fractionated on a pulsed-field gel (A) and plotted according to their insert sizes (B). M, lambda ladder PFG marker; V, vector fragment; N, number of random clones analyzed for insert size estimation of the library.
Positive clones of the Upland cotton TM-1 BIBAC library identified by library screening with RGAs closely linked with the resistance to bacterial blight pathogen, pv. [43,44]
| B004B14 | B011P19 | B013N15 | B017B02 | B017H07 | B019I04 |
| B025M21 | B026F24 | B030I22 | B041O11 | B046C16 | B046H18 |
| B048P23 | B050K21 | B051D16 | B051D23 | B052F10 | B055I19 |
| B059G06 | B059N17 | B060P23 | B061B24 | B062I17 | B063I15 |
| B063L01 | B065M06 | B068H24 | B069J11 | B071B09 | B071D22 |
| B072P14 | B075A15 | B076D08 | B080N11 | B085C16 | B085D09 |
| B085G19 | B087G17 | B089E06 | B091L11 | B092C13 | B092P12 |
| B093J03 | B097P19 | B104A01 | B105F15 | B105G23 | B108L23 |
| B108P19 | B110J23 | B111C17 | B111K03 | B112G15 | B112K13 |
| B117K22 | B118M16 | B119B09 | B120H01 | B121B01 | B125O02 |
| B133A14 | B134N15 | B135C10 | B136C02 | B136O19 | B138A12 |
| B138A18 | B138D03 | B139E13 | B140B05 | B140J23 | B145L08 |
| B147F15 | B148P17 | B151K12 | B155E18 | B160G13 | B161M18 |
| B162E05 | B162H06 | B163B09 | B165D12 | B169M17 | B170D18 |
| B171A18 | B175N17 | B177F17 | B177M15 | B178C14 | B179B04 |
| B179D06 | B179J12 | B179N18 | B184A18 | B186F07 | B188D16 |
| B188E15 | B188G13 | B188I15 | B189B12 | B189J03 | B192O18 |
| B197K02 | B199N18 | | | | |
| B006G23 | B007N18 | B008N22 | B017B02 | B017K11 | B022M07 |
| B037I22 | B015L11 | B016M13 | B013N15 | B014N21 | B015O20 |
| B011P19 | B025B03 | B032B23 | B028D04 | B030D15 | B026F24 |
| B031I22 | B042J09 | B047M01 | B045O16 | B048P23 | B050F06 |
| B049K06 | B050K21 | B052L03 | B055P20 | B071D19 | B069E09 |
| B069F11 | B069K14 | B070M07 | B066O02 | B068P02 | B085C16 |
| B085G19 | B057I10 | B061L07 | B059M03 | B061O06 | B061O22 |
| B075A15 | B080E20 | B077L10 | B075N13 | B073N21 | B096D03 |
| B093J03 | B100J14 | B115E13 | B117K23 | B120M11 | B115P05 |
| B133J15 | B107D12 | B111K03 | B142J06 | B148M10 | B166A18 |
| B165D12 | B163H11 | B161M18 | B184A18 | B181A23 | B183D05 |
| B179O02 | B154N23 | B171A18 | B175N14 | B176O01 | B198J14 |
| B125E12 | |||||
The positive BIBAC clones of the genes identified by screening the Upland cotton cv. TM-1 BIBAC library using overgo probes designed from the gene sequences
| B130C07, B016N19 | 2 | |
| B024E21, B072G17, B086A07, B083I01, B073K01, B073L02, B096F08 | 7 | |
| CelA6 | B098I23, B116B12, B017I05, B035E07, B115K16, B005M05, B086H21 | 7 |
| B099A19, B024H22, B014D05, B027E13, B026F22, B092E14 | 6 | |
| B099A19, B024H22, B014D05, B027E13, B026F22, B092E14 | 6 | |
| B175F03, B050B20, B187C03, B186M07, B016L07, B016L07, B166C05, B161G09 | 8 | |
| B138P05, B080A19 | 2 | |
| B174C01, B192C23 | 2 | |
| B026F22, B007F03 | 2 | |
| B046G15,B048N17, B085P21, B162F12 | 4 | |
| B070C20, B173F02 | 2 | |
| B009G10, B065P05, B075L23, B097O20, B108L10, B146H15, B170C06, B178M15 | 8 | |
| Unnamed | B008D22, B145L18, B164E04, B165A23 | 4 |
Summary of the Upland cotton BIBAC end sequences (BESs) generated and their exon and SSR contents
| Total BESs sequenced | 10,752 |
|---|---|
| Q20 (50–842 bp) | 9,711 |
| Paired-ends | 4,441 |
| GC level | 35.50% |
| Total length | 3,842,009 bp |
| Average read of BESs | 395.63 bp |
| Number of exons: | |
| GenScan | 3,022 |
| GeneMark | 2,912 |
| Microsatellite (di, tri, tetra, penta and hexa): | |
| Four or more repeat units | 1,269 |
| Five or more repeat units | 313 |
| Six or more repeat units | 103 |
Characteristics of repeat elements contained in the Upland cotton BIBAC end sequences (BESs)
| Retroelements | 776 | 208,081 | 5.42 |
| LINEs: | 21 | 4,469 | 0.12 |
| | 21 | 4,469 | 0.12 |
| LTR elements: | 755 | 203,612 | 5.39 |
| | 166 | 40,878 | 1.06 |
| | 578 | 162,030 | 4.22 |
| DNA transposons | 5 | 302 | 0.01 |
| MuDR-IS905 | 3 | 152 | 0.00 |
| Total interspersed repeats | 208,383 | 5.42 | |
| Small RNA | 131 | 36,531 | 0.95 |
| Simple repeats | 201 | 10,777 | 0.28 |
| Low complexity | 2,289 | 118,328 | 3.08 |
Figure 2Frequency of the ten most abundant SSR motifs in the Upland cotton genome. The SSR motifs are di-, tri-, tetra-, penta- and hexa-mers, with four or more repeats.
Comparative sequence analysis of Upland cotton BESs with the D genome sequence of (DOE Joint Genome Institute, Cotton D V1.0,http:// http://www.phytozome.net/cotton.php )
| | Continuous MinMatch (bp) | 100 | 100 | 100 | 100 | 100 | 100 |
| All AD-subgenome BESs | Sequence identity (%) | 100 | 95 | 90 | 80 | 70 | 60 |
| (No. of BESs = 9,711; total sequence length = 3,842,009 bp) | No. of BESs aligned | 57 | 4,587 | 6,588 | 7,118 | 7,257 | 7,277 |
| | % of BESs aligned | 0.59 | 47.24 | 67.84 | 73.30 | 74.73 | 74.94 |
| | Total sequence length of BESs aligned to the | 15,539 | 1,367,385 | 1,960,413 | 2,048,311 | 1,996,820 | 1,975,511 |
| | % of sequence length of BESs aligned to the | 0.41 | 35.59 | 51.10 | 53.31 | 51.95 | 51.42 |
| | Continuous MinMatch (bp) | 100 | 100 | 100 | 100 | 100 | 100 |
| A-subgenome-specific BESs | Sequence identity (%) | 100 | 95 | 90 | 80 | 70 | 60 |
| (No. of BESs = 418; total sequence length = 169,533 bp) | No. of BESs aligned | 7 | 205 | 294 | 310 | 317 | 317 |
| | % of BESs aligned | 1.67 | 49.04 | 70.33 | 74.16 | 75.84 | 75.84 |
| | Continuous MinMatch (bp) | 100 | 100 | 100 | 100 | 100 | 100 |
| D-subgenome-specific BESs | Sequence identity (%) | 100 | 95 | 90 | 80 | 70 | 60 |
| (No. of BESs = 184; total sequence length = 71,410 bp) | No. of BESs aligned | 1 | 88 | 128 | 136 | 139 | 139 |
| % of BESs aligned | 0.54 | 47.83 | 69.57 | 73.91 | 75.54 | 75.54 | |