| Literature DB >> 33969207 |
Mei Jiang1, Yuning Li2, Haimei Chen1, Bin Wang2, Chang Liu1.
Abstract
Lactuca raddeana is a biennial plant of the Lactuca genus belonging to the Asteraceae family. The classification of Lactuca is controversial, and no consistent conclusions have been reached based on the analysis of morphological characters and different molecular markers. Here, we sequenced and assembled the complete chloroplast genome of L. raddeana. This genome has a total length of 152,339 bp, a conservative quartile structure that is composed of a large single-copy (LSC) region of 83,976 bp, a small-copy (SSC) region of 18,521 bp, and a pair of inverted repeats (IRs) of 24,921 bp. The genome contains 112 unique genes, including 79 protein-coding, four rRNA, and 29 tRNA genes. Repeat analysis obtained 17 microsatellite, 16 tandem, and 17 interspersed repeats. Comparison of sequence divergence between L. raddeana and L. sativa found the intergenic spacer trnC-petN exhibited the highest degree of variation. Three phylogenetic trees based on the 72 shared protein, matK gene, and rbcL gene sequences showed that L. raddeana and L. sativa are clustered together. The acquisition and comparative analysis of the chloroplast genome provide a valuable resource for the taxonomic and phylogenetic studies of Lactuca.Entities:
Keywords: Chloroplast genome; DNA barcode; Lactuca raddeana; Phylogeny
Year: 2021 PMID: 33969207 PMCID: PMC8079048 DOI: 10.1080/23802359.2021.1911700
Source DB: PubMed Journal: Mitochondrial DNA B Resour ISSN: 2380-2359 Impact factor: 0.658
Figure 1.The schematic representation of the chloroplast genome of L. raddeana was created by using CPGAVAS2. The map contains four rings. From the center going outward, the first circle shows the scattered forward and reverse repeats connected with red and green arcs, respectively. The next circle shows the tandem repeats marked with short bars. The third circle shows the microsatellite sequences identified. The fourth circle shows the gene structure of the chloroplast genome. The genes were colored based on their functional categories, which are shown in the left corner.
Gene composition in the chloroplast genome of L. raddeana.
| Category of genes | Group of genes | Name of genes |
|---|---|---|
| rRNA | ||
| tRNA | 29 trna genes (6 contain an intron) | |
| Photosynthesis | Subunits of ATP synthase | |
| Subunits of photosystem II | ||
| Subunits of NADH-dehydrogenase | ||
| Subunits of cytochrome b/f complex | ||
| Subunits of photosystem I | ||
| Subunit of rubisco | ||
| Self replication | Large subunit of ribosome | |
| DNA dependent RNA polymerase | ||
| Small subunit of ribosome | ||
| Other genes | Subunit of Acetyl-CoA-carboxylase | |
| c-type cytochrom synthesis gene | ||
| Envelop membRNAe protein | ||
| Protease | ||
| TRNAslational initiation factor | ||
| Maturase | ||
| Unknown | Conserves open reading frames |
The ‘2’ symbol after gene name reflect the genes located on the IR regions.
Microsatellite repeats in the chloroplast genome of L. raddeana.
| SSR type | Structure of SSRs | Size (bp) | Start | End | Region (gene names) |
|---|---|---|---|---|---|
| p1 | (A)10 | 10 | 4408 | 4417 | IGS (matK-rps16) |
| p1 | (T)15 | 15 | 12,680 | 12,694 | IGS (trnE-UUC- rpoB) |
| p1 | (A)11 | 11 | 16,599 | 16,609 | Intron (rpoC1) |
| p1 | (A)10 | 10 | 18,409 | 18,418 | Exon (rpoC1) |
| p1 | (T)10 | 10 | 23,213 | 23,222 | IGS (rpoC2- rps2) |
| p2 | (AT)6 | 12 | 26,596 | 26,607 | IGS (atpH- atpF) |
| p1 | (A)10 | 10 | 43,666 | 43,675 | Intron (ycf3) |
| p1 | (T)10 | 10 | 50,222 | 50,231 | IGS (ndhC- trnM-CAU) |
| p1 | (T)10 | 10 | 59,176 | 59,185 | IGS (psaI- ycf4) |
| p1 | (T)10 | 10 | 67,305 | 67,314 | IGS (psaJ- rpl33) |
| p2 | (TA)7 | 14 | 67,647 | 67,660 | IGS (rpl33- rps18) |
| p1 | (T)11 | 11 | 70,358 | 70,368 | Exon (clpP) |
| p1 | (T)10 | 10 | 71,329 | 71,338 | Intron (clpP) |
| p1 | (T)10 | 10 | 77,876 | 77,885 | Exon (rpoA) |
| p1 | (A)10 | 10 | 116,247 | 116,256 | IGS (ndhD- psaC) |
| p1 | (T)10 | 10 | 124,260 | 124,269 | Exon (ycf1) |
| p1 | (T)10 | 10 | 124,985 | 124,994 | Exon (ycf1) |
p1: perfect repeat of mononulceotide; p2: perfect repeat of dinucleotides. The structure of SSRs is presented as repeat units surrounded with parenthesis, followed with the number of repeat units.
Tandem repeats in the chloroplast genome of L. raddeana.
| Start-end sites of tandem repeats | Repeat units | Region (gene name) | ||
|---|---|---|---|---|
| Size (bp) | Percent of match | Sequence (copy number) | ||
| 4871–4908 | 19 | 100 | TAAAGAACTTCGATTCCTT (2) | IGS ( |
| 8639–8682 | 21 | 100 | TATGTTACATTACATATCAAG (2) | IGS ( |
| 11,771–11,808 | 19 | 94 | ACCAATGGACCATAGGGGC (2) | Exon ( |
| 12,249–12,305 | 28 | 100 | ATTGAGCCCTAAATAATACTGTCAATTG (2) | IGS ( |
| 31,073–31,106 | 17 | 94 | GTTAGAGTATATTAGTT (2) | IGS ( |
| 31,492–31,531 | 19 | 100 | ATAATATTTCTATATTTAG (2) | IGS ( |
| 56,199–56,248 | 25 | 92 | ATTTGAGTTTCAGGCAATGGATACT (2) | Exon ( |
| 58,617–58,646 | 15 | 100 | TTCAATATATTCAAT (2) | IGS ( |
| 58,620–58,655 | 15 | 90 | AATATATTCAATATA (2) | IGS ( |
| 58,743–58,790 | 24 | 100 | AATAATATCTCATCCATATCCTTT (2) | IGS ( |
| 58,879–58,914 | 18 | 100 | TATTAAGTAATAATAATA (2) | IGS ( |
| 65,047–65,095 | 24 | 100 | AATAGAAATGACTAGTCTTAGAAT (2) | IGS ( |
| 65,473–65,528 | 27 | 100 | TTTTATACAGTTGTGACTTTATCCTTG (2) | IGS ( |
| 66,338–66,369 | 16 | 100 | GTCCAATGTGAATTGA (2) | IGS ( |
| 67,780–67,820 | 21 | 95 | TAAATCCAAGCGAACTTTTCT (2) | Exon ( |
| 68,264–68,319 | 26 | 96 | TATTCTTATCATATTCTATTCATTTT (2) | IGS ( |
| 81,570–81,611 | 21 | 100 | TTTGTATAGATAACTAAATAT (2) | IGS ( |
| 91,223–91,282 | 18 | 100 | CGATATTGATGCTAGTGA (3) | Exon ( |
| 106,912–106,973 | 32 | 93 | CATTGTTCAACTCTTTAACAACACGAAAAAAC (2) | Exon ( |
| 107,809–107,839 | 15 | 100 | AGTAGCATAACAAAA (2) | IGS ( |
| 107,947–107,984 | 18 | 100 | TAATTGGATAGTTGTAAA (2) | IGS ( |
| 112,130–112,177 | 22 | 96 | TTATATTATAGACAAATATTAT(2) | IGS ( |
| 128,332–128,369 | 18 | 100 | TATTTACAACTATCCAAT (2) | IGS ( |
| 128,477–128,507 | 15 | 100 | TTTTTGTTATGCTAC (2) | IGS ( |
| 129,343–129,404 | 32 | 93 | TTTTTCATGTTGTCAAAGAGTTGAACAATGGT (2) | Exon ( |
| 145,034–145,093 | 18 | 100 | ATATCGTCACTAGCATCA (3) | Exon ( |
Figure 2.Comparison of the chloroplast genomes of L. raddeana and L. sativa by using mVASTA. The gray line and arrow indicated the orientation of genes. The coloring of the different conserved regions corresponds to the annotation of the region. The genes were represented as gray arrows on the top of the alignment. Other regions are labeled with different colors. The pink regions are ‘Conserved Non-Coding Sequences’ (CNS), the dark blue regions are exons, and the light-blue regions are ‘Untranslated regions’ (UTR). The Y-scale represents the percent identity ranging from 50% to 100%.
Figure 3.K2p distances between the IGS regions of L. raddeana and L. sativa.
Figure 4.LSC, SSC, and IR regions' boundaries in the chloroplast genomes of L. raddeana and L. sativa. The genes on the positive and negative strands are presented above and below the tracks, respectively. The JLB, JSB, JSA, and JLA represent junction sites of IRb and LSC, IRb and SSC, SSC and IRa, and IRa and LSC, respectively.
Figure 5.The phylogenetic tree of species from Cichorieae constructed using the maximum likelihood (ML) method using the 72 shared proteins present among the seventeen chloroplast genomes. The number above each node represents are the bootstrap value.