| Literature DB >> 21939539 |
Xiao-Wei Wang1, Jun-Bo Luan, Jun-Min Li, Yun-Lin Su, Jun Xia, Shu-Sheng Liu.
Abstract
BACKGROUND: Invasive species are valuable model systems for examining the evolutionary processes and molecular mechanisms associated with their specific characteristics by comparison with closely related species. Over the past 20 years, two species of the whitefly Bemisia tabaci species complex, Middle East-Asia Minor 1 (MEAM1) and Mediterranean (MED), have both spread from their origin Middle East/Mediterranean to many countries despite their apparent differences in many life history parameters. Previously, we have sequenced the transcriptome of MED. In this study, we sequenced the transcriptome of MEAM1 and took a comparative genomic approach to investigate the transcriptome evolution and the genetic factors underlying the differences between MEAM1 and MED.Entities:
Mesh:
Year: 2011 PMID: 21939539 PMCID: PMC3189941 DOI: 10.1186/1471-2164-12-458
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary for the MEAM1 whitefly transcriptome
| Total number of reads | 17,049,500 |
| Total base pairs (bp) | 1,278,712,500 |
| Average read length (bp) | 75 |
| Total number of contigs | 123,055 |
| Mean length of contigs (bp) | 269 |
| Total number of scaffolds | 104,722 |
| Mean length of scaffolds (bp) | 326 |
| Clusters | 135 |
| Singletons | 57,606 |
| Total unique sequences | 57,741 |
| Sequences with E-value < 10-5 | 15,922 |
Figure 1Length distribution of distinct sequences. The numbers of distinct sequences are shown on the top of each bar.
Figure 2Characteristics of homology search of assembled sequences against the nr database. A. E-value distribution of best BLASTx hits for each distinct sequence with a cut-off E-value of 1.0E-5. B. Similarity distribution of the best BLAST hit for each sequence. C. Species distribution is shown as a percentage of the total homologous sequences with an E-value of at least 1.0E-5. We used the first hit of each sequence for analysis. Homo: Homo sapiens; Mus: Mus musculus; Rat: Rattus norvegicus.
Figure 3Identification of the orthologous gene pairs between MEAM1 and MED. The bidirectional best hit method was used to identify genes that are putatively orthologs. Coding sequences (CDS) of the orthologous genes were determined by BLASTx against all known proteins in Swissprot database using a threshold of 1.0E-5. After removing the UTR regions, sequences shorter than 150 bp and with unexpected codons in the CDS region were further filtered.
Sequence divergence between the MED and MEAM1 cryptic species
| % Differences | |||||||
|---|---|---|---|---|---|---|---|
| % CpG | % GC | Loci | Mean | SE | Compared | ts/tve | |
| 5' UTRsa | 6.3 | 37.12 | 188 | ||||
| All | 1.66 | 0.153 | 21.11 | 1.41 | |||
| No CpG | 1.46 | 0.141 | 19.78 | 1.28 | |||
| CpG | 5.29 | 0.775 | 1.33 | 2.32 | |||
| CDSb | 6.57 | 42.84 | 3585 | ||||
| All | 0.83 | 0.014 | 1730.64 | 3.07 | |||
| No CpG | 0.61 | 0.011 | 1616.93 | 2.6 | |||
| CpG | 3.99 | 0.818 | 113.71 | 4.59 | |||
| nd sitesc | 5.72 | 43.62 | 3585 | ||||
| All | 0.2 | 0.007 | 1024.65 | 1.34 | |||
| No CpG | 0.18 | 0.007 | 966.05 | 1.2 | |||
| CpG | 0.58 | 0.051 | 58.6 | 2.63 | |||
| 4d sitesd | 11.03 | 37.2 | 3585 | ||||
| All | 2.68 | 0.051 | 238.02 | 2.38 | |||
| No CpG | 1.73 | 0.041 | 211.77 | 1.69 | |||
| CpG | 12.16 | 0.332 | 26.25 | 3.94 | |||
| 3' UTRs | 3.74 | 35.19 | 286 | ||||
| All | 1.43 | 0.114 | 43.1 | 1.56 | |||
| No CpG | 1.1 | 0.102 | 41.48 | 1.36 | |||
| CpG | 7.32 | 0.848 | 1.61 | 3.18 | |||
aUTRs: untranslated regions.
bCDS: coding sequence.
cnd sites: non-degenerative sites.
d4d sites: fourfold-degenerate sites where no changes cause any amino acid replacement.
ets/tv: ration of transitions (ts) over transversions (tv).
List of genes with Ka/Ks larger than one
| MEAM1 Gene IDs | S-Sub | N-Sub | Ka | Ks | Ka/Ks | Protein homolog |
|---|---|---|---|---|---|---|
| BT_B_ZJU_Singletons23470 | 16 | 2 | 0.0413 | 0.0142 | 2.9039 | Alpha-trehalose-phosphate synthase |
| BT_B_ZJU_Singletons22749 | 5 | 1 | 0.0310 | 0.0117 | 2.6482 | Hydroxyacylglutathione hydrolase |
| BT_B_ZJU_Singletons82918 | 16 | 3 | 0.0822 | 0.0342 | 2.4027 | Oligopeptide transporter |
| BT_B_ZJU_Singletons9172 | 8 | 2 | 0.0429 | 0.0261 | 1.6434 | RNA-directed DNA polymerase |
| BT_B_ZJU_Singletons22262 | 2 | 1 | 0.0148 | 0.0092 | 1.6172 | Keratin 9 |
| BT_B_ZJU_Singletons97129 | 5 | 1 | 0.0184 | 0.0115 | 1.5985 | Diamine acetyltransferase |
| BT_B_ZJU_Singletons86701 | 8 | 2 | 0.0367 | 0.0238 | 1.5386 | Putative ankyrin repeat protein |
| BT_B_ZJU_Singletons25715 | 4 | 1 | 0.0061 | 0.0045 | 1.3570 | UPF0580 protein C15orf58 homolog |
| BT_B_ZJU_Singletons98955 | 4 | 1 | 0.0075 | 0.0057 | 1.3165 | Coactivator of PPAR-gamma-like |
| BT_B_ZJU_Singletons14447 | 4 | 1 | 0.0063 | 0.0049 | 1.2796 | Phosphatidylserine synthase 1 |
| BT_B_ZJU_Singletons8887 | 4 | 1 | 0.0205 | 0.0162 | 1.2653 | Cysteine proteinase |
| BT_B_ZJU_Singletons102961 | 4 | 1 | 0.0125 | 0.0100 | 1.2567 | tRNA (cytosine-5-)-methyltransferase |
| BT_B_ZJU_Singletons103698 | 3 | 1 | 0.0104 | 0.0084 | 1.2325 | tRNA (uracil-5-)-methyltransferase |
| BT_B_ZJU_Singletons103804 | 4 | 1 | 0.0116 | 0.0094 | 1.2288 | Cyclin-K |
| BT_B_ZJU_Singletons69824 | 3 | 1 | 0.0210 | 0.0173 | 1.2133 | Extracellular domains-containing protein |
| BT_B_ZJU_Singletons102115 | 16 | 4 | 0.0226 | 0.0192 | 1.1816 | Retinol dehydrogenase |
| BT_B_ZJU_Singletons97093 | 3 | 1 | 0.0153 | 0.0130 | 1.1709 | RNA-directed DNA polymerase |
| BT_B_ZJU_Singletons96386 | 3 | 1 | 0.0084 | 0.0076 | 1.1098 | Mitogen-activated protein kinase organizer |
| BT_B_ZJU_Singletons102891 | 10 | 3 | 0.0120 | 0.0110 | 1.0931 | Gamma-glutamyltranspeptidase 1 |
| BT_B_ZJU_Singletons103844 | 3 | 1 | 0.0062 | 0.0058 | 1.0832 | Serine/threonine-protein kinase PINK1 |
| BT_B_ZJU_Singletons22561 | 3 | 1 | 0.0059 | 0.0055 | 1.0693 | Uncharacterized protein |
| BT_B_ZJU_Singletons102559 | 7.74 | 2.26 | 0.0241 | 0.0226 | 1.0660 | Cathepsin B-like cysteine proteinase 4 |
| BT_B_ZJU_Singletons24476 | 3 | 1 | 0.0169 | 0.0166 | 1.0200 | Krueppel homologous protein 1 |
| BT_B_ZJU_Singletons15873 | 6 | 2 | 0.0304 | 0.0299 | 1.0162 | Inactive purple acid phosphatase 29 |
The GeneIDs of the MED orthologs are lists in the Additional file 4.
S-Sub: Synonymous substitutions; N-Sub: Nonsynonymous substitutions; Ka: Nonsynonymous substitution rate; Ks: Synonymous substitution rate.
Figure 4Distribution of Ka and Ks. Sequences with Ka/Ks > 1 fall above the solid line; while sequence with Ka/Ks between 0.5 -1 fall between the solid and dashed lines. Analysis was performed using the method of Yang & Nielsen (2000).
Figure 5Amino acid sequence alignments of alpha-trehalose-phosphate synthase and cytochrome P450. A. Amino acid sequences alignment of the MEAM1 and MED alpha-trehalose-phosphate synthase (TPS). The putative catalytic domain is underlined. B. Amino acid sequences alignment of the MEAM1 and MED cytochrome P450 4C1 (CYP4C). The different amino acid residues are show in light blue.