| Literature DB >> 33339505 |
Alejandra Carla Scannapieco1, Claudia Alejandra Conte1, Máximo Rivarola2, Juan Pedro Wulff1, Irina Muntaabski1, Andrés Ribone2, Fabián Milla1, Jorge Luis Cladera1, Silvia Beatriz Lanzavecchia3.
Abstract
BACKGROUND: Anastrepha fraterculus sp. 1 is considered a quarantine pest in several American countries. Since chemical control applied in an integrated pest management program is the only strategy utilized against this pest, the development of pesticide-free methods, such as the Sterile Insect Technique, is being considered. The search for genes involved in sex-determination and differentiation, and in metabolic pathways associated with communication and mating behaviour, contributes with key information to the development of genetic control strategies. The aims of this work were to perform a comprehensive analysis of A. fraterculus sp. 1 transcriptome and to obtain an initial evaluation of genes associated with main metabolic pathways by the expression analysis of specific transcripts identified in embryos and adults.Entities:
Keywords: Differential gene expression; Fruit fly; Microsatellite markers; RNA-Seq analysis; Transcript annotation
Mesh:
Year: 2020 PMID: 33339505 PMCID: PMC7747455 DOI: 10.1186/s12863-020-00943-2
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Summary of the A. fraterculus sp. 1 transcriptome
| Total number of reads (72 h embryos/ Adult females/ Adult males) | 5,519,495/ 4,765,195/ 6,322,876 |
| Number of discarded reads (72 h embryos/ Adult females/ Adult males) | 6640/ 6653/ 6356 |
| Mean read length (bp) | 151 |
| Smallest transcript length (bp) | 201 |
| Number of largest transcript length (bp) | 12,701 |
| Number of overall average length (bp) | 682.8 |
| Median transcript length (bp) | 410 |
| N50 (bp) | 1020 |
| Total assembled sequences | 86,925 |
| Number of loci | 57,000 |
| Number of GO annotated sequences | 28,756 |
| Number of BLAST annotated sequences | 52,655 |
| GC% (72 h embryos/ Adult females/ Adult males) | 44/ 44/ 42 |
| Complete BUSCOsa | 1762 (62.9%) |
| Complete and single-copy BUSCOs a | 827 (29.5) |
| Complete and duplicated BUSCOs a | 935 (33.4%) |
| Fragmented BUSCOs a | 484 (17.3%) |
| Missing BUSCOsa | 553 (19.8%) |
aBUSCO completeness assessment results for A. fraterculus transcriptome. Results are based on a complete set out of 2799 single-copy orthologs. bp base pairs
Fig. 1Gene Ontology (GO) assignment of A. fraterculus sp. 1 transcripts. The results are summarized in the three main categories “cellular component”, “molecular function” and “biological process”. GO was assigned to 28,756 transcripts in total. The percentage (left Y-axis) and total number (right Y-axis) of transcripts in each category (the second GO level) are shown. Y-axes are in log (10) scale. WEGO was used to produce the graph
Ortholog prediction analysis for the A. fraterculus sp. 1 transcriptome compared to three closely related species (B. oleae and R. zephyria and C. capitata) (Additional File 3). The table shows the number of sequences (predicted proteins) from each species with orthologs in every other proteome
| … with orthologs in … | |||||
|---|---|---|---|---|---|
| Number of proteins from … | 16,420 | 16,931 | 16,508 | ||
| 12,627 | 15,444 | 16,047 | |||
| 15,289 | 19,023 | 19,664 | |||
| 15,205 | 20,057 | 19,413 | |||
Fig. 2Descriptive analysis of differentially expressed transcripts in paired-comparisons between libraries. Values inside each bar indicate the number of differentially expressed transcripts (Y-axis) for each comparison (72 h embryo/ female, 72 h embryos/ male, female/ male). 72 h embryos/ females: the number of transcripts over or under expressed in 72 h embryos compared to females. 72 h embryos/ males: the number of transcripts over or under expressed in 72 h embryos compared to males. Females/ males: the number of transcripts over or under expressed in females compared to males. Threshold criteria: from the transcriptome assembly only transcripts with > 10 cpm were considered; FC > 10 over-represented transcripts; FC < 0.1 under-represented transcripts
Description, annotation and qPCR characteristics of selected transcripts
| ID/ gene namea | GO term assigned | Accession no.b | Primer sequence 5′-3′ | Product size (bp) | qPCR efficiency |
|---|---|---|---|---|---|
| Trinity DN12595_c0_g1_i2 | 0016020/ 0007349/ 0005737 | JQ599256 | Fw: TGAGAGTTTGCGCACAGTGA Rv: CACTGCTGCACCTGAGTCAT | 154 | 1.882 |
| Trinity DN15480_c0_g1_i3 | – | KY204955 | Fw: TGGAAATCGATGATCGCCGT Rv: GACGACGGGAGCGATAATCA | 144 | 1.957 |
| Trinity DN10186_c0_g1_i4 | 0005875/ 0008340/ 0042026/ 0042595 | XM011197636 | Fw: TTTCGGCTTTGGCTTGCATC Rv: TGCACACTTGGAAGCCATCT | 200 | 1.929 |
| Trinity DN11516_c0_g1_i3 | – | JQ048622 | Fw: TTTACGGCGTTGGTAGTGCC Rv: CCGATCAACTCAGCTTTGGG | 106 | 1.924 |
| Trinity DN11027_c0_g1_i1 | – | KU317977 | Fw: GCTGCAGTCGTTGTCCATTG Rv: GTGTCGACATGCAACTCAGC | 85 | 1.934 |
| Trinity DN15916_c0_g4_i3 | 0012505/ 0016021 | XM029042282 | Fw: GCGTGTTGGATGCAGTGTTT Rv: CGTTTGGGATAGGCACGGAA | 109 | 1.905 |
| Trinity DN12527_c0_g1_i1 | – | XM011180327 | Fw: CCAGCCCGAATACGGTACAA Rv: CGGTCGACGGACTTCTGAAA | 138 | 1.907 |
| Trinity DN6544_c0_g1_i2 | 0000022/ 0003735/ 0022625/ 0006412 | NM139834 | Fw: GAAAGGCTCCTGGTGTTCCA Rv: TTTGTAACCGCAGCTTGACC | 104 | 1.963 |
| Trinity DN12142_c0_g1_i2 | 0008340/ 0003746/ 0006414/ 0005525/ 0006184/ 0005829/ 0005853 | GU339154 | Fw: GCACCACGAAGCTTTAGCAG Rv: ACTGGAGTGTAACCGTTGGC | 198 | 1.959 |
a Gene name from the best BLAST hit or GO annotation
b Accession number from the annotated sequence with the best BLAST hit
Fig. 3Comparative expression profiles of candidate genes obtained by qPCR among E (72 h embryos), sexually mature and virgin adults females (F) and males (M). NRQ are Expression Units. Different letters indicate significant differences (P < 0.05) between stages (t-test results). Letters marked with an asterisk (*) showed statistically marginal differences (P = 0.08). Reference genes for qPCR: ribosomal protein L18 (rpL18) and elongation factor-1a (ef-1a)
Fig. 4Comparative expression profiles of candidate genes obtained by qPCR among virgin vs mated females (vF, mF) and males (vM, mM). NRQ are Expression Units. Different letters indicate significant differences (P < 0.05) between treatments (t-test results). Letters marked with an asterisk (*) showed statistically marginal differences (P = 0.08). Reference genes for qPCR: ribosomal protein L18 (rpL18) and elongation factor-1a (ef-1a)
Fig. 5Microsatellite marker prediction. a Distribution of microsatellite motif by repeat type classes. Numbers represent percentage of transcripts within each repeat class (2: di-, 3: tri-, 4: tetra- and 5: pentanucleotide motifs, respectively). b Distribution and composition of di-nucleotide repeats. c Distribution and composition of tri-nucleotide repeats