| Literature DB >> 32517321 |
Cheyenne N McKinley1, Sarah E Lower1.
Abstract
Identifying the basis of phenotypic variation is a key objective of genetics. This work has been mostly limited to model systems with a plethora of genetic manipulation and functional characterization tools. With the development of high-throughput sequencing and new computational tools, it is possible to identify candidate genes related to phenotypic variation in non-model organisms. Fireflies are excellent for studying phenotypic variation because of their diverse and well-characterized behaviors. Most adult fireflies emit a single mating flash pattern and do not eat. In contrast, adult females of many species in the genus Photuris employ multiple flash patterns and prey upon mate-seeking males of other firefly species. To investigate the genetic basis for this variation, we used comparative transcriptomics to identify positively selected genes between a predatory firefly, Photuris sp., and a non-predatory relative, Photuris frontalis, controlling for genes generally under selection in fireflies by comparing to a Photinus firefly. Nine gene families were identified under positive selection in the predatory versus non-predatory Photuris comparison, including genes involved in digestion, detoxification, vision, reproduction, and neural processes. These results generate intriguing hypotheses about the genetic basis for insect behavior and highlight the utility of comparative transcriptomic tools to investigate complex behaviors in non-model systems.Entities:
Keywords: FUSTr; Photinus; dN/dS; molecular evolution
Year: 2020 PMID: 32517321 PMCID: PMC7348864 DOI: 10.3390/genes11060627
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Graphical workflow of computational methods. The specific programs used for each step are shown in italic Courier font. Colored bars represent raw reads that are first downloaded, and then assembled into contigs and quality trimmed. The assemblies are assessed for gene families under positive selection using the posterior probability that β > α. The putative function of the positively selected genes is determined using a homology search. We are interested in the genes involved in predatory behavior, which are in the Photuris frontalis and Photuris sp. comparison, represented by a star.
Assembly statistics for three firefly species. Statistics compiled from Trinity assembly assessment output (script name/flags). N genes = number of unique genes in assembly, N transcripts = number of unique transcripts (including isoforms) in assembly. GC (%) = GC content percentage in assembly. Median length (bp) = median length of longest isoforms per gene. Mean length (bp) = mean length of longest isoforms per gene. N bases = number of total bases in assembly based on longest isoforms per gene only. N50 = N50 statistic based on longest isoforms per gene only. Assembly statistics for all transcripts, not just longest isoforms, are given in (Supplemental Table S2).
| Species | N Genes | N Transcripts | GC (%) | Median Length (bp) | Mean Length (bp) | N Bases | N50 |
|---|---|---|---|---|---|---|---|
|
| 40,547 | 58,028 | 34.65 | 405 | 866.46 | 35,132,369 | 1693 |
| 38,303 | 56,626 | 34.43 | 402 | 901.44 | 34,527,893 | 1820 | |
|
| 130,648 | 188,474 | 38.96 | 346 | 634.52 | 82,898,270 | 902 |
Benchmarking Universal Single-Copy Orthologs (BUSCO) assembly scores. C = Complete BUSCOs. S = Complete and single-copy BUSCOs. D = Complete and duplicated BUSCOs. F = Fragmented BUSCOs. M = Missing BUSCOs.
| Species | C (%) | S (%) | D (%) | F (%) | M (%) |
|---|---|---|---|---|---|
|
| 90.9% | 62.2% | 28.7% | 4.9% | 4.2% |
| 91.3% | 59.6% | 31.7% | 4.4% | 4.3% | |
|
| 93.5% | 46.3% | 47.2% | 3.9% | 2.6% |
Figure 2Homologs of genes under positive selection in each comparison identified by BLAST searches. Each color/slice represents the BLAST homolog for each gene family. Percentages represent the number of genes with that functional annotation divided by the total number of genes under positive selection. Parentheticals following each protein name represent (number of genes, number of gene families). (A) Photuris sp. and Photuris frontalis comparison. (B) Photuris frontalis and Photinus pyralis comparison. (C) Photuris sp. and Photinus pyralis comparison. Detailed results, including representation of sequences from each species in the comparison, can be found in supplemental Tables S4–S6.
FUSTr results. Comparison = the two species FUSTr compared in its analysis. N input transcripts = number of transcripts input into FUSTr. N isoforms disregarded = number of isoforms removed from analysis. N transcripts used = number of transcripts used by FUSTr. N families = number of gene families found in each comparison. N families with > 15 sequences = number of gene families with greater than 15 sequences. N families with β > α = number of gene families found to be under strong positive selection.
| Comparison | N input Transcripts | N Isoforms Disregarded | N Transcripts Used | N Families | N Families with > 15 Sequences | N Families with β > α |
|---|---|---|---|---|---|---|
| 114,654 | 29,237 | 46,371 | 36,387 | 44 | 9 | |
| 245,100 | 49,882 | 71,700 | 62,055 | 49 | 12 | |
| 246,502 | 49,025 | 74,039 | 64,354 | 48 | 8 |