Literature DB >> 28384348

De novo transcriptome analysis of the excretory tubules of Carausius morosus (Phasmatodea) and possible functions of the midgut 'appendices'.

Matan Shelomi1,2.   

Abstract

The Malpighian tubules are the insect excretory organs, responsible for ion and water homeostasis and elimination of nitrogenous wastes. Post-genomic assays suggest they also metabolize and detoxify xenobiotic compounds and have antimicrobial properties. The Phasmatodea have an additional, unique set of excretory organs referred to predominantly as midgut appendices. Their function and how it compares to phasmid and other insect Malpighian tubules is unknown. Hypotheses include carbonic anhydrase activity, calcium and metal cation sequestration, and xenobiotic transport. This work presents the first comparative transcriptomic analysis of the Phasmatodean excretory organs, using the model insect Carausius morosus. I produced de novo transcriptomes of the midgut appendices, midgut wall, and Malpighian tubules, and looked for differentially expressed genes associated with putative organ functions. The appendices differentially and highly express lipid transport and metabolism proteins, and the biomineralization gene otopetrin. The Malpighian tubules differentially and highly express acid phosphatases and multiple transporter types, while appendices express fat-soluble vitamin and peptide transporters. Many defense proteins such as multidrug resistance proteins, ABC transporters, cytochrome P450's, and glutathione-S-transferases were differentially expressed in specific excretory organs. I hypothesize that the appendices and Malpighian tubules both have defensive / xenobiotic metabolism functions, but each likely target different substrates. Phasmid Malpighian tubules excrete as in other insects, while the appendices may predominantly regulate amino acids, fats, and fat-soluble compounds. Lipid metabolism in insects is poorly understood, and the Phasmatodea may thus serve as a model for studying this further.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28384348      PMCID: PMC5383107          DOI: 10.1371/journal.pone.0174984

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The main organs of insect excretion, analogous to the human nephrons, are the Malpighian tubules, which usually arise at the junction of the midgut and hindgut. Not only do they eliminate nitrogenous wastes and regulate water/ion balances in the hemolymph, but also they function in the elimination and metabolism of xenobiotics such as plant secondary toxins [1]. Pioneering work on the excretory physiology of Malpighian tubules was done by J.A. Ramsay in the 1950’s, in which he perfected techniques of collecting urine directly from a single tubule dissected and placed in a drop of saline under liquid paraffin, using the model insects Rhodnius [2] and the laboratory or Indian walking stick, Carausius morosus (previously Dixippus morosus) [3-6]. In the 1970’s, Taylor performed a series of physiological and ultrastructural studies on the Malpighian tubules, again using Carausius [7-10]. Tracking the fate of injected dyes or alkaloids as they are absorbed by the tubules for elimination is another technique with considerable history [11-12], which confirmed that the tubules excrete such xenobiotic solutes via active transport [13-16]. More recently, electrophysiological [17] and post-genomics era assays, primarily on model insects Drosophila and Manduca, have identified the proteins associated with the various transport functions of the tubules, namely V-ATPase proton pumps working in concert with cation/proton antiporters, demonstrating that the tubules can actively transport solutes [18-21]. They also revealed a large complement of defense proteins such as multidrug resistance proteins (a class of ATP-binding cassette, or ABC, transporters) [22], NO synthase (used in immune sensing) [23], Diptericin (an antimicrobial peptide effective against gram-negative bacteria whose expression is upregulated by NO synthase) [24], and xenobiotic metabolism/detoxification/conjugation genes such as cytochrome P450s, glutathione-S-transferases and alcohol dehydrogenases [24-26]. As many of these genes have human homologues involved in kidney disorders, the Malpighian tubules have been declared a model for human renal disease [21, 27]. In addition to the Malpighian tubules, some types of excretion are performed as ancillary functions of other organs such as the midgut [28] or pericardial tissue [29]. However, a separate and unique excretory organ system evolved in the Phasmatodea, including Carausius. They have been given many names, but are referred to mostly as “appendices of the midgut” or some translation thereof [30-36]. They consist of long, coiled tubules approximately 1/3 the diameter of Malpighian tubules that end blindly in the hemolymph near the anal end of the insect, and are proximally connected to pear-shaped ampullae 300–500μm long that project from and open into the posterior end of the midgut (Fig 1, S1 Fig). The tubules are highly motile, well-tracheated [32] with muscular elements spiraling across their lengths as in Malpighian tubules [36] and showing simple harmonic motion. The ampullae often appear filled with a yellow fluid, but neither they nor the tubules contain microbial symbionts [30]. An autapomorphy of the Phasmatodea [31], these “appendices” are not, as their name would suggest, vestigial, nor are they a displaced variant of the Malpighian tubules, though the two are likely homologous in origin. Evidence with vital staining confirms that the appendices actively transport solutes. However they do not transport the same stains as the Malpighian tubules, such as indigo carmine or methyl green [14, 28], nor stains associated with pericardial tissue, such as ammonium carmine [12, 29]. Instead they absorb and eliminate into the midgut lumen a unique set of predominantly cationic but otherwise chemically disparate stains, suggesting that, though excretory, their exact function has no known analogue in arthropod excretory tissue [32]. The only other clue to their function is an assay by Monteiro et al. [33] finding the highest specific activity of the alkalinizing enzyme carbonic anhydrase in the “Midgut protuberances,” which corresponds with the known alkalinity of the posterior midgut at and following the origin of the midgut tubules [30].
Fig 1

Digestive and excretory system of the Phasmatodea.

A) Schematic and B) dissection of the alimentary canal from Aretaon asperrimus, (Heteropterygidae) typical of other Phasmatodea [32]. The insect was vitally stained with New Methylene Blue N and dissected 6 days later. The appendices [violet] appear on the posterior midgut. The Malpighian tubules [colorless] originate at the midgut/hindgut junction, trailing over the posterior midgut before going towards the posterior end of the insect. The gut section between the two, the “post-posterior midgut,” was used for our midgut wall (MGWall) samples, excluding any tubules. Key: AMG = anterior midgut. HG = hindgut, MMG = middle midgut. PMG = posterior midgut. PPMG = post-posterior midgut. The schematic is reused with permission from this author’s previously published work [32].

Digestive and excretory system of the Phasmatodea.

A) Schematic and B) dissection of the alimentary canal from Aretaon asperrimus, (Heteropterygidae) typical of other Phasmatodea [32]. The insect was vitally stained with New Methylene Blue N and dissected 6 days later. The appendices [violet] appear on the posterior midgut. The Malpighian tubules [colorless] originate at the midgut/hindgut junction, trailing over the posterior midgut before going towards the posterior end of the insect. The gut section between the two, the “post-posterior midgut,” was used for our midgut wall (MGWall) samples, excluding any tubules. Key: AMG = anterior midgut. HG = hindgut, MMG = middle midgut. PMG = posterior midgut. PPMG = post-posterior midgut. The schematic is reused with permission from this author’s previously published work [32]. Thus, despite their having been described over a century ago [34, 35], we still know little about what the appendices actually do. Ramsey himself never successfully used his urine-collecting methods on the appendices despite having developed them on the Phasmatodea, or at least never published the results of his efforts. This is likely due to the appendices’ degradation in Ringer and other medias [36], and the difficulty of performing Ramsay assays on tubules with too narrow a lumen [19, 37]. Several authors studying Phasmatodea Malpighian tubules made no mention of the appendices altogether, including Taylor in his work on Carausius [7-10]. The extant hypotheses for the appendices’ function is excretion of solutes the Malpighian tubules cannot process, which could include xenobiotics, as well as carbonic anhydrase activity. Shelomi & Kimsey [32] predicted the appendices function in calcium regulation and/or organic alkaloid sequestration, while rejecting functions known for other insect midgut outcroppings such as microbial crypts [30]. Next-generation sequencing technologies such as transcriptomics can rapidly test these hypotheses and develop new insights into the function of these enigmatic organs, just as they revealed hitherto unknown or only hypothesized functions in the Malpighian tubules [24, 27]. Tissue-specific analyses can also identify tissue-specific functions and differentially expressed genes, including tightly expressed genes that could otherwise be overlooked in whole-organism tests [21, 26]. In this experiment, I returned to the model organism Carausius morosus, and generated de novo transcriptomes for three tissue types: the Malpighian Tubules (MpgT), appendices of the midgut (AoM), and the midgut wall itself (MGwall). Using RNA-Seq, I identified the highest and/or most differentially expressed genes in each tissue type, with the goal of determining what the function of the AoMs is on a molecular level.

Results

De novo transcriptome assembly and assessment

A total of 9 C. morosus cDNA libraries—three each from AoMs, MpgTs, and MGwalls—were produced from adult, female [the species is mostly parthenogenetic] insects. Illumina sequencing and processing generated 20 million reads (4.00 gigabases) of data per library, available in the Sequence Read Archive (SRA) of NCBI [see data availability statement]. The transcriptome assembly from the quality tested, trimmed, and pooled data produ was refined via duplications testing and fusion of contiguous sequences with the program CAP3 [38] to 73,143 non-redundant contigs (Tables 1 and 2). All contigs are either named “Contig#” [post-CAP3 contigs] or “Carausius_C#” [unchanged from original assembly]. The species most commonly represented in BLAST hits for the assembly as determined via Blast2GO v3.0.11 [39] were all animals and almost all insects, with the majority of identified hits belonging to the termite Zootermopsis nevadensis (S2 Fig), as expected given the close relationship of Phasmatodea with Blattodea/Isoptera relative to the other insects on the list [31].
Table 1

Transcriptome assembly contig length measurements.

with scaffolded regionswithout scaffolded regions
N75599468
N501206932
N2523221796
Minimum25050
Maximum41,52141,521
Average863662
Count85,006108,805
Table 2

Transcriptome assembly summary statistics.

CountAverage LengthTotal Bases
Reads161,455,31894.5615,267,714,112
Matched126,641,49394.5411,972,103,583
Not matched34,813,82594.663,295,610,529
Contigs85,00686373,393,572
Reads in Pairs69,329,932326.56
Broken paired reads57,311,56194

RNA-Seq and differential expression

Mapping the original library reads to the contigs from the CAP3 transcriptome using Cufflinks v2.2.1 resulted in 64,747 contigs with mapped reads. Cufflinks then identified 9196 contigs differentially expressed most highly in MpgTs relative to the other two tissues, 3240 such contigs in the MGWall, and 5084 in the AoMs (Fig 2). It also found 4577 contigs differentially expressed in both excretory organs (AoM + MpgT) relative to the midgut wall, 3240 to the midgut tissues (AoM + MGWall), and 1303 to the MGWall and MpgT. By defining high expression as any RPKM (reads per kilo base per million mapped reads) value ten times above the mean, I found 881 contigs highly expressed in the MpgTs, 924 in the AoMs, and 758 in the MGwall, including those highly expressed in more than one tissue type (318 highly expressed in all three). Combining these statistics I found 379 contigs highly and differentially expressed in the MpgT alone, 361 in the AoMs, and 427 in the midgut (Fig 2). From these, 126, 131 and 109 were unidentifiable by BLASTx searches.
Fig 2

Venn diagram showing the distribution of transcripts in the tissue libraries.

The number of sequences differentially expressed (and, in parentheses, the number highly expressed) in the midgut “appendices,” Malpighian tubules, the midgut wall, and pairs of these tissues are given. The center shows the number of sequences equally expressed in all three tissues (meaning no differential expression), with the number highly expressed in all three tissues in parentheses. “Zero” indicates the 153 reads that Cufflinks could not map.

Venn diagram showing the distribution of transcripts in the tissue libraries.

The number of sequences differentially expressed (and, in parentheses, the number highly expressed) in the midgut “appendices,” Malpighian tubules, the midgut wall, and pairs of these tissues are given. The center shows the number of sequences equally expressed in all three tissues (meaning no differential expression), with the number highly expressed in all three tissues in parentheses. “Zero” indicates the 153 reads that Cufflinks could not map. Among the differentially and highly expressed genes in the appendices were multiple fatty-acid- and lipid storage droplet surface binding proteins, triaglycerol lipases, V-type proton ATPases, a retinol (Vitamin A) dehydrogenase, several alpha tocopherol (Vitamin E) transfer proteins, two glutathione-S-transferases, many cytochrome p450s, an otopetrin, beta-actins, and various proteases. Among the differentially and highly expressed genes in the Malpighian tubules were annexins, many cytochrome p450s and multidrug resistance proteins, prostatic acid phosphatases, many solute carrier transporter family members, and other transporters of calcium, protons, sodium, phosphates, monocarboxylates, anions, cations, trehalose, and vesicular glutamate. Among the differentially and highly expressed genes in the midgut wall were a great many ribosomal proteins, an alpha tocopherol transfer protein, triaglycerol lipases, peritrophin, ferritin, many antennal esterases, some cytochrome P450s, and many digestive proteins such as beta-galactosidase, glucosylceramidase, cellulase, maltase, polygalacturonase (pectinase), and various proteases (Table 3, S1 Table).
Table 3

Top identified, over-expressed transcripts of the Carausius appendices of the midgut.

Contig NameDescription [Blast2GO]Hit Accession #e-valuesim meanlength (bp)mean AoM RPKMlog 2 fold change (vs MpgT)log 2 fold change (vs MGwall)
Carausius_C45Chymotrypsin BIKDR14900.12.22E-590.656117396.6-8.86-1.16
Carausius_C67zinc metalloproteinaseAGM32350.14.13E-610.69015579.6-6.72-8.86
Carausius_C392lipocalin cytosolic fatty-acid bindingAGM32122.13.28E-620.826803895.0-3.57-2.32
Contig52Lipase member HKDR14939.14E-420.5519193860.6-8.93-1.19
Carausius_C126Zinc metalloproteinase nas-14KDR19395.18.8E-600.75993595.2-1.56-8.27
Contig51Glutathione S-transferase omega-1KDR22870.16.22E-170.518592743.4-4.45-1.23
Carausius_C197lipocalin cytosolic fatty-acid bindingAGM32122.12.95E-610.829732669.2-3.65-2.11
Carausius_C912Triaglycerol lipaseKDQ97822.19.08E-450.5513612631.0-8.89-1.17
Carausius_C444V-type proton ATPase subunit partialKDR08717.100.9415732448.7-0.82-1.43
Carausius_C198lipocalin cytosolic fatty-acid bindingAGM32122.14.3E-590.8110052346.3-3.37-2.87
Carausius_C103beta actin-5CNP_511052.14.47E-1240.996532310.8-0.94-1.10
Carausius_C1664zinc metalloproteinaseAGM32350.13.33E-250.614832225.6-6.15-8.70
Contig348Natterin-3 [kininogenase]KDR19911.15.07E-480.6610411933.0-1.88-2.74
Contig49beta actin, partialADZ52965.14.00E-1760.925421608.6-0.78-0.88
Contig245Zinc carboxypeptidase A 1KDR22871.16.11E-930.7121401495.5-4.64-4.48
Carausius_C1260vacuolar-type H+-ATPaseAGO46410.100.9834561286.8-0.66-1.24
Carausius_C1234zinc metalloproteinaseAGM32350.12.08E-610.715311212.7-1.88-8.14
Carausius_C395Glutathione S-transferase omega-1KDR22870.11.29E-190.511971124.9-4.05-1.81
Contig292Na-dependent nutrient amino acid transporterKDR22766.100.7815191122.5-7.38-8.80
Carausius_C27CD63 antigenKDR11258.11.16E-270.6417091049.6-2.70-4.11
Contig235Retinol dehydrogenase 11KDR07942.12.15E-930.759991032.2-3.91-2.85
Carausius_C1903Cytochrome P450 6k1KDR14071.11.17E-270.74346950.1-3.15-2.24
Contig356juvenile hormone esterase Est1ACT53736.12.45E-690.6898941.0-4.12-3.01
Carausius_C25CD63 antigenKDR11258.14.67E-310.651665922.6-2.53-4.06
Contig209Cytochrome P450 partialKDQ77054.12.56E-1140.651195907.1-2.52-1.43

Only genes that were both highly expressed [RPKM values >10x the mean] and differentially expressed in the appendices alone are included.

Only genes that were both highly expressed [RPKM values >10x the mean] and differentially expressed in the appendices alone are included. Distribution of the gene ontology (GO) categories for only the both highly and differentially expressed transcripts of the three tissue types is summarized in Fig 3. Many of these transcripts in the midgut wall encoded for the aforementioned hydrolases. The Malpighian tubules transcripts were predominantly membrane or membrane-bound compounds. The differentially and highly expressed transcript profiles for the appendices are more similar to those of the Malpighian tubules to the midgut, suggesting similar function types despite the significantly different genes involved. The results of the KEGG [40] mapping of the most highly and differentially expressed genes per tissue are available in S2 Table and summarized as follows: Unique to the highly and differentially expressed genes in AoMs were enzymes involved in metabolism of glutathione (thioredoxin peroxidase, Enzyme Commission (EC) #1.11.1.15), alpha-linolenic acid (an omega-3 fatty acid) (EC 3.1.1.32) and glycerophospholipid (EC 1.1.1.8). Unique to the MpgTs were multiple enzymes used in sugar metabolism and antibiotic synthesis (EC’s 2.7.1.1, 4.1.2.13, 4.2.1.3, and 5.3.1.9), a retinol metabolism dehydrogenase (EC 1.1.1.105), and aminobenzoate degrading phosphatases (EC 3.1.3.2). Unique to the MGWall were psychosine hydrolases (EC 3.2.1.45), pectin depolymerase (EC 3.2.1.15), and alpha-fucosidase (EC 3.2.1.51). Both AoM and MGWall samples had differentially expressed chitodextrinases (EC 3.2.1.14) and hexosaminidases (EC 3.2.1.52).
Fig 3

Pie charts of level 3 GO term categories for the differentially and highly expressed genes of the three tissue types.

Specific protein types

Based on the BLASTx results and on past hypothesis for AoM function, I performed targeted searches for genes with specific functions, such as those involved in defense/multidrug resistance, biomineralization, and transport. I also performed a hidden Markov model (HMM) protein family domain search [41, 42] specifically for ATP-binding cassette (ABC) transporters, which are linked to xenobiotic elimination in Malpighian tubules [27] and which were hypothesized to be differentially expressed in Phasmatodea excretory organs [32]. The resulting lists of contigs are likely overestimates of the true gene number for each protein type, as some of these are likely alleles or variants of the same gene, or have their differences in the noncoding regions that our analysis software could not detect.

Multixenobiotic resistance and defense

I identified 187 ABC transporter / multidrug resistance protein contigs from several sub-families (S3 Table). Most of those with high expression (>100 RPKM) were expressed in the Malpighian tubules, and only one (Contig2190, an ABCE1) was also highly expressed in the other two. Several were significantly differentially expressed in certain tissues (Table 4), but none were both highly and differentially expressed. Of those BLASTx putatively identified to subfamily and member, two ABCA1’s and several ABCG4’s were differentially expressed in the AoMs. Both of those ABC transporters are cholesterol efflux transporters in other insects, known to work in concert to transport cholesterol and lipoproteins [43, 44]. Most of the other subfamilies either had differentially expressed representatives in multiple tissues or were broadly expressed. This includes several multidrug resistance proteins (MRPs), a class of the ABCC subfamily known as ATP-driven xenobiotic pumps [45].
Table 4

Differential expression of putative ATP-Binding Cassette (ABC) transporters in Carausius excretory tissues.

CategoryTotalAoMMpgTMgWallAoM + MpgTMpgT + MGWallAoM + MgWall
ABC[#]21
ABCA142
ABCA211
ABCA3134241
ABCB102
ABCB621
ABCB71
ABCB821
ABCC111
ABCC81
ABCC-MRP[#]55733314
ABCC-MRP136718125
ABCC-MRP3211
ABCC-MRP410133
ABCC-MRP497331
ABCC-MRP57
ABCC-MRP721
ABCD21
ABCD332
ABCE12
ABCF11
ABCF22
ABCF31
ABCG143
ABCG1411
ABCG2043
ABCG2332
ABCG41351112
ABCG533
ABCG811

ABC transporters were identified with a profile hidden Markov model search [41] of the transcriptome with an ABC transporter PFAM domain query [42]. ABC transporter names are given with subfamily letter and subgroup number when identifiable. Values are the number of total sequences per subcategory in the transcriptome, and the number differentially expressed in each tissue or possible tissue pair. None of these sequences were highly expressed. For the full list, see S3 Table. AoM = Appendices of the Midgut, MGWall = Midgut Wall, MpgT = Malpighian tubules. MRP = multidtrug resistance protein.

ABC transporters were identified with a profile hidden Markov model search [41] of the transcriptome with an ABC transporter PFAM domain query [42]. ABC transporter names are given with subfamily letter and subgroup number when identifiable. Values are the number of total sequences per subcategory in the transcriptome, and the number differentially expressed in each tissue or possible tissue pair. None of these sequences were highly expressed. For the full list, see S3 Table. AoM = Appendices of the Midgut, MGWall = Midgut Wall, MpgT = Malpighian tubules. MRP = multidtrug resistance protein. I identified 181 other transcripts for proteins with noted functions in multidrug resistance / xenobiotic metabolism / defense. These included cytochrome p450s, glutathione-S-transferase, gram negative bacterial binding proteins (GNBBP), toll-related proteins, carboxylesterases, and one antimicrobial peptide (AMP) identified as having an attacin domain (Carausius_C61676). This is the first AMP identified in Phasmatodea, which I have named by convention for insect AMPs as Carausicin (Genbank Accession # KY271086), and which is differentially expressed in the MpgTs. Forty-seven defense genes, predominantly cytochrome p450s and gluthathione-s-transferases, were highly expressed in at least one tissue (Fig 4). One gluthathione-s-transferases (Contig51) was the 12th most highly expressed gene in the AoMs. (S1 Table).
Fig 4

Expression heatmap for highly expressed, putative multixenobiotic detoxificaton genes in Carausius excretory tissue.

All are cytochrome P450’s except the two labeled glutathione S-transferases [24]. Only genes with high expression in at least one tissue type (approximately >100 RPKM) were included. The original dendrogram, with branch lengths shortened for space, is provided, and the rows ordered accordingly.

Expression heatmap for highly expressed, putative multixenobiotic detoxificaton genes in Carausius excretory tissue.

All are cytochrome P450’s except the two labeled glutathione S-transferases [24]. Only genes with high expression in at least one tissue type (approximately >100 RPKM) were included. The original dendrogram, with branch lengths shortened for space, is provided, and the rows ordered accordingly.

Fatty acid / Lipid and other transporters

I searched for any lipid metabolism genes based in part on Canavoso et al. [46]. This includes lipases and phospholipases, di- and triacylglycerol lipases, elongation of very long chain fatty acids” proteins, fatty-acid amide hydrolases and synthases, and fatty acid transporters. Of the 180 such transcripts identified, 46 were differentially expressed in the AoMs alone and another 30 in the AoMs and another tissue (Table 5). Some of these were among the most highly expressed transcripts of the AoMs, including several triacylglycerol lipases (Contig52, 8th; Carausius_C912, 15th) and fatty-acid binding proteins (Carausius_C392, 7th; Carausius_C197, 14th; Carausius_C198, 19th) (S1 Table). Several lipases were differentially expressed in the MGWall. All lipid phosphate phosphohydrolases and lipid storage droplets surface-binding proteins were differentially expressed in the AoMs, as well as several long-chain fatty acid transport proteins and phospholipases.
Table 5

Putative lipid/fatty acid/lipophilic substance metabolism/ and transport genes in Carausius excretory tissue.

CategoryTotal# Differentially Expressed (# Highly Expressed)
AoMMpgTMgWallAoM + MpgTMpgT + MGWallAoM + MGWall
AB-hydrolases21 (1)1
Acidic Lipase21 (2)1
Ca-Independent Phospholipase2
Diacylglycerol Lipase321
Elongation of VLCFAs Protein921111
Fat Storage-Inducing Transmembrane1
Fatty Acid Amide Hydrolase8311
Fatty Acid Binding Protein64 (3)(3)(3)1
Fatty Acid Hydroxylase4211
Fatty Acid Synthase2925
Hormone-sensitive Lipase1
Lipase Maturation Factor21
Lipase, Other411
Lipid Export Permease1
Lipid Phosphate Phosphohydrolase33 (1)
Lipid Storage Droplets Surface-Binding55 (2)
Long-Chain Fatty Acid CoA Ligase62 (1)111
Long-Chain Fatty Acid Transport91 (1)42
Lysophospholipase321
Lysophospholipid Acyltransferase95
Non-Specific Lipid-Transfer Protein42
Patatin-like Phospholipase44
Phospholipase Activator3
Phospholipase Inhibitor11
Phospholipase, Other141 (1)4 (2)112
Phospholipid Hydroperoxide Glutathione Peroxidase3(3)(3)(1)2
Phospholipid Scramblase211
Phospholipid-Transporting ATPase1441 (1)13
stAR-related Lipid Transfer Protein31
Triacylglycerol Lipase234 (3)17 (6)23

Gene categories chosen in part based on Canavoso et al. [46]. Values are the number of total sequences per subcategory in the transcriptome, and the number differentially expressed in each tissue or possible tissue pair, with the number highly expressed in each tissue given in parentheses. VLCFA = very long chain fatty acid. AoM = Appendices of the Midgut, MGWall = Midgut Wall, MpgT = Malpighian tubules.

Gene categories chosen in part based on Canavoso et al. [46]. Values are the number of total sequences per subcategory in the transcriptome, and the number differentially expressed in each tissue or possible tissue pair, with the number highly expressed in each tissue given in parentheses. VLCFA = very long chain fatty acid. AoM = Appendices of the Midgut, MGWall = Midgut Wall, MpgT = Malpighian tubules. I expected to find lipophorins, as they are supposed to be the predominant lipid transport molecule in insects [46, 47]. However, BLASTx did not identify any. I then ran a profile HMM search for insect lipophorins as I did for ABC transporters, but still found none. I identified 493 other putative transport protein genes. These included amino acid, anion, cation, Ca/Na/K, carboxylate, peptide, phosphate, phospholipid, sugar, sulfate, vitamin, water [aquaporins and aquaglyceroporins], and zinc transporters, plus 62 unspecified solute carriers. Of these, 93 were highly expressed with RPKMs >100 (Fig 5). Of those showing differential expression, most were over-expressed in the MpgTs. Zinc transporters were more expressed in the MGWall, while peptide and protein transporters more expressed in the AoMs. While some amino acid transporters were more expressed in the MpgT, others were very highly and differentially expressed in the AoMs, including one (Contig292), a sodium-dependent nutrient amino acid transporter and the 57th most highly expressed transcript in the AoMs. Several V-ATPase transporters (proton pumps) were very highly expressed in the AoMs, some differentially, but most also highly in the other tissues as well. Many vitamin A transporters were expressed in the AoM and/or MGWall, while other vitamin transporters were predominantly in the MpgTs.
Fig 5

Expression heatmap for highly expressed, putative transporters in Carausius excretory tissue.

Due to space constraints, only genes with high expression in at least one tissue type (approximately >100 RPKM) were included. Rows ordered alphabetically by gene function subcategory.

Expression heatmap for highly expressed, putative transporters in Carausius excretory tissue.

Due to space constraints, only genes with high expression in at least one tissue type (approximately >100 RPKM) were included. Rows ordered alphabetically by gene function subcategory.

Biomineralization, calcium regulation, and phosphatases

I searched for biomineralization genes based on the listing of Livingston et al. [48], due to the hypothesized existence in the AoMs of carbonic anhydrase [33] and otopetrin [49], and the known role of the Phasmatodea MpgTs in calcium elimination and deposition of unique calcium phosphate and/or calcium oxalate layers on the eggs [50-52]. These include bone morphogenic proteins, collagen, and regucalcin (one of which was highly and differentially expressed in the MpgTs, another in the AoMs), and matrix proteases (Fig 6). One of the latter (Carausius_C45) was identified as a collagenolytic serine protease, and was the third most highly expressed transcript in the AoMs and 27th in the MGWall (S1 Table). I identified four otopetrin genes, two differentially expressed each in the AoMs and MpgTs, but only one (Carausius_C5193) highly expressed (325th highest in AoM). Several calcium-binding phosphoproteins were differentially and highly expressed in the MpgT, while carbonic anhydrases were mostly expressed in the AoMs, though not highly.
Fig 6

Expression heatmap for biomineralization genes in Carausius excretory tissue.

Gene categories chosen in part based on Livingston et al. [48]. All identified biomineralization-type genes are included. Rows ordered alphabetically by gene function subcategory.

Expression heatmap for biomineralization genes in Carausius excretory tissue.

Gene categories chosen in part based on Livingston et al. [48]. All identified biomineralization-type genes are included. Rows ordered alphabetically by gene function subcategory. A total of 318 contigs were identified as phosphatases. A few were highly expressed (S4 Table) nearly all in the MpgTs, including some of the most highly expressed of transcripts of the entire transcriptome: a lysosomal acid phosphatase (4th most highly expressed MpgT contig and most highly expressed of all MpgT specific enzymes) and three prostatic acid phosphatase-like genes (6th, 8th, and 13th most highly expressed MpgT contig). In contrast, alkaline and some protein phosphatases were more differentially expressed in the AoM, and nucleotide phosphatases in the MGWall.

Discussion

The de novo transcriptome produced here provides the first clues to how the AoMs differ from the MpgTs on a molecular biology level. The data confirms what was known about MpgTs: they eliminate ions and other wastes from the hemolymph, and are actively involved in defense against and metabolism of xenobiotic and toxic compounds. The abundance of calcium-regulatory transcripts there matches what is known about Phasmatodea MpgTs and their relationship to calcium [50, 51]. The transcriptome also identified the known digestive enzymes in the midgut, including the Phasmatodea-specific pectinases [49], suggesting sufficient coverage. That the majority of identified transcripts had closest sequence homology to other Polyneoptera insect genes suggests low contamination and little chance that these are not endogenous Carausius genes. The presence of phosphatases in the MpgTs is expected based on prior work with insects [53, 54], and their high abundance in Phasmatodea is likely because the order excretes excess calcium as a phosphate rather than as a carbonate as in most insects [50]. The highly expressed MpgTs alkaline phosphatases (S4 Table) thus likely function in calcium elimination. The highly expressed prostatic acid phosphatases, which in humans are produced by the prostate and are a serum tumor marker for prostate cancer, have many possible roles including excretion [54]. The possible use of Phasmatodea as models for human medical research is not unprecedented, as transcriptomes of Drosophila MpgTs promoted their use as models for human renal disease [21]. As predicted, the different tissues each express their own, unique set of genes involved in the transport, metabolism, and elimination of xenobiotics: ABC-transporters, glutathione-S-transferases, and cytochrome P450s (Table 4, Fig 4). The differential and often high expression of these proteins in the AoMs strongly suggests that the AoMs serve a function in multixenobiotic resistance, as hypothesized based on their physiology [32]. I do not reject the hypothesis that the observed differential staining patterns in Phasmatodea [32] can be due to these variations in xenobiotic compound transporters between AoMs and MpgTs, as has also been suggested by O’Donnell [1, 55]. The transcriptomes did not contain p-glycoproteins (ABCB1), a well-characterized xenobiotic pump in insects [1], but did contain an antimicrobial peptide with an attacin domain [56]. I am working towards identifying further antimicrobial peptides in the Phasmatodea. Notably, I reject the hypotheses that the AoMs are involved in ion homeostasis, including the cation hypothesis from Shelomi & Kimsey [32], based on their relative lack of such transporters. Instead, the AoMs differentially and highly express peptide and protein transporters and phosphatases, and several V-type proton ATPases (Fig 5) that may or may not be involved in xenobiotic compound transport. I cannot rule out a function for the AoMs in peptide homeostasis. V-ATPases are known to be responsible for normal formation of uric acid concretions and calculi in insect MpgTs [57], and may explain the concretions observed in Peruphasma schultei AoMs [30]. Otherwise, they are critical to the transport function of Malpighian tubules [55, 58], and likely drive the transport in the AoMs as well. Although the AoMs seem to differentially absorb cationic stains [32], they did not differentially express cation transporters, suggesting that those stains were picked up through the action of other transporter types: the ABC transporters and/or an unidentified pump driven by the electric potential generated by V-ATPases. The differential and often high expression of lipid- and fatty acid metabolism, binding-, and transport genes in the AoMs (Table 5), including cholesterol and lipoprotein specific ABC transporters (Table 4) [43, 44], suggests a key role for these organs in movement and homeostasis of lipids, lipophilic substances (including fat soluble vitamins), and/or lipoproteins. This is a completely novel hypothesis for AoM function. The known yellow color of the AoM ampule contents and the lipophilic nature of the P. schultei concretion [30] further suggest that the AoMs may contain a nonpolar fluid or at least be well suited to the transport of lipophilic substances. However, the AoMs are known to pick up water-soluble stains with ease [32]. The BLASTx and nhmmr results did not show lipophorins in any tissues, despite their key role in lipid transport in insects [46, 59], but did find di- and triacylglycerol transporters [47]. Carausius may express lipophorins in the hemolymph, which would have been washed away in our dissections and not sampled. Alternatively, some or all Phasmatodea use a different lipid transport system than generally known in insects, which warrants further study. Vitellogenins are another possible insect lipoprotein [59], and I found some differentially but not highly expressed in the MGWalls (S1 Table). de Sinéty [60] and Savage [52] reported that certain Phamsatodea calciferous Malpighian tubules terminate in the fat body (cells of Sidorot), but neither de Sinéty nor Shelomi et al. [30] found evidence that the AoMs terminate at any tissues. They may nonetheless interact with fat bodies and pass near them, transporting lipophilic substances to and/or from the midgut lumen and fat body or hemolymph, possibly having evolved this ability in absence of lipophorins. Certain insect lipoproteins in the hemolymph are known to bind to xenobiotics or enable coagulation of the hemolymph following microbial infection [59], so the AoM lipid-related genes may thus also be defensive. Lipid droplets in Drosophila are known to serve as protein storage depots as well [61]. The combination of lipid storage droplet binding sites and peptide transporters among the differentially and highly expressed AoM genes, including a highly expressed sodium-dependent nutrient amino acid transporter, would thus suggest a role of the AoMs in transporting lipid-droplet bound proteins and peptides. Lastly, the high and/or differential expression of a retinol (a fat soluble vitamin) dehydrogenase (Table 3) and several alpha tocopherol (a fat soluble vitamin) transfer proteins (S1 Table) suggests the AoMs may function in the transport of fat-soluble vitamins as well as nutrient amino acids. Ultimately the literature on fat metabolism and transport in insects is incomplete [46,47], though perhaps the Phasmatodea have just proven their worth as model insects for further study of this field. The presence of beta-actins in the AoMs is expected given the high motility of the tubules. The actins are likely involved in the harmonic motion of the AoM tubules in the hemolymph. I also absolutely confirm the predictions of Monteiro et al. [33] that the AoMs express carbonic anhydrase (Fig 6), which alkalinizes the gut lumen. The effects of this alkalinization could be to counteract or disable midgut digestive enzymes, but it may also have functions within the AoM tubules and ampules themselves. Once again identified in the AoMs [49] is high and differential expression of the otolith formation and biomineralization gene, otopetrin [62] (Fig 6). The two otopetrins differentially expressed in the AoMs (Carausius_C5193 & Carausius_C8672) have the 1st and 7th greatest log2 fold change recorded between the AoM and MpgT (S1 Table). Only Carausius_C5193 is highly expressed [mean RPKM of 274.4 in the AoMs]. In insects, otopetrin is highly expressed in glowworm light organs along with carbonic anhydrase and several defensive compounds [63], and otopetrin is upregulated in mosquitoes after topical juvenile hormone application [64]. More generally, otopetrin is known to regulate calcium homeostasis [62], but as the calcium transport genes were predominantly expressed in the MpgTs, I still cannot explain what this gene does in the AoMs. In mice, otopetrin is part of a signaling pathway in adipose tissue that reduces obesity [65], so it is possible that Phasmatodea AoM otopetrin are part of the lipid metabolism/transport functionality rather than biomineralization. Future tests using RNAi or other knockdown methods targeting otopetrin could answer this question, but RNAi has not yet been attempted in a Phasmatodea.

Conclusion

By combining prior physiological assays of the Phasmatodea-specific midgut appendices with the new transcriptomic data, I can provide the first conclusions (Table 6) for what these enigmatic organs do since their earliest recorded description over a century ago. The main functions of the appendices include, but are not limited to: 1) Multixenobiotic resistance by metabolizing and transporting such compounds out of the hemolymph, targeting different compounds from the Malpighian tubules. 2) Transport and homeostasis of lipids and fatty acids, lipoproteins, and/or lipophilic substances such as nutrients, xenobiotics, and lipid storage droplet-bound proteins, possibly in absence of lipophorin. 3) Contributing to the alkalization of the posterior midgut by carbonic anhydrase activity rather than cation excretion.
Table 6

First conclusions for the functions of the Phasmatodea appendices of the midgut and supporting evidence.

Function of the AppendicesEvidence (•) and Hypothesized Links (*)Source
1. Multixenobiotic resistance (metabolism, excretion, and defense against foreign compounds, toxins, microbes)• Elimination of injected stains unlike those of Malpighian tubulesShelomi & Kimsey [32]
• Differential expression of ABCC xenbiotic export pumps (Multidrug resistance proteins)Table 4, S3 Table
• High and differential expression of cytochrome P450s and glutathione S-transferases (metabolism and detoxification genes)Fig 4
* High and differential expression of V-type proton ATPase (transport gene), but not of waste/homeostasis transportersFig 5
2. Lipophilic compound (nutrients, lipoproteins, xenobiotics) transport and regulation• High and differential expression of fatty acid binding proteins, lipid storage droplet binding proteins, triaglycerol lipases, and fat-soluble vitamin transfer genesTable 3
• High and differential expression of alpha-linolenic acid and glycerophospholipid metabolism enzymes (based on KEGG mapping)S2 Table
• Differential expression of cholesterol and lipoprotein transporting ABC transportersTable 4, S3 Table
• Differential expression of lipid phosphate phosphohydrolases, long-chain fatty acid transport proteins, and a number of other lipid-related genes.Table 5
• High and differential expression of (fat soluble) vitamin A and vitamin E transportersTable 3, Fig 5
* Absence of lipophorins demands alternative transport method-
3. Alkalization of the posterior midgut via biomineralization proteins• pH of midgut reaches or passes neutral at the start of the AppendicesShelomi et al. [30],Monteiro et al. [33]
• High carbonic anhydrase activity measuredMoneteiro et al. [33]
• Differential expression of carbonic anhydrase, but not cation transportersFigs 5 and 6
* Differential expression of calcium-carbonate related gene OtopetrinFig 6

Materials and methods

Insect dissection and RNA extraction

Adult Carausius morosus females were obtained from private cultures maintained on blackberry (Rubus sp.) leaves. Three biological replicates of three insects each were prepared. Tissue removal was performed on ice with tools sterilized in ethanol and treated with RNaseZap. Insects had their heads and last abdominal segments removed with scissors and the entire gut pulled out using watchmaker’s forceps and placed into a Petri dish of 70% ethanol. If necessary, a full-length, lateral incision was made with Castroviejo scissors and the entire digestive tract removed. Gut contents within their peritrophic membrane could often be pulled from the entire midgut via the anterior opening. Incisions were made at the beginning and end of the posterior midgut [defined as the region studded with appendix ampules], taking care not to cut through the appendix tubules, and just before and after the origin of the Malpighian tubules. The section of midgut between these, the “post-posterior midgut” [30], was used as the midgut wall. Tissue rings were washed in a separate dish of ethanol to remove contaminants, with care to ensure all appendix tubules were removed from among the Malpighian tubules. Tissues (approximately 20μg each) were then immediately placed into Invitrogen™ RNAlater™ Stabilization Solution (ThermoFischer) and macerated in a frozen Tissue Lyser with metal beads. RNA was extracted with an innuPREP RNA MiniKit (Analytik-Jena AG) and purified with theRNeasy® MinElute® cleaning kit (Qiagen®) following the manufacturers’ protocols. RNA quality was tested with an Experion™ RNA chip (Bio-Rad) on an Agilent 2100 Bioanalyzer following the manufacturers’ protocols.

Sequencing, de novo transcriptome assembly

The three replicates of three tissues each were sent over dry ice to the Max Planck-Genome-centre, Cologne, Germany (http://mpgc.mpipz.mpg.de/home/) for RNA library production including polyA enrichment with an Illumina HiSeq 2500 Sequencer with 100bp paired-end reads, followed by raw data processing, data quality filtering, and data de-multiplexing. This generated 20 million reads (4.00 gigabases) of data per library. The libraries were quality tested, trimmed (minimum contig length 250bp), and pooled for assembly of a de novo transcriptome using CLC Genomics Workbench v8.0 (Qiagen®). This transcriptome was tested for duplications, revealing 64,323 singlets and 20,683 contigs forming an overlap or contained within another. The resulting contigs were fused using CAP3 [38] with standard parameters. This assembled transcriptome of 73,143 contigs made up of approximately 70 Mbp is available on Dryad.

RNA-Seq and data analysis

RNA-Seq analysis, which has been successfully used in Phasmatodea before [49, 66], was performed by the Max Planck-Genome-centre with the Cufflinks v2.2.1 suite of tools (http://cole-trapnell-lab.github.io/cufflinks/) by mapping the original library reads to the CAP3 de novo transcriptome with TopHat, assembling with Cufflinks, merging with Cuffmerge, and calculating differential expression between each possible pair of tissue types with Cuffdiff [67]. The expression values were calculated as RPKM (reads per kilo base per million mapped reads), and the mean RPKM for each tissue type and gene is available in S1 Table. Contigs were tentatively identified via BLAST using the CLC Genomics Workbench v8.0. I identified contigs that Cufflinks analysis demonstrated were statistically significantly (p<0.05) differentially highly expressed in one tissue type compared to the other two. These are labeled in S1 Table and were matched to transcriptome sequences using the online bioinformatics software Galaxy version 1.0.2 to manipulate the data and produce a fasta file. This was further annotated via Blast2GO v3.0.11 [39] using BLASTx (e-value<1e-10) to search against the GenBank non-redundant protein database. Transcripts were also scanned for conserved protein domains via an InterProScan and mapped and annotated within Blast2GO.

Functional domain search for specific gene types

To search for transcripts with ABC transporter or lipophilin domains, I obtained insect query sequences for such proteins using the PFAM database (http://pfam.xfam.org/) [42], eliminating putative and uncharacterized proteins. These were aligned with MUSCLE v3.3 [68], converted to Stockholm format (.sto), and backtranslated into a nucleotide.sto file with EMBOSS Backtranambig software (http://www.ebi.ac.uk/Tools/st/emboss_backtranambig/), and converted into a pHMM query file with the hmmbuild package of HMMER v3.1. This file was used to query the transcriptome with nhmmer [41]. These identified contigs were matched to their expression data from the RNA-Seq dataset. To make heatmaps I used the heatmap3 package for R v3.3.1 [69].

Digestive and excretory system of Carausius morosus.

A) Schematic and B) dissection of the alimentary canal from Carausius morosus, typical of other Phasmatodea [32]. The gut is presented unstained, so tubules are not visible to the naked eye. The appendices appear on the posterior midgut. The Malpighian tubules originate at the midgut/hindgut junction, trailing over the posterior midgut before going towards the posterior end of the insect. The gut section between the two, the “post-posterior midgut,” was used for our midgut wall (MGWall) samples, excluding any tubules. Key: AMG = anterior midgut. AoM = appendices of the midgut. MMG = middle midgut. MpgT = Malpighian tubules. PMG = posterior midgut. PPMG = post-posterior midgut. (TIFF) Click here for additional data file.

Top-Hit species distribution for BLASTx results of the Carausius excretory tissue de novo transcriptome assembly.

(TIFF) Click here for additional data file.

Descriptions and differential expression statistics for C. morosus excretory tissue transcripts.

(XLS) Putative descriptions based on BLASTx results. RKPM = Mean reads per kilobase per million mapped reads. Contigs are “highly expressed” in their tissue type if the RPKM is greater than 10x the mean. For each tissue type pair, the log2 fold change in expression based on Cufflinks analysis is given, with negative values indicating higher expression in the first tissue type of the pair as ordered in the heading. “Differential expression” is based on significantly (p<0.05 based on Cufflinks analysis of the test statistic) higher differential expression in the tissues as follows: AoM = Appendices of the Midgut, excretory = AoM & MpgT, MGWall = Midgut Wall, midgut = AoM & MGwall, MpgT = Malpighian tubules, no = no differential expression, rear = MpgT+MGWall, zero = no significant expression in any tissue type. (XLS) Click here for additional data file.

Distribution of KEGG pathways of the highly and differentially expressed transcripts of the C. morosus excretory tissue transcripts.

(XLS) Click here for additional data file.

C. morosus transcriptome contigs identified as ATP-binding cassette (ABC) transporters.

ABC transporters were identified with a profile hidden Markov model search [41] of the transcriptome with an ABC transporter PFAM domain query [42]. ABC transporter names are given with subfamily letter and subgroup number when identifiable. “Differential expression” is based on significantly (p<0.05 based on Cufflinks analysis of the test statistic) higher differential expression in the tissues as follows: AoM = Appendices of the Midgut, excretory = AoM & MpgT, MGWall = Midgut Wall, midgut = AoM & MGwall, MpgT = Malpighian tubules, no = no differential expression, rear = MpgT+MGWall, zero = no significant expression in any tissue type. (XLS) Click here for additional data file.

Putative phosphatase genes in Carausius excretory tissue.

“Differential expression” is based on significantly (p<0.05 based on Cufflinks analysis of the test statistic) higher differential expression in the tissues as follows: AoM = Appendices of the Midgut, excretory = AoM & MpgT, MGWall = Midgut Wall, midgut = AoM & MGwall, MpgT = Malpighian tubules, no = no differential expression, rear = MpgT+MGWall, zero = no significant expression in any tissue type. (XLS) Click here for additional data file.
  49 in total

1.  Active transport of water by insect Malpighian tubules.

Authors:  Simon Maddrell
Journal:  J Exp Biol       Date:  2004-02       Impact factor: 3.312

2.  Vital staining of the stick insect digestive system identifies appendices of the midgut as novel system of excretion.

Authors:  Matan Shelomi; Lynn S Kimsey
Journal:  J Morphol       Date:  2013-12-14       Impact factor: 1.804

Review 3.  The developmental, molecular, and transport biology of Malpighian tubules.

Authors:  Klaus W Beyenbach; Helen Skaer; Julian A T Dow
Journal:  Annu Rev Entomol       Date:  2010       Impact factor: 19.686

Review 4.  Lipid transport in insects.

Authors:  J P Shapiro; J H Law; M A Wells
Journal:  Annu Rev Entomol       Date:  1988       Impact factor: 19.686

5.  Structural and functional changes in the Malpighian tubules of Carausius morosus during dehydration and starvation.

Authors:  H H Taylor
Journal:  Z Zellforsch Mikrosk Anat       Date:  1973-08-14

Review 6.  Alkaline phosphatase isozymes in insects and comparison with mammalian enzyme.

Authors:  M Eguchi
Journal:  Comp Biochem Physiol B Biochem Mol Biol       Date:  1995-06       Impact factor: 2.231

Review 7.  Too much of a good thing: how insects cope with excess ions or toxins in the diet.

Authors:  M J O'Donnell
Journal:  J Exp Biol       Date:  2009-02       Impact factor: 3.312

8.  Separate control of anion and cation transport in malpighian tubules of Drosophila Melanogaster.

Authors:  M J O'Donnell; J A Dow; G R Huesmann; N J Tublitz; S H Maddrell
Journal:  J Exp Biol       Date:  1996-05       Impact factor: 3.312

9.  Tissue-specific transcript profiling for ABC transporters in the sequestering larvae of the phytophagous leaf beetle Chrysomela populi.

Authors:  Anja S Strauss; Ding Wang; Magdalena Stock; René R Gretscher; Marco Groth; Wilhelm Boland; Antje Burse
Journal:  PLoS One       Date:  2014-06-02       Impact factor: 3.240

10.  Differential expression of endogenous plant cell wall degrading enzyme genes in the stick insect (Phasmatodea) midgut.

Authors:  Matan Shelomi; W Cameron Jasper; Joel Atallah; Lynn S Kimsey; Brian R Johnson
Journal:  BMC Genomics       Date:  2014-10-21       Impact factor: 3.969

View more
  2 in total

1.  Blattella germanica displays a large arsenal of antimicrobial peptide genes.

Authors:  Francisco J Silva; Maria Muñoz-Benavent; Carlos García-Ferris; Amparo Latorre
Journal:  Sci Rep       Date:  2020-12-03       Impact factor: 4.379

2.  Gene expression profiles in Malpighian tubules of the vector leafhopper Psammotettix striatus (L.) revealed regional functional diversity and heterogeneity.

Authors:  Feimin Yuan; Cong Wei
Journal:  BMC Genomics       Date:  2022-01-21       Impact factor: 3.969

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.