| Literature DB >> 30086708 |
C Llorens1, S C Arcos2, L Robertson2, R Ramos3, R Futami1, B Soriano1, S Ciordia4, M Careche5, M González-Muñoz6, Y Jiménez-Ruiz2, N Carballeda-Sangiao6, I Moneo6, J P Albar4, M Blaxter7, A Navas8.
Abstract
BACKGROUND: Anisakis simplex sensu stricto and Anisakis pegreffii are sibling species of nematodes parasitic on marine mammals. Zoonotic human infection with third stage infective larvae causes anisakiasis, a debilitating and potentially fatal disease. These 2 species show evidence of hybridisation in geographical areas where they are sympatric. How the species and their hybrids differ is still poorly understood.Entities:
Keywords: A. Pegreffii; Allergens; Anisakis simplex; Gene expression; Transcriptome
Mesh:
Substances:
Year: 2018 PMID: 30086708 PMCID: PMC6080401 DOI: 10.1186/s12864-018-4970-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Metrics for de novo assembly transcriptomes
| Summary |
|
| Hybrid | Merged |
|---|---|---|---|---|
| Total transcriptome size | 88,007,524 | 68,071,234 | 50,568,936 | 67,459,080 |
| Unigenes (Loci) | 36,645 | 31,988 | 29,656 | -NA- |
| Transcripts (isoforms) | 121,907 | 91,541 | 76,848 | 75,380 |
| Longest transcript (bp) | 10,774 | 11,724 | 10,798 | 16,240 |
| shortest transcript (bp) | 100 | 100 | 100 | 107 |
| % transcripts >1Kb | 20.3% | 21.7% | 17.9% | 28.7% |
| N50 (bp) | 973 | 1026 | 885 | 1276 |
| L50 (bp) | 25,878 | 19,118 | 16,598 | 15,385 |
| %A | 30.87% | 30.25% | 30.11% | 30.41% |
| %C | 19.60% | 20.03% | 20.36% | 18.64% |
| %G | 19.64% | 19.98% | 20.49% | 20.60% |
| %T | 29.89% | 29.74% | 29.04% | 29.87% |
| %Ns | 0% | 0% | 0% | 0% |
Fig. 1Annotations. a Blast top hits per subject species using NR as refseq database for unigene annotation. b Distribution of annotated and un-annotated transcripts and unigenes annotated via blast using NR as refseq database. c Venn diagram showing unique and shared features of non-redundant annotations per transcriptome (GIs, KOGs, InterPros, GOs, ECs)
Summary of annotations per transcriptome
| Sequences with annotations |
|
| Hybrids | |||
|---|---|---|---|---|---|---|
| transcripts | unigenes | transcripts | unigenes | transcripts | unigenes | |
| Gene Identifiers (GIs) from NR/NT | 62,483 | 14,057 | 60,816 | 14,393 | 53,741 | 14,183 |
| KOG clusters | 38,717 | 9999 | 29,977 | 9444 | 28,114 | 9928 |
| InterPro domains | 17,093 | 5938 | 13,514 | 5122 | 12,884 | 5125 |
| GO Terms | 14,701 | 5165 | 11,773 | 4485 | 11,101 | 4501 |
| ECs | 5416 | 1995 | 4237 | 1671 | 3913 | 1617 |
| Predicted proteins | 49,755 | 14,384 | 41,324 | 13,506 | 38,782 | 13,834 |
| Predicted signal-P domains | 1124 | 647 | 603 | 394 | 482 | 318 |
| Predicted TM domains | 11,613 | 6338 | 8906 | 4676 | 7545 | 4109 |
Fig. 2Transcriptome differential expression patterns. a BCV versus the average log of counts per million (CPM). b MDS plot based on logFC of differences among taxa sources and the sequenced replicates per taxa (small and medium insert size fastqs). c MA-plots (one per each differential expression analysis) representing the log Fold Change (logFC) against the log-average log CPM per each transcript cluster across each pair of compared samples. Differentially expressed clusters supported by FDR < 0.05 are plotted in red. Each MA-plot is accompanied by a summarization of the results of each differential expression comparison
Fig. 3Enrichment of metabolic pathways. a Scatter plot for the enrichment of metabolic pathways per differential enrichment comparison at FDR < 0.05. In the x-axis, the ratio between number of DE transcripts and total number of transcripts per pathway using the Wallenius approximation; in the y-axis, the log P-value for differential enrichment in each pathway. b Box plots based on the median, quartiles, maximum and minimum and outlier values of the LogFCs of all transcripts contributing to the enrichment of the metabolic pathways shown in A in each differential expression comparison. Red lines highlight the logFC value of zero
Fig. 4Differential enrichment of Gene Ontologies. a Barplot based on the ratio between the number of differentially expressed transcripts and the assayed transcripts with annotation of biological process GOs detected as differentially enriched in at least 1 of the 3 comparisons performed to asses GO enrichment. Plots based on the comparison performed between the hybrids and A. pegreffii is colored blue; plots for the comparison between the hybrids and A. simplex s.s. are colored red; and those based on the comparison between A. pegreffii and A. simplex s.s. are colored green. GOs that were significant at FDR < 0.05 are highlighted with an asterisk. b Same Bar plot representation based on Molecular Function GOs; c Cellular components
Fig. 5Nematode intermediary metabolism, as described for C. elegans and other species. The main pathways and GOs found as enriched among the 3 Anisakis transcriptomes are highlighted
Fig. 6LogFCs for differentially enriched Cellular Component GOs. Box plots based on the median, quartiles, maximum and minimum and outlier values of the LogFC of all transcripts contributing to the enrichment of Cellular Component GOs per differential expression comparison. Red lines highlight the logFC value of zero
Fig. 7Expression patterns of excretory/secretory genes. Heatmap with double clustering based on the LogFCs of 70 representative ES genes (rows) identified as differentially expressed at a FDR < 0.05 in at least 2 of the 3 differential expression comparisons (columns). Quantiles 0.05, 0.5 and 0.95 were used as breaks to color the transcript counts in red, yellow and blue, respectively. Below is, the color scale used to color the gradient of LogFC values. Columns and rows were clustered using the complete linkage with euclidean distance measure. For a logFC summary of the 356 genes assayed see Additional file 5
Distribution of Allergen Families (AllFams) and logFC results from differential expression
| Allergen Families | FamID | LogFCs Hh vs | LogFCs Hh vs | LogFCs | Con Seqs | Allergen Name | Realm | Type of allergen |
|---|---|---|---|---|---|---|---|---|
| 24 kDa protein | Not yet assigned |
|
| − | 20 | NA | Animal | NA |
| 60S acidic ribosomal protein | AF070 |
| 0,25,574 | − 0,44,982 | 4 | Asp f 8, Alt a 12, Asp f 8, Fus c 1 | Fungi | Minor |
| Aldehyde dehydrogenase | AF040 | 0,06989 | −0,10,196 | − 0,17,186 | 26 | Blo t 4 | Animal | Minor |
| Alpha-amylase | AF033 | −0,05472 | 0,32,169 | 0,37,641 | 6 | Alt a 10, Cla h 10 | Fungi | Minor |
| Ani s 10 allergen | Not yet assigned |
| 0,06603 | − | 1 | Ani s 10 | Animal | NA |
| Ani s 11 allergen | Not yet assigned | − |
|
| 6 | Ani s 11, Ani s 11 L1, Ani s 11 L2 | Animal | NA |
| Ani s 12 allergen | Not yet assigned | −0,22,028 | −0,25,132 | − 0,03103 | 4 | Ani s 12 | Animal | Major |
| Animal Kunitz serine protease inhihibitor | AF003 |
|
|
| 9 | Ani s 1 | Animal | Major |
| ARM-like | Not yet assigned |
|
|
| 44 | Ani s 7, Ani s 14 | Animal | Major, NA |
| ATP:guanido phosphotransferase | AF049 | 0,28,397 | 0,03604 | −0,24,793 | 5 | Pen m 2, Der p 20, Plo i 1 | Animal | Major, Minor |
| ATP synthase | Not yet assigned | 0,41,629 | 0,19,251 | −0,22,378 | 3 | Bos d OSCP | Animal | Minor |
| BCL7 family | AF121 | −0,42,399 | 0,29,518 | 0,71,916 | 1 | Hom s 3 | Animal | NA |
| Calreticulin family | AF055 | −0,27,705 |
| − 0,37,927 | 7 | Pen ch 31 | Animal | NA |
| CAP family | AF044 |
|
|
| 12 | sol i 3, Dol a 5, Vesp c5, Poly s 5, Pol d 5, Dol m 5, allrgn_V5/Tpx1 | Animal | Major, NA |
| Carboxylesterase | AF140 | −0,07113 | −0,09074 | − 0,01961 | 50 | Api m 8 | Animal | Minor |
| Casein kinase II regulatory subunit | Not yet assigned | −0,54,905 |
| − 0,26,768 | 2 | Gal d Phosvitin | Animal | Na |
| Catalase | AF047 |
|
|
| 1 | Pen c 30 | Fungi | Minor |
| Chitinase class III and peritrophin-like | AF077 | −0,55,642 | −0,22,104 | 0,33,539 | 12 | Der p 15, Der f 15 | Animal | Major |
| Collagen | AF097 | −0,20,422 | 0,10,867 | 0,31,289 | 2 | Bos d alpha 2 l | Animal | NA |
| Cyclophilin | AF038 | 0,18,030 | 0,27,158 | 0,09128 | 13 | Asp f 27 | Fungi | NA |
| Cystatin | AF005 |
| −0,32,427 |
| 4 | Ani s 4 | Animal | NA |
| Cysteine-rich trypsin inhibitor-like domain | AF027 |
|
|
| 5 | Ani s 6 | Animal | NA |
| Cytochrome b-c1 | Not yet assigned |
| 0,94,815 | 0,35,621 | 6 | NA | Animal | NA |
| Cytochrome B5 | Not yet assigned | 1 | 0,37,530 |
| 3 | NA | Animal | NA |
| Cytochrome c | AF006 | 0,14,607 |
|
| 5 | Cur l 3 | Fungi | Major |
| EF hand family | AF007 | 0,20,495 |
| 0,19,922 | 34 | Bla g 6, Per a 6, Lit v 3, Ani s Troponin C | Animal | Major, Minor, NA |
| Enolase | AF031 |
| −0,22,686 | 0,29,484 | 5 | rho m 1, hev b 9, Alt a 6 | Fungi,Plant | Minor |
| Eukaryotic aspartyl protease | AF004 | −0,11,780 | −0,35,758 | − 0,23,979 | 14 | Bla g 2 | Animal | Major |
| Eukaryotic elongation factor 1 | AF011 | 0,03983 | −0,05788 | −0,09771 | 8 | Pen c 24 | Fungi | NA |
| Fe/Mn superoxide dismutase | AF020 | 0,00817 | 0,19,412 | 0,18,595 | 2 | Hev b 10 | Plant | Minor |
| Gelsolin family | AF074 |
|
|
| 17 | Der f 16 | Animal | Minor |
| GILT family | AF155 |
|
| −0,68,276 | 3 | Tri a 27 | Plant | NA |
| Globin | AF009 |
|
|
| 3 | Ani s 13 | Animal | Minor |
| Glutathione S-transferase | AF010 | 0,06592 | 0,47,962 | 0,41,370 | 11 | Bla g 5, Der p 8 | Animal | Major |
| Glyoxalase superfamily | AF082 | −0,40,931 | 0,77,379 |
| 2 | Ory Glyosalase l | Plant | NA |
| GMC oxidoreductase | AF081 | 0,37,676 | 0,45,602 | 0,07926 | 10 | Mala s 12 | Fungi | Major |
| Heat shock protein Hsp70 | AF002 | −0,17,513 |
| − | 45 | Pen c 10, Cla h HSP70 | Fungi | Minor |
| Heat shock protein Hsp90 | AF042 | −0.19335 | 0.11856 |
| 15 | Asp f 12 | Fungi | NA |
| Hevein-like domain | AF043 | 0.35419 | −0.39541 |
| 1 | Hev b 11 | Plant | Minor |
| Histidine acid phosphatase | AF062 | −0.03153 |
|
| 15 | Api m 3 | Animal | Minor |
| Hyaluronidase | AF103 |
| −0.32364 | − | 10 | Pol a 2, Api m 2 | Animal | NA. Major |
| Intermediate filament protein | AF008 |
|
|
| 13 | Hom s 5 | Animal | NA |
| Lactate/malate dehydrogenase | AF014 |
|
| 0.21566 | 3 | Cit la MDH | Plant | Major |
| Larval allergen (Brugia malayii) | Unclassified | 0.32808 |
|
| 2 | NA | Animal | NA |
| Lipase | AF037 | 0.09487 |
|
| 15 | Dol m1, Pol d 1, rhi o Lipase, The l lipase | Animal. Plant | Major. NA |
| Lipocalin | AF015 |
| 0.05554 |
| 3 | tyr p 13 | Animal | Minor |
| Major allergen (Loa loa) | Unclassified | −0.00868 | −0.22329 | − 0.21461 | 4 | NA | Animal | NA |
| Myosin heavy chain | AF100 |
| 0.07600 |
| 179 | Ani s 2, Blo t 11, Der f 11, der p 11 | Animal | Major. Minor |
| NAC domain | AF107 |
| 0.39473 | −0.19002 | 3 | Hom s 2 | Animal | NA |
| Nitrophorin | Unclassified | 0.27507 | 0.31344 | 0.03836 | 5 | Cim I Nitrophorin | Animal | NA |
| Papain-like cysteine protease | AF030 | −0.01475 | 0.08466 | 0.09941 | 24 | Act d 1, Ana c 2, car p 1, Gly m Bd 30 k, blo t 2 | Animal.Plant | Major. NA |
| Patatin family | AF104 | −0.27609 |
|
| 1 | Hev b 7 | Plant | Major |
| Phosphoglycerate kinase | Unclassified | −0.22025 |
| − 0.33607 | 5 | Can a PGK | Fungi | NA |
| Profilin family | AF051 | 0.19390 | −0.08435 | −0.27824 | 1 | Par j 3 | Plant | Major |
| Prolamin superfamily | AF050 | 0.07050 | −0.07100 | −0.14148 | 1 | Tri a 26 | Plant | Major |
| Prolyl oligopeptidase family | AF061 | −0.04633 | 0.30623 | 0.35255 | 6 | Ves v 3, Tri r 4 | Animal, Fungi | Major. NA |
| Proteasome subunit beta type | Unclassified | 0 |
| 0.04012 | 3 | Zea m 20S | Plant | Minor |
| Protein kinase | AF017 |
| −0.00592 |
| 101 | Sal k 2 | Plant | Major |
| Ribosomal protein L3 | AF058 |
|
| 0.21599 | 2 | Asp f 23 | Fungi | Minor |
| SART-1 family | AF116 | −0.01191 | −0.21989 | −0.20797 | 1 | Hom s 1 | Animal | NA |
| Serpin serine protease inhibitor | AF018 |
| 0.38305 |
| 12 | Gal d 2 | Animal | Major |
| Short-chain dehydrogenase | AF028 | −0.09468 | 0.02563 | 0.12031 | 19 | Alt a 8, Cla h 8 | Fungi | Major. Minor |
| Subtilisin-like serine protease | AF021 | 0.06703 |
|
| 1 | Rho m 2 | Fungi | Major |
| SXP/RAL-2 family | AF137 |
|
|
| 15 | Ani s 8, Ani s 9, Ani s 5 | Animal | Minor |
| Thioredoxin | AF023 | 0.57209 |
| 0.17728 | 1 | Asp f 28 | Fungi | Major |
| Translationally controlled tumour protein | AF136 | 0.40520 | −0.40064 |
| 3 | Hom s TCTP | Animal | NA |
| Triosephosphate isomerase | AF032 | 0.01199 | 0.03933 | 0.02734 | 2 | Tri a 31 | Plant | Minor |
| Tropomyosin | AF054 |
|
| −0.07586 | 17 | Ani s 3, Bal r 1, Pat y 1, Hal d 1, Hel as 1, Por s 1, Der p 10, Pan s 1, Cra g 1 | Animal | Major. Minor. NA |
| Troponin I and T | AF146 | 0.23760 | − 0.36364 |
| 2 | Pon l 7 | Animal | Minor |
| Trypsin-like serine protease | AF024 | −0.33264 | 0.37800 |
| 4 | Blo t 3, Api m 7, Der f 6 | Animal | Minor. NA |
| Tuber Storage Protein | Unclassified | 0.26964 |
|
| 5 | Dio o TSP | Plant | NA |
| Tubulin/FtsZ family | AF025 | −0.22307 |
| −0.03802 | 46 | Lep d alpha Tubulin, Tyr p alpha Tubulin | Animal | Minor |
| WD-40 repeat | AF142 | 0.00385 | −0.15642 | −0.16027 | 15 | For t 2 | Plant | Major |
| Zn-containing dehydrogenase | AF029 | 0.02937 | −0.01838 | −0.04775 | 5 | can a 1 | Fungi | Major |
List from A to Z, Significant differential expressed AllFams supported by FDR < 0,05 are higlighted bold, NA means not avaiable information