| Literature DB >> 30070993 |
Colette T Dooley1, Tatiana Ferrer2, Heidi Pagán2, Gregory M O'Corry-Crowe2.
Abstract
The Major Histocompatibility Complex (MHC) is a critical element in mounting an effective immune response in vertebrates against invading pathogens. Studies of MHC in wildlife populations have typically focused on assessing diversity within the peptide binding regions (PBR) of the MHC class II (MHC II) family, especially the DQ receptor genes. Such metrics of diversity, however, are of limited use to health risk assessment since functional analyses (where changes in the PBR are correlated to recognition/pathologies of known pathogen proteins), are difficult to conduct in wildlife species. Here we describe a means to predict the binding preferences of MHC proteins: We have developed a model positional scanning library analysis (MPSLA) by harnessing the power of mixture based combinatorial libraries to probe the peptide landscapes of distinct MHC II DQ proteins. The algorithm provided by NNAlign was employed to predict the binding affinities of sets of peptides generated for DQ proteins. These binding affinities were then used to retroactively construct a model Positional Scanning Library screen. To test the utility of the approach, a model screen was compared to physical combinatorial screens for human MHC II DP. Model library screens were generated for DQ proteins derived from sequence data from bottlenose dolphins from the Indian River Lagoon (IRL) and the Atlantic coast of Florida, and compared to screens of DQ proteins from Genbank for dolphin and three other cetaceans. To explore the peptide binding landscape for DQ proteins from the IRL, combinations of the amino acids identified as active were compiled into peptide sequence lists that were used to mine databases for representation in known proteins. The frequency of which peptide sequences predicted to bind the MHC protein are found in proteins from pathogens associated with marine mammals was found to be significant (p values <0.0001). Through this analysis, genetic variation in MHC (classes I and II) can now be associated with the binding repertoires of the expressed MHC proteins and subsequently used to identify target pathogens. This approach may be eventually applied to evaluate individual population and species risk for outbreaks of emerging diseases.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30070993 PMCID: PMC6072028 DOI: 10.1371/journal.pone.0201299
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Method for model positional scanning library analysis (MPSLA).
Nine steps that take the researcher from genetic sequence data, through MHC binding analysis, to protein and pathogen prediction.
Frequency of selected alleles for bottlenose dolphin from Pagán et al.
| A. | ||||||
| 18 | 5 | 0 | 7 | |||
| 16 | 36 | 42 | 45 | |||
| 23 | 34 | 50 | 45 | |||
| n = 44 | n = 44 | n = 36 | n = 60 | |||
| 20 | 6 | 0 | 7 | |||
| 10 | 31 | 47 | 45 | |||
| 3 | 6 | 6 | 17 | |||
| 13 | 31 | 38 | 28 | |||
| n = 30 | n = 36 | n = 34 | n = 58 | |||
| B. | ||||||
| 1 | — | — | 2 | DQ1-1 | ||
| — | 1 | — | — | DQ1-8 | ||
| — | 3 | 3 | 1 | DQ1-10 | ||
| — | 2 | 5 | 5 | DQ2-4 |
DQA and DQB peptide binding regions (exon 2) were genotyped in bottlenose dolphins from Florida Mosquito Lagoon (ML), North Indian River Lagoon (NIRL), and South IRL (SIRL) as well as the adjacent Atlantic coast (ATL). (A) Frequency of allele in sample population (n) expressed as a percentage. (B) The DQA/DQB haplotypes were determined from homozygous individuals or inferred from heterozygotes. STD; standard.
Fig 2Predicted binding affinities of peptide sequences derived from proteins encoded by cetacean MHC II DQ alleles.
Binding affinities of 7,634 peptide sequences predicted by NNAlign were compared by counting the number of peptides below 100, 500, 1000, 5000 and 10,000nM thresholds. The algorithm was supplied with a 7,647 amino acid sequence and the DQA and B protein sequences from cetaceans (killer whale, sperm whale, finless dolphin) obtained from Genbank and from bottlenose dolphins in the Indian River Lagoon (IRL) and adjacent Atlantic coast (ATL) (DQ1-1; Protein derived from DQA 1*01 DQB 1*01 DQ1-8; Protein derived from DQA 1*01 DQB 1*08 DQ1-10; Protein derived from DQA 1*01 DQB 1*10 and DQ2-4; Protein derived from DQA 1*02 DQB 1*04).
Fig 3Distribution of active sequences derived from dolphin DQ proteins from the IRL.
Sequences with a binding affinity below 10,000nM for each of the datasets for the four proteins (DQ1-1, DQ1-8, DQ1-10 and DQ2-4) from dolphins in the Indian River Lagoon and adjacent Atlantic coast were compiled and the frequency of sequences identified uniquely binding to one protein or shared by 2, 3 or all 4 proteins is shown.
Model positional scanning library and ranking for MCH II DP.
| A | Position (calculated affinities nM) | B | Position (aa) | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AA | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | Rank | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| 835 | 534 | 725 | 430 | 878 | 592 | 385 | 551 | ||||||||||||
| 1818 | 620 | 745 | 410 | 328 | 601 | 885 | 670 | 414 | F | Y | W | V | F | W | L | V | G | ||
| 1018 | 615 | 540 | 898 | 388 | 795 | 421 | 424 | 744 | V | L | Y | F | Y | Y | Y | L | Y | ||
| 358 | 308 | 338 | 217 | 853 | L | V | V | W | V | L | E | A | D | ||||||
| 1277 | 802 | 483 | 449 | 929 | 862 | 1042 | 803 | 340 | W | H | F | D | D | D | W | I | I | ||
| 655 | 422 | 781 | 414 | 408 | 1118 | 543 | 449 | 450 | I | T | L | H | E | V | K | E | V | ||
| 629 | 635 | 556 | 475 | 576 | 731 | 920 | 400 | 435 | H | W | G | G | L | R | H | K | H | ||
| 866 | 732 | 645 | 1336 | 683 | 716 | 479 | 428 | 528 | A | A | E | I | H | K | M | H | L | ||
| 492 | 342 | 462 | 751 | 395 | 399 | 357 | 368 | 466 | K | E | I | T | A | I | A | R | M | ||
| 1058 | 677 | 875 | 792 | 707 | 803 | 560 | 679 | 498 | T | D | K | A | S | E | T | Q | K | ||
| 2110 | 659 | 1187 | 850 | 591 | 889 | 752 | 831 | 1561 | E | I | T | L | I | M | N | D | A | ||
| 2042 | 2809 | 1931 | 2024 | 1377 | 847 | 1702 | 1634 | 990 | M | N | D | M | N | T | Q | M | W | ||
| 2250 | 1997 | 1066 | 1247 | 1138 | 944 | 762 | 623 | 1648 | G | M | H | N | K | P | D | T | E | ||
| 1775 | 1244 | 931 | 1678 | 869 | 712 | 1062 | 603 | 2127 | R | K | M | E | M | G | V | W | T | ||
| 1805 | 929 | 1322 | 1005 | 563 | 917 | 1893 | 1544 | 1625 | S | G | R | S | T | A | I | G | P | ||
| 926 | 430 | 650 | 713 | 801 | 816 | 705 | 712 | 959 | D | S | Q | Q | R | N | G | N | N | ||
| 476 | 403 | 278 | 281 | 317 | 660 | 906 | 330 | 443 | P | R | N | K | G | S | R | F | S | ||
| 552 | 445 | 262 | 377 | 267 | 436 | 734 | 659 | N | Q | S | R | Q | Q | P | S | Q | |||
| 305 | 263 | 247 | 280 | 384 | 408 | Q | P | P | P | P | H | S | P | R | |||||
(A) The calculated binding affinities for mixtures (nM) derived from the sequences affinities generated by NetMHCIIpan 3.1 for each of the 19 amino acids at nine positions of the core binding peptide for the human HLA DP2 protein. (B) The amino acids (single letter code A-Y) defined in the mixtures are sorted by mixture affinity. Mixtures with the lowest value and highest affinity (those ranked 1) are in blue on both sides, those ranked 1–5 are in gray on right.
Fig 4Comparison of amino acid ranking from physical and model positional scanning libraries.
Binding affinities for the (19 x 9 = 171) mixtures obtained from the screening of the physical protein or modeled protein encoded by HLA-DP2 (HsDPA1*0103, HsDPB1*0201) were ranked from 1–19, where 1 represents the lowest value and therefore highest binding affinity. Correlations were performed on scatterplots, Coefficients of determination (r2) derived from Pearson coefficients (r) are recorded in upper right corner.
Coefficients of determination (r2) derived from pearson correlation coefficients (r) from scatterplots of amino acid rankings.
| Scatterplots of amino acid rankings | Position | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Physical Library | v | MPSLA | 0.2255 | 0.2690 | 0.2639 | 0.5276 | 0.0428 | 0.1671 | 0.3017 | 0.2233 | 0.0040 |
| * | * | * | *** | ns | ns | * | * | ns | |||
| Physical Library | v | DS-QMnap | 0.3491 | 0.2532 | 0.1682 | 0.0052 | 0.0018 | 0.0808 | 0.2615 | 0.0257 | 0.0021 |
| ** | * | ns | ns | ns | ns | * | ns | ns | |||
| DS-QMnap | MPSLA | 0.5042 | 0.4159 | 0.1505 | 0.0025 | 0.0562 | 0.1266 | 0.0182 | 0.0439 | 0.0110 | |
| *** | *** | ns | ns | ns | ns | ns | ns | ns | |||
| DQ 1–10 | v | DQ 1–8 | 0.6741 | 0.7193 | 0.7607 | 0.628 | 0.5470 | 0.6840 | 0.7792 | 0.8195 | 0.8442 |
| DQ 1–10 | v | DQ | 0.8086 | 0.5813 | 0.7142 | 0.6643 | 0.2359 | 0.2187 | 0.3600 | 0.6068 | 0.6643 |
| DQ 1–1 | v | DQ | 0.6256 | 0.5452 | 0.2330 | 0.3422 | 0.4201 | 0.6424 | 0.3404 | 0.5586 | 0.4640 |
| DQ 1–1 | v | DQ | 0.6717 | 0.5541 | 0.5744 | 0.7193 | 0.7581 | 0.8359 | 0.7872 | 0.9004 | 0.8581 |
aNeo.ph; Neophocaena phocaenoides
bPhy.ma; Physeter macrocephalus
Statistical significance as defined by Graphpad;
P values 0.1234 (ns), 0.0332(*), 0.0021(**), 0.0002 (***) and <0.00001(****).
A. Coefficients of determination (r2) were derived from Pearson coefficient values for each of the 9 positions and statistical significance. Correlations were performed for amino acid ranking values obtained from human HLA-DP alleles using 3 methods; Physical Positional Scanning Library, Model Positional scanning library (MPSLA) and Model of amino acid preference (DS-QMnap). B. DQ proteins from the Indian River Lagoon that differed significantly from the standard DQ (correlations generated low Pearson coefficient values) were found to have a higher degree of relatedness (higher Pearson coefficient values) when compared to 2 other cetacean species (Yangtze finless porpoise and sperm whale).
Fig 5Comparison of rankings for dolphin (standard) to four IRL dolphins and 3 other cetaceans.
Correlations were performed on scatterplots of amino acid ranking obtained from our standard DQ protein of Bottlenose dolphin (Tursiops truncatus), with each of the four proteins found in the IRL (DQ1-1, DQA 1–8, DQA1-10 and DQ2-4). Correlations were also performed comparing the amino acid ranking from DQ proteins for standard bottlenose dolphin to those of killer whale (Orcinus orca), Yangtze finless porpoise (Neophocaena phocaenoides) and sperm whale (Physeter microcephalus). Coefficients of determination (r2) derived from Pearson coefficients (r) are plotted for each of the 9 positions of the MPSLs.
Fig 6Comparison of amino acid ranking from MPSLs derived for DQ1-10 and DQ2-4 in dolphin.
Predicted binding affinities for the (20 x 9 = 180) mixtures obtained from the model positional scanning libraries derived from proteins encoded by dolphin alleles DQA1*01DQB 1*10 (DQ1-10) and DQA1*02 DQB1*04 (DQ2-4) were ranked from 1–20, where 1 represents the lowest value and therefore highest binding affinity. Correlations were performed on scatterplots, coefficients of determination (r2) derived from Pearson coefficients (r) are recorded in upper right corner. Vertical and horizontal lines demark amino acids ranked below 5.
Fig 7Selection of amino acids for combination sequences.
(A) To determine the highest ranked amino acids selective for, or common to MPSLs for 3 IRL dolphin proteins (DQ1-08 □, DQ1-10 Δ and DQ2-04 ○), each of the 9 positions the ranking data for all 3 MPSLs were superimposed on a single graph (sample graph for position 2 is shown). (B). The top 5 amino acids at each position were given a value 1–3 depending on their rank in the composite graph For example, Tyrosine (Y) in position 2 would be assigned a value of 1 for DQ2-4, 2 in DQ1-10, and is not in the top 5 for DQ1-8; Methionine (M) would be given a value of 1 for both DQ1-8 and DQ1-10. (C) Combinations of the common or most selective amino acids amino acids used to generate sequences a total of 13,392 sequences were generated.
Summary of hits from search of UniProtKB database (90,645,980 million entries).
| # combinations | # sequences | # sequences in list | |||
|---|---|---|---|---|---|
| 3,456 | 1,090 | 32% | 393 | 36% | |
| 3,888 | 540 | 14% | 129 | 24% | |
| 2,586 | 223 | 9% | 27 | 12% | |
| 3,456 | 178 | 5% | 29 | 16% | |
Proteins and pathogens identified by MPSLA for dolphins from the IRL.
| Organism | # | a | Protein | # | a | Sequence | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | Uncharacterized protein | ||||||||
| 2 | 9 | + | Cell division protein FtsQ | 5 | + | |||||
| Competence protein ComE | ||||||||||
| Invasion protein expression up-regulator SirB | ||||||||||
| LPS export ABC transporter periplasmic protein LptC | ||||||||||
| Surface polysaccharide O-acyltransferase, integral membrane enzyme | ||||||||||
| 3 | 3 | + | Sec-independent protein translocase protein TatB | 2 | ||||||
| Tail length tape-measure protein | ||||||||||
| 4 | 17 | + | ABC transporter ATP-binding protein | 6 | + | |||||
| ComEA protein | ||||||||||
| Membrane protein, PF03706 family (Fragment) | ||||||||||
| Permease, cytosine/purine, uracil, thiamine, allantoin family | ||||||||||
| Putative stage III sporulation protein E | ||||||||||
| Signal recognition particle receptor FtsY | ||||||||||
| 5 | 1 | Uncharacterized protein | + | |||||||
| 6 | 4 | 1,3-beta-glucanosyltransferase | 5 | + | ||||||
| Zinc finger protein klf1 | ||||||||||
| C2H2 finger domain-containing protein | ||||||||||
| 7 | 30 | + | ABC transporter permease | 12 | + | |||||
| AI-2E family transporter | ||||||||||
| Alkyl hydroperoxide reductase | ||||||||||
| Bacitracin transport permease bcrb | ||||||||||
| Cadmium efflux P-type ATPase | ||||||||||
| D-ribose ABC transporter substrate-binding protein | ||||||||||
| DUF1453 domain-containing protein | ||||||||||
| Endoribonuclease | ||||||||||
| Lipoprotein signal peptidase | ||||||||||
| Phage protein | ||||||||||
| 8 | 10 | + | ABC transporter permease | 5 | + | |||||
| Flp pilus assembly protein CpaB | IIVVVALAA | LMVVAVLVV | VIVVAALVA | WIVVAVLIA | ||||||
| Receptor family ligand-binding protein | ||||||||||
| Stress protection protein MarC | ||||||||||
| 9 | 10 | + | ABC transporter permease | 1 | ||||||
| 10 | 4 | + | BAX inhibitor (BI)-1 like protein (UPF0005 domain) | 3 | + | |||||
| GntP family permease | ||||||||||
| D-glycerate transporter (Predicted) | ||||||||||
| Uncharacterized protein | ||||||||||
| 11 | 1 | Disulfide bond formation protein B | 1 | |||||||
| 12 | 12 | + | Uracil permease | 6 | + | |||||
| Copper-exporting P-type ATPase A | ||||||||||
| PTS system beta-glucoside-specific EIIBCA component | ||||||||||
| Signal peptidase I | ||||||||||
| Sortase | ||||||||||
| Sporulation integral membrane protein YtvI | ||||||||||
| 13 | 36 | + | Arabinosyltransferase C | 14 | + | |||||
| Beta-carotene 15,15'-monooxygenase | ||||||||||
| Competence protein ComE-like protein | ||||||||||
| CvpA family protein | ||||||||||
| YceI-like domain protein | ||||||||||
| CytoChrome c oxidase Caa3 assembly factor | ||||||||||
| Major facilitator transporter | ||||||||||
| Septum formation | ||||||||||
| Serine protease | ||||||||||
| Signal peptidase I | ||||||||||
| Sodium/hydrogen exchanger | ||||||||||
| Phage tail tape measure protein, TP901 family (Fragment) | ||||||||||
| Phosphatidate cytidylyltransferase | ||||||||||
| Polyisoprenoid-binding protein YceI | ||||||||||
| 14 | 4 | Lipoprotein signal peptidase | ||||||||
| 15 | 5 | + | Inner membrane peptidase. Serine peptidase. MEROPS family S49 | 6 | + | |||||
| Major facilitator superfamily MFS_1 | ||||||||||
| Methyl-accepting chemotaxis sensory transducer TarH | ||||||||||
| Probable lipid II flippase MurJ | ||||||||||
| 16 | 7 | + | Multidrug ABC transporter permease | 2 | + | |||||
| Alpha/beta hydrolase | ||||||||||
| 17 | 3 | Glutathione-regulated potassium-efflux system protein (K(+)/H(+)antiporter) | 4 | + | ||||||
| Hydrogenase-4 component B / Formate hydrogenlyase subunit 3 | ||||||||||
| 18 | 2 | Amino acid transporter | 2 | |||||||
| Glutamyl-tRNA amidotransferase subunit A | ||||||||||
| 19 | 1 | Hemagglutinin | 1 | |||||||
| 20 | 1 | + | Proline and betaine transporter | 2 | + | |||||
| 21 | 1 | Uncharacterized protein | + | |||||||
| 22 | 3 | Lipoprotein releasing system transmembrane LolC | 3 | + | ||||||
| 23 | 3 | + | ATP-binding cassette, subfamily B | 3 | + | |||||
| Predicted arginine uptake transporter | ||||||||||
| 24 | 3 | Mechanosensitive ion channel protein MscS | 2 | + | | |||||
| Cytochrome c oxidase accessory protein CcoG | ||||||||||
| 25 | 2 | Iron ABC transporter permease | 3 | |||||||
| 26 | 1 | Uncharacterized protein | + | | ||||||
| 27 | 132 | + | ABC transporter permease | 61 | + | 157 | ||||
| Acyl-CoA dehydrogenase | ||||||||||
| Adenosylcobinamide-GDP ribazoletransferase | ||||||||||
| Arsenic transporter | ||||||||||
| Cadmium-translocating P-type ATPase | ||||||||||
| ComE operon protein 1 | ||||||||||
| Cytochrome C-type biogenesis protein ccdA | ||||||||||
| Dipeptide-binding protein DppE precursor | ||||||||||
| Ethanolamine permease | ||||||||||
| Exopolyphosphatase | ||||||||||
| Flotillin | ||||||||||
| Haloacid dehalogenase | ||||||||||
| Long-chain-acyl-CoA dehydrogenase | ||||||||||
| MCE-family protein MCE1A (Fragment) | ||||||||||
| Modulator of FtsH protease HflK | ||||||||||
| Murein biosynthesis integral membrane MurJ | ||||||||||
| Oxidoreductase molybdopterin-binding protein | ||||||||||
| Protein-export membrane protein SecF | ||||||||||
| Thioredoxin | ||||||||||
| Type VII secretion integral membrane protein EccD | ||||||||||
| UDP-phosphate galactose phosphotransferase | ||||||||||
| Virulence factor Mce | ||||||||||
| 28 | 10 | + | Cytochrome C oxidase assembly factor CtaG-related | 5 | + | |||||
| HTH-type transcriptional repressor | ||||||||||
| MFS transporter | ||||||||||
| Thioredoxin | ||||||||||
| Type VII secretion integral membrane protein EccD-like protein | ||||||||||
| 29 | 5 | + | Macrolide export ATP-binding/permease protein MacB | 2 | + | |||||
| Electron transport complex subunit B | ||||||||||
| 30 | 5 | NADH-ubiquinone/plastoquinone complex I subunit | 3 | + | ||||||
| LemA family protein | ||||||||||
| 31 | 61 | + | Acyltransferase family protein | 15 | + | 61 | ||||
| Allantoin permease | ||||||||||
| Arabinose efflux permease family protein | ||||||||||
| TrbK entry exclusion protein | ||||||||||
| Chemotaxis sensory transducer | ||||||||||
| Cytochrome o ubiquinol oxidase subunit IV | ||||||||||
| Deoxyribonuclease | ||||||||||
| TspO and MBR related proteins | ||||||||||
| Endolytic murein transglycosylase | ||||||||||
| Heat-shock protein | ||||||||||
| 32 | 17 | + | ABC transporter permease | 10 | + | |||||
| Arabinosyltransferase | ||||||||||
| Cell wall arabinan synthesis protein | ||||||||||
| FMN-binding glutamate synthase family protein | ||||||||||
| Histidine kinase | ||||||||||
| Methylamine utilization protein MauD | ||||||||||
| NADH-Ubiquinone/plastoquinone (Complex I), various chains family protein | ||||||||||
| Pilus assembly protein TadE | ||||||||||
| Sensor histidine kinase DcuS | ||||||||||
| 33 | 1 | Uncharacterized protein | + | |||||||
| 34 | 10 | + | Macrolide export protein MacA | 4 | + | |||||
| D-galactonate transporter | ||||||||||
| 35 | 2 | Autophagy protein | 1 | + | ||||||
| 36 | 3 | Multidrug MFS transporter | 2 | + | ||||||
| 37 | 8 | + | Major facilitator transporter | 4 | + | |||||
| PTS system beta-glucoside-specific IIA Glc family | ||||||||||
| Septation ring formation regulator EzrA | ||||||||||
| Phosphate transport system permease protein PstA | ||||||||||
| 38 | 16 | + | Acriflavin resistance protein | 6 | + | |||||
| DeoR faimly transcriptional regulator | ||||||||||
| Dipeptide and tripeptide permease A | ||||||||||
| Flagellar basal body-associated protein FliL | ||||||||||
| Homoserine/homoserine lactone efflux protein | ||||||||||
| Thiol-disulfide isomerase | ||||||||||
| Putative ABC transporter, permease component | ||||||||||
The 3,456 sequences derived from amino acids common to the 3 Model Positional scanning libraries for (DQ1-8, DQ1-10 and DQ2-4) were searched for protein matches in the UniProtKB database through the Protein Information Resource (PIR). The search generated 1090 matches with 393 sequences identified in proteins of microbes associated with marine mammals. Sequence matches for proteins originating from reported pathogens in marine mammals are summarized here. Columns listed as (#) refer to numbers identified, or (a) list includes undefined species or proteins. Full details are supplied in S12 Table.
Marine pathogens identified from sequences derived from MPSLs for dolphins from the IRL.
| Org. | Genus | Reported species identified | Org. | Genus | Reported species identified | ||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | IRL | Aeromonas sp. | ✓ | 28 | Cet | Absidia sp. | ✓ | ||
| 2 | IRL | Bacillus sp. | ✓ | 29 | Cet | Acinetobacter sp. | ✓ | ||
| 3 | IRL | Campylobacter sp. | ✓ | 30 | Pin | Corynebacterium sp. | ✓ | | |
| 4 | IRL | Candida. sp. | ✓ | 31 | Cet | Fusarium sp. | ✓ | ||
| 5 | IRL | Clostridium sp. | ✓ | 32 | Pin | Bordetella sp. | ✓ | ||
| 6 | IRL | Edwardsiella sp. | ✓ | 33 | Cet | Citrobacter sp. | ✓ | ||
| 7 | IRL | Enterobacter sp. | ✓ | 34 | Cet | Influenza A | ✓ | ||
| 8 | IRL | Escherichia sp. | ✓ | 35 | Cet | Kingella sp. | ✓ | ||
| 9 | IRL | Helicobacter sp. | ✓ | 36 | Pin | Leptospira sp. | ✓ | ||
| 10 | IRL | Klebsiella sp. | ✓ | 37 | Cet | Micrococcus sp. | ✓ | ||
| 11 | IRL | Plesiomonas sp. | 38 | Pin | Mycoplasma sp. | ✓ | |||
| 12 | IRL | Pseudomonas sp. | ✓ | 39 | Cet | Moraxella sp. | ✓ | ||
| 13 | BD | Actinomyces sp. | ✓ | 40 | Cet | Mortierella sp. | ✓ | ||
| 14 | BD | Ajellomyces sp. | ✓ | 41 | Pin | Rhodococcus sp. | ✓ | ||
| 15 | BD | Aspergillus sp. | ✓ | 42 | Cet | Rhizopus sp. | ✓ | ||
| 16 | BD | Brucella sp. | ✓ | 43 | Cet | Serratia sp. | ✓ | ||
| 17 | BD | Enterococcus sp. | ✓ | 44 | Cet | Sporothrix sp. | ✓ | ||
| 18 | BD | Morganella sp. | ✓ | 45 | BD | Blastomyces Sp. | |||
| 19 | BD | Mycobacterium sp. | ✓ | 46 | BD | Coccidioides sp. | |||
| 20 | BD | Nocardia sp. | ✓ | 47 | BD | Trycophyton sp. | |||
| 21 | BD | Photobacterium sp. | ✓ | 48 | BD | Lacazia sp. | |||
| 22 | BD | Proteus sp. | ✓ | 49 | Pin | Dermatophilus sp. | |||
| 23 | BD | Providencia sp. | ✓ | 50 | Cet | Pasteurella sp. | |||
| 24 | BD | Salmonella sp. | ✓ | 51 | Cet | Abiotrophia sp. | |||
| 25 | BD | Streptococcus sp. | ✓ | 52 | Cet | Actinobacillus sp. | |||
| 26 | BD | Staphylococcus sp. | ✓ | 53 | Cet | Cetobacterium sp. | |||
| 27 | BD | Vibrio sp. | ✓ | 54 | Cet | Mucor sp. | |||
| 55 | Pin | Bisgaardia sp. |
*Reported in bottlenose dolphin in Indian River Lagoon IRL, other bottlenose dolphin, BD; other cetacean, Cet; or pinniped, Pin.
Sequences derived from Model Positional scanning libraries for 3 dolphin proteins (DQ1-8, DQ1-10 and DQ2-4) were searched for protein matches in the UniProtKB database through Protein Information Resource (PIR). The bacterial, fungal or viral sources of the matching proteins that have been reported to infect marine mammals are summarized.
Fig 8Relative occurrence of genera in UniProt database and in current search for marine related pathogens.
A. Percentage of entries for named genera in the database compared to the percentage of sequences identified in named genera derived from MPSLA. B. Fold increase in identification of named genera.
Fig 9Frequency of amino acids at each position of a decamer peptide sequence.
The frequency of each of the 20 amino acids (single letter code) was determined for each of the 10 positions of the peptide (one symbol for each position) from (A) a list of 616 sequences designed to have near equal distribution or (B) a list of 616 peptides derived from viral proteins.