| Literature DB >> 26380841 |
Samantha J Emery1, Ernest Lacey2, Paul A Haynes1.
Abstract
Eight Assemblage A strains from the protozoan parasite Giardia duodenalis were analysed using label-free quantitative shotgun proteomics, to evaluate inter- and intra-assemblage variation and complement available genetic and transcriptomic data. Isolates were grown in biological triplicate in axenic culture, and protein extracts were subjected to in-solution digest and online fractionation using Gas Phase Fractionation (GPF). Recent reclassification of genome databases for subassemblages was evaluated for database-dependent loss of information, and proteome composition of different isolates was analysed for biologically relevant assemblage-independent variation. The data from this study are related to the research article "Quantitative proteomics analysis of Giardia duodenalis Assemblage A - a baseline for host, assemblage and isolate variation" published in Proteomics (Emery et al., 2015 [1]).Entities:
Keywords: Assemblage A; Giardia duodenalis; Label-free quantitative shotgun proteomics; Parasite proteomics; Variable genome; Variant Surface Protein
Year: 2015 PMID: 26380841 PMCID: PMC4556777 DOI: 10.1016/j.dib.2015.08.003
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Classification information for the eight G. duodenalis strains used in this study including subassemblage, geographic origin, and the host species the strain was isolated from. Strain identification coincides with those previously published in the literature.
| BRIS/83/HEPU 106 | A1 | Brisbane, Australia | Human |
| BRIS87/HEPU/713 | A1 | Brisbane, Australia | Human |
| OAS1 | A1 | Canada | Sheep ( |
| Bac2 | A1 | Australia | Cat ( |
| BRIS/95/HEPU/2041 | A1 | Victoria, Australia | Cockatoo ( |
| BRIS/89/HEPU/1065 | A1 | Brisbane, Australia | Human |
| WB | A1 | Afghanistan | Human |
| BRIS/89/HEPU/1003 | A2 | Brisbane, Australia | Human |
Assemblage A1 genome strain (ATCC 50803).
Fig. 1Distribution of shared and unique proteins in the A1 subassemblage between the 1197 non-redundant proteins identified within the seven isolates analysed. The 1197 proteins were reproducibly identified in at least one isolate, with 149 (12.4%) of these proteins identified within only one isolate, and therefore considered to be uniquely expressed. Part A (left) shows the distribution of these 149 uniquely expressed proteins by isolate in the seven A1 isolates analysed in this study. Part B (right) shows the distribution of the shared proteins between the seven subassemblage A1 isolates. A total of 503 (42%) proteins were identified in all seven isolates examined in this study, and are considered common between isolates of the A1 subassemblage. The remaining segments indicates proteins common within decreasing numbers of isolates, while the final elevated segment indicates the 149 isolate-unique proteins.
| Subject area | Biology |
| More specific subject area | Quantitative proteomic data of 8 |
| Type of data | Table, Figure, Supplementary Tables |
| How data was acquired | Protein extracts from biological triplicates were digested in solution, and fractionated online using GPF with mass range fraction optimised for the |
| Data format | Raw data, reproducibly identified proteins. |
| Experimental factors | 8 |
| Experimental features | Sample triplicates were combined to produce reproducibly identified proteins and spectral counts of each protein were used to calculate NSAF values for each protein. |
| Data source location | Sydney, NSW, Australia |
| Data accessibility | Data is available from |
First proteomic baseline for taxonomy and isolate variation in Assemblage A strains. Provides proteome coverage of isolates from animal and human hosts, both A1 and A2 subassemblages, with an emphasis on Australian isolates. Evaluates database-dependent losses based on new genome reclassifications and releases in Assemblage A. Identifies sources of inter- and intra-assemblage A isolate variation and its impacts. |