| Literature DB >> 30029559 |
Rocio Garcia-Rubio1, Sara Monzon2, Laura Alcazar-Fuoli3, Isabel Cuesta4, Emilia Mellado5.
Abstract
Aspergillus fumigatus is a ubiquitous saprophytic mold and a major pathogen in immunocompromised patients. The effectiveness of triazole compounds, the A. fumigatus first line treatment, is being threatened by a rapid and global emergence of azole resistance. Whole genome sequencing (WGS) has emerged as an invaluable tool for the analysis of genetic differences between A. fumigatus strains, their genetic background, and antifungal resistance development. Although WGS analyses can provide a valuable amount of novel information, there are some limitations that should be considered. These analyses, based on genome-wide comparative data and single nucleotide variant (SNV) calling, are dependent on the quality of sequencing, assembling, the variant calling criteria, as well as on the suitable selection of the reference genome, which must be genetically close to the genomes included in the analysis. In this study, 28 A. fumigatus genomes sequenced in-house and 73 available in public data bases have been analyzed. All genomes were distributed in four clusters and showed a variable number of SNVs depending on the genome used as reference (Af293 or A1163). Each reference genome belonged to a different cluster. The results highlighted the importance of choosing the most suitable A. fumigatus reference genome to avoid misleading conclusions.Entities:
Keywords: Aspergillus fumigatus; azole resistance; cyp51A; reference genomes; whole genome sequencing
Year: 2018 PMID: 30029559 PMCID: PMC6071029 DOI: 10.3390/genes9070363
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Aspergillus fumigatus strains whole genome sequenced in our laboratory. The assigned lineage was based on the phylogenetic study.
| Samples | Origin | AZL SC | Mating Type | Source | Lineage | |
|---|---|---|---|---|---|---|
| AF293 | SP | 5SNPs | S | M1.2 | CL | 3 |
|
| SP | WT | S | M1.1 | CL | 1.1 |
| ATCC204305 | SP | I242V | S | M1.1 | CL | 1.2 |
| ATCC46645 | SP | WT | S | M1.1 | CL | 2.1 |
| CEA10 | SP | WT | S | M1.1 | CL | 1.1 |
| CM2141 | SP | WT | S | M1.1 | CL | 2.2 |
| CM237 | SP | WT | S | M1.1 | CL | 1.1 |
| CM2495 | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM2730 | SP | 3SNPs | S | M1.2 | CL | 4 |
| CM2733 | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM3248 | SP | N248K | S | M1.1 | CL | 1.1 |
| CM3249 | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM3249b | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM3262 | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM3720 | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM4602 | SP | 3SNPs | S | M1.2 | CL | 4 |
| CM4946 | SP | 3SNPs | S | M1.2 | CL | 4 |
| CM5419 | SP | WT | S | M1.2 | CL | 2.2 |
| CM5757 | SP | WT | S | M1.2 | CL | 1.1 |
| CM6126 | SP | WT | S | M1.2 | CL | 1.1 |
| CM6458 | SP | WT | S | M1.2 | CL | 1.3 |
| CM7510 | SP | WT | R | M1.1 | CL | 1.1 |
| CM7555 | SP | WT | R | M1.2 | CL | 1.3 |
| CM7560 | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM7570 | SP | 3SNPs | S | M1.1 | CL | 4 |
| CM7632 | SP | 5SNPs | S | M1.1 | CL | 3 |
| TP12 | SP | 5SNPs | S | M1.2 | CL | 3 |
| TP32 | SP | 3SNPs | S | M1.1 | CL | 4 |
SP (Spain), SNPS (single nucleotide polymorphisms), 3SNPs cyp51A modifications (F46Y, M172V, E427K) and 5SNPs cyp51A modifications (F46Y, M172V, N248T, D255E, E427K), AZL susceptibility (azole susceptibility), S (susceptible), R (resistant), CL (clinical).
Aspergillus fumigatus genomes from public data bases included in this study. The assigned lineage is based on the phylogenetic study.
| Samples | Origin | AZL SC | Mating Type | Source | Lineage | Ref. | |
|---|---|---|---|---|---|---|---|
| 08-12-12-13 | NT | TR34/L98H, S297T, F495I | R | M1.1 | CL | 2.2 | [ |
| 08-19-02-10 | NT | TR34/L98H | R | M1.2 | ENV | 2.2 | [ |
| 08-19-02-30 | NT | WT | S | M1.1 | ENV | 1.3 | [ |
| 08-19-02-46 | NT | TR34/L98H | R | M1.1 | ENV | 2.2 | [ |
| 08-19-02-61 | NT | TR34/L98H | R | M1.1 | ENV | 2.2 | [ |
| 08-31-08-91 | NT | TR34/L98H | R | M1.1 | CL | 2.2 | [ |
| 08-36-03-25 | NT | TR34/L98H, S297T, F495I | R | Unclear | CL | 2.2 | [ |
| 09-7500806 | UK | WT | S | M1.2 | CL | 1.3 | [ |
| 10-01-02-27 | NT | TR34/L98H | R | M1.2 | CL | 2.2 | [ |
| 12-7504462 | UK | WT | S | M1.2 | CL | 1.3 | [ |
| 12-7504652 | UK | WT | S | M1.1 | CL | 2.1 | [ |
| 12-7505054 | UK | WT | S | M1.1 | CL | 2.1 | [ |
| 12-7505220 | UK | TR34/L98H | R | M1.1 | CL | 2.2 | [ |
| 12-7505446 | UK | TR34/L98H | R | M1.2 | CL | 2.2 | [ |
| Af293 | UK | 5SNPs | S | M1.2 | CL | 3 | [ |
| AF41 | UK | WT | S | M1.2 | CL | 1.1 | [ |
| Af65 | UK | WT | S | M1.2 | CL | 2.1 | [ |
| AF72 | UK | G54E | R | M1.2 | CL | 1.1 | [ |
| AF90 | UK | M220V | R | M1.1 | CL | 1.1 | [ |
| Afu1042-09 | IN | TR34/L98H | R | M1.1 | CL | 2.2 | [ |
| Afu124-E11 | IN | TR34/L98H | R | M1.1 | ENV | 2.2 | [ |
| Afu166-E11 | IN | TR34/L98H | R | M1.1 | ENV | 2.2 | [ |
| Afu218-E11 | IN | TR34/L98H | R | M1.1 | ENV | 2.2 | [ |
| Afu257-E11 | IN | TR34/L98H | R | M1.1 | ENV | 2.2 | [ |
| Afu343-P-11 | IN | TR34/L98H | R | Unclear | CL | 2.2 | [ |
| Afu591-12 | IN | TR34/L98H | R | M1.1 | CL | 2.2 | [ |
| Afu942-09 | IN | TR34/L98H | R | M1.1 | CL | 2.2 | [ |
| F12041 | UK | G138C | R | M1.2 | CL | 2.1 | [ |
| F12219 | UK | G54R | R | M1.2 | CL | 1.1 | [ |
| F12636 | UK | G54E | R | M1.1 | CL | 2.1 | [ |
| F13535 | UK | G138C | R | M1.2 | CL | 2.1 | [ |
| F13619 | UK | H147Y, G448S | R | M1.1 | CL | 2.1 | [ |
| F13952 | UK | G138C | R | M1.2 | CL | 2.1 | [ |
| F14403 | UK | G54R | R | M1.1 | CL | 1.1 | [ |
| F14513G | UK | G138C | R | M1.2 | CL | 2.1 | [ |
| F14532 | UK | M220T | R | M1.1 | CL | 2.1 | [ |
| F14946G | UK | WT | S | M1.2 | CL | 1.1 | [ |
| F15390 | UK | M220T | R | M1.1 | CL | 2.1 | [ |
| F15927 | CN | 3SNPs | S | M1.1 | CL | 4 | [ |
| F16134 | DN | M220K | R | M1.2 | CL | 1.1 | [ |
| F16216 | UK | TR34/L98H | R | M1.2 | CL | 2.1 | [ |
| F16311 | UK | WT | S | M1.1 | CL | 1.1 | [ |
| F17582 | UK | WT | S | M1.2 | CL | 2.1 | [ |
| F17729 | UK | WT | S | M1.2 | CL | 1.1 | [ |
| F17729W | UK | WT | S | M1.2 | CL | 1.1 | [ |
| F17764 | CN | WT | S | M1.1 | CL | 1.1 | [ |
| F18085 | UK | WT | S | M1.1 | CL | 1.1 | [ |
| F5211G | UK | WT | S | M1.1 | CL | 2.1 | [ |
| F7763 | UK | 5SNPs | S | M1.1 | CL | 3 | [ |
| IFM55369 | JP | WT | S | M1.1 | CL | 1.2 | [ |
| IFM58026 | JP | N248K | S | M1.2 | CL | 1.1 | [ |
| IFM58029 | JP | WT | S | M1.2 | CL | 1.2 | [ |
| IFM58401 | JP | WT | S | M1.2 | CL | 1.1 | [ |
| IFM59056 | JP | WT | S | M1.1 | CL | 1.1 | [ |
| IFM59073 | JP | WT | S | M1.2 | CL | 1.2 | [ |
| IFM59359 | JP | WT | S | Unclear | CL | 1.2 | [ |
| IFM59361 | JP | WT | S | M1.1 | CL | 1.3 | [ |
| IFM59365 | JP | WT | S | M1.2 | CL | 1.1 | [ |
| IFM59777 | JP | WT | S | M1.1 | CL | 1.2 | [ |
| IFM60514 | JP | WT | S | M1.2 | CL | 1.1 | [ |
| IFM61118 | JP | N248K | S | M1.1 | CL | 1.1 | [ |
| IFM61407 | JP | WT | S | Unclear | CL | 1.1 | [ |
| IFM61578 | JP | P216L | S | M1.2 | CL | 1.1 | [ |
| IFM61610 | JP | WT | S | M1.2 | CL | 1.1 | [ |
| IFM62516 | JP | P329P | S | M1.1 | CL | 1.2 | [ |
| SL143435 | PT | N248K | S | M1.2 | UNK | 1.1 | * |
| SL143436 | PT | WT | S | M1.2 | UNK | 1.1 | * |
| SL143437 | PT | WT | S | M1.1 | UNK | 1.3 | * |
| SL143438 | PT | N248K | S | M1.2 | UNK | 1.1 | * |
| SL143439 | PT | WT | S | M1.1 | UNK | 1.3 | * |
| SL143440 | PT | 3SNPs | S | M1.1 | UNK | 4 | * |
| SL143441 | PT | WT | S | M1.1 | UNK | 1.1 | * |
| SL146112 | PT | N248K | S | M1.2 | UNK | 1.1 | * |
NT (the Netherlands), UK (the United Kingdom), IN (India), CN (Canada), JP (Japan), PT (Portugal). SNPs (single nucleotide polymorphisms), 3SNPs cyp51A modifications (F46Y, M172V, E427K) and 5SNPs cyp51A modifications (F46Y, M172V, N248T, D255E, E427K), AZL SC (azole susceptibility), S (susceptible), R (resistant), CL (clinical), ENV (environmental), UNK (unknown), * NCBI SRA.
Figure 1Whole genome phylogenetic analysis of 101 Aspergillus fumigatus genomes included in the study. Dendrogram was performed using Af293 as reference genome.
Figure 2Whole genome phylogenetic analysis of 101 Aspergillus fumigatus genomes included in the study. Dendrogram was performed using A1163 as reference genome.
Mean of single nucleotide variants (SNVs). Variants within clusters and subclusters compared to both reference genomes.
| Phylogenetic Groups | SNVs vs. A1163 Reference Genome | SNVs vs. Af293 Reference Genome |
|---|---|---|
| Subcluster I.1 | 34,544 | 75,653 |
| Subcluster I.2 | 40,124 | 65,121 |
| Subcluster I.3 | 46,811 | 79,899 |
| Subcluster II.1 | 81,478 | 98,423 |
| Subcluster II.2 | 80,994 | 97,749 |
| Cluster III | 82,235 | 35,268 |
| Cluster IV | 158,154 | 159,742 |
Single nucleotide variant differences compared to both reference genomes. Variants of each A. fumigatus genome included in this study.
| Samples | A1163 | Af293 | Samples | A1163 | Af293 | Samples | A1163 | Af293 |
|---|---|---|---|---|---|---|---|---|
| 08-12-12-13 | 80,791 | 91,956 | CM2495 | 151,089 | 148,524 | F17582 | 79,598 | 97,024 |
| 08-19-02-10 | 75,401 | 89,762 | CM2730 | 157,505 | 159,177 | F17729 | 44,054 | 79,106 |
| 08-19-02-30 | 42,487 | 87,831 | CM2733 | 154,361 | 160,494 | F17729W | 43,663 | 78,416 |
| 08-19-02-46 | 78,449 | 96,243 | CM3248 | 22,834 | 74,568 | F17764 | 25,812 | 70,013 |
| 08-19-02-61 | 80,244 | 96,416 | CM3249 | 157,242 | 159,900 | F18085 | 51,241 | 78,553 |
| 08-31-08-91 | 88,972 | 98,911 | CM3249b | 154,510 | 157,780 | F5211G | 82,091 | 99,726 |
| 08-36-03-25 | 83,921 | 102,006 | CM3262 | 161,695 | 160,926 | F7763 | 84,764 | 55,332 |
| 09-7500806 | 45,286 | 74,134 | CM3720 | 156,600 | 158,781 | IFM55369 | 42,849 | 64,003 |
| 10-01-02-27 | 72,487 | 93,299 | CM4602 | 155,381 | 161,767 | IFM58026 | 22,186 | 68,777 |
| 12-7504462 | 48,927 | 75,301 | CM4946 | 158,692 | 161,082 | IFM58029 | 36,521 | 63,106 |
| 12-7504652 | 80,203 | 93,323 | CM5419 | 89,354 | 99,771 | IFM58401 | 26,669 | 63,489 |
| 12-7505054 | 86,952 | 101,471 | CM5757 | 36,027 | 74,420 | IFM59056 | 30,841 | 61,481 |
| 12-7505220 | 75,479 | 100,229 | CM6126 | 55,871 | 86,902 | IFM59073 | 43,230 | 68,775 |
| 12-7505446 | 86,194 | 100,736 | CM6458 | 42,482 | 71,670 | IFM59359 | 39,309 | 67,118 |
| Af293 | 83,486 | 313 | CM7510 | 33,559 | 79,283 | IFM59361 | 42,795 | 73,228 |
| AF293 | 78,045 | 659 | CM7555 | 51,657 | 77,679 | IFM59365 | 21,830 | 74,248 |
| AF41 | 34,879 | 72,914 | CM7560 | 158,657 | 153,769 | IFM59777 | 40,117 | 65,300 |
| Af65 | 90,143 | 103,614 | CM7570 | 160,908 | 162,292 | IFM60514 | 39,629 | 75,765 |
| AF72 | 35,271 | 73,632 | CM7632 | 81,519 | 25,490 | IFM61118 | 21,905 | 70,230 |
| AF90 | 40,262 | 73,395 | F12041 | 78,812 | 93,712 | IFM61407 | 32,025 | 76,034 |
| Afu1042-09 | 80,346 | 96,348 | F12219 | 35,440 | 73,872 | IFM61578 | 26,201 | 67,946 |
| Afu124-E11 | 81,115 | 97,509 | F12636 | 80,359 | 98,501 | IFM61610 | 41,690 | 74,102 |
| Afu166-E11 | 80,948 | 97,350 | F13535 | 78,741 | 93,777 | IFM62516 | 41,229 | 67,364 |
| Afu218-E11 | 80,571 | 96,955 | F13619 | 85,412 | 107,985 | SL143435 | 22,037 | 81,031 |
| Afu257-E11 | 80,499 | 97,008 | F13952 | 78,466 | 93,926 | SL143436 | 52,604 | 84,188 |
| Afu343-P11 | 73,797 | 101,305 | F14403 | 27,476 | 74,454 | SL143437 | 61,289 | 91,394 |
| Afu591-12 | 80,781 | 97,039 | F14513G | 78,647 | 94,213 | SL143438 | 22,676 | 81,649 |
| Afu942-09 | 80,827 | 97,149 | F14532 | 79,931 | 102,436 | SL143439 | 39,562 | 87,955 |
|
| 1043 | 78,152 | F14946G | 44,442 | 79,719 | SL143440 | 162,993 | 165,788 |
| ATCC 204305 | 37,614 | 60,180 | F15390 | 79,743 | 101,967 | SL143441 | 38,498 | 72,994 |
| ATCC 46645 | 84,994 | 103,941 | F15927 | 161,157 | 164,076 | SL146112 | 21,782 | 81,137 |
| CEA10 | 1009 | 75,854 | F16134 | 19,660 | 77,718 | TP12 | 83,363 | 24,982 |
| CM2141 | 88,704 | 107,238 | F16216 | 78,073 | 90,736 | TP32 | 163,367 | 162,032 |
| CM237 | 48,402 | 83,874 | F16311 | 51,387 | 78,630 | F17582 | 79,598 | 97,024 |
Types of SNVs and features. Variants found within each cluster compared to both genomes.
| Variants | A1163 Reference Genome | Af293 Reference Genome | ||||||
|---|---|---|---|---|---|---|---|---|
| I | II | III | IV | I | II | III | IV | |
| Frameshift variants | 256 | 512 | 477 | 828 | 446 | 587 | 133 | 819 |
| Inframe deletions | 149 | 278 | 264 | 511 | 246 | 330 | 94 | 527 |
| Inframe insertions | 191 | 369 | 291 | 561 | 325 | 447 | 91 | 556 |
| Intergenic variants | 6405 | 11,107 | 12,819 | 6730 | 13,428 | 16,394 | 3,785 | 8792 |
| Intron variants | 1568 | 3668 | 3418 | 9066 | 3120 | 4322 | 960 | 8670 |
| Missense variants | 4342 | 10,251 | 10,008 | 23,107 | 8766 | 11,973 | 2562 | 22,746 |
| Protein altering | 8 | 19 | 12 | 18 | 16 | 22 | 4 | 21 |
| Start lost | 9 | 16 | 23 | 57 | 21 | 24 | 6 | 57 |
| Stop gained | 95 | 230 | 264 | 298 | 141 | 218 | 43 | 288 |
| Stop lost | 13 | 27 | 32 | 61 | 25 | 33 | 9 | 61 |
| Stop retained | 10 | 22 | 21 | 52 | 17 | 24 | 5 | 52 |
| Synonymous variants | 3675 | 9026 | 9005 | 26,080 | 8189 | 11,058 | 2365 | 25,418 |
Figure 3Circos representation of whole-genome depth of coverage of 20 A. fumigatus genomes, averaged over 10,000-bp bins. Red marks showed null coverage, black marks a range of coverage between 1–500, and green marks showed a coverage higher than 500. Color legend shows the selected genomes of each cluster: I (blue), II (pink), III (orange) and IV (green).