| Literature DB >> 34292068 |
Hatim Almutairi1,2, Michael D Urbaniak1, Michelle D Bates1, Narissara Jariyapan3, Godwin Kwakye-Nuako4, Vanete Thomaz-Soccol5, Waleed S Al-Salem2, Rod J Dillon1, Paul A Bates1, Derek Gatherer1.
Abstract
We present the LGAAP computational pipeline, which was successfully used to assemble six genomes of the parasite subfamily Leishmaniinae to chromosome-scale completeness from a combination of long- and short-read sequencing data. LGAAP is open source, and we suggest that it may easily be ported for assembly of any genome of comparable size (∼35 Mb).Entities:
Year: 2021 PMID: 34292068 PMCID: PMC8297458 DOI: 10.1128/MRA.00439-21
Source DB: PubMed Journal: Microbiol Resour Announc ISSN: 2576-098X
FIG 1Graphical representation of the LGAAP protocol.
Assembly metrics for Leishmania genome assemblies deposited in GenBank
| Organism | NCBI assembly no. | Strain | Sequencing technology(ies) | Assembly method | No. of scaffolds | Total length (bp) | |
|---|---|---|---|---|---|---|---|
| 209-622 | PacBio RS II | CANU | 118 | 33,648,436 | 763,733 | ||
| L147 | Illumina | Allpaths-LG | 160 | 31,630,816 | 1,001,864 | ||
| 210-660 | PacBio RS II | CANU | 92 | 33,504,997 | 850,106 | ||
| NA | Roche 454, Illumina | Newbler, Velvet, Zorro | 2,627 | 29,029,348 | 22,901 | ||
| UA301 | Illumina | SMALT | 34 | 32,156,470 | NA | ||
| LEM1108 | Illumina | AllPaths-LG | 168 | 31,269,090 | 1,057,807 | ||
| IOC-L 3564 | IonTorrent | SPAdes | 1,029 | 38,003,648 | 758,103 | ||
| MHOM/BR/75/M2903 | Roche 454 | Newbler | 744 | 35,210,150 | 1,030,512 | ||
| MHOM/BR/75/M2904 | Sanger | NA | 138 | 32,068,771 | 992,961 | ||
| MHOM/BR/75/M2904 | PacBio, Illumina | NA | 35 | 32,301,632 | NA | ||
| MCER/BR/1981/M6445/Salvaterra | Illumina | SOAPdenovo | 36 | 31,924,566 | 1,043,794 | ||
| MHOM/HD/2017/M32502/Amapala | Illumina | SOAPdenovo | 36 | 31,924,975 | 1,043,719 | ||
| BHU 1220 | Illumina | Bowtie | 36 | 32,414,853 | 1,024,085 | ||
| BPK282A1 | Roche 454, Illumina | NA | 36 | 32,444,968 | 1,024,085 | ||
| FDAARGOS_360 | PacBio, Illumina | CANU | 71 | 34,011,430 | 828,097 | ||
| FDAARGOS_361 | PacBio, Illumina | CANU | 56 | 33,453,722 | 1,033,854 | ||
| HU3 | Illumina | NA | 36 | 33,035,865 | NA | ||
| Ld 2001 | SOLiD | Velvet | 14,518 | 27,466,456 | 3,370 | ||
| Ld 39 | SOLiD | Velvet | 16,323 | 23,683,296 | 1,772 | ||
| LdCL | PacBio, Illumina | HGAP, Celera Assembler, CANU | 36 | 32,959,864 | NA | ||
| MHOM/IN/1983/AG83 | Illumina | AllPaths, STLab-assembler | 36 | 32,148,377 | 1,015,993 | ||
| MHOM/IN/1983/AG83 | Illumina | AllPaths | 36 | 32,196,393 | 1,029,368 | ||
| Pasteur | PacBio | HGAP | 37 | 33,545,875 | 1,079,609 | ||
| LEM3045 | Illumina | AllPaths-LG | 495 | 30,761,861 | 868,233 | ||
| MCAV/BR/2001/CUR178, LV763 | ONT, Illumina | LGAAP | 54 | 33,318,864 | 1,075,649 | ||
| LEM452 | Illumina | AllPaths-LG | 492 | 31,398,648 | 379,527 | ||
| 204-365 | PacBio RS II | CANU | 123 | 33,816,023 | 683,170 | ||
| HUUFS14 | Illumina | ABySS | 2,507 | 32,578,914 | 29,848 | ||
| JPCM5 | Sanger | NA | 76 | 32,122,061 | 1,043,848 | ||
| JPCM5 | PacBio, Illumina | NA | 36 | 32,803,248 | NA | ||
| TR01 | Illumina | Geneious | 36 | 32,009,138 | NA | ||
| 216-34 | PacBio RS II | CANU | 137 | 34,152,029 | 638,860 | ||
| Friedlin | Sanger | NA | 36 | 32,855,089 | NA | ||
| LV39c5 | Roche 454 | Newbler | 849 | 32,327,517 | 978,401 | ||
| SD 75.1 | Roche 454 | Newbler | 36 | 31,242,750 | 1,022,795 | ||
| LEM2494 | Illumina | AllPaths-LG | 251 | 30,813,970 | 873,628 | ||
| MHOM/TH/2012/LSCM1, LV760 | ONT, Illumina | LGAAP | 42 | 32,413,670 | 1,046,741 | ||
| 215-49 | PacBio RS II | CANU | 55 | 32,057,209 | 825,953 | ||
| MHOM/GT/2001/U1103 | Sanger | NA | 588 | 32,108,741 | 1,044,075 | ||
| MHOM/TH/2014/LSCM4, LV768 | ONT, Illumina | LGAAP | 98 | 34,194,276 | 1,120,138 | ||
| MHOM/COL/81/L13 | Illumina | SOAP denovo | 952 | 31,263,945 | 156,905 | ||
| MHOM/PA/94/PSC-1 | Roche 454, Illumina | Newbler, PAGIT | 35 | 30,688,794 | 1,043,456 | ||
| LEM-1537 | NA | NA | 37 | 33,890,200 | 1,047,715 | ||
| PAB-4377 | NA | NA | 37 | 32,907,781 | 1,015,393 | ||
| AIIMS/LM/SS/PKDL/LD-974 | Illumina | A5 assembly pipeline | 1,100 | 27,848,322 | 61,709 | ||
| MHOM/GH/2012/GH5, LV757 | ONT, Illumina | LGAAP | 116 | 35,953,538 | 1,100,365 | ||
| MPRO/NA/1975/252, LV425 | ONT, Illumina | LGAAP | 67 | 34,118,624 | 1,066,046 | ||
| Parrot Tar II | PacBio RS II | HGAP | 179 | 35,416,496 | 663,019 | ||
| Parrot Tar II | Roche 454 | Newbler | 7,227 | 31,556,583 | 7,432 | ||
| ATCC 50129 | Illumina | CLC Genomics Workbench | 1,928 | 30,870,161 | 32,161 | ||
| CDC216-162 | PacBio RS II, Illumina | Flye | 43 | 32,700,668 | 1,070,514 | ||
| L590 | Illumina | AllPaths-LG | 448 | 32,989,014 | 303,214 | ||
| MHOM/LB /2017/IK | Illumina | CLC NGS Cell | 9,499 | 32,139,927 | 13,854 | ||
| MHOM/LB/2015/IK | Illumina | CLC NGS Cell | 17,013 | 32,280,712 | 7,721 | ||
| LEM423 | Illumina | AllPaths-LG | 336 | 32,320,007 | 397,299 | ||
| MCOE/PA/1965/C119, LV43 | ONT, Illumina | LGAAP | 74 | 34,958,538 | 967,170 |
Asterisks indicate the six genomes assembled using LGAAP. NA, either not applicable to the technology used or not available from the GenBank record.
SOLiD, sequencing by oligonucleotide ligation and detection.