| Literature DB >> 24441737 |
Yat T Tang1, Xin Gao1, Bruce A Rosa1, Sahar Abubucker1, Kymberlie Hallsworth-Pepin1, John Martin1, Rahul Tyagi1, Esley Heizer1, Xu Zhang1, Veena Bhonagiri-Palsikar1, Patrick Minx1, Wesley C Warren1,2, Qi Wang1, Bin Zhan3,4, Peter J Hotez3,4, Paul W Sternberg5,6, Annette Dougall7, Soraya Torres Gaze7, Jason Mulvenna8, Javier Sotillo7, Shoba Ranganathan9,10, Elida M Rabelo11, Richard W Wilson1,2, Philip L Felgner12, Jeffrey Bethony13, John M Hawdon13, Robin B Gasser14, Alex Loukas7, Makedonka Mitreva1,2,15.
Abstract
The hookworm Necator americanus is the predominant soil-transmitted human parasite. Adult worms feed on blood in the small intestine, causing iron-deficiency anemia, malnutrition, growth and development stunting in children, and severe morbidity and mortality during pregnancy in women. We report sequencing and assembly of the N. americanus genome (244 Mb, 19,151 genes). Characterization of this first hookworm genome sequence identified genes orchestrating the hookworm's invasion of the human host, genes involved in blood feeding and development, and genes encoding proteins that represent new potential drug targets against hookworms. N. americanus has undergone a considerable and unique expansion of immunomodulator proteins, some of which we highlight as potential treatments against inflammatory diseases. We also used a protein microarray to demonstrate a postgenomic application of the hookworm genome sequence. This genome provides an invaluable resource to boost ongoing efforts toward fundamental and applied postgenomic research, including the development of new methods to control hookworm and human immunological diseases.Entities:
Mesh:
Year: 2014 PMID: 24441737 PMCID: PMC3978129 DOI: 10.1038/ng.2875
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Summary of N. americanus genomic features
| Estimated genome size (Mega bases) | 244 |
|---|---|
|
| |
| Total number of supercontigs (>=1 kb) | 11,713 |
| Total number of base pairs (bp) in supercontigs | 244,009,025 |
| Number of N50 supercontigs | 283 |
| N50 supercontig length (bp) | 213,095 |
| Number N90 supercontigs | 1,336 |
| N90 supercontig length (bp) | 29,214 |
| GC content of whole genome (%) | 40.20% |
| Repetitive sequences (%) | 23.50% |
|
| |
| Total number of protein coding genes | 19151 |
| Avg. gene loci footprint (bp) | 4289 |
| Avg. # exons per gene | 6.4 |
| Avg. exon size (bp) | 125 |
| Avg. intron size (bp) | 642 |
| Avg. intergenic space (bp) | 6631 |
N50: number-50% of all nucleotides in the assembly are in 283 supercontigs, length-50% of the genome is in supercontigs with a minimum length of 213kb; N90: number-90% of all nucleotides in the assembly are within 1,336 supercontigs, length-90% of the genome is in supercontigs with a minimum length of 29kb.
Figure 1N. americanus gene feature organization compared to C. elegans
a, The average exon of N. americanus genes is significantly shorter and the average intron length is significantly larger than for C. elegans genes. b, Orthologous genes have significantly more introns than non-orthologous genes in both species. c, Introns are longer for orthologous genes in C. elegans at every intron position (compared to non-orthologous genes). In a-c, Error bars indicate standard error values. d, N. americanus genes in operons and conserved with C. elegans shown on the C. elegans chromosomes.
Figure 2Molecular functions enriched among N. americanus genes, stage-enriched genes and the N. americanus degradome
a, “Molecular Function” Gene Ontology (GO) terms enriched in life-cycle stages and in N. americanus compared to other species. Included are (i) categories enriched in the iL3 or adult life cycle stages in N. americanus (ii) categories significantly over-represented or depleted in N. americanus compared to at least two of the comparison species, and (iii) second-order root nodes. b, Expression profiling of N. americanus proteases with C.-elegans orthologs. c, Expression profiling of N. americanus proteases with no C. elegans orthologs.