| Literature DB >> 24438588 |
Dan Zhou, Donghui Zhang, Guohui Ding, Linna Shi, Qing Hou, Yuting Ye, Yang Xu, Huayun Zhou, Chunrong Xiong, Shengdi Li, Jing Yu, Shanchao Hong, Xinyou Yu, Ping Zou, Chen Chen, Xuelian Chang, Weijie Wang, Yuan Lv, Yan Sun, Lei Ma, Bo Shen1, Changliang Zhu.
Abstract
BACKGROUND: Anopheles sinensis is an important mosquito vector of Plasmodium vivax, which is the most frequent and widely distributed cause of recurring malaria throughout Asia, and particularly in China, Korea, and Japan.Entities:
Mesh:
Year: 2014 PMID: 24438588 PMCID: PMC3901762 DOI: 10.1186/1471-2164-15-42
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of the raw reads of the sequencing analysis of
| Single-end | 5,171,177 | 153,380 | 5,017,797 | 383 |
| 3 K paired end | 6,302,769 | 270,609 | 6,032,160 | 205 |
| 8 K paired end | 2,829,232 | 80,660 | 2,748,572 | 207 |
| 20 K paired end | 864,365 | 14,201 | 850,164 | 347 |
Estimated genome size of based on K-mer analysis
| 13 | 1,874,145,919 | 7 | 267,735,131 | 1,934,235,439 | 5,007,460 |
Statistics for the assembly of the genome
| | ||||
|---|---|---|---|---|
| N90 | 2,384 | 10,962 | 30,600 | 582 |
| N80 | 7,384 | 6,003 | 149,975 | 249 |
| N70 | 13,407 | 3,858 | 338,010 | 149 |
| N60 | 20,357 | 2,558 | 537,812 | 98 |
| N50 | 30,137 | 1,685 | 814,231 | 66 |
| Longest | 357,810 | | 5,918,260 | |
| Total Size | 214,524,114 | | 220,784,734 | |
| Total Number (>100 bp) | | 27,488 | | 9,596 |
| Total Number (>2 Kb) | 12,156 | 2,038 | ||
Characteristics of the genomes of , , , and
| Genome size (Mbps) | 220.8 | 278 | 1376 | 540 |
| Genome coverage (×) | 18.8 | 10.2 | 7.6 | 6.1 |
| Number of contig | 27,497 | 18,962 | 36,206 | 48,671 |
| Number of scaffold | 9,594 | 8,987 | 4,758 | 3,171 |
| Contig N50 (Kbps) | 30.1 | --- | 82.6 | 28.6 |
| Scaffold N50 (Kbps) | 814.2 | --- | 1.5 | 486.8 |
| Average contig size (bps) | 7800 | 13878 | 36184 | --- |
| Average scaffold size (bps) | 23012 | 30930 | 290873 | --- |
| GC (%) | 42.6 | 40.9 | 38.2 | 37.4 |
| Number of gene (protein-coding) | 16766 | 12457 | 15419 | 18883 |
| Percentage of gene Length (%) | 26.4 | 23.1 | 17.4 | 18.5 |
| Percentage of exon region length (%) | 11.0 | 7.2 | 1.9 | 4.4 |
Figure 1Repetitive elements in .
Figure 2protein length plotted against the orthologous protein length for . The red dashed line shows the results of a linear regression. The concordance of the two variables is presented with 95% confidence intervals. Perfect concordance (1.0) would indicate that all points fall on the line where x = y, depicted by the edge of the grey shading.
Figure 3Gene ontology classifications for at level two. Gene ontology classifications included three components: biological processes, cellular components and molecular functions.
Figure 4Ortholog delineation among the protein-coding gene repertoires of the four mosquito species and . Membership of the categories of orthologous groups are depicted as follows: (i) 1:1:1:1:1 indicates single-copy orthologs in all species; (ii) N:N:N:N:N indicates multi-copy orthologs in all species; (iii) N in 1, N in 2, etc. indicates multi-copy orthologs in one or two species, etc; (iv) x:x:x:x:0, x:0:x:x:x, x:x:0:x:0 etc. indicates (by a 0) which of the five species, in the order listed above, did not contain single-copy or multi-copy orthologs. The remaining proportion of the sequence for each species exhibited no orthologs with genes in the other species (depicted as specific-specific in the figure).
Characteristics of microsynteny blocks between , , , , and
| 927 | 8,457 | ||
| 1,668 | 6,792 | ||
| 1,690 | 7,087 | ||
| 1,031 | 2,658 | ||
Figure 5The coverage of the microsynteny block of on the chromosome of .
Figure 6The inferred supertree for four mosquito species and . The topology of the supertree was evaluated by bootstrap percentages. Distances are in millions of years.