| Literature DB >> 29329394 |
Jason R Miller1,2, Sergey Koren3, Kari A Dilley1, Vinita Puri1, David M Brown1, Derek M Harkins1, Françoise Thibaud-Nissen4, Benjamin Rosen5, Xiao-Guang Chen6, Zhijian Tu7, Igor V Sharakhov8,9, Maria V Sharakhova8,9, Robert Sebra10, Timothy B Stockwell11, Nicholas H Bergman11, Granger G Sutton1, Adam M Phillippy3, Peter M Piermarini1,12, Reed S Shabman1,13.
Abstract
Background: The 50-year-old Aedes albopictus C6/36 cell line is a resource for the detection, amplification, and analysis of mosquito-borne viruses including Zika, dengue, and chikungunya. The cell line is derived from an unknown number of larvae from an unspecified strain of Aedes albopictus mosquitoes. Toward improved utility of the cell line for research in virus transmission, we present an annotated assembly of the C6/36 genome.Entities:
Mesh:
Year: 2018 PMID: 29329394 PMCID: PMC5869287 DOI: 10.1093/gigascience/gix135
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Three karyotypes of the C6/36 Ae. albopictus cell line. Chromosomes are labeled chr1 for the shortest, chr2 for the longest, and chr3 for the intermediate-size chromosomes within each image. A, Cell has 3 normal paired chromosomes. An additional acrocentric chromosome pair has a short arm, indicating deletion or translocation elsewhere. An additional short haploid chromosome is unpaired. B, Cell has chromosomes with pairs slightly separated. Chr1 appears normal. The other chromosomes are abnormal, possibly due to translocation of most of 1 arm of chr2 to 1 arm of chr3. C, Cell with chromosome pairs separated. Chr1, chr2, and chr3 appear normal. The chr1 homologous pairs overlap. There are several additional short metacentric and acrocentric chromosomes, shown by arrows. The 5-μm scale bar applies to all 3 images.
Figure 2:Contig size vs coverage. Each bar in the scatter plot represents 1 contig to which short reads were mapped. The 1832 contigs with mode coverage in the ×18 ± ×6 range collectively span 95.82% of the assembled bases, but there are a few large contigs with ×36 ± ×6 coverage. The apparent bimodal distribution suggests that the ∼×18 contigs could contain separate representations of heterozygous loci, while the ∼×36 contigs could contain the consensus of both alleles from less heterozygous loci.
BUSCO gene analysis.
| No. of Instances | Genes found | Genes on contigs with ×18 | Genes on contigs with ×36 | Genes only on contigs with ×18 | Genes only on contigs with ×36 | Instances on contigs with ×18 | Instances on contigs with ×36 | Instances on other contigs |
|---|---|---|---|---|---|---|---|---|
| 1 | 825 | 704 | 113 | 704 | 113 | 704 | 113 | 8 |
| 2 | 1668 | 1662 | 57 | 1600 | 2 | 3262 | 59 | 15 |
| 3 | 98 | 98 | 9 | 80 | 0 | 275 | 10 | 9 |
| 4 | 24 | 24 | 1 | 23 | 0 | 94 | 2 | 0 |
| 5–8 | 9 | 9 | 0 | 9 | 0 | 57 | 0 | 0 |
| Total | 2624 | 2497 | 180 | 2416 | 115 | 4392 | 184 | 32 |
BUSCO genes are presumed single-copy in eukaryotic genomes, but most occur twice in the C6/36 assembly. BUSCO arthropod genes were searched against C6/36 contigs. Two instances were found for 1668 genes. The genes were further evaluated for whether any of their instances occurred in contigs with short-read coverage in the ×18 ± ×6 or ×36 ± ×6 range. These coverage values suggest haplotype separation and collapse, respectively, within the contig sequences. Of genes with exactly 2 instances in the assembly, 1662 (99.6%) had at least 1 instance on a ×18-range contig, while only 57 had at least 1 instance on a ×36-range contig. This supports the characterization of ×18-range contigs as containing sequences specific to a haplotype. Table S13 gives the coordinates and short-read coverage of every instance.
Figure 3:The C6/36 Dicer locus harbors a pseudogene allelic to a gene deletion. The dot plot illustrates 30-Kbp regions of contigs 192 and 1238. Red and blue dots indicate forward and reverse strand local alignments, respectively. These contigs were identified as paired contigs based on sharing of mapped short reads, indicating putative capture of alternate haplotypes. Five previously published A. albopictus Dicer 2 transcripts (inset) have full-length alignments to contig 192. These alignments span a single-base deletion, apparent in the contig sequence, corresponding to a previously reported null mutation in C6/36 cells. The same transcripts have only short alignments at their 5’ termini to contig 1238, indicating a previously unrecognized gene loss in the cognate allele. Additional images of the region are offered in Fig. S3.
Enrichment scores for functional clusters among the highest-expressed transcripts in C6/36 cells at rest.
| Category | Functional cluster | Score |
|---|---|---|
| Transcription and translation | ||
| Ribosome | 28.65 | |
| Translation factor activity | 9.98 | |
| Protein folding | 6.35 | |
| rRNA binding | 2.26 | |
| Elongation factor | 2.18 | |
| Regulation of translation | 1.63 | |
| Heat shock protein 70 | 1.58 | |
| RNA recognition motif | 1.47 | |
| Protein sorting and trafficking | ||
| Protein transport | 2.39 | |
| Proteolysis | ||
| Proteasome | 1.62 | |
| Ubiquitin | 1.47 | |
| Ubiquitin-mediated proteolysis | 1.31 | |
| ATP metabolism | ||
| Oxidative phosphorylation | 2.69 | |
| Glycolysis | 2.15 | |
| Cytochrome-c oxidase activity | 1.70 | |
| Cytoskeleton | ||
| Regulation of cytoskeleton organization | 2.09 | |
| Actin binding | 1.39 | |
| Cell signaling | ||
| GTP binding | 3.94 | |
|
| 1.73 | |
| Generic | ||
| Cellular homeostasis | 2.52 | |
| Nucleotide binding | 2.15 | |
| Proteasome component region | 1.46 | |
Functional clusters were generated and scored by DAVID and categorized manually. The “generic” functional clusters contain transcripts with no specific or consistent functional theme.