| Literature DB >> 29763462 |
Zihao Yuan1, Tao Zhou1, Lisui Bao1, Shikai Liu1, Huitong Shi1, Yujia Yang1, Dongya Gao1, Rex Dunham1, Geoff Waldbieser2, Zhanjiang Liu3.
Abstract
Channel catfish (Ictalurus punctatus) is a highly adaptive species and has been used as a research model for comparative immunology, physiology, and toxicology among ectothermic vertebrates. It is also economically important for aquaculture. As such, its reference genome was generated and annotated with protein coding genes. However, the repetitive elements in the catfish genome are less well understood. In this study, over 417.8 Megabase (MB) of repetitive elements were identified and characterized in the channel catfish genome. Among them, the DNA/TcMar-Tc1 transposons are the most abundant type, making up ~20% of the total repetitive elements, followed by the microsatellites (14%). The prevalence of repetitive elements, especially the mobile elements, may have provided a driving force for the evolution of the catfish genome. A number of catfish-specific repetitive elements were identified including the previously reported Xba elements whose divergence rate was relatively low, slower than that in untranslated regions of genes but faster than the protein coding sequences, suggesting its evolutionary restrictions.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29763462 PMCID: PMC5953449 DOI: 10.1371/journal.pone.0197371
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The proportion of major categories of repetitive elements within the channel catfish repeatome.
Fig 2The distribution of Tc1/Mariner transposons cross channel catfish genome.
Color key is indicated at the lower right of the figure, with blue color to indicate low and red color to indicate high levels of the transposons in the chromosomal regions. Each color bar represented a physical distance of 1 Mb DNA.
Fig 3The distribution of microsatellites and satellites along the chromosomes of the channel catfish genome.
Color key is indicated at the lower right of the figure, with blue color to indicate low and red color to indicate high levels of the microsatellites and satellites in the chromosomal regions. Each color bar represented a physical distance of 1 Mb DNA.
Fig 4The divergence distribution of channel catfish Xba elements (blue) and DNA/TcMar-Tc1 transposons (pink).
The X-axis represents the average number of substitutions per site (%), and the Y-axis represents the percentage sequences that comprise the whole genome (%).
Fig 5The divergence time of the channel catfish and blue catfish with fossil calibrations (orange nodes) based on the mitochondrial cytochrome b sequences.
The major novel repetitive elements and their characteristics in the channel catfish repeatome.
| Size | AT content | Representative scaffold | Representative scf _start | Representative scf_end |
|---|---|---|---|---|
| 1,417K | 64.1% | IpCoco_scf00172 | 16,372,640 | 16,373,425 |
| 1,316K | 45.2% | lcl|IpCoco_scf00610 | 3,078 | 6,525 |
| 1,081K | 46.5% | IpCoco_scf00517 | 9,296,133 | 9,298,172 |
| 977K | 46.0% | IpCoco_scf00474 | 141,399 | 143,674 |
| 963K | 64.8% | IpCoco_scf00563_1077_652_655 | 1,427,889 | 1,431,117 |
| 924K | 60.8% | IpCoco_scf00369 | 2,537,431 | 2,538,007 |
| 899K | 50.1% | IpCoco_scf00203 | 3,637,116 | 3,638,889 |
| 881K | 64.7% | IpCoco_scf00789 | 5,607 | 11,243 |
| 870K | 64.9% | IpCoco_scf00540 | 509 | 4,548 |
| 842K | 59.5% | IpCoco_scf00570_502_500 | 2,136,706 | 2,137,523 |
| 793K | 44.5% | IpCoco_scf00419 | 7,439,807 | 7,440,803 |
| 690K | 49.2% | IpCoco_scf00021 | 1,307,023 | 1,308,497 |
| 682K | 47.2% | IpCoco_scf00077_78 | 260,739 | 262,006 |
| 630K | 65.4% | IpCoco_scf00799 | 9,869 | 11,749 |
| 589K | 53.3% | IpCoco_scf00354_353 | 643,250 | 644,319 |
| 546K | 60.0% | IpCoco_scf00563_1077_652_655 | 1,372,133 | 1,373,248 |
| 522K | 43.9% | IpCoco_scf00077_78 | 1,199,900 | 1,201,056 |
| 518K | 66.6% | IpCoco_scf00172 | 2,251,598 | 2,252,801 |
The major repetitive protein domains characterized from the channel catfish repeatome.
| Rank | Total Abundance | Gene Ontology |
|---|---|---|
| 1 | 2,990K | Adaptive immunity, Immunity |
| 2 | 1,046K | Adaptive immunity, Immunity |
| 3 | 375K | Cellular morphogenesis. |
| 4 | 353K | Nucleus, cell junction, nucleoplasm |
| 5 | 255K | Integral component of membrane |
| 6 | 232K | Unknown |
| 7 | 214K | Kinase, Transferase |
| 8 | 204K | DNA replication, mismatch repair |
| 9 | 203K | Kinase, Transferase |
| 10 | 174K | Transferase, Ubl conjugation, pathway, zinc binding |
| 11 | 145K | GTP-binding, Nucleotide-binding |
| 12 | 132K | Structural molecule activity |
| 13 | 131K | Osteogenesis, Transcription regulation |
| 14 | 129K | Cell adhesion |
| 15 | 124K | Adaptive immunity, Immunity |
| 16 | 118K | Adaptive immunity, Immunity |
| 17 | 114K | Guanine-nucleotide releasing factor |
| 18 | 109K | Integral component of membrane |
| 19 | 109K | Hydrolase, Ligase, Oxidoreductase, Amino-acid biosynthesis, Histidine biosynthesis, Methionine biosynthesis, One-carbon metabolism, Purine biosynthesis, ATP-binding, NADP, Nucleotide-binding |
| 20 | 108K | Hormone |
| 21 | 108K | Zinc binding, Cytosol, Plasma membrane |
| 22 | 107K | Nucleotide-binding, DNA integration |
| 23 | 107K | Endopeptidase inhibitor, liver development |
| 24 | 106K | ATP binding, protein serine/threonine kinase activity |