| Literature DB >> 30347774 |
Bhuwan Khatri1, Ashley M Hayden2, Nicholas B Anthony3, Byungwhi C Kong4.
Abstract
Arkansas Regressor (AR) chickens, unlike Arkansas Progressor (AP) chickens, regress tumors induced by the v-src oncogene. To better understand the genetic factors responsible for this tumor regression property, whole genome resequencing was conducted using Illumina Hi-Seq 2 × 100 bp paired-end read method (San Diego, CA, USA) with AR (confirmed tumor regression property) and AP chickens. Sequence reads were aligned to the chicken reference genome (galgal5) and produced coverage of 11× and 14× in AR and AP, respectively. A total of 7.1 and 7.3 million single nucleotide polymorphisms (SNPs) were present in AR and AP genomes, respectively. Through a series of filtration processes, a total of 12,242 SNPs were identified in AR chickens that were associated with non-synonymous, frameshift, nonsense, no-start and no-stop mutations. Further filtering of SNPs based on read depth ≥ 10, SNP% ≥ 0.75, and non-synonymous mutations identified 63 reliable marker SNPs which were chosen for gene network analysis. The network analysis revealed that the candidate genes identified in AR chickens play roles in networks centered to ubiquitin C (UBC), phosphoinositide 3-kinases (PI3K), and nuclear factor kappa B (NF-kB) complexes suggesting that the tumor regression property in AR chickens might be associated with ubiquitylation, PI3K, and NF-kB signaling pathways. This study provides an insight into genetic factors that could be responsible for the tumor regression property.Entities:
Keywords: SNPs; chicken; resequencing; tumor regression; v-src
Year: 2018 PMID: 30347774 PMCID: PMC6210987 DOI: 10.3390/genes9100512
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Primers used for allele-specific polymerase chain reaction (PCR).
| Gene | Primer Name | Oligo Sequence (5’ → 3’) | Annealing Temperature (°C) |
|---|---|---|---|
|
| BMX-F1 | GAACTTACATACAGATCGTC | 55 |
| BMX-F3 | GAACTTACATACAGATCGTT | 55 | |
| BMX-R | CTTCCAACCCAAGCCATTAC | 55 | |
|
| FAM208B-F1 | CCACTCCTTGGTGGAGTATT | 55 |
| FAM208B-F3 | CCACTCCTTGGTGGAGTATC | 55 | |
| FAM208B-R | AGAAAGATGAGGATCGTGCG | 55 | |
|
| IFT140-F1 | AAATCCATCAAGTTGATTAA | 55 |
| IFT140-F3 | AAATCCATCAAGTTGATTAG | 55 | |
| IFT140-R | TCTTTCTGAGAACGAAAGGG | 55 | |
|
| IGSF-F1 | CAATGGGACTGTGCTGAGTC | 63 |
| IGSF-F2 | CAATGGGACTGTGCTGAGTT | 63 | |
| IGSF-R | TCTCAGGCAGAGGTGATGAT | 63 | |
|
| LAMB4-F1 | TCTCTTATTTGCGTTCAATT | 55 |
| LAMB4-F3 | TCTCTTATTTGCGTTCAATC | 55 | |
| LAMB4-R | TTGCAGATGAGAGTGTGCCT | 55 | |
|
| PIK3R4-F1 | ACTAGGGTGAGATGTTTAAT | 55 |
| PIK3R4-F3 | ACTAGGGTGAGATGTTTAAC | 55 | |
| PIK3R4-R | GGGGATCATCAGAAGTCTGT | 55 | |
|
| THADA-F1 | ACAAACCATGCTGGCATACT | 63 |
| THADA-F3 | ACAAACCATGCTGGCATACC | 63 | |
| THADA-R | CAGGACATGCTAACCTCTGT | 63 | |
|
| TOPAZ1-F1 | AAGCTCTGGTAGGCTACGGG | 63 |
| TOPAZ1-F3 | AAGCTCTGGTAGGCTACGGT | 63 | |
| TOPAZ1-R | CAGGCCAGAATACTGCATCT | 63 |
Data from HiSeq and sequence assembly.
| Line | # of Reads | # of Reads Aligned | # of Reads Not Aligned | Coverage | Total # of SNP |
|---|---|---|---|---|---|
| AP | 69,221,284 | 55,224,050 | 10,903,306 | 14× | 7,372,778 |
| AR | 55,368,344 | 44,328,649 | 8,551,173 | 11× | 7,173,788 |
AP: Arkansas Progressor; AR: Arkansas Regressor.
Figure 1Distribution of single nucleotide polymorphisms (SNPs) in different regions of genome (A) and SNPs associated with different types of mutations (B). CDS: Coding DNA sequences; mRNA: Messenger RNA; ncRNA: Non-coding RNA; tRNA: Transfer RNA.
The 63 reliable marker SNPs that induced amino acid changes showing ≥ 10 read depths.
| Contig ID | C | Ref Pos | Ref Base | Called Base | Impact | SNP% | Feature Name | DNA Change | AA Change | Depth | A Cnt | C Cnt | G Cnt | T Cnt | Del |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NC_006088 | 1 | 1034107 | T | C | N-syn | 1 | c.4777A | p.K1593E | 13 | 0 | 13 | 0 | - | 0 | |
| NC_006088 | 1 | 14729572 | T | C | N-syn | 0.8 |
| c.4640A | p.K1547R | 10 | 0 | 8 | 0 | - | 0 |
| NC_006088 | 1 | 64181733 | G | T | N-syn | 1 |
| c.1585G | p.A529S | 10 | 0 | 0 | - | 10 | 0 |
| NC_006088 | 1 | 65873551 | C | G | N-syn | 0.9 |
| c.960G | p.R320S | 11 | 0 | - | 10 | 0 | 0 |
| NC_006088 | 1 | 104629701 | G | A | N-syn | 1 |
| c.1099G | p.A367T | 11 | 11 | 0 | - | 0 | 0 |
| NC_006088 | 1 | 112303547 | A | C | N-syn | 0.8 |
| c.2209A | p.I737L | 10 | - | 8 | 0 | 0 | 0 |
| NC_006088 | 1 | 121349181 | C | T|C | N-syn | 0.75 |
| c.[1549G | p.D517N, | 12 | 0 | - | 0 | 9 | 0 |
| +[1549G | p.D517D | ||||||||||||||
| NC_006088 | 1 | 127837266 | G | T | N-syn | 0.89 |
| c.449G | p.W150L | 10 | 0 | 0 | - | 9 | 0 |
| NC_006088 | 1 | 179358329 | A | T|A | N-syn | 0.76 |
| c.[1849T | p.Y617N, | 13 | - | 0 | 0 | 10 | 0 |
| +[1849T | p.Y617Y | ||||||||||||||
| NC_006088 | 1 | 185190031 | T | C | N-syn | 0.8 |
| c.983A | p.Q328R | 10 | 0 | 8 | 0 | - | 0 |
| NC_006088 | 1 | 193338454 | A | T | N-syn | 1 |
| c.590T | p.L197Q | 13 | - | 0 | 0 | 13 | 0 |
| NC_006089 | 2 | 19385973 | C | T | N-syn | 1 |
| c.3400G | p.V1134I | 10 | 0 | - | 0 | 10 | 0 |
| NC_006089 | 2 | 41279450 | G | T | N-syn | 1 |
| c.2924C | p.T975N | 10 | 0 | 0 | - | 10 | 0 |
| NC_006089 | 2 | 42201510 | T | T|C | N-syn | 0.75 |
| c.[1937A | p.D646D, | 12 | 0 | 9 | 0 | - | 0 |
| +[1937A | p.D646G | ||||||||||||||
| NC_006089 | 2 | 63518627 | T | C | N-syn | 0.89 |
| c.137T | p.I46T | 10 | 0 | 9 | 0 | - | 0 |
| NC_006089 | 2 | 96924595 | G | A | N-syn | 1 |
| c.934G | p.A312T | 10 | 10 | 0 | - | 0 | 0 |
| NC_006089 | 2 | 1.06E+08 | G | C | N-syn | 0.9 |
| c.17C | p.A6G | 11 | 1 | 10 | - | 0 | 0 |
| NC_006089 | 2 | 1.21E+08 | T | C | N-syn | 0.91 |
| c.194T | p.L65P | 82 | 0 | 75 | 0 | - | 0 |
| NC_006090 | 3 | 24479109 | T | C | N-syn | 1 |
| c.859A | p.S287G | 10 | 0 | 10 | 0 | - | 0 |
| NC_006090 | 3 | 24515241 | T | C | N-syn | 1 |
| c.1522T | p.F508L | 10 | 0 | 10 | 0 | - | 0 |
| NC_006090 | 3 | 37785831 | A | G | N-syn | 1 |
| c.1321A | p.I441V | 10 | - | 0 | 10 | 0 | 0 |
| NC_006090 | 3 | 37785856 | C | T | N-syn | 1 |
| c.1346C | p.T449I | 10 | 0 | - | 0 | 10 | 0 |
| NC_006090 | 3 | 38965704 | G | T | N-syn | 1 |
| c.1027C | p.L343I | 10 | 0 | 0 | - | 10 | 0 |
| NC_006090 | 3 | 105934067 | C | G | N-syn | 0.85 |
| c.6146G | p.R2049T | 14 | 0 | - | 12 | 0 | 0 |
| NC_006091 | 4 | 120829955 | C | T | N-syn | 0.8 |
| c.62G | p.R21Q | 10 | 0 | - | 0 | 8 | 0 |
| NC_006091 | 4 | 31100937 | A | C | N-syn | 0.9 |
| c.903T | p.D301E | 11 | - | 10 | 0 | 0 | 0 |
| NC_006091 | 4 | 56709491 | G | G|A | N-syn | 0.76 |
| c.[631G | p.D211N, | 13 | 10 | 0 | - | 0 | 0 |
| +[631G | p.D211D | ||||||||||||||
| NC_006091 | 4 | 60221486 | C | T | N-syn | 1 |
| c.934C | p.P312S | 12 | 0 | - | 0 | 12 | 0 |
| NC_006092 | 5 | 24297188 | T | C | N-syn | 1 |
| c.7214T | p.M2405T | 13 | 0 | 13 | 0 | - | 0 |
| NC_006092 | 5 | 45169641 | T | C | N-syn | 1 |
| c.2191A | p.I731V | 10 | 0 | 10 | 0 | - | 0 |
| NC_006092 | 5 | 57915337 | A | G | N-syn | 0.86 |
| c.230A | p.D77G | 15 | - | 0 | 13 | 0 | 0 |
| NC_006093 | 6 | 130323 | G | T | N-syn | 1 |
| c.1446G | p.E482D | 10 | 0 | 0 | - | 10 | 0 |
| NC_006093 | 6 | 23801282 | T | C | N-syn | 0.81 |
| c.1528A | p.I510V | 11 | 0 | 9 | 0 | - | 0 |
| NC_006094 | 7 | 4260582 | C | T | N-syn | 0.8 |
| c.2110G | p.A704T | 10 | 0 | - | 1 | 8 | 0 |
| NC_006094 | 7 | 4717753 | G | A | N-syn | 0.8 |
| c.601G | p.V201I | 10 | 8 | 0 | - | 0 | 0 |
| NC_006094 | 7 | 4717765 | G | A | N-syn | 0.9 |
| c.613G | p.V205I | 11 | 10 | 0 | - | 0 | 0 |
| NC_006094 | 7 | 14344085 | A | G | N-syn | 1 |
| c.131A | p.H44R | 13 | - | 0 | 13 | 0 | 0 |
| NC_006094 | 7 | 15047308 | A | G | N-syn | 1 |
| c.1253A | p.Y418C | 10 | - | 0 | 10 | 0 | 0 |
| NC_006094 | 7 | 15047322 | A | G | N-syn | 1 |
| c.1267A | p.K423E | 10 | - | 0 | 10 | 0 | 0 |
| NC_006096 | 9 | 713606 | G | A | N-syn | 1 |
| c.151G | p.V51M | 13 | 13 | 0 | - | 0 | 0 |
| NC_006096 | 9 | 20652272 | T | C | N-syn | 1 |
| c.2905T | p.S969P | 11 | 0 | 11 | 0 | - | 0 |
| NC_006096 | 9 | 21722478 | G | A | N-syn | 0.83 |
| c.1352G | p.G451D | 12 | 10 | 0 | - | 0 | 0 |
| NC_006096 | 9 | 21722484 | C | T | N-syn | 0.8 |
| c.1358C | p.P453L | 10 | 0 | - | 0 | 8 | 0 |
| NC_006096 | 9 | 22653344 | T | G | N-syn | 0.89 |
| c.333T | p.Y111. | 10 | 0 | 0 | 9 | - | 1 |
| NC_006097 | 10 | 11164703 | C | T | N-syn | 1 |
| c.287G | p.R96K | 11 | 0 | - | 0 | 11 | 0 |
| NC_006098 | 11 | 9564860 | G | A | N-syn | 1 |
| c.2206G | p.G736R | 10 | 10 | 0 | - | 0 | 0 |
| NC_006099 | 12 | 5208897 | A | G | N-syn | 1 |
| c.3511A | p.K1171E | 10 | - | 0 | 10 | 0 | 0 |
| NC_006100 | 13 | 14403319 | C | G | N-syn | 0.9 |
| c.260G | p.G87A | 11 | 0 | - | 10 | 0 | 0 |
| NC_006101 | 14 | 1.2E+07 | T | C | N-syn | 1 |
| c.2794A | p.N932D | 10 | 0 | 10 | 0 | - | 0 |
| NC_006101 | 14 | 13770765 | A | G | N-syn | 0.89 |
| c.2372A | p.K791R | 10 | - | 0 | 9 | 0 | 0 |
| NC_006102 | 15 | 10822742 | T | C | N-syn | 0.89 |
| c.1492A | p.S498G | 10 | 1 | 9 | 0 | - | 0 |
| NC_006104 | 17 | 6833124 | C | T | N-syn | 0.83 |
| c.368G | p.R123H | 12 | 0 | - | 0 | 10 | 0 |
| NC_006104 | 17 | 7711131 | A | G | N-syn | 1 |
| c.637A | p.I213V | 11 | - | 0 | 11 | 0 | 0 |
| NC_006106 | 19 | 5656736 | C | T | N-syn | 0.8 |
| c.1174G | p.G392S | 10 | 0 | - | 0 | 8 | 0 |
| NC_006107 | 20 | 43121 | C | T | N-syn | 1 |
| c.412C | p.R138C | 10 | 0 | - | 0 | 10 | 0 |
| NC_006107 | 20 | 478926 | C | T | N-syn | 0.8 |
| c.1472C | p.S491F | 10 | 0 | - | 0 | 8 | 0 |
| NC_006107 | 20 | 4706188 | G | T | N-syn | 0.89 |
| c.619G | p.A207S | 10 | 0 | 0 | - | 9 | 0 |
| NC_006108 | 21 | 1906952 | T | C | N-syn | 1 |
| c.2T | p.M1T | 10 | 0 | 10 | 0 | - | 0 |
| NC_006108 | 21 | 1906991 | G | T | N-syn | 1 |
| c.41G | p.C14F | 11 | 0 | 0 | - | 11 | 0 |
| NC_006108 | 21 | 2451062 | T | G | N-syn | 0.89 |
| c.1912A | p.I638L | 10 | 0 | 0 | 9 | - | 0 |
| NC_006108 | 21 | 3241081 | C | T | N-syn | 1 |
| c.1246G | p.A416T | 11 | 0 | - | 0 | 11 | 0 |
| NC_006108 | 21 | 4662973 | C | A | N-syn | 0.89 |
| c.1070G | p.S357I | 10 | 9 | - | 0 | 0 | 0 |
| NC_006111 | 24 | 130579 | C | T|C | N-syn | 0.81 |
| c.[1196C | p.P399P, | 16 | 0 | - | 0 | 13 | 0 |
| +[1196C | p.P399L |
Contig ID, chromosome (C) numbers, reference position (Ref Pos), reference base (Ref Base), called (SNP) base, impact (kinds of protein mutation), SNP%, feature name (gene name), DNA change, amino acid (AA) change, Depths, and five columns for SNP counts (Cnts) are indicated.
Figure 2Nine candidate markers and different genotypes shown by allele-specific PCR in a larger population of AR and AP chicken lines. RJF: Red Jungle Fowl.
Validation of SNPs using allele-specific PCR in 96 AR and 96 AP line chickens.
| Chr | Ref Pos | Genes | Ref Base | Called Base | Impact | SNP% | Amino Acid Change | Results of 96 Birds Each from AP and AR Lines | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Homozygous RJF | Heterozygote | Homozygous SNP | |||||||||||
| AR | AP | AR | AP | AR | AP | ||||||||
| 1 | 193338454 |
| A | T | N-syn | 1 | p.L197Q | 5 | 93 | 23 | 3 | 68 | 0 |
| 1 | 121349181 |
| C | T|C | N-syn | 0.75 | p.D517N, p.D517D | 16 | 96 | 26 | 0 | 54 | 0 |
| 1 | 1034107 |
| T | C | N-syn | 1 | p.K1593E | 0 | 94 | 0 | 1 | 96 | 1 |
| 14 | 13770765 |
| A | G | N-syn | 0.89 | p.K791R | 9 | 95 | 44 | 1 | 43 | 0 |
| 20 | 43121 |
| C | T | N-syn | 1 | p.R138C | 0 | 63 | 0 | 28 | 96 | 5 |
| 1 | 14729572 |
| T | C | N-syn | 0.8 | p.K1547R | 39 | 84 | 15 | 2 | 42 | 10 |
| 2 | 42201510 |
| T | T|C | N-syn | 0.75 | p.D646D, p.D646G | 16 | 71 | 4 | 16 | 76 | 9 |
| 3 | 24479109 |
| T | C | N-syn | 1 | p.S287G | 0 | 64 | 0 | 30 | 96 | 2 |
| 2 | 41279450 |
| G | T | N-syn | 1 | p.T975N | 0 | 16 | 0 | 78 | 96 | 2 |
Gene name and functions of genes containing amino acid changes showing over 10 depth counts in AR chickens.
| ID | Entrez Gene Name | Location | Type(s) |
|---|---|---|---|
|
| ATP-binding cassette, sub-family A (ABC1), member 12 | Plasma Membrane | Transporter |
|
| arylsulfatase D | Cytoplasm | Enzyme |
|
| B-cell scaffold protein with ankyrin repeats 1 | Extracellular Space | Other |
|
| Bone Marrow on X chromosome (BMX) non-receptor tyrosine kinase | Cytoplasm | Kinase |
|
| Coiled-coil domain containing 157 | Other | Other |
|
| coiled-coil domain containing 67 | Other | Other |
|
| DEAD (Asp-Glu-Ala-Asp) box polypeptide 10 | Nucleus | Enzyme |
|
| DEAH (Asp-Glu-Ala-His) box polypeptide 36 | Cytoplasm | Enzyme |
|
| disrupted in schizophrenia 1 | Cytoplasm | Other |
|
| Endoplasmic reticulum (ER) membrane protein complex subunit 1 | Plasma Membrane | Other |
|
| Family with sequence similarity 13, member C | Other | Other |
|
| Family with sequence similarity 154, member B | Other | Other |
|
| Family with sequence similarity 208, member B | Other | Other |
|
| Fasciculation and elongation protein zeta 1 (zygin I) | Cytoplasm | Other |
|
| Globoside alpha-1,3- | Other | Enzyme |
|
| Gamma-glutamyltransferase 7 | Plasma Membrane | Enzyme |
|
| GTPase, very large interferon inducible pseudogene 1 | Other | Other |
|
| Interferon (α, β and Ω) receptor 2 | Plasma Membrane | transmembrane receptor |
|
| Intraflagellar transport 140 | Extracellular Space | Other |
|
| Immunoglobulin superfamily, member 1 | Plasma Membrane | Other |
|
| Inositol 1,4,5-trisphosphate receptor interacting protein | Extracellular Space | Other |
|
| KIAA0556 | Extracellular Space | Other |
|
| Laminin, beta 4 | Other | Other |
|
| Low density lipoprotein receptor-related protein 3 | Plasma Membrane | transmembrane receptor |
|
| MYC associated factor X (MAX) dimerization protein | Nucleus | transcription regulator |
|
| Phosphatidylinositol-4-phosphate 3-kinase, catalytic subunit type 2 γ | Cytoplasm | Kinase |
|
| Phosphoinositide-3-kinase, regulatory subunit 4 | Cytoplasm | Kinase |
|
| Pleckstrin homology domain containing, family H (with MyTH4 domain) member 2 | Cytoplasm | Other |
|
| Protein phosphatase 1, regulatory subunit 26 | Nucleus | Other |
|
| Protein tyrosine phosphatase, non-receptor type 2 | Cytoplasm | Phosphatase |
|
| RAB17, member RAS oncogene family | Cytoplasm | Enzyme |
|
| Retinitis pigmentosa GTPase regulator | Cytoplasm | Other |
|
| S-antigen; retina and pineal gland (arrestin) | Cytoplasm | Other |
|
| Sodium channel, non-voltage-gated 1, δ subunit | Plasma Membrane | ion channel |
|
| Sucrase-isomaltase (α-glucosidase) | Cytoplasm | Enzyme |
|
| Solute carrier family 2 (facilitated glucose/fructose transporter), member 5 | Plasma Membrane | Transporter |
|
| Solute carrier family 46 (folate transporter), member 1 | Plasma Membrane | Transporter |
|
| Trans-activation response (TAR) (HIV-1) RNA binding protein 1 | Nucleus | transcription regulator |
|
| Thyroid adenoma associated | Other | Other |
|
| Transmembrane protein 52 | Other | Other |
|
| Thioredoxin-related transmembrane protein 1 | Cytoplasm | Enzyme |
|
| Testis and ovary specific Piwi Argonaut and Zwille (PAZ) domain containing 1 | Other | Other |
|
| Tetratricopeptide repeat domain 29 | Other | Other |
|
| Zinc finger protein 385B | Nucleus | Other |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
|
| Uncharacterized | N/A | N/A |
Associated network functions of candidate genes.
| ID | Molecules in Network | Score | Top Diseases and Functions |
|---|---|---|---|
| 1 |
| 30 | Developmental Disorder, Hereditary Disorder, Metabolic Disease |
|
| |||
| 2 | 22 | Cell Death and Survival, Hematological System Development and Function, Humoral Immune Response | |
|
| |||
| 3 | 22 | Developmental Disorder, Gastrointestinal Disease, Hereditary Disorder |
Figure 3Gene network #1. Molecular interactions among the important focus molecules are displayed. Gray symbols show the genes found in the list of SNP while white symbols indicate neighboring genes that are functionally associated, but not included, in the gene list of SNP. Symbols for each molecule are presented according to molecular functions and type of interactions.
Figure 4Gene network #2. Molecular interaction and symbols are the same as described in Figure 3.
Figure 5Gene network #3. Molecular interaction and symbols are the same as described in Figure 3.