| Literature DB >> 32617122 |
Janet E Fulton1, David W Burt2,3, Andrew S Mason2,4, Ashlee R Lund1, Paul M Hocking2.
Abstract
BACKGROUND: Endogenous retroviruses (ERVs) are the remnants of retroviral infections which can elicit prolonged genomic and immunological stress on their host organism. In chickens, endogenous Avian Leukosis Virus subgroup E (ALVE) expression has been associated with reductions in muscle growth rate and egg production, as well as providing the potential for novel recombinant viruses. However, ALVEs can remain in commercial stock due to their incomplete identification and association with desirable traits, such as ALVE21 and slow feathering. The availability of whole genome sequencing (WGS) data facilitates high-throughput identification and characterisation of these retroviral remnants.Entities:
Keywords: ALVE; Avian Leukosis virus; Chicken; Endogenous retrovirus; Ev gene; obsERVer
Year: 2020 PMID: 32617122 PMCID: PMC7325683 DOI: 10.1186/s13100-020-00216-w
Source DB: PubMed Journal: Mob DNA
Fig. 1KASP assay primer design rationale for ALVE integration sites. Primer 1 (wildtype) and primer 2 (ALVE) are fluorophore-labelled primers and their amplification enables genotyping direct from solution. The starting sequence for the genotype-specific primers is often the same, but they differ when they cross the target site duplication, with primer 1 continuing through the host genome sequence and primer 2 entering the ALVE insertion. Rounds of elongation are short, so amplification between primer 2 and 1r would be unlikely, even with short insertions. Such short amplification required a four-primer approach rather than the single reverse primer typical in SNP genotyping KASP assays
ALVEs of the Hy-Line elite layer lines
| Name | Location | TSD | Gene | Length | WL1 | WL2 | WL3 | WL4 | WL5 | WPR1 | WPR2 | RIR |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ALVEB5 | 1:10,637,460 | GGTGGT | 7530 (F) | AD | ✓ | ✓ | ||||||
| ALVE1 | 1:65,993,542 | ACGGTT | SOX5 int1 | 7530 (F) | ✓ | ✓ | ✓ | ✓ | ✓ | |||
ALVE_ros001 (COTW55) | 1:101,668,931 | GTTGTG | 7531 (F) | ✓ | ||||||||
ALVE_ros002 (COTW69) | 1:158,775,708 | ATAAGT | – | ✓ | ||||||||
ALVE_ros003 (SGT-24) | 1:163,248,553 | CCTACT | 7528 (F) | ✓ | ||||||||
| ALVE-TYR | 1:187,921,213 | ACACTG | TYR int4 | 7534 (F) | ✓ | ✓ | ||||||
| ALVE-NSAC1 | 2:120,868,843 | CCTGTT | 4838 (P-E-3 L) | ✓ | ✓ | ✓ | ||||||
| ALVE_ros004 | 2:124,432,997 | CTTGAC | 7530 (F) | NSI | NSI | ✓ | ||||||
ALVE_ros005 (New11) | 2:142,480,536 | TTGATA | 280 (SL) | NSI | ✓ | |||||||
| ALVE-NSAC3 | 3:53,639,776 | ATAAAA | – | ✓ | ✓ | |||||||
ALVE_ros006 (N4) | 3:57,337,987 | GGACTC | – | ✓ | ||||||||
| ALVE15 | 3:70,384,294 | GTTTAT | GRIK2 int16 | 280 (SL) | ✓ | ✓ | ✓ | |||||
| ALVE_ros007 | 4:59,843,015 | AATAGA | 1400 (E-3 L) | ✓ | ||||||||
ALVE_ros008 (BK-59) | 4:62,680,158 | CTGTAG | 7529 (F) | ✓ | ||||||||
| ALVE_ros009 | 4:71,095,932 | GTCCAG | – | ✓ | ||||||||
| ALVE9 | 6:33,153,441 | CTCAAA | DOCK1 int35 | 5077 (P-E-3 L) | ✓ | |||||||
| ALVE-NSAC7 | 9:11,714,130 | CTTCTC | 7531 (F) | ✓ | ✓ | |||||||
| ALVE_ros010 | 9:11,871,576 | TCGGAT | – | NSI | ✓ | |||||||
| ALVE3 | 20:10,309,347 | AACCAC | HCK int6 | 5848 (F, RT-) | ✓ | ✓ | ✓ | AD | ||||
| ALVE21 | Z:10,681,671 | GGGTAG | 7529 (F) | ✓ | ✓ | ✓ |
Identified ALVEs are shown with Galgal5 location, target site duplication (TSD), overlap with annotated gene, sequence integrity, and presence in each of the eight analysed lines. Ambiguous prior names are shown. ALVE integrity is shown under length where: F = full, P = polymerase, E = envelope, 3 L = 3’LTR, RT- = missing reverse transcriptase, SL = solo LTR. Five ALVEs were not sequenced. ALVE detection: tick indicates detection by obsERVer; AD shows allelic dropout in the sequencing data, NSI shows that ALVE was present in the line, but not in the sequenced individuals.
Fig. 2High-throughput genotyping of ALVE integrations by KASP assay. Genotype calls are based on relative intensity of the fluorescent tags conjugated to the wildtype and ALVE integration specific primers, with each dot representing a single chicken DNA sample. Red points (top left) represent individuals homozygous for the ALVE integration, blue points (bottom right) for individuals homozygous for the wildtype allele, and green points (top right) are heterozygotes. Black points are sample negative controls and pink points are ambiguous samples either due to their location outside genotype clusters, or their relative fluorescence positions at earlier cycle stages. Three plots are shown as examples, with all twenty in Fig. S4. ALVE3 shows high numbers of individuals in all three genotypes. ALVE-TYR is fixed in both WPR lines and absent in all WLs and the RIR (hence no heterozygotes). Homozygous individuals for ALVE_ros003 were rare, so genotype confidence was derived by pooling data from multiple generations (not shown)
Fig. 3ALVE21 genotyping and identification of the WPR K locus revertant (kR). a IGV alignment view of the empty ALVE21 integration site in a FF WL. b IGV alignment view in the SF WPR. Base mismatches are coloured and show the split reads on either side of the TSD. Approximately 50% of reads align through the TSD, supporting the empty ALVE21 integration site in the tandem repeat of the K locus. c IGV alignment view in the FF WPR showing the ALVE21 integration, but with no reads aligning across the TSD. d ALVE21 KASP assay. All FF WPR individuals appear homozygous for ALVE21. All SF individuals appear heterozygous due to the empty site in the tandem repeat. e KASP assay for the unique bridging sequence between the two tandem repeats is only seen in the SF lines and not in any FF, including the FF WPR. Plot values corrected for representation of the internal control in all groups. f Optic maps generated across the K locus (coordinates show Mbp on the Z chromosome) using the Nt.BspQ1 restriction enzyme. The in silico shows predicted Nt.BspQ1 sites (vertical bars). Cases where predicted sites are very close (red circles) cannot be resolved beyond a single site. Predicted site dropout (open circle) may represent a mutation in that Nt.BspQ1 site. The WL-FF optic map matches the in silico exactly. The WPR-FF shows a ~ 7.5 kbp longer optic map representing the integrated ALVE21. Optic map figures were adapted from IrysView
Fig. 4ALVEs as genetic markers. Cladogram constructed based on ALVE presence/absence data for all sixty-five analysed datasets (Table S1; AF1). WLs dominate and cluster tightly together in the top clade. Brown Leghorns cluster with the brown egg layer WPRs, RIWs and RIRs, as well as the heritage broiler datasets, reflecting the broader genetic diversity of these breeds. The non-commercial datasets are highly diverse