| Literature DB >> 21092319 |
Vicki Whan1, Matthew Hobbs, Sean McWilliam, David J Lynn, Ylva Strandberg Lutzow, Mehar Khatkar, William Barendse, Herman Raadsma, Ross L Tellam.
Abstract
BACKGROUND: About forty human diseases are caused by repeat instability mutations. A distinct subset of these diseases is the result of extreme expansions of polymorphic trinucleotide repeats; typically CAG repeats encoding poly-glutamine (poly-Q) tracts in proteins. Polymorphic repeat length variation is also apparent in human poly-Q encoding genes from normal individuals. As these coding sequence repeats are subject to selection in mammals, it has been suggested that normal variations in some of these typically highly conserved genes are implicated in morphological differences between species and phenotypic variations within species. At present, poly-Q encoding genes in non-human mammalian species are poorly documented, as are their functions and propensities for polymorphic variation.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21092319 PMCID: PMC3014979 DOI: 10.1186/1471-2164-11-654
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Characteristics of bovine poly-Q proteins. Comparison of the sizes of Q-tracts in bovine and human poly-Q orthologs.
Figure 2Representative PCR amplicons for bovine poly-Q encoding genes. Amplicons for a number of bovine poly-Q encoding genes were produced by PCR using genomic DNA derived from a small cattle diversity panel consisting of 8 individuals. Panels (a)-(d) highlight polymorphic genes, while panels (e) and (f) show examples of genes that were monomorphic. Each lane represents one unrelated individual. The lane order representing the 8 animals is the same for each gene. Amplicons were visualised on 3% agarose gels stained with ethidium bromide. More than one poly-Q encoding tract within a gene was signified by extension to the gene symbol (e.g. MLL3-Q3).
Polymorphic variation in bovine poly-Q genes
| Gene Symbol | |||||
|---|---|---|---|---|---|
| BTA23: c28,330,331-28,317,176 | G1A1G9A1 | 3 | NS | NS | |
| BTA26: 23,731,487-23,787,266 | G8 | 1 | - | - | |
| BTA7: 10,407,726-10,660,224 | G6 | 3 | < 0.05 | < 0.01 | |
| BTA25: c3,741,183-3,622,474 | G5A1G3A1G5 | 4 | < 0.001 | NS | |
| BTA10: 83,270,819-83,301,240 | G9A1 | 3 | < 0.0001 | NS | |
| BTA12: c80,851,585-81,459,510 | G10 | 6 | < 0.0001 | NS | |
| BTA12:24,343,037-24,382,185 | G1A1G6 | 3 | < 0.0001 | NS | |
| BTA15: c45,697,647-45,662,984 | G10A1 | 18 | < 0.0001 | < 0.001 | |
| BTA6: 120,080,853-120,204,565 | G15 | 9 | < 0.0001 | <0.01 | |
| BTA25: 38,082,422-38,093,009 | G7 | 3 | < 0.0001 | NS | |
| BTAX: c49,328,169-49,305,901 | G6A1G2A1G1A | 5 | < 0.01 | NS | |
| 1G13A1G1N4G5 | |||||
| BTA17: c75,807,030-75,798,191 | G6NG1NG3N3G | 3 | < 0.001 | NS | |
| 10NG1NG2N3G | |||||
| 3NG1N3G5NG4 | |||||
| BTA21: c5,782,758-5,605,728 | G5 | 1 | - | - | |
| BTA4: c119,034,986-118,841,623 | G5A1 | 4 | < 0.0001 | NS | |
| BTA19: 34,217,888-34,332,169 | G10 | 2 | < 0.0001 | NS | |
| BTA23: 19,237,338-19,466,747 | G3A2G3A1G5A | 1 | - | - | |
| 1G3A1G2 | |||||
| BTA3: c72,520,937-72,308,057 | G7 | 3 | < 0.0001 | NS | |
| BTAUN: WGA3026_4: 46,413-59,300 | G5A1G13 | 4 | < 0.0001 | NS | |
| BTA18: 34,372,276-34,373,950 | G5A1G4A1G1A | 5 | < 0.05 | NS | |
| 1G5 | |||||
| BTAX: c51,674,166-51,881,948 | G3A1G4; | 1 | - | - | |
| G3A1G4 | 1 | - | - | ||
| BTA23: 15,621,415-15,645,446 | G3; | 1 | - | - | |
| G2 | 1 | - | - | ||
| BTA6: 88,472,908-88,481,760 | A1G4 | 1 | - | - |
1BTAU4.0 bovine genome assembly (UCSC genome browser). BTA, chromosome number; UN, chromosome unknown; c, complementary strand.
2Length and purity of repeat triplet codon. G, A and N followed by a number represent CAG, CAA and other triplet codon tracts, respectively, that were present in the BTAU 4.0 bovine genome assembly. Multiple closely linked Q-tracts are shown as a single entity connected by Ns. Multiple discontinuous Q-tracts within a gene that were assayed are separated by a semi-colon. The Q-tract for NFYA traversed a splice site and is represented by two genomic regions each of which was genotyped.
3These genes were not ranked highly for potential polymorphic variation using the hierarchical prioritisation criteria.
4Probability of difference in allele frequencies between Bos taurus taurus and Bos taurus indicus animals (Fisher exact test [25]). NS, not significant.
5Probability of difference in allele frequencies between Bos taurus taurus beef and dairy breeds (Fisher exact test [25]). NS, not significant.
6Bolded gene symbols have been linked with human diseases caused by poly-Q tract expansions or mutations [53].
Figure 3Relationship between allelic diversity and the number of CAG-encoding glutamines within a repeat. The number of alleles identified for each polymorphic bovine gene in the large cattle diversity panel was plotted against the number of poly-Q encoding CAG repeats present within the largest allele of each gene.
Figure 4Interactions of poly-Q proteins. The Innate DB database was used to identify networks of poly-Q proteins that have been documented to physically interact with each other and relationships between the same group of proteins and transcriptional regulation of poly-Q encoding genes [28,29]. (a) Interaction network for bovine poly-Q proteins. Black lines signify documented physical interactions while red lines refer to transcriptional regulation. Singletons representing either self-interactions or cis-acting transcriptional regulation are not shown. The cell is partitioned into cell surface, cytoplasm and nucleus. Protein-gene linkages (red) are shown at the bottom of the diagram. (b) Interaction network for the identified polymorphic bovine poly-Q encoding genes.
Figure 5Expression of . (a) Diagrammatic representation of the NFYA genomic structure in relation to the poly-Q encoding region traversing the NFYA splice site across exons 2 and 3. The PCR amplicons corresponding to the exon 2-intron 2 and intron 2-exon 3 boundary regions when using genomic DNA (gDNA), and the amplicon produced when using cDNA as template are shown as thick black lines. The sequence at each splice boundary is shown with exonic sequence in black and intronic sequence in red. The absolutely conserved intronic sequence elements at the splice sites are underlined. (b) PCR amplicon profiles for four individuals. The left hand and right hand panels show the exon 2 - intron 2 and intron 3 - exon 3 amplicons, respectively, with genomic DNA as template. The central panel shows the amplicons produced using mammary tissue cDNA and primers traversing the splice site junction. Two amplicons encoding 4 or 5 Q were identified. (c) Tissue specific NYFA mRNA splicing across the poly-Q encoding region. Representative results are shown for one of four individuals for four different tissues.