| Literature DB >> 28353203 |
Aleksandra Lipka1,2, Marta Majewska3, Grzegorz Panasiewicz4, Martyna Bieniek-Kobuszewska4, Bozena Szafranska4.
Abstract
The pregnancy-associated glycoprotein-like family (PAG-L) is a large group of chorionic products, expressed in the pre-placental trophoblast and later in the post-implantational chorionic epithelium, and are involved in proper placenta development and embryo-maternal interaction in eutherians. This study describes identification of the PAG-L family in the genome of the Eurasian beaver (Castor fiber L.), named CfPAG-L. We identified 7657 bp of the CfPAG-L gDNA sequence (Acc. No. KX377932), encompassing nine exons (1-9) and eight introns (A-H). The length of the CfPAG-L exons (59-200 bp) was equivalently similar to the only known counterparts of bPAG1, bPAG2, and pPAG2. The length of the CfPAG-L introns ranged 288-1937 bp and was completely different from previously known PAG introns. The exonic CfPAG-L regions revealed 50.3-72.9% homology with equivalent segments of bPAG1 and pPAG2 structure. The intronic CfPAG-L regions alignments revealed a lack of homology. Within the entire CfPAG-L gene, 31 potential single nucleotide variants (SNV: 7 transversions and 24 transitions) were predicted. The identified exonic polymorphic loci did not affect the amino acid sequence of the CfPAG-L polypeptide precursor. This is the first report describing the CfPAG-L gene sequence, structural organization, and SNVs in the Eurasian beaver, one of the largest rodents.Entities:
Keywords: APs; Aspartic proteinases; Beaver; Exon-intron structure; PAGs; SNVs
Mesh:
Substances:
Year: 2017 PMID: 28353203 PMCID: PMC5561160 DOI: 10.1007/s10142-017-0557-9
Source DB: PubMed Journal: Funct Integr Genomics ISSN: 1438-793X Impact factor: 3.410
Specific primers applied for the CfPAG-L amplifications with gDNA templates
| Forward primers | Reverse primers | ||
|---|---|---|---|
| Name | Sequence (5′–3′) | Name | Sequence (5′–3′) |
| ATG_F | ATGAAGTGGATAGTGGTGGCC | ||
| Exon2_F | TTCCTGAAGAMSCACAA | Exon2_R | TCATGGAGAACACTGAAGTC |
| Exon3_F | GCTCCTCCAACCTGTGGGT | Exon3_R | ACCCACAGGTTGGAGGAGCC |
| Exon4_F | ACCTACCACACCGATAAGAAGA | Exon4_R | TCTTCTTATCGGTGTGGTAGGT |
| Exon5_F | GATGGCATCMTGGGSCTGGCCTA | Exon5_R | TAGGCCAGSCCCAKGATGCCATC |
| Exon6_F | GTGTGGACAAGCGCCTGTACAA | Exon6_R | CTCCCCCATCCTTAGAGCCTTC |
| Exon7_F | TTGTKGACACMGGCACCTCTCTG | Exon7_R | CAGAGAGGTGCCKGTGTCMACAA |
| Exon8_F | CTGCCCACACTCGACTTCATCA | Exon8_R | TGATGAAGTCGAGTGTGGGCAG |
| Exon9_F | ACTGCATGGTGGGAATCCAG | Exon9_R | GAAGACATCWCCMAGGATCCAA |
| IntronA_F | GCAGCCCTTGAATGACAGGTAT | IntronA_R | CCTCCTTTTTCAGTATGAGTGCAA |
| IntronB_F | CTTGTGAGCTAACAGCCTGCC | IntronA_R2 | TGGCTCTCACTTACAATCTCCATG |
| IntronD_F | GCAAGGTTGAAGAATGAGCCTAG | IntronA_R3 | TTCCTCCATGACACCAACAAAGAC |
| IntronF_F | CCACAGGTGTCCTCTGCATGA | IntronE_R | TTCGGGTTTCATGTAGAGTGAG |
| IntronG_F | GGGTTAGAGCAAAATTACTGGAAC | 3’utr_R | CCAGAGAAGAGGCACAGATAGA |
Fig. 1Exon-intron structure of the identified genomic CfPAG-L sequence. Abbreviations: SP signal peptide, AS active site sequences coding domains 1 and 2 of the catalytic cleft
Characteristics of exon-intron junctions of the CfPAG-L gDNA sequence
| Donor splice sites | Acceptor splice sites | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| exon | 5′- > 3’ | Phase | Intron | 5′- > 3’ | Intron | 5′- > 3’ | Phase | Exon | 5′- > 3’ |
| 1 | CAATATCCAG | 0 | A |
| A | TGACTTAC | 1 | 2 | TGACTTACAG |
| 2 | TTACATGGAT | 0 | B |
| B | CTGCCCCC | 0 | 3 | GCTACTTACT |
| 3 | AGCGGCTGCT | 1 | C |
| C | CCCTCTGC | 1 | 4 | CCACACATCC |
| 4 | CACTCTGACC | 0 | D |
| D | TCCCCTCC | 1 | 5 | GTCAGAACAT |
| 5 | ACCTTGGCAG | 2 | E |
| E | TCTCATGC | 2 | 6 | GAAAGAAGGC |
| 6 | AAATCGAGGA | 2 | F |
| F | TTCCATTC | 2 | 7 | GTTCCTCGTT |
| 7 | AGGAGAACCT | 2 | G |
| G | CTCGCTGC | 0 | 8 | TATTTTGTGA |
| 8 | CATCATTAGT | 0 | H |
| H | TGTTGAGC | 0 | 9 | GACGATGGCT |
Nucleotide lengths of the exons and introns of the CfPAG-L, bPAG1, bPAG2, and pPAG2 gDNA templates
| Segment name |
|
|
|
|
|---|---|---|---|---|
| Exon 1 | 59 | 53 | 53 | 53 |
| Intron A | 1937 | 1100 | 1300 | 1350 |
| Exon 2 | 160 | 151 | 151 | 166 |
| Intron B | 385 | 1000 | 1000 | 1200 |
| Exon 3 | 118 | 118 | 118 | 118 |
| Intron C | 917 | 100 | 100 | 90 |
| Exon 4 | 119 | 119 | 119 | 119 |
| Intron D | 451 | 1200 | 1200 | 1200 |
| Exon 5 | 200 | 194 | 194 | 200 |
| Intron E | 1138 | 900 | 1100 | 850 |
| Exon 6 | 123 | 117 | 117 | 117 |
| Intron F | 288 | 1900 | 1700 | 1800 |
| Exon 7 | 148 | 142 | 142 | 136 |
| Intron G | 681 | 100 | 100 | 85 |
| Exon 8 | 99 | 99 | 99 | 99 |
| Intron H | 603 | 1700 | 1700 | 292 |
| Exon 9 | 147 | 150 | 150 | 156 |
Homology of the CfPAG-L exons with equivalents of bPAG1 and pPAG2 gDNA
|
|
|
|
|---|---|---|
| Exon 1 | 60.7 | 64.3 |
| Exon 2 | 50.3 | 60.4 |
| Exon 3 | 66.1 | 72.9 |
| Exon 4 | 56.9 | 59.2 |
| Exon 5 | 54.8 | 60.4 |
| Exon 6 | 56.9 | 62.2 |
| Exon 7 | 55.8 | 54.2 |
| Exon 8 | 57 | 57.4 |
| Exon 9 | 53.5 | 56.7 |
Identified SNVs, genotypes and allele frequency (px) within the CfPAG-L sequence
| Type of SNV | Position | Segment | Homozygous | Heterozygous | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| nt | n | px | nt | n | px | nt | n | px | ||||
|
| Transition | 109 | Intron A | gg | 4 | 0.27 | aa | 0 | 0 | ga | 11 | 0.73 |
|
| Transversion | 112 | Intron A | tt | 4 | 0.27 | gg | 0 | 0 | tg | 11 | 0.73 |
|
| Transition | 2634 | Exon 3 | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 2655 | Exon 3 | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 2668 | Intron C | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 2671 | Intron C | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 2676 | Intron C | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 2702 | Intron C | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 2721 | Intron C | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 2738 | Intron C | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 3803 | Intron D | tt | 0 | 0 | cc | 0 | 0 | tc | 15 | 1.0 |
|
| Transition | 3822 | Intron D | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 3890 | Intron D | gg | 0 | 0 | aa | 0 | 0 | ga | 15 | 1.0 |
|
| Transition | 3912 | Intron D | gg | 2 | 0.14 | aa | 0 | 0 | ga | 13 | 0.86 |
|
| Transversion | 3936 | Intron D | gg | 4 | 0.27 | cc | 0 | 0 | gc | 11 | 0.73 |
|
| Transversion | 4098 | Intron D | gg | 0 | 0 | tt | 1 | 0.07 | gt | 14 | 0.93 |
|
| Transversion | 4124 | Intron D | aa | 1 | 0.07 | cc | 1 | 0.07 | ac | 13 | 0.86 |
|
| Transition | 4131 | Intron D | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 4141 | Intron D | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transversion | 4221 | Exon 5 | aa | 0 | 0 | cc | 0 | 0 | ac | 15 | 1.0 |
|
| Transversion | 6101 | Intron G | tt | 0 | 0 | gg | 0 | 0 | tg | 15 | 1.0 |
|
| Transition | 6152 | Intron G | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 6948 | Intron H | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 6981 | Intron H | aa | 0 | 0 | gg | 0 | 0 | ag | 15 | 1.0 |
|
| Transition | 6993 | Intron H | cc | 12 | 0.8 | tt | 0 | 0 | ct | 3 | 0.2 |
|
| Transversion | 7138 | Intron H | aa | 0 | 0 | tt | 0 | 0 | at | 15 | 1.0 |
|
| Transition | 7140 | Intron H | tt | 0 | 0 | cc | 0 | 0 | tc | 15 | 1.0 |
|
| Transition | 7345 | Intron H | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 7406 | Intron H | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 7537 | Exon 9 | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |
|
| Transition | 7549 | Exon 9 | cc | 0 | 0 | tt | 0 | 0 | ct | 15 | 1.0 |