| Literature DB >> 26976328 |
Chih-Ming Hung1, Ai-Yun Yu2, Yu-Ting Lai2, Pei-Jen L Shaner2.
Abstract
Microsatellites have a wide range of applications from behavioral biology, evolution, to agriculture-based breeding programs. The recent progress in the next-generation sequencing technologies and the rapidly increasing number of published genomes may greatly enhance the current applications of microsatellites by turning them from anonymous to informative markers. Here we developed an approach to anchor microsatellite markers of any target species in a genome of a related model species, through which the genomic locations of the markers, along with any functional genes potentially linked to them, can be revealed. We mapped the shotgun sequence reads of a non-model rodent species Apodemus semotus against the genome of a model species, Mus musculus, and presented 24 polymorphic microsatellite markers with detailed background information for A. semotus in this study. The developed markers can be used in other rodent species, especially those that are closely related to A. semotus or M. musculus. Compared to the traditional approaches based on DNA cloning, our approach is likely to yield more loci for the same cost. This study is a timely demonstration of how a research team can efficiently generate informative (neutral or function-associated) microsatellite markers for their study species and unique biological questions.Entities:
Mesh:
Year: 2016 PMID: 26976328 PMCID: PMC4791680 DOI: 10.1038/srep23087
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Characteristics of 24 microsatellite loci genotyped in Apodemus semotus.
| Locus | Ch | Primer sequence | Motif | N | Size | NA | HE | HO | PHWE | Protein coding gene |
|---|---|---|---|---|---|---|---|---|---|---|
| 1A720 | 1 | F-GATAGACATCTCAGTGCCAAAC | (AAAG) | 22 | 335–345 | 4 | 0.354 | 0.318 | 0.641 | |
| R-AGTCCAAAGAGAATCAGAGTTC | ||||||||||
| 2A1340 | 2 | F-TTGGAGAGGCAGAATTAACTTG | (AAAG) | 17 | 296–321 | 7 | 0.713 | 0.529 | 0.025 | |
| R-CATGTAAATGTGAGCAAACCAC | ||||||||||
| 2A2910 | 2 | F-CATCAATTTATCCTCCACCCTC | (AAAC) | 24 | 318–335 | 5 | 0.696 | 0.708 | 0.974 | NA |
| R-ATTTGTAGCTTGGGTTTGTCTC | ||||||||||
| 3A2393 | 3 | F-CCCAGAAACTTAGAAGCTAGTG | (AGAT) | 21 | 244–260 | 8 | 0.828 | 0.667 | 0.172 | |
| R-CATTAGAGTGTCACGGAAGAG | ||||||||||
| 4A2131 | 4 | F-ATTTCCATTCCAGAATCTCCAC | (ACTC) | 23 | 134–156 | 9 | 0.831 | 0.783 | 0.802 | |
| R-TTGTTTAAAGGTGCAAGGTTTG | ||||||||||
| 5A672 | 5 | F-AAAGGTTTACAACTCCATACCC | (AGAT) | 24 | 230–346 | 5 | 0.751 | 0.75 | 0.839 | NA |
| R-GAAGGAGTAAGATGCACAGAAC | ||||||||||
| 5B59 | 5 | F-ATGCTGGTATTGTGTAGGATTG | (AAAG) | 24 | 186–207 | 17 | 0.918 | 0.833 | 0.435 | NA |
| R-TTAGTGTAGAGGAATGAGAGGC | ||||||||||
| 6A496 | 6 | F-GTAAAGTTGTGCAATGTCAGC | (AGAT) | 23 | 437–457 | 6 | 0.82 | 0.913 | 0.941 | |
| R-TATAATGTCCTAGCTCTGTAGG | ||||||||||
| 6A565 | 6 | F-AGTTAATTCAGTGCTTGTTGGG | (AGAT) | 24 | 246–318 | 9 | 0.862 | 0.708 | 0.028 | NA |
| R-ATCTGATCTCCTCTTCTGTCAG | ||||||||||
| 8A401 | 8 | F-TCAACACTTTCGAGGTTTAGTC | (AAAC) | 24 | 352–368 | 4 | 0.574 | 0.5 | 0.725 | |
| R-CTTTGCTTTGATTGTGACCATG | ||||||||||
| 8A1226 | 8 | F-TCATTCCATTTCCAACTCAGAC | (AGAT) | 24 | 418–422 | 2 | 0.422 | 0.417 | 1 | NA |
| R-CTTTGCTTTGATTGTGACCATG | ||||||||||
| 8A472 | 8 | F-AAAGGGAGGAGGAAGAAAGAAC | (ACAT) | 22 | 349–403 | 19 | 0.947 | 0.864 | 0.012 | NA |
| R-CCATTAGCACCATCTCTATTCG | ||||||||||
| 9A1141 | 9 | F-GATCTGGTCTGAGTTGTCTG | (AACT) | 21 | 227–235 | 3 | 0.528 | 0.238 | 0.007 | |
| R-CCATTAGCACCATCTCTATTCG | ||||||||||
| 9B878 | 9 | F-AAGAGACAGTATTGAAAGCATG | (AGAT) | 24 | 407–439 | 9 | 0.87 | 0.75 | 0.422 | |
| R-AGCTGAATTTACTCCAAGCATC | ||||||||||
| 10B1562 | 10 | F-CAGCACTAAACCTAACTACACC | (AGAT) | 22 | 427–428 | 2 | 0.304 | 0.182 | 0.106 | |
| R-AGCTGAATTTACTCCAAGCATC | ||||||||||
| 11A2041 | 11 | F-TCTAAATTCTTGATGCACCTGG | (AATG) | 9 | 348–372 | 7 | 0.81 | 0.667 | 0.1563 | NA |
| R-AGCTGAATTTACTCCAAGCATC | ||||||||||
| 12A1292 | 12 | F-TCATCTATTGATTGATCCACCC | (AGAT) | 20 | 272–315 | 14 | 0.933 | 0.9 | 0.540 | |
| R-CAGATATAGACACGGAGGTAGG | ||||||||||
| 12A1851 | 12 | F-CCACCCTTCCATCTATTCATTC | (ACAT) | 21 | 329–369 | 10 | 0.835 | 0.381 | 0 | |
| R-CAGATATAGACACGGAGGTAGG | ||||||||||
| 14A594 | 14 | F-CCTATGGAAGCTTTGTGAGTTG | (ACAG) | 22 | 445–471 | 12 | 0.915 | 0.909 | 0.927 | |
| R-ATAATTCAACCAAACCGTGTCC | ||||||||||
| 15B141 | 15 | F-CAAGAACAGGAGAAGAGTCAAG | (AAGG) | 21 | 410–419 | 3 | 0.354 | 0.333 | 1 | |
| R-TATATTCAACTGAGTCACTGCC | ||||||||||
| 15B636 | 15 | F-CACAAGTGTAAGGGTATTGG | (AGAT) | 24 | 310–314 | 4 | 0.301 | 0.25 | 0.093 | |
| R-CTAGGGACAATGAACTGACATG | ||||||||||
| 17A410 | 17 | F-ACATATCTAGTTTCAAGCCAGC | (AAAC) | 19 | 172–181 | 7 | 0.858 | 0.842 | 0.410 | |
| R-CAAGTCTCATTGGGTCTATCTG | ||||||||||
| 17A501 | 17 | F-AGAGAATACAATATGGCACTGC | (ACAT) | 18 | 348–371 | 9 | 0.867 | 0.889 | 0.536 | |
| R-CAAGTCTCATTGGGTCTATCTG | ||||||||||
| 18A352 | 18 | F-CCAAATTTAAAGGGAGGCAATG | (AAGG) | 24 | 277–286 | 2 | 0.254 | 0.042 | 0.001 | |
| R-CAAGTCTCATTGGGTCTATCTG |
Ch indicates the chromosome (of Mus musculus) where the locus is located. N, Size, NA, HE and HO indicate sample size of A. semotus, size range of amplified fragments, number of alleles, expected and observed heterozygosity, respectively. PHWE indicates p value of testing for deviation from Hardy-Weinberg equilibrium (HWE), where
*indicates deviation from HWE (p < 0.05). Protein coding gene indicates the closest blasted gene within a distance of 100 kbp from the microsatellite, where NA indicates that no gene is identified in the 200-kbp genomic region because there is no blasting result fitting the criteria (Method; see Supplementary Fig. S1 for relative locations of the microsatellites and corresponding protein coding genes).
Figure 1Distribution of the distance between a microsatellite locus and its nearest exon in the Mus musculus genome (identified using Map Viewer) or between a microsatellite locus and its nearest coding region in Apodemus semotus scaffolds (identified using BLASTX).
Figure 2Distribution of the number of genes located within 100 kbp (upstream and downstream) from a microstatellite locus.
Genes in the Mus musculus genome are identified using Map Viewer and that in Apodemus semotus scaffolds are identified using BLASTX.
Figure 3Numbers of amplified loci.
A total of 59 loci generated from Apodemus semotus scaffolds are tested on 14 rodent species. Darker color indicates higher amplification success rates of loci, which are estimated based on 24 samples for A. semotus and five samples for each of the other 14 species. The cladogram (modified from Fabre et al. 2012) on the left indicates the phylogenetic relationship among these species.