| Literature DB >> 17300727 |
Jon Slate1, Matthew C Hale, Timothy R Birkhead.
Abstract
BACKGROUND: Passerines (perching birds) are widely studied across many biological disciplines including ecology, population biology, neurobiology, behavioural ecology and evolutionary biology. However, understanding the molecular basis of relevant traits is hampered by the paucity of passerine genomics tools. Efforts to address this problem are underway, and the zebra finch (Taeniopygia guttata) will be the first passerine to have its genome sequenced. Here we describe a bioinformatic analysis of zebra finch expressed sequence tag (EST) Genbank entries.Entities:
Mesh:
Year: 2007 PMID: 17300727 PMCID: PMC1804275 DOI: 10.1186/1471-2164-8-52
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Relative abundance of SSR repeats within zebra finch ESTs.
| SSR type | ≥ 20 bp | ≥ 30 bp | ||
| N | Freq | N | Freq | |
| di- | 173 (58) | 0.27 | 51 (14) | 0.35 |
| tri- | 224 (115) | 0.35 | 50 (16) | 0.35 |
| tetra- | 86 (60) | 0.13 | 9 (2) | 0.06 |
| penta- | 155 (122) | 0.24 | 34 (15) | 0.24 |
| Total | 638 (355) | 144 (47) | ||
| Prop. | 3.7% | 0.8% | ||
The number of SSRs of length greater than or equal to 20 bp and 30 bp are reported. The number in parentheses is the number of perfect, i.e. uninterrupted, repeats. Repeats with less than 90% purity are unreported. Prop. is the proportion of all unique sequences that contained an SSR of at least 20 bp or 30 bp.
The most abundant motifs among EST-SSRs ≥ 20 bp and ≥ 30 bp
| SSR ≥ 20 bp | SSR ≥ 30 bp | ||||
| Motif | N | Freq | Motif | N | Freq |
| AT | 110 | 0.17 | AT | 34 | 0.24 |
| AGG | 74 | 0.12 | AAT | 18 | 0.13 |
| CCG | 56 | 0.08 | AGG | 12 | 0.08 |
| AC | 43 | 0.07 | AC | 11 | 0.08 |
| AAAAC | 37 | 0.06 | AGC | 10 | 0.07 |
| AGC | 35 | 0.05 | CCG | 8 | 0.06 |
| AAT | 28 | 0.04 | AGCCG | 7 | 0.05 |
| CCCGG | 26 | 0.04 | AG | 6 | 0.04 |
| AAAC | 21 | 0.03 | ACGGC | 5 | 0.03 |
| AG | 19 | 0.03 | AAAAC | 4 | 0.03 |
| AAAAT | 4 | 0.03 | |||
| Total | 449 | 0.70 | Total | 119 | 0.83 |
Frequency is the proportion of all EST-SSRs represented by each motif.
Figure 1Regression of number of . Chicken chromosomes are generally numbered largest first, so chromosome 1 is the data point in the top right corner. Chromosome length is a good predictor of the number of mapped EST-SSR loci. However, microchromosomes are relatively EST-SSR abundant relative to macrochromosomes (chromsomes 1–5 and Z).
EST-SSR loci with significant homology to previously described passerine markers
| EST-SSR | Passerine marker | GenBank Accession | % similarity | E value |
| DV951916 | 91 | 5e-44 | ||
| DV958593 | 79 | 1e-112 | ||
| Contig 26 | 93 | 9e-52 | ||
| Contig 35 | 96 | 1e-127 | ||
| Contig 125 | 82 | 1e-130 | ||
| Contig 131 | 79 | 3e-30 |
Amplification success of ten EST-SSR markers in two zebra finch and two house sparrow populations.
| GI number | Primers | Expected size | Zebra finch | House sparrow | ||||||||
| Size range | Sheffield | Australia | Size range | Lundy | Aldra | |||||||
| k | Ho | k | Ho | k | Ho | k | Ho | |||||
| CK304956 | F-GGAATGCGTGGACCTTCTGGT | 184 | 184–187 | 3 | 0.444 | Not tested | Not Tested | 181–185 | 4 | 0.60 | 4 | 0.76 |
| DV952809 (contig23) | F-CACATCAAAGGATGTGGTGG | 192 | 189–191 | 2 | 0.25 | 2 | 0.38 | 195–197 | 2 | 0.44 | 1 | 0.00 |
| DV952125 | F-TGGATTACCTGTCTGAAAGACC | 206 | 202–210 | 5 | 0.86 | 5 | 0.5 | 151–171 | 2 | 0.28 | 2 | 0.43 |
| DV955012 | F-TCTCAGCATCAGTGTGTTCTGTAC | 289 | 282–307 | 6 | 0.91 | 3 | 0.52 | 296–298 | 2 | 0.35 | 1 | 0.00 |
| CK313552 | F-GGAGTTGGAGCGCAAGAAGC | 235 | - | - | - | - | - | - | - | - | - | - |
| DV946651 | F-CACAGAAAAGTGAGTGCATTCC | 249 | 243–262 | 3 | 0.35 | 5 | 0.89 | 287–308 | 5 | 0.64 | 5 | 0.60 |
| CK307697 (contig35) | F-CGAGTAGCGTATTTGTAGCGA | 180 | 170–183 | 6 | 0.79 | 4 | 0.62 | 171–181 | 6 | 0.55 | 4 | 0.66 |
| DV948303 | F-CTTCCTGCTATAAGGCACAGG | 118 | 107–117 | 3 | 0.38 | 6 | 0.74 | 96–105 | 3 | 0.50 | - | - |
| CK312585 (contig 73) | F-ATTCTCAGGTCAATCGCTCACACT | 434 | - | - | - | - | - | - | - | - | - | - |
| DV959049 (contig 206) | F-GGGGACGTTTTATCTGTTACC | 244 | 207–230 | 4 | 0.47 | 9 | 0.59 | - | - | - | - | - |
GI number is the Genbank identification number of each EST sequence. k = number of alleles, Ho = observed heterozygosity. 24 birds were typed from each population. All were in Hardy-Weinberg Equilibrium. All are dinucleotides except CK313552 which is a tetranucleotide. Further details on these EST-SSRs (including repeat motif and length, and predicted chromosomal assignments) are available in Additional Files 1 and 2.