Literature DB >> 26191464

Transcriptome-facilitated development of SNPs for the Sonoran Desert rock fig, Ficus petiolaris (Moraceae).

Nicholas G Davis1, Derek D Houston1, John D Nason1.   

Abstract

PREMISE OF THE STUDY: Single-nucleotide polymorphism (SNP) primers were developed for a native North American desert fig, Ficus petiolaris (Moraceae), to provide markers for population genetic studies designed to quantify patterns of gene flow across a complex landscape. METHODS AND
RESULTS: Transcriptome sequencing and bioinformatic protocols were implemented to discover SNPs in single-copy protein-coding genes. Multiplexes of 30 nuclear and 24 organellar (chloroplast and mitochondrial) SNPs were selected for primer development and genotyping on the Sequenom MASSArray System. Of these 54 loci, 49 reliably amplified across a panel of 96 F. petiolaris individuals.
CONCLUSIONS: This study has provided SNP primers that can be applied in future studies investigating population genetics of F. petiolaris and its coevolution with associated pollinating and nonpollinating fig wasps.

Entities:  

Keywords:  Ficus petiolaris; Moraceae; RNA sequencing; population genomics; single nucleotide polymorphism; transcriptome sequencing

Year:  2015        PMID: 26191464      PMCID: PMC4504724          DOI: 10.3732/apps.1500028

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


The genus Ficus L. (Moraceae) is a diverse (>750 species) and ecologically important lineage of tropical woody plants. Many organisms depend on figs to carry out portions of their life cycles, particularly fig wasp pollinators and parasites, which are often host fig specific. Despite substantial interest in the coevolution of figs and fig wasps (Herre et al., 2008), as nonmodel organisms genomic resources are largely lacking. Next-generation sequencing technologies have facilitated the development of genomic resources, such as single-nucleotide polymorphisms (SNPs), for nonmodel organisms. SNPs are biallelic markers that can yield valuable insight into ecological, genetic, and coevolutionary processes (Morin et al., 2004; Pool et al., 2010; Steiner et al., 2013). The Sonoran Desert rock fig, F. petiolaris Kunth, is the only widespread, desert-adapted fig species in North America. It is also the northernmost naturally distributed Ficus in the New World, reaching a latitude of 31°N in the state of Sonora, Mexico. Ficus petiolaris supports a community of obligately associated fig wasps, including a pollinator (Pegoscapus) and several nonpollinators (Aepocerus, Heterandrium, Idarnes, and Physothorax). To enable ecological and evolutionary genetic studies, we sequenced the transcriptome of F. petiolaris to develop SNP markers optimized for high-throughput genotyping on the Sequenom MASSArray System (Agena Bioscience, San Diego, California, USA).

METHODS AND RESULTS

RNA was extracted from nine F. petiolaris plants grown from seeds sampled from five populations distributed across the species’ range in Baja California, Mexico (Appendix 1). Five milligrams of leaf tissue was sampled per individual, samples were pooled and homogenized in liquid nitrogen with a mortar and pestle, and RNA was extracted using the Spectrum Plant Total RNA Kit (Sigma-Aldrich, St. Louis, Missouri, USA). Extracted RNA was quantified using a NanoDrop 1000 Spectrometer (Thermo Fisher Scientific Inc., Waltham, Massachusetts, USA) and then submitted to the Iowa State University (ISU) DNA Facility where it was quantified a second time using the Agilent RNA 6000 Nano Kit (Agilent Technologies, Santa Clara, California, USA). A cDNA library was prepared from the mRNA templates using the Illumina TruSeq RNA Sample Preparation Kit V2 (Illumina, San Diego, California, USA), with library construction verified using the Agilent DNA 7500 Kit (Agilent Technologies), before transcriptome sequencing at the ISU DNA Facility on an Illumina MiSeq (Illumina) with 250-cycle paired-end reads. Illumina sequencing produced 33,294,480 reads, with an average read length of 215 bp, for a total of 7,147,200,749 bp sequenced. Low-quality reads were removed using Sickle v.1.33 (Joshi and Fass, 2011). The F. petiolaris transcriptome was de novo assembled using Trinity release 2013-11-10 (Grabherr et al., 2011). The final assembly contained 125,493 contigs, with a mean length of 1176 bp, mean coverage depth of 48×, N50 and N90 of 2011 and 478, respectively, and a total length of 147,624,931 bp. Reads were mapped to the assembled transcriptome using the program Bowtie2 v.2.1.0 (Langmead and Salzberg, 2012). SNP calling was performed using the Genome Analysis Toolkit (GATK) v.2.7-2 (McKenna et al., 2010). GATK input files were prepared using SAMtools v.1.1 (Li et al., 2009) and Picard v.1.97 (The Broad Institute; freely available at http://broadinstitute.github.io/picard/). GATK identified 139,254 putative SNPs, which were filtered bioinformatically using customized Python scripts. Initial SNP filtering was based on the following criteria: (1) sequence depth at the SNP position was ≥10; (2) the GATK quality score was ≥30; (3) there were no ambiguous bases, indels, or other SNPs located within 100 bp flanking the SNP; and (4) the minor allele was represented in at least 1% of the reads (to minimize ascertainment bias). This initial filtering yielded a set of 21,228 putative SNPs. SNPs occurring in single-copy protein-coding genes were identified as follows: (1) Primary protein transcripts for Arabidopsis thaliana (L.) Heynh., Oryza sativa L., and Vitis vinifera L. were obtained from the U.S. Department of Energy database (http://jgi.doe.gov). (2) Single-copy nuclear gene variants identified by Duarte et al. (2010) as shared among a diverse sampling of seed plants were retrieved from the primary protein transcripts. (3) A local BLASTX of F. petiolaris transcripts against the single-copy nuclear gene variant database was performed. BLAST results were filtered by E-value (≥1e-100), identity score (≥70%), and having hits to two or more species. This filtering yielded 3200 putative SNPs in 927 single-copy nuclear gene contigs. For contigs containing multiple SNPs, the one with the highest coverage was selected if it was also located ≥60 bp from the contig’s ends and ≥20 bp from the nearest neighboring SNP. SNPs in organellar genomes were identified by performing tBLASTX against the mitochondrial genomes of Malus domestica Borkh. (GenBank no. FR714868), V. vinifera (FM179380), Ricinus communis L. (HQ874649), Carica papaya L. (EU431224), and A. thaliana (Y08501), and the chloroplast genomes of A. thaliana (NC_000932), Populus trichocarpa Torr. & A. Gray (NC_009143), and V. vinifera (NC_007957). The BLAST results for organellar genomes were filtered based on E-value (≥1e-50), identity score (≥70%), and hits to three or more mtDNA genomes, or two or more cpDNA genomes. After filtering out SNPs located near contig ends, a set of 31 putative organellar SNPs was obtained. This relatively small number of SNPs is likely due to the generally low levels of polymorphism in maternally inherited plant genomes. The 927 nuclear and 31 organellar SNPs from F. petiolaris that were submitted to the Sequenom MASSArray System software for primer design had a minimum minor allele frequency of 9%, which should minimize the likelihood of calling a false SNP instead of a true SNP, particularly given the accuracy of the Illumina sequencing platform (Ross et al., 2013). The nuclear SNPs formed 31 multiplexes ranging from 27 to 30 loci, and the organellar SNPs formed two multiplexes of 24 loci and seven loci. The Sequenom software could not effectively multiplex the nuclear and organellar SNPs together, but given that genotypes were accurately scored, it is unlikely that loci in separate multiplexes would give rise to systematic bias. For genotyping, we selected two of the 33 total multiplexes: one nuclear multiplex of 30 loci, which had the highest confidence score (78.1%), and the organellar multiplex of 24 loci (confidence score 82.6%) (Table 1; Appendix 2).
Table 1.

Information for the 54 SNPs validated through genotyping a panel of 96 Ficus petiolaris individuals.

SNP IDMultiplexbContigBase positioncMajor alleleMinor alleleMinor allele frequency% AmplifiedPolymorphicd
Fpet.011 (nuclear)29,925408AC0.2916No (A)
Fpet.021 (nuclear)22,889893AG0.13100Yes
Fpet.031 (nuclear)27,895714TC0.14100Yes
Fpet.041 (nuclear)24,9242212GA0.19100Yes
Fpet.051 (nuclear)30,7151145CA0.1199Yes
Fpet.061 (nuclear)20,750598TC0.2599Yes
Fpet.071 (nuclear)24,920340TC0.37100Yes
Fpet.081 (nuclear)23,0501493AG0.1591Yes
Fpet.091 (nuclear)30,628318CT0.4799Yes
Fpet.101 (nuclear)30,8201147GA0.40100Yes
Fpet.111 (nuclear)18,553748CT0.3879Yes
Fpet.121 (nuclear)18,592469TA0.1198Yes
Fpet.131 (nuclear)24,973636TC0.200N/A
Fpet.141 (nuclear)28,6701480TC0.11100Yes
Fpet.151 (nuclear)17,060273TC0.4999Yes
Fpet.161 (nuclear)26,8683628CT0.12100Yes
Fpet.171 (nuclear)26,6171811CT0.11100Yes
Fpet.181 (nuclear)27,253339GT0.24100Yes
Fpet.191 (nuclear)28,9761599AG0.1896Yes
Fpet.201 (nuclear)22,155293GC0.28100Yes
Fpet.211 (nuclear)23,811845AG0.12100No (G)
Fpet.221 (nuclear)22,3852212TC0.140N/A
Fpet.231 (nuclear)21,6471908TC0.4886Yes
Fpet.241 (nuclear)25,125247CT0.2384Yes
Fpet.251 (nuclear)22,9881450CT0.2795Yes
Fpet.261 (nuclear)28,413935GA0.32100Yes
Fpet.271 (nuclear)28,379559TC0.2785Yes
Fpet.281 (nuclear)21,679382GA0.44100Yes
Fpet.291 (nuclear)22,7373130CT0.09100Yes
Fpet.301 (nuclear)30,983611AC0.200N/A
Fpet.312 (mtDNA)22,102197CA0.4599No (C)
Fpet.322 (mtDNA)30,053546AG0.1699No (G)
Fpet.332 (mtDNA)25,896588AG0.30100No (G)
Fpet.342 (mtDNA)25,5641258CT0.1998Yes
Fpet.352 (cpDNA)25,5444155GA0.40100Yes
Fpet.362 (mtDNA)23,8111056GA0.29100No (G)
Fpet.372 (mtDNA)25,564906AT0.4398Yes
Fpet.382 (cpDNA)28,6871168GT0.3199Yes
Fpet.392 (mtDNA)30,053245TA0.4398No (T)
Fpet.402 (mtDNA)25,564501CT0.4099Yes
Fpet.412 (mtDNA)14,8451153GA0.49100No (G)
Fpet.422 (cpDNA)30,7141292TC0.2499No (C)
Fpet.432 (cpDNA)23,204378TC0.35100Yes
Fpet.442 (mtDNA)19,3651456CT0.50100No (C)
Fpet.452 (cpDNA)771259TC0.47100No (C)
Fpet.462 (mtDNA)19,3651577TC0.28100No (C)
Fpet.472 (cpDNA)30,714407TC0.30100No (C)
Fpet.482 (cpDNA)15,0492950AT0.32100Yes
Fpet.492 (mtDNA)14,845647GA0.1899No (G)
Fpet.502 (mtDNA)25,564681CG0.4020No (C)
Fpet.512 (cpDNA)24,260787AG0.33100No (G)
Fpet.522 (cpDNA)25,544288GT0.38100Yes
Fpet.532 (cpDNA)25,5445259CA0.3395Yes
Fpet.542 (mtDNA)32,131293AG0.301No

Major and minor alleles and minor allele frequencies were determined from the assembled transcriptome data, whereas percentage of samples that amplified and whether the SNP was polymorphic were determined through genotyping.

Sequenom multiplex number and source genome (in parentheses).

Base position within the contig.

Whether the SNP was polymorphic in the diversity panel (if monomorphic then the observed allele is listed in parentheses).

Information for the 54 SNPs validated through genotyping a panel of 96 Ficus petiolaris individuals. Major and minor alleles and minor allele frequencies were determined from the assembled transcriptome data, whereas percentage of samples that amplified and whether the SNP was polymorphic were determined through genotyping. Sequenom multiplex number and source genome (in parentheses). Base position within the contig. Whether the SNP was polymorphic in the diversity panel (if monomorphic then the observed allele is listed in parentheses). SNPs were verified by genotyping 96 F. petiolaris individuals representing the species range in Baja California, Mexico (Appendix 1). Genomic DNA was extracted from silica-dried leaf tissue using an AutoGen Prep 740 DNA extraction robot (AutoGen, Holliston, Massachusetts, USA). DNA concentration was standardized to 20–25 ng/μL, then individuals were genotyped using the Sequenom MASSArray instrument at the ISU Genomic Technologies Facility. Of the 30 nuclear SNPs, 26 (87%) amplified successfully, of which 25 were polymorphic (Table 1). The one monomorphic SNP was likely due to poor amplification on the diversity panel (16% amplification; see Table 1). Of the 24 maternally inherited SNPs, 23 (96%) amplified successfully, of which only nine were polymorphic (Table 1). The relatively low number of polymorphic mtDNA and cpDNA SNPs may be an artifact of having a number of full siblings in our diversity panel, although further testing on additional samples is needed to verify that as the case.

CONCLUSIONS

We successfully developed primers for 49 SNPs that amplified reliably in F. petiolaris individuals sampled across a broad geographic range. These SNPs can be applied to future ecological, genetic, and coevolutionary studies of F. petiolaris and its associated pollinating and nonpollinating fig wasps.
Appendix 1.

Source locality information for samples included in this study.

Sampling localityGeographic coordinatesTissue voucher no.aN
San Bartolo, Baja California Sur, Mexico23.736520°N, 109.843830°WFpet.70.08.3.A-JDNA10
Fpet.70.38.4DNA1
Fpet.70.56.2RNA1
Fpet.70.56.3DNA1
Fig Canyon (San Isidro), Baja California Sur, Mexico26.357880°N, 111.803510°WFpet.95.10DNA1
Fpet.95.17BDNA1
La Paz Summit, Baja California Sur, Mexico24.048400°N, 110.150080°WFpet.96.34.5DNA1
Fpet.96.34.36RNA1
Fpet.96.36.20DNA1
Mesa La Caguama, Baja California Sur, Mexico27.56675°N, 113.07373°WFpet.112.101DNA1
Fpet.112.102DNA1
Fpet.112.104DNA1
Santa Agueda, Baja California Sur, Mexico27.086955°N, 112.516378°WFpet.113.01.01RNA, DNA1
Fpet.113.4N.17RNA, DNA1
Aguijito Higuera, Baja California, Mexico29.261530°N, 114.016780°WFpet.158.6A.15RNA, DNA1
Fpet.158.29.18.A-TDNA19
Fpet.158.29.30RNA1
La Lagunita, Baja California Sur, Mexico28.2172°N, 113.18943°WFpet.170.05.A-BDNA2
Bahia San Francisquito Rd., Baja California Sur, Mexico28.291410°N, 113.111680°WFpet.172.2.11DNA1
Fpet.172.2.23RNA1
Fpet.172.4.13RNA1
Fpet.172.4.18.A-YDNA25
Fpet.172.30.17.A-VDNA21
Fpet.172.30.18.A-DDNA4
El Ranchito, Baja California Sur, Mexico25.375988°N, 111.316845°WFpet.201.14DNA1
Fpet.201.15DNA1

Note: N = number of samples.

Tissue vouchers are deposited in the laboratory of J. Nason. The superscript RNA denotes samples that were used for RNA extraction and transcriptome sequencing, whereas the superscript DNA denotes samples that were used for DNA extraction and SNP genotyping.

Appendix 2.

SNP primer table including the marker’s ID, GenBank accession number (NCBI ss#), polymorphism type, sequence capture primers 1 and 2, Sequenom extension primer, and cellular location.

SNP IDNCBI ss#SNP typeCapture primer 1 (5′–3′)Capture primer 2 (5′–3′)Extend primer (5′–3′)Cellular source
Fpet.011573990490A/C1-GGCGCCGGAGGGCTCCAT2-TCCTTCAAGTCCACCATCTCCCAACCCCCTCCACAACNucleus
Fpet.021573990591A/G1-CTCCAAACTATCTTACGGTG2-GCCAAGCAAAGCCTTTTCACATGCCTTGTCAAGCATCNucleus
Fpet.031573990721C/T1-CACACAAAATTTGCACCCCC2-TGTCATCCTTGCGTTGAATCTGAATCAAAGGCTCTCCNucleus
Fpet.041573990894A/G1-GGAGTTGAACTAAGGGTCTG2-TATACCCCTTCGCGCCAAACCCAAACCTCCATTCACTCNucleus
Fpet.051573991012A/C1-GGATACCCTCTTCCTTTCTC2-TGTCGCCATTCTCAAAGAGGTAGTACCAAAACAACGGGNucleus
Fpet.061573991131C/T1-GATGACTCTCGAGAAACTGC2-CTTGTCAGCCAATTGAACTCCCAATTGAACTCTCTTCACNucleus
Fpet.071573991257C/T1-GGTTACTTGCCATCATCCAG2-ACGGTATACCAAGCGACAACTTTTGCGTGACGCCACAATNucleus
Fpet.081573991353A/G1-GAAGAGATTCTGGCGAAAGG2-TTCCTCACCCTTACACCAACCTATTCTTCTTCTCCTCCCTNucleus
Fpet.091573991462C/T1-ATGAAACGCCTTGTCCAGTC2-AGTGGCTCTGGTATTCTGTCAGGTGAGCTGGCGCAACAGNucleus
Fpet.101573991529A/G1-GGGTGTATGGATAAGTTGC2-ACGGATCACGCTTCTTTGACTTTTGCCAAACTGCGACCAANucleus
Fpet.111573991680C/T1-AGCGTTGTTAGGATCAGGAG2-GTGAGATGTGACAGGCTTAGAAGGCTTTATACTCCTCGGCNucleus
Fpet.121573991771A/T1-AATGTTCCAACATGGCACCG2-TAACCTGCCTGTTCTTCACGCACTTCAACCTTGTTTCCACANucleus
Fpet.131573991863C/T1-TGCTGAAGGTTTCTCTGAAC2-TCATATCCTTGAGCTTCACCGGTGTATTCAACCAAAGCAACNucleus
Fpet.141573991951C/T1-GGCAGATCGAGTCAGTTATG2-CAAACTGCTGTTTGAGCTCCAAATCTCCTCTACCCTCCACTCNucleus
Fpet.151573992101C/T1-GAACCTACGGTGTGGTTTAC2-GCTCCAAACGGATCTTCTTCTTCTCTTCAAAGCAATTGTCTCNucleus
Fpet.161573992221C/T1-AGTACAAGTCCCCAACTGTC2-GGCAAGATAATGGTGGATTGATTGCATTGAAAATATTCCTGCNucleus
Fpet.171573992381C/T1-GGTGGCCCTATCGGTTTAAT2-GCTCTCCAACTCTCCATCTGACTATTCTCCTATTCTCAACATCNucleus
Fpet.181573992495G/T1-TCACACATTTTCTGATTCCG2-CCGGGGACAACTGATAACTTTGTACAACTGATAACTTCCAAAANucleus
Fpet.191573992599A/G1-CTGTTTTTACTCCTAAGGAAG2-ATAAAGTTTCCTTATGGGCAGGGCCTTATGGGCAATGCTAATNucleus
Fpet.201573992755C/G1-AGGGTCGGCACGTATGAATC2-TCGTTGTCACTCATCTCTGGGGGATGGTTAAAAGAACCAGAAGNucleus
Fpet.211573992856A/G1-TTCCCTAGGACTGCTATAAC2-ATGCATGCTAATGGGGCAAGACACATATTTTTCGTGGTCTATATNucleus
Fpet.221573993008C/T1-TTGTCTCCAATGCACCATCC2-TGAAGACATCTGCATGAGCGTTTCCGGGCAGCGATGATAATCCTNucleus
Fpet.231573993170C/T1-TGGGAGAGCAGTTATCGTTG2-TTTCTACTGCACTGCACAGGCGGTGGACTTGAGAGAAAGTAGAANucleus
Fpet.241573993274C/T1-GATGTTGAAGTTAGCGTCCC2-CATAGACGGTCCACTTATGCCCCCTTTACCAAGCCAAAACGCAATNucleus
Fpet.251573993392C/T1-GTTTGCCAAACTAGATGGTC2-CTGCCGCTCATGATGTATTGGGGACCTGGCTTATAAGAACGTATCNucleus
Fpet.261573993496A/G1-AAGCTCTACACCGAAGACTG2-GATTTCCTGACACGCTTACGAGAACCGCTTACGAATTTAACTTTCCNucleus
Fpet.271573993653C/T1-CCAACTCCCTCAGAGTAATC2-CCAGCCAAGGTTCATAAAGCGAAATAATGTTCTTAGAGGTCTGCATNucleus
Fpet.281573993755A/G1-TCAGGCTGAGTTGGTTTTGG2-CTATCGTCCAAGTAATCCCCGAACTCACCAATCTTATTCTCTTTCCTNucleus
Fpet.291573993864C/T1-CCAAAGGTGACCCAAGAATC2-CAACTTCTCTCCAAACGACCAACCCCTCCAAACGACCAAAGCTCTTCNucleus
Fpet.301573993967A/C1-CTCCATATTCCATCCTCTTC2-TGTCAAACCAGAGGGATATGACATTATTCAATTCCTGCTGAAATTCCNucleus
Fpet.311573994123A/C1-GCCTTTCTTGTACTAATACC2-ATTCCGGTACCCCCGTGTTACCCCCGTGTTACTCCTTMitochondria
Fpet.321573994227A/G1-AGAATACGTTCTCGCATCGC2-GAATGAAGTGGGTCAACCTCCAACCTCTTTTTGGCTTMitochondria
Fpet.331573994336A/G1-GAGTTATGGCATTCAATCTC2-CAACCATTTTTGCTCGTGCTGCTCGTGCTAGTGCCCCMitochondria
Fpet.341573994438C/T1-CTTTTATCTGTTGGCTTTGG2-CGAAAGTAGCTCTCAAGAACGAAAAGAAATCGCCCATTMitochondria
Fpet.351573994550A/G1-CTTAACAATAGGACCTGGAG2-GCATCTAAAGCCCCCTTTACAGCAATAGCATGATGAACChloroplast
Fpet.361573994673A/G1-TTATCCAACCCCGAGCAATC2-GCTAAAAAAACGCCAGTCACCCATACCAGCTAACGAACCMitochondria
Fpet.371573994759A/T1-AGCCCTTGCTCATGGTTTTG2-TGTACCAACCCAACACACACCAGCACTCTCTCCCACATTTMitochondria
Fpet.381573994857G/T1-CCGGGTCACAATTTGTATCG2-CGGCTCTTCGAGAATGTATCCTAACTTTGGGAATTCCCACChloroplast
Fpet.391573994962A/T1-TGACATAGCGTTCCTGATAG2-CAAAGCAGGACTTCTTTGGCTGGCAAAAAGAACTTGAATAMitochondria
Fpet.401573995030C/T1-CTCCATAAATCAAGCTCTCC2-GCCTGGCACTAAGTGCAATGACCTTCCTGCTAGTATTCCTAMitochondria
Fpet.411573995130A/G1-ACTTTTCCGGAAAGACCACC2-TTGGCAATCCTTGGTAGAGCCCAGATGATTTCGTGCTGAACMitochondria
Fpet.421573995228C/T1-TGAGATACAGAGGAATAAGC2-GATGATAGTCGGCACAATTCCCCCATGCAGCTTTAACATCTCChloroplast
Fpet.431573995397C/T1-GAAACTCGCCGTAAAAAAATG2-TCAGTACAGTAGATATTCCCCACACGTCCCTTTCTGTCTGAChloroplast
Fpet.441573995511C/T1-AATGATGGATTTCGCGCCAC2-AATTGCTTTAGCGGGAGCTGCGGTCGGTATTGGAAACGTCTTMitochondria
Fpet.451573995594C/T1-GATCGGTATAAACATCAAC2-AGGTAGGATTTTTTTGGCCCACTTATTTGTTGAGGAGAAACTChloroplast
Fpet.461573995675C/T1-CGATCAGTCCAATTGAAACC2-AGCTATTGCATTGTTTGCCCCTTCTGCCCTAATGATGGCCTTTMitochondria
Fpet.471573995798C/T1-AGGGAACCTGCAAATATTGG2-GGGTTTTTCTGGTCCAAGTGCCAAGTGTATCTTGTTTTTACTAChloroplast
Fpet.481573995908A/T1-CGACAAGGAATTTCGCTACC2-GAAGTTGGTGACCTGATGACGGGATGTGAACGGCGGCCGTAACChloroplast
Fpet.491573996018A/G1-GGAGATTTATAGCATCATTC2-GGTCTGGAATTAGGTGTAGCAGTACAAGCTTATGTTTTTACGATMitochondria
Fpet.501573996173C/G1-TAGGAAAGTTGTTGTAGCTG2-TGCTGATGCAATCACCATACCATCACCATACTAGTACACTTAATAMitochondria
Fpet.511573996288A/G1-GTTTGGTGATTAAGGCGAAG2-AAAAGGCGCTCAGCCTACAGCCTACAGGAACTGTTTATGATATTTChloroplast
Fpet.521573996427G/T1-ATTCCTTAACTATTGGCGGG2-GTTCCGGCGAACGAATAATCATCGTTCCGGACAACACATACAAAGAChloroplast
Fpet.531573996541A/C1-GGGTCCTCTATGATCGATG2-TTGTTGCCCGGAGCAACAAGGAGCCAGTTGGTAAGTATTAAAATCCChloroplast
Fpet.541573996656A/G1-ACTCGCTCTGTAGTGTTGTC2-TGCTTTCCTAGATCTTCTCCCTTCCTAGATCTTCTCCTTAATGTATTMitochondria
  8 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

Review 2.  Population genetic inference from genomic sequence variation.

Authors:  John E Pool; Ines Hellmann; Jeffrey D Jensen; Rasmus Nielsen
Journal:  Genome Res       Date:  2010-01-12       Impact factor: 9.043

Review 3.  Conservation genomics of threatened animal species.

Authors:  Cynthia C Steiner; Andrea S Putnam; Paquita E A Hoeck; Oliver A Ryder
Journal:  Annu Rev Anim Biosci       Date:  2013-01-03       Impact factor: 8.923

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

6.  Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels.

Authors:  Jill M Duarte; P Kerr Wall; Patrick P Edger; Lena L Landherr; Hong Ma; J Chris Pires; Jim Leebens-Mack; Claude W dePamphilis
Journal:  BMC Evol Biol       Date:  2010-02-24       Impact factor: 3.260

7.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

8.  Characterizing and measuring bias in sequence data.

Authors:  Michael G Ross; Carsten Russ; Maura Costello; Andrew Hollinger; Niall J Lennon; Ryan Hegarty; Chad Nusbaum; David B Jaffe
Journal:  Genome Biol       Date:  2013-05-29       Impact factor: 13.583

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.