Literature DB >> 22768984

Tissue-specific temporal exome capture revealed muscle-specific genes and SNPs in Indian buffalo (Bubalus bubalis).

Subhash J Jakhesara¹, Viral B Ahir, Ketan B Padiya, Prakash G Koringa, Dharamshibhai N Rank, Chaitanya G Joshi.

Abstract

Whole genome sequencing of buffalo is yet to be completed, and in the near future it may not be possible to identify an exome (coding region of genome) through bioinformatics for designing probes to capture it. In the present study, we employed in solution hybridization to sequence tissue specific temporal exomes (TST exome) in buffalo. We utilized cDNA prepared from buffalo muscle tissue as a probe to capture TST exomes from the buffalo genome. This resulted in a prominent reduction of repeat sequences (up to 40%) and an enrichment of coding sequences (up to 60%). Enriched targets were sequenced on a 454 pyro-sequencing platform, generating 101,244 reads containing 24,127,779 high quality bases. The data revealed 40,100 variations, of which 403 were indels and 39,218 SNPs containing 195 nonsynonymous candidate SNPs in protein-coding regions. The study has indicated that 80% of the total genes identified from capture data were expressed in muscle tissue. The present study is the first of its kind to sequence TST exomes captured by use of cDNA molecules for SNPs found in the coding region without any prior sequence information of targeted molecules.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
DNA, Complementary

Year: 2012 PMID： 22768984 PMCID： PMC5054198 DOI： 10.1016/j.gpb.2012.05.005

Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN： 1672-0229 Impact factor: 7.691

Introduction

Efficient functioning of a whole multicellular organism involves complex but specific interactions between different tissues of the body in a timely manner. A number of genes are turned off and on at particular points of time in response to various physiological processes. To fully understand how interactions between genes expressed in different tissues result in disease, increased milk or meat production or any other physiological response, it is important to study temporal gene expression in the tissues involved at different times/stages of the physiological process in question. Sequencing of the coding part of the genome (exome) helps in identifying variations which give rise to phenotypic variation. Collection of all the genes expressed in a single tissue at a specific point of time can be denoted as tissue specific temporal exome (TST exome). Massively parallel sequencing technologies available currently are rapidly enhancing existing knowledge about the genomes of various organisms. Nonetheless, progress in genome sequencing efforts for buffalo is very limited. Buffalo is an important farm animal for dairying and agriculture particularly in Asia and contributes nearly 55% of total bovine milk production in India. The buffalo genome sequencing is expected to be completed in the near future (http://www.ncbi.nlm.nih.gov/projects/genome/guide/buffalo/). Up till now, only a total of 66,935 nucleotide sequences for the water buffalo (Bubalus bubalis) have been deposited in the GenBank database [1], out of which 64,212 genome shotgun sequences (ACZF01000001-ACZF01064212) were generated by our group. Recently, researchers from the National Bureau of Animal Genetic Resources, Karnal, India have published the first whole genome sequence assembly of water buffalo with a read depth of 17–19× and ∼91–95% coverage in comparison to the cattle assembly Btau 4.0 [2]. Buffalo whole genome sequencing and re-sequencing is almost cost prohibitive for a single institution. Therefore, targeted sequencing of selected genomic regions is the best alternative for characterization of SNPs [3], genome-wide association studies [4] and re-sequencing specific exons believed to be involved in specific disease processes and many others [5]. Of these, the re-sequencing of specific disease-associated exons seems very promising as it consists of less than 1% of total genome size [6] and is also functionally important. However, it is necessary to have knowledge of completely sequenced exomes in a species for exome re-sequencing. In the absence of complete genomic information regarding water buffalo, this approach may not be applicable. However, sequencing of the coding region of genome alone represents a possible and economically viable project to create a database containing all expressed genes in an organism that can be utilized later on for genome-wide association or mutation scanning procedures. The coding region of a genome can be isolated and enriched by genome capture after hybridization with suitable baits. Various approaches have been reported for exome capture including microarray, in which an array containing all coding sequences is hybridized with genomic DNA to capture whole exome [7], [8]. Unfortunately, such an array is not available in buffalo at present. Another approach is to selectively amplify targeted regions with PCR for sequencing. Although a very sensitive procedure, PCR is difficult to use on a larger scale and it would be very cumbersome to amplify whole portions of exomes by PCR with uniformity. Furthermore, such multiplex amplification of exomes suffers from uneven representation of sequencing targets and poor reproducibility [5]. For these reasons this strategy cannot be applied in the case of buffalo. In this study, we have used the cDNA prepared from buffalo muscle tissue as a bait to capture muscle TST exomes in buffalo and then sequence them with the 454 sequencer based on pyro-sequencing technology. This represents a simple method involving use of customized baits and, if optimized, can result in a robust method which can be used routinely in genome sequencing laboratories.

Results

Sequences containing low quality bases were trimmed so that high quality data can be used for analysis and subsequent SNP finding. As a result, we obtained 101,244 reads containing 24,127,779 high quality bases, of which 59,211 were assembled into a total of 750 contigs. Nearly 41% of the reads mapped against the Bos taurus mRNA reference database. Table 1 presents a summary of assembly and mapping results of the sequencing data. 634 unique genes were revealed in 720 out of 750 mapped contigs. Similarly, 20 unique genes were revealed in 38 out of 230 contigs formed after assembly of unmapped and too short reads; whereas, 541 unique genes were revealed in 819 out of 29,273 singletons.

Table 1

Summary of sequencing data obtained from Bubalus bubalis muscle TST exome capture

gsMapper assembly (with Bos taurus reference mRNA database)
No. of reads	101,244
No. of bases	24,127,779
No. of mapped reads	59,211	41.40%
No. of mapped bases	6389,585	26.48%
No. of unmapped reads	41,530	41.02%
No. of too short reads	503	0.50%
No. of contigs	750
No. of bases	151,663

De novo assembly for unmapped and too short reads
No. of contigs	230
No. of bases	102,080
No. of singleton reads	29,273
No. of singleton bases	6503,535

Note: Reads available after sequencing were mapped against the Bos taurus reference mRNA database. Unmapped reads after merging with too short reads are subjected to de novo assembly to form contigs. Assembled contigs were utilized for analysis.

Functional annotation of identified genes

Genes identified from blastn were subjected to KEGG analysis for identification of pathways. Out of 634 unique genes identified from mapped contigs, pathways could be attributed to 411 genes. Likewise, for unmapped and too short reads and singletons, pathways were identified for 15 and 395 genes, respectively. KEGG pathway analysis is summarized in Table 2. The majority of pathways identified in our experiment were involved in metabolism followed by other categories. This pathway analysis suggests that the dataset analyzed was collected from a metabolically active state. Genes identified in annotation were subjected to gene ontology (GO) to summarize the information contained in the genes and top 10 GO terms were plotted in Table 3. In our dataset, the most enriched category was molecular function (24%) followed by metabolism (17%), intracellular (16%), binding (13%) and other categories.

Table 2

Pathway classification of genes based on KEGG analysis

Category	Mapped contigs	Unmapped and too short contigs	Singletons	Total	%
Metabolism	114	7	109	230	28.01
Organismal systems	90	3	89	182	22.16
Human diseases	68	0	75	143	17.41
Environmental information processing	53	4	52	109	13.27
Cellular processes	49	1	39	89	10.84
Genetic information processing	37	0	31	68	8.28
Total	411	15	395	821

Note: Percentage of enriched pathways in buffalo muscle tissue was calculated after analysis with KEGG pathway. Importantly, genes related with metabolic pathways were enriched in muscle tissue.

Table 3

Top 10 GO terms enriched in muscle TST exome

Go category	Proportion in muscle TST exome (%)
Molecular function	24
Metabolism	17
Intracellular	16
Binding	13
Catalytic activity	7
Cytoplasm	5
Biosynthesis	5
Development	5
Protein binding	4
Nucleic acid metabolism	4

Genes involved in muscle-specific gene expression

Searching against the TiGER database demonstrated that 269 out of 634 unique genes had no records in the database. Of the remaining 365 genes, 290 (80%) genes were expressed in muscle tissue to different extents, while 75 genes are not expressed in human muscle tissue at all. The top 10 genes with preferential expression in muscle tissue are TNNI1, NRAP, NOS1, SMYD1, FBXO40, MYOM1, IKZF5, FNDC5, UBA5 and Fam13A1. Amongst these, important ones are (1) TNNI1, which encodes for Troponin-central regulatory protein of striated muscle contraction [9]; (2) NRAP, which encodes for nebulin-related anchoring protein-involved in anchoring the terminal actin filaments in the myofibril to the membrane [10]; (3) NOS1, which encodes for nitric oxide synthase which is highly expressed in skeletal muscle [11]; (4) FBXO40, which encodes for F-box protein 40 with a probable function in myogenesis [12]; (5) MYOM1, which encodes for Myomesin 1 which binds myosin, titin, and light meromyosin and interconnects the major structure of sarcomeres [13] and (6) IKZF5, which encodes for IKAROS family zinc finger 3 transcription factor that plays an important role in the regulation of lymphocyte differentiation [14].

SNP detection

A total of 24.1 Mb of sequence was generated by TST exome capture and used for mapping with Burrows-Wheeler Aligner (BWA) algorithm. BWA is successfully used for mapping and alignment of large and complex genomes sequenced by 454 sequencing [15], repeat finding [16] as well as SNP finding [17]. Using BWA, 97,829 out of 101,244 obtained reads (96%) were successfully mapped to the reference sequence (Btau 4.0). Among them, 72,090 (71%) were uniquely mapped reads, 25,795 (25%) were mapped to multiple sites, while 3415 (4%) remained unmapped. After variant calling with SAMtools, a total of 40,100 variants were detected containing 39,697 SNPs and 403 indels. Figure 1 shows number of uniquely-mapped reads and SNPs found in our dataset after mapping against each bovine chromosome.snpEff version 1.7 was used to predict the effect of candidate SNPs using the ENSEMBL version Btau 4.0.60 database as a reference. Out of 39,697 identified SNPs, 19,642 (48.98%) were located in intergenic regions and 6184 (15.42%) were located upstream or downstream of gene bodies. Out of remaining 14,274 SNPs, 13,953 (34%) were located in intronic regions, while 218 (0.54%) encoded synonymous changes and 195 (0.48%) encoded nonsynonymous changes. Only 479 (1.2%) SNPs were reported and established in bovine according to Btau 4.0.60 and none of them were nonsynonymous.

Figure 1

Number of uniquely-mapped reads and SNPs found in buffalo against each bovine chromosome

Results obtained from repeat masker revealed that there was a prominent reduction of repeats in captured genomic sequences. The comparison between captured genome sequence and whole genome sequences is summarized in Table 4.

Table 4

Percentage of repeat sequences present in Bubalus bubalis muscle TST exome and in whole genome

Repeat	In captured genomic sequence^a (%)	In whole genome sequence^b (%)
Simple repeat	0.09	4.00
Satellite	15.38	0.63
Low complexity	0.30	2.78
LINE	13.89	26.64
SINE	2.44	35.37
LTR	4.07	7.25
DNA elements	1.78	3.91
RNA	0.08	0.1
Unknown	0.02	0.06
Total	38.05	80.74

Note: Analysis of repeat masking of captured data with repeat masker software. Capture resulted in almost 40% reduction in repeat sequences. Percentage of repeats in captured genomic sequences (a) and in whole genome sequence assembly of sequencing data (b) was shown. LINE, long interspersed nuclear element; SINE, short interspersed nuclear element; LTR, long terminal repeat.

Discussion

In our study, TST exomes were captured using mRNA from muscle tissue. It is hence, imperative that whatever genes have been identified should be expressed in muscle tissue. To validate this the genes were searched against the “TiGER” database which contains information on tissue-specific combinational gene regulation, based on transcription factor binding sites, enabling us to perform a large-scale analysis of tissue-specific gene regulation in human tissues. Though, the data generated in this experiment was from buffalo muscle tissue exome, this analysis would give a fair idea about tissue specific gene expression and efficiency of TST exome capture. The results revealed that genes captured in this experiment were expressed in muscle tissue and thus show reliability of this approach for TST exome capture. Not only that, several genes represented by only one or two reads with very low expression in muscle tissue were successfully captured using mRNA molecules as bait. This shows the usefulness of this approach in obtaining a complete gene expression profile for the tissue in question. Comparison against the Btau 4.0.60 database revealed that only 479 (1.2%) SNPs were reported in the reference database while 39,218 (98.8%) candidate SNPs were novel. Previous studies involving novel candidate SNP finding in B. taurus revealed around 87% of novel candidate SNPs [17], [18]. As we have used the B. taurus reference for genomic sequence from buffalo, a comparatively larger number of novel candidate SNPs were reported in our analysis. All 195 nonsynonymous candidate SNPs were homozygous, located in 37 different genes with the highest number found on chromosome (Chr) 11 followed by Chr 19, 21, 29, 30 and 6. Our analysis showed 34 candidate SNPs (17.5%) in the CRB2 gene alone. No reports are available for candidate SNPs in the buffalo CRB2 gene, however, human mutation analysis of the CRB2 gene revealed 11 sequence variants leading to an amino acid substitution causing autosomal recessive disease [19]. CRB2 was followed by PPFIA1 with 10 candidate SNPs. Though PPFIA1 has not been studied for candidate SNP finding, the gene is associated with squamous cell carcinoma in human [20], [21]. It is evident from Table 4 that there was significant reduction in the percentage of repeats in captured sequences versus whole genome sequences [22] for all classes of repeats, except satellite repeats. It is surprising that an increase in satellite repeat sequences occurs in this situation. Satellite repeats are present with more than one copy and they flank the coding sequences in most of the cases. In exons, tri-nucleotide repeats are invariably the most abundant in all taxa, with hexa-nucleotide repeats being the second most common. Inter-genic regions and introns contain more hexa-nucleotide repeats than exons do, but the tri-nucleotide repeats are less common [23]. Hence, it is logical to assume an increase in satellite sequences in TST exome capture data as satellite repeats contain tri-nucleotide repeats and hexa-nucleotide repeats, of which the former repeats are more common in exons. In our experiment, we have not utilized any carrier to block the capturing of repeat sequencing. Nonetheless the decrease in repeat content in enriched sequencing data is significant (47.12%). In future studies, use of some blocking reagent/sequences for repeats may result in further reduction in repeat sequences and in turn capture of highly enriched sequences for the coding region. Several strategies have been tried to reduce genome complexity which include EST sequencing, methyl filtration, and high-Cot DNA selection [24]. These approaches may decrease genome complexity to some extent but it is not possible to have targeted capture as by solution hybridization. In our experiment we noticed that approximately 2/3rd of the captured reads are on the targets, i.e., regions intended to be coding, but a still larger proportion, approximately 1/3rd of captured sequences were off the target. Blocking with repetitive DNA can be employed to increase on-target capture and to reduce off target capture. Other approaches include reducing fragment size of the library thereby reducing chances of secondary capture of off target sequences [25]. However, it is invariably necessary to develop whole genome sequence of buffalo so that high density capture arrays can be prepared for efficient and specific capture of buffalo coding regions. Currently kits for exome capture/target enrichment are commercially available. However, these arrays are only available for humans and mice but not applicable in farm animals. This study shows use of solution hybridization to capture whole TST exome without using any array. Our method can be applied to less explored species or farm animals for which the complete genome sequence is unavailable for examining SNPs in coding regions and tissue specific gene expression. In addition, it is possible to scale up to work with large numbers of samples simultaneously using this method although it needs to be improved for blocking repeat sequences. The present study reports the first instance of utilizing the simple strategy of solution hybridization for capture of exons from buffalo muscle tissue with cDNA prepared from muscle tissue as bait molecules. We have successfully enriched coding sequences present in the buffalo genome and the generated data was used for SNP and indel detection. This approach can be used for TST exome capture in farm animals, which would help to evolve a strategy for selection of animals with better production potential. Moreover, this approach provides flexibility of use with any species irrespective of genome size and prior sequence information and can be applied to the capture and sequencing of any type of variation, either SNPs or indels. With some modification and optimization of hybridizing conditions the method could be potentially applied to any organism. For the species without complete whole genome assembly, we have to develop alternate strategies to study variations in coding sequences and this study will serve as an important step in that direction.

Materials and methods

Sample collection

The thigh muscle tissue samples for the study were collected from a slaughtered male Surti breed of buffalo (B. bubalis) immediately after slaughter. The muscle tissues were collected following aseptic procedures in RNAlater® and transported to the laboratory on ice and stored at −196 °C in liquid nitrogen. Blood was collected from a male Surti buffalo in Vacuette (Greiner Bio-One, NC) containing EDTA and transported to laboratory on ice.

mRNA and cDNA preparation

Total RNA was isolated from both tissues using a standard TRIZOL protocol (Invitrogen™ Life Technologies, Carlsbad, CA) following the manufacturer’s instructions. Contaminating genomic DNA was removed from total RNA by DNAse treatment using DNAseI, RNAse-free (Fermentas, CA) following the manufacturer’s instructions. Total RNA was assayed and quality and quantity was verified on Nanodrop ND1000 (Thermo Fischer Scientific, Wilmington, USA) and Bioanalyzer 2100 (Agilent Technologies, Mississauga, ON). A sample containing total RNA from muscle was used for cDNA preparation using biotinylated OligodT.

Genomic library preparation

Genomic DNA was extracted from a blood sample collected from a mature buffalo bull of the Jaffrabadi breed using and standard phenol/chloroform extraction methods [26]. To prepare a single stranded (sst) library, genomic DNA was fragmented and adapters from GS-FLX Titanium were ligated following the manufacturer’s protocol.

TST exome capture

The sst genomic DNA library prepared was hybridized (1:10) with biotinylated cDNA from muscle tissue. Here, muscle cDNA was used in excess so that exons encoding rarely expressed transcripts can also be captured. Hybridization was performed in 2× hybridization buffer containing 10 × SSPE, 10 × Denhardt’s, 10 mM EDTA and 0.2% SDS [27]. Hybridization was carried out for 24 h at 66 °C and captured genomic regions were recovered using NaOH. The recovered genomic DNA library was purified with MinElute columns (Quiagen, DE) and subjected to emulsion PCR.

Sequencing

DNA-positive beads from emulsion PCR were recovered, enriched and subjected to 454 sequencing standard protocols developed by Roche Diagnostics (Roche Diagnostics, Switzerland). 1/4th region on PicoTitrePlate was utilized for sequencing of capture sequences from buffalo genomic DNA on GS FLX.

Sequence analysis

Among reads generated after sequencing, only reads that contained the correct base key at the start (a portion of the 454 primer used to differentiate reads from internal quality control sequences), including at least 84 flows and less than 5% of flow cycles resulting in an ambiguous base call (N), were retained for analysis. The 3′ ends of the reads were trimmed such that <3% of the remaining flows have ambiguous signal intensity for incorporation. Each base sequenced was also annotated with a Phred equivalent quality score from Roche diagnostics and the average base score for all the reads was found to be 31 indicating the accuracy of base calling to be 99.99%. Available reads were assembled using gsMapper provided by Roche to map obtained reads against the B. taurus reference mRNA database. For efficient utilization of obtained sequences, remaining sequence data of too short and unmapped reads after reference database mapping were again subjected to de novo assembly using inbuilt software tool gsAssembler provided by Roche using default parameters, i.e., overlap length 40 bp, seed step 12 bp and minimum sequence identity 90%.

Sequence annotation

For annotation of available sequence data, all the contigs and unassembled sequences obtained after mapping and de novo assembly were subjected to NCBI Blastn against the B. taurus mRNA reference database using default parameters. Genes found with Blastn were used in the KEGG Mapper for identification of respective pathways [28]. All genes identified were searched against Tissue-specific Gene Expression and Regulation (TiGER) for validation of muscle tissue specific gene expression [29]. All the genes found with Blast search are used to download respective GO terms from http://www.ebi.ac.uk/GOA/cow_release.html of the cow genome. Available GO terms were categorized using CateGOrizer [30] GO_Slim classification system.

SNP finding

For alignment and annotation of the sequence reads, we used the bovine genome chromosome Btau 4.0 (http://www.hgdownload.cse.ucsc.edu/goldenPath/bosTau4/chromosomes/) as a reference source. For long read alignment, we used BWA-SW ver 0.5.9 [31]. The BWA default values for mapping were as follow: Score of a match = 1 (matchScore), Mismatch penalty = 3 (mmPen), Gap open penalty = 5 (gapOpenPen), Gap extension penalty = 2 (gapExtPen), Number of threads in the multi-threading mode = 1 (thresh), Band width in the banded alignment = 33 (zBest), Minimum score threshold divided by a = 37 (nHspRev) and Coefficient for threshold adjustment according to query length = 5.5. After read mapping we discarded repeat sequences and unmapped reads. Only reads mapped to a unique position on the reference genome sequence were used for SNP calling. To call SNPs, we used SAMtools ver 1.1.12a [32] and applied additional filters as follows: reads mapped with minimum mapping quality of 20, minimum read depth = 3. After SNP calling, we annotated the candidate SNPs using SNPeff (http://www.snpeff.sourceforge.net/) software.

Repeat masker analysis

Available sequencing data were subjected to Repeat Masker for repeat prediction, with the DNA source parameter set to cow, comparison species set to human with weak lineage specific masking, advance alignment option to be in repeat orientation and by masking repeats with lower case. The remaining parameters were set to default. The results were obtained in three different text files containing results of repeat data, masked sequences and alignment.

Authors’ contributions

CG conceived the whole experiment, monitored the progress and helped draft manuscript. DN helped in manuscript preparation and made advisory comments. SJ prepared the manuscript and performed the experiment with VB. PG assisted in sequencing of captured library with SJ and VB. KB performed SNP finding. All authors read and approved the final manuscript.

Competing interests

The authors have no competing interests to declare.

29 in total

1. KEGG: kyoto encyclopedia of genes and genomes.

Authors: M Kanehisa; S Goto
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

2. Characterization of MR-1, a novel myofibrillogenesis regulator in human muscle.

Authors: Tian-Bo Li; Xiu-Hua Liu; Shuang Feng; Yang Hu; Wei-Xi Yang; Yue Han; Yi-Guang Wang; Li-Min Gong
Journal: Acta Biochim Biophys Sin (Shanghai) Date: 2004-06 Impact factor: 3.848

3. A rapid procedure for extracting genomic DNA from leukocytes.

Authors: S W John; G Weitzner; R Rozen; C R Scriver
Journal: Nucleic Acids Res Date: 1991-01-25 Impact factor: 16.971

4. Amplification and overexpression of PPFIA1, a putative 11q13 invasion suppressor gene, in head and neck squamous cell carcinoma.

Authors: Kaia Davis Tan; Yansong Zhu; Hiang Khoon Tan; Vikneswari Rajasegaran; Amit Aggarwal; Jeanie Wu; Hui Yong Wu; Jacqueline Hwang; Dennis T H Lim; Khee Chee Soo; Patrick Tan
Journal: Genes Chromosomes Cancer Date: 2008-04 Impact factor: 5.006

5. Characterization of the Crumbs homolog 2 (CRB2) gene and analysis of its role in retinitis pigmentosa and Leber congenital amaurosis.

Authors: José A J M van den Hurk; Penny Rashbass; Ronald Roepman; Jason Davis; Krysta E J Voesenek; Maarten L Arends; Marijke N Zonneveld; Marga H G van Roekel; Karen Cameron; Klaus Rohrschneider; John R Heckenlively; Robert K Koenekoop; Carel B Hoyng; Frans P M Cremers; Anneke I den Hollander
Journal: Mol Vis Date: 2005-04-15 Impact factor: 2.367

6. PPFIA1 and CCND1 are frequently coamplified in breast cancer.

Authors: Ana-Maria Dancau; Laura Wuth; Marcel Waschow; Frederik Holst; Antje Krohn; Matthias Choschzick; Luigi Terracciano; Sotirios Politis; Stefan Kurtz; Annette Lebeau; Kay Friedrichs; Katharina Wencke; Outi Monni; Ronald Simon
Journal: Genes Chromosomes Cancer Date: 2010-01 Impact factor: 5.006

7. Genomic organization, alternative splicing, and expression of human and mouse N-RAP, a nebulin-related LIM protein of striated muscle.

Authors: Saidi A Mohiddin; Shajia Lu; John-Paul Cardoso; Stefanie Carroll; Sanjaya Jha; Robert Horowits; Lameh Fananapazir
Journal: Cell Motil Cytoskeleton Date: 2003-07

8. Whole-genome resequencing shows numerous genes with nonsynonymous SNPs in the Japanese native cattle Kuchinoshima-Ushi.

Authors: Ryouka Kawahara-Miki; Kaoru Tsuda; Yuh Shiwa; Yuko Arai-Kichise; Takashi Matsumoto; Yu Kanesaki; Sen-ichi Oda; Shizufumi Ebihara; Shunsuke Yajima; Hirofumi Yoshikawa; Tomohiro Kono
Journal: BMC Genomics Date: 2011-02-10 Impact factor: 3.969

9. SNP-specific extraction of haplotype-resolved targeted genomic regions.

Authors: Johannes Dapprich; Deborah Ferriola; Eleni E Magira; Mark Kunkel; Dimitri Monos
Journal: Nucleic Acids Res Date: 2008-07-08 Impact factor: 16.971

10. TiGER: a database for tissue-specific gene expression and regulation.

Authors: Xiong Liu; Xueping Yu; Donald J Zack; Heng Zhu; Jiang Qian
Journal: BMC Bioinformatics Date: 2008-06-09 Impact factor: 3.169

2 in total

1. Assessing signatures of selection through variation in linkage disequilibrium between taurine and indicine cattle.

Authors: Ana M Pérez O'Brien; Yuri T Utsunomiya; Gábor Mészáros; Derek M Bickhart; George E Liu; Curtis P Van Tassell; Tad S Sonstegard; Marcos V B Da Silva; José Fernando Garcia; Johann Sölkner
Journal: Genet Sel Evol Date: 2014-03-04 Impact factor: 4.297

2. Target capture and massive sequencing of genes transcribed in Mytilus galloprovincialis.

Authors: Umberto Rosani; Stefania Domeneghetti; Alberto Pallavicini; Paola Venier
Journal: Biomed Res Int Date: 2014-06-30 Impact factor: 3.411

2 in total