Literature DB >> 28204480

A reference gene set construction using RNA-seq of multiple tissues of Chinese giant salamander, Andrias davidianus.

Xiaofang Geng1,2, Wanshun Li3, Haitao Shang4, Qiang Gou5, Fuchun Zhang2, Xiayan Zang1, Benhua Zeng4, Jiang Li3, Ying Wang5, Ji Ma2, Jianlin Guo1, Jianbo Jian3, Bing Chen5, Zhigang Qiao1, Minghui Zhou5, Hong Wei4, Xiaodong Fang3, Cunshuan Xu1.   

Abstract

BACKGROUND: Chinese giant salamander (CGS) is the largest extant amphibian species in the world. Owing to its evolutionary position and four peculiar phenomenon of life (longevity, starvation tolerance, regenerative ability, and hatch without sunshine), it is an invaluable model species for research. However, lack of genomic resources leads to fewer study progresses in these fields, due to its huge genome of ∼50 GB making it extremely difficult to be assembled.
RESULTS: We reported the sequenced transcriptome of more than 20 tissues from adult CGS using Illumina Hiseq 2000 technology, and a total of 93 366 no-redundancy transcripts with a mean length of 1326 bp were obtained. We developed for the first time an efficient pipeline to construct a high-quality reference gene set of CGS and obtained 26 135 coding genes. BUSCO and homologous assessment showed that our assembly captured 70.6% of vertebrate universal single-copy orthologs, and this coding gene set had a higher proportion of completeness CDS with comparable quality of the protein sets of Tibetan frog.
CONCLUSIONS: These highest quality data will provide a valuable reference gene set to the subsequent research of CGS. In addition, our strategy of de novo transcriptome assembly and protein identification is applicable to similar studies.
© The Author 2017. Published by Oxford University Press.

Entities:  

Keywords:  Andrias davidianus; Assembly; Chinese giant salamander; De novo transcriptome

Mesh:

Year:  2017        PMID: 28204480      PMCID: PMC5467019          DOI: 10.1093/gigascience/gix006

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


Data Description

Background

The Chinese giant salamander (CGS; Andrias davidianus), belonging to order Caudata, family Cryptobranchidae, is the largest extant amphibian species in the world. It is endemic to mainland China and widely distributed in central, south-western, and southern China. It is crowned as a living fossil, because it has existed for more than 350 million years [1]. It is an invaluable model species for research in the fields of evolution and phylogeny, owing to its important evolutionary position representing the transition of animals from aquatic to terrestrial life [2, 3]. However, in the past 50 years, the natural populations of CGS have sharply declined due to habitat destruction, climate change, and overhunting. This endangered amphibian has now been listed in annex I of the Convention on International Trade in Endangered Species of Wild Fauna and Flora and in class II of the national list of protected animals in China [4]. It was also listed as one of the top 10 “focal species” in 2008 by the Evolutionarily Distinct and Globally Endangered project. Natural population decline and high values for scientific conservation and medicinal use lead to its commercial aquaculture in many locations throughout China. Despite their unique life-history characteristics, this species remains poorly characterized at the molecular level. No genomic resources are available for this species, because it has larger genomes with about 50 GB and is extremely difficult to conduct whole-genome de novo assembly even with present sequencing technology. Fortunately, RNA sequencing technologies provide cost-effective alternative approaches for the construction of the transcribed genes. Transcriptome analysis using Illumina sequencing technology has been reported in the skin and spleen of CGS [5-7], but these studies mainly discover genes associated with the immune and inflammatory response, and only two different tissues cannot obtain enough genes to research the specific biology of CGS. Here, we reported the sequenced transcriptome of more than 20 tissues from adult CGS using Illumina Hiseq 2000 technology. Our results showed that a reference gene set with high quality was constructed in this study, and it will serve as a valuable resource for future biological study of CGS.

Samples collection

Adult female CGS with a weight of about 2 kg were obtained from an artificial breeding base, Chongqing Kui Xu Biotechnology Incorporated Company. The giant salamanders were reared in aerated, tap water-supplied tanks at 20°C and fed with diced bighead carp for 2 weeks prior to experiment. Animals were heavily anesthetized by anaesthetic MS-222 and sacrificed by dissection before sample collection. Multiple tissues (abdominal skin, dorsal skin, lateral skin, lung, heart, kidney, liver, pancreas, small intestine, spleen, stomach, brain, spinal cord, cartilage, eye, fingertip, long bone, maxillary, skull, muscle, ovary, fat, tail fat, blood) were collected.

Sequencing

Total RNA (∼10 μg) was extracted from each sample using the TRIzol Reagent (Invitrogen). The cDNA library was contracted using TruSeq®RNA sample prep kit (Illumina) according to the manufacturer's protocol. After quality control of the cDNA libraries, pair-end sequencing was carried out via Illumina HiSeq™ 2000 at the Beijing Genomics Institute in Shenzhen. To ensure the accuracy of de novo assembly, raw reads were filtered by removal of adaptor and low-quality sequences. After preprocessing the reads, up to 156 GB of clean data were obtained in total, at least 6.4 GB of data in each sample with Q20 bases <96% (Table 1).
Table 1:

Summary statistics of sequencing data and Q20 percentage

SamplesClean readsClean dataQ20% of fq1Q20% of fq2
Abdominal skin71 388  2386 424 941 42097.8697.15
Blood73 523 0506 617 074 50097.8396.23
Brain72 150  5626 493 550 58098.1097.24
Cartilage72 085 3006 487 677 00098.0097.33
Dorsal skin71 852 9966 466 769 64097.8296.91
Eye72 360 4226 512 437 98097.9697.44
Fat72 747 6546 547 288 86097.0195.82
Fingertip71 793 2426 461 391 78098.0497.38
Heart71 465 3426 431 880 78098.0096.71
Kidney73 287 4526 595 870 68098.0096.31
Lateral skin71 640 0466 447 604 14097.9597.30
Liver71 772 3526 459 511 68098.1896.90
Long bone72 620 8986 535 880 82097.0696.43
Lung73 629 8646 626 687 76097.8696.46
Maxillary71 368 5826 423 172 38097.9496.73
Muscle73 184 4766 586 602 84097.7396.12
Ovary73 636 4846 627 283 56097.9596.69
Pancreas71 963 5746 476 721 66097.2196.51
Skull73 445 0866 610 057 74097.6096.52
Small intestine71 451 8886 430 669 92097.4096.63
Spinal cord72 208 3986 498 755 82098.1497.30
Spleen71 432 3326 428 909 88097.3796.59
Stomach73 740,5326 636 647 88097.9696.56
Tail fat72 435 8946 519 230 46098.0397.61
Summary statistics of sequencing data and Q20 percentage

Huge RNA-seq data assembly and evaluation

To obtain an integrated transcript set, we firstly put together all clean data and performed a combined assembly strategy by a publicly available program Trinity (V2.0.6; http://trinityrnaseq.sourceforge.net/) with the following parameters: min_kmer_cov = 3, min_glue = 3, group_pairs_distance = 250, path_reinforcement_distance = 85 [8]. It yields a huge number of transcripts, up to 425 357 transcripts output, and it includes many assembly errors and background sequences. To reduce these background and assembly errors, we developed a strict pipeline to filter these sequences (Fig. 1A). (i) Removal of assembly errors. Only each base pair in any sequence covered by at least one read was saved, except 50 bp near to the both end sequences. Any gap was trimmed, no matter where it was. The sequence was split into pieces at gap sites if there were gaps in the middle of the sequence. (ii) Removal of the background sequences. The clean reads were mapped to all the transcripts and fragments per kilobase per million mapped fragments (FPKM) value was calculated. When the expression profiling of sequence reached this standard of ≥1 FPKM in at least two samples or ≥5 FPKM in at least one sample, it was retained. (iii) Removal of isoforms produced by alternative splicing. The high homologous region (identity at least 95%) between two sequences reached this criterion: >40% or 90 bp in length of one sequence, and the shorter one was removed. (iv) Removal of short sequences. The sequence <250 bp in length was discarded. Finally, a total of 93 366 transcripts with a mean length of 1326 bp were obtained (Table 2). The clean reads were mapped to all the transcripts, and the total mapping rate and unique mapping rate ranged from 70.15% to 86.07% and 69.24% to 81.56%, respectively, except the sample ‘long bone’ (43.12% and 42.21%; Fig. 1B).
Figure 1:

Huge RNA-seq data assembly. (A) The pipeline for de novo assembly, quality filter, and gene identification and classification. (B) The statistics of mapping rate before and after transcript filter. Among the total mapped reads, the unique mapped reads were >98% and the multiple mapped reads were <2% after transcript filter, except sample ‘stomach.’ Moreover, the total mapping rate was a slight decrease in comparison to the result before filter.

Table 2:

The statistics of final assembly and coding gene prediction

Total data (Mb)Total length (bp)Total number (≥250 bp)Total number (≥1 kp)Average lengthCoding geneNoncoding genes
156 347123 835 13593 36634 840132626 13567 231
Huge RNA-seq data assembly. (A) The pipeline for de novo assembly, quality filter, and gene identification and classification. (B) The statistics of mapping rate before and after transcript filter. Among the total mapped reads, the unique mapped reads were >98% and the multiple mapped reads were <2% after transcript filter, except sample ‘stomach.’ Moreover, the total mapping rate was a slight decrease in comparison to the result before filter. The statistics of final assembly and coding gene prediction To evaluate our filter pipeline, we compared the total mapping rate and unique mapping rate of transcripts before and after filter, respectively. After filter, the total mapping rate was slightly decreased (<2.6% average) in comparison to the total mapping rate before filter (Fig. 1B), suggesting that we retained a higher completeness rate. On the other hand, among the total mapped reads, the unique mapped reads were >98% and the multiple mapped reads were <2% after transcript filter except sample ‘stomach,’ while the multiple mapped reads were up to 10.39% (average ratio in 24 samples) before transcript filter. These data hinted that most of the redundant sequences were removed and the set of transcripts after filter had very low redundancy. Above all, our filter pipeline was effective, not only removing the assembly errors and redundancy, but also keeping most of the unique expressed sequences.

Functional annotation

A total of 41 874 sequences can be annotated by searching against four function databases, that is, nonredundant protein database (Nr) in NCBI, Swiss-Prot, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database, and Cluster of Orthologous Group (COG) using BLASTX (E-value ≤ 10−5) (Table 3). Gene ontology (GO) classification was analyzed by the Blast2GO software (v2.5.0) based on Nr annotation.
Table 3:

Statistics for functional annotation

FunctionalNumber of sequences
databaseannotated
NR41 043
Swiss-Prot30 049
KEGG32166
COG13 229
GO16 369
Total41 874
Statistics for functional annotation

Identification of coding gene set

To identify high-quality coding genes, we developed the following pipeline to perform (Fig. 2A). Firstly, we predicted the CDS (coding sequences) of at least 60 bp using the following three methods. (i) We predicted the CDS using transdecoder (https://transdecoder.github.io/ version 2.0.1). (ii) All transcripts were searched in protein databases using BLASTX (E-value < 10−5) in the following order: PRD (western clawed frog protein set, 947 proteins of CGS and 554 proteins of newt from NCBI), Nr, SwissProt, and KEGG. Transcripts with sequences having matches in one database were not searched further. We selected CDS from sequences based on the best hit. (iii) All transcripts were used to predict the CDS by ESTScan (http://www.ch.embnet.org/software/ESTScan2.html; v3.0.2). Before prediction, the ESTScan was trained using the CDS data produced by BLASTX alignment method. The transcripts with CDS regions were identified by at least two methods mentioned above, and the longest CDS was chosen. Then, we filtered them with these criteria: the shortest CDS was at least 100 bp and the ratio of CDS/mRNA in length was at least >0.1. These data were defined as primary protein sets, and it was checked in the next step. Secondly, the candidate transcripts were predicted by Coding Potential Calculator software (http://cpc.cbi.pku.edu.cn/). When the transcript was reported as a coding gene and the score was ≥1, it was defined as a true coding gene. Finally, 26 135 sequences (25 965 genes after removing redundancy) passed our criterion, and they were defined as coding genes, and the remaining 67 231 sequences were defined as noncoding genes (Table 2).
Figure 2:

Identification and evaluation of giant salamander gene set. (A) The pipeline of prediction of coding genes. PRD represents Western clawed frog protein set, 947 proteins of CGS and 554 proteins of Newt from NCBI. (B) The results of BUSCO estimation. Asterisk (*) represents the final protein sets; pound (#) represents the primary protein sets. (C) Comparison of the length of homologous region to X. tropicalis and N. parkeri based on single copy othorlogs. The X-axis is the ratio of similarity length, and the Y-axis is the percentage of gene number at this scale. (D) Comparison of the length of CDS to X. tropicalis and N. parkeri based on single copy othorlogs. The X-axis is log base 2 of CDS length ratio, and the Y-axis is the percentage of gene number at this scale.

Identification and evaluation of giant salamander gene set. (A) The pipeline of prediction of coding genes. PRD represents Western clawed frog protein set, 947 proteins of CGS and 554 proteins of Newt from NCBI. (B) The results of BUSCO estimation. Asterisk (*) represents the final protein sets; pound (#) represents the primary protein sets. (C) Comparison of the length of homologous region to X. tropicalis and N. parkeri based on single copy othorlogs. The X-axis is the ratio of similarity length, and the Y-axis is the percentage of gene number at this scale. (D) Comparison of the length of CDS to X. tropicalis and N. parkeri based on single copy othorlogs. The X-axis is log base 2 of CDS length ratio, and the Y-axis is the percentage of gene number at this scale.

Evaluation of coding gene set

To evaluate the completeness of this coding gene set, we employed Benchmarking Universal Single-Copy Orthologs (BUSCO; http://busco.ezlab.org/) to evaluate the gene set of CGS using vertebrata data [9] and compared with two frog species, which have whole genome data available as follows: Western clawed frog (Xenopus tropicalis; http://ftp.ensembl.org/pub/release-81/fasta/xenopus_tropicalis/) and Tibetan frog (Nanorana parkeri; BioProject accession: PRJNA243398). The total number of genes for evaluation is 3023. Nearly 70.6% of total complete and single-copy BUSCOs were identified in this gene set and 73.3% (Tibetan frog) and 90.4% (Western clawed frog) of this indictor in two frogs’ gene sets (Fig. 2B). The complete and duplicated BUSCOs’ nearly zero in CGS comparing to 2.8% and 3.4% in two frogs (Fig. 2B). These data showed that our gene set had low duplicates. And the ratio of fragmented BUSCOs is 5.2%, more than Western clawed frog (3.6%) and less than Tibetan frog (9.1%) (Fig. 2B). These data hinted that we obtained an acceptable gene set of CGS, which has comparable quality of whole genome sequencing of Tibetan frog, although we only used dozens of sample by RNA-seq. We also performed the same analysis using the primary protein sets, which were only identified by three kinds of CDS prediction methods. Fortunately, these two results were much closer (70.6% and 72.3%) (Fig. 2B). These data suggest that the Coding Potential Calculator method is highly effective to remove noncoding RNAs and retain the coding mRNAs. We advanced to evaluate the completeness of a single gene using single copy genes. To identify the single copy gene families, we selected the following reference species: Andrias davidianus, Xenopus tropicalis, Nanorana parkeri, Anolis carolinensis, Pareuchiloglanis sinensis, Danio rerio, Oryzias latipes, and Homo sapiens. For comparative analysis, we used the following pipeline to cluster individual genes into gene families using TreeFam [10]. Firstly, we collected protein sequences longer than 33 amino acids from these eight species. The longest protein isoform was retained from each gene. Secondly, BLASTP was used for all the protein sequence alignments against itself with an E-value of 1E-7. After alignment, fragmental alignments for each gene pair were conjoined using Solar [11]. Thirdly, gene families were constructed. We used average distance for the hierarchical clustering algorithm, requiring the minimum edge weight (H-score) of 10 and the minimum edge density (total number of edges/theoretical number of edges) of >1/3. The result of family classification showed that CGS obtained slightly fewer gene families than two frogs (Table 4). Finally, we identified 6634 single copy genes among three amphibian species mentioned above to evaluate the completeness of single CDS. The results showed that the percentage of CGS’s CDS with at least 90% homologous regions in Western clawed frog's ortholog (>82%) was higher than Tibetan frog vs Western clawed frog (74%; Fig. 2C) and CGS vs Tibetan frog (73%; Fig. 2C). Moreover, the CDS length of CGS was closer to Western clawed frog than Tibetan frog, and they had longer CDS than Tibetan frog (Fig. 2D). Considering differences among species, these data showed that we have obtained a higher proportion of complete CDS in this gene set.
Table 4:

The results of gene family classification

SpeciesTotal genesUnclustered genesGene familiesUnique familiesAverage genes per family
A. davidianus 26 135 (25 965)*634112 1885201.62
X. tropicalis 18 42921813 235211.38
N. parkeri 22 972239113 9863061.47
A. carolinensis 17 76781813 387301.27
P. sinensis 18 16463813 548311.29
D. rerio 26 046145313 8321771.78
O. latipes 19 671146112 4371381.46
H. sapiens 21 375206215 5424091.24

Asterisk (*) represents gene number after correction.

The results of gene family classification Asterisk (*) represents gene number after correction.

Estimation of gene expression

For expression level, the clean reads of each sample were mapped to all transcripts using the Bowtie2 (version 2.2.5) software [12], then we used RSEM (v1.2.12) [13] to count the number of mapped reads and estimate FPKM values [14]. The expressed transcripts ranged from 47.32% to 75.12% of 93 366 total transcripts in each library. The expressed transcripts and genes number was illuminated in Table 5 and detail profiling of all genes was summarized in Additional file 1. The hierarchical clustering of gene expression profiling was analyzed, and the results showed that the coding genes had higher expression levels than noncoding genes (Fig. 3).
Table 5:

The statistics of transcripts and coding genes expressed in each sample

SamplesExpressed transcriptsCoding genesSamplesExpressed transcriptsCoding genes
Abdominal skin53 32420 193Long bone56 28619 754
Dorsal skin60 44621 580Lung70 13222 991
Lateral skin53 28520 437Maxillary59 43121 424
Blood56 54019 994Muscle49 58219 968
Brain66 92322 715Ovary53 34321 072
Cartilage59 72420 979Pancreas44 17718 746
Eye67 76922 826Skull59 93322 206
Fat65 58621 570Small intestine59 15621 588
Fingertip63 58221 626Spinal cord64 80822 423
Heart62 12721 734Spleen64 25821 699
Kidney66 22322 792Stomach58 68821 601
Liver59 75521 622Tail fat63 09021 264
Figure 3:

Hierarchical clustering of gene expression profiling. Coding genes (left); noncoding genes (right). The coding genes had higher expression abundances than noncoding genes.

The statistics of transcripts and coding genes expressed in each sample Hierarchical clustering of gene expression profiling. Coding genes (left); noncoding genes (right). The coding genes had higher expression abundances than noncoding genes.

Conclusions and future directions

In summary, we sequenced 24 samples from adult CGS to construct a good reference gene set in this study, due to CGS with a huge genome size of ∼50 GB, which was hardly constructed well by present sequencing technology. A total of 26 135 coding genes with comparable quality of protein sets of Tibetan frog were identified; CGS had more coding genes than Western clawed frog with 18 429 proteins and Tibetan frog with 22 972 proteins. Moreover, this coding gene set contained approximately 70% of universal single-copy orthologs of vertebrata genes and had a higher proportion of completeness CDS with quality metrics comparable to gene set of Tibetan frog. Gene families obtained in CGS were slightly less than two frog species. Hence, we believe that CGS has more gene copies than Western clawed frog and Tibetan frog. The most likely is that more gene copies were produced with transposon element expansion and low loss rate. Sun et al. [15, 16] reported that LTR retrotransposon expansion contributed to genomic gigantism of several salamanders. A similar mechanism may contribute to CGS’s huge genome size. Obviously, we missed parts of genes, due to the fact that we sequenced 24 samples only from adult CGS. Data from other developmental stages need to be supplemented in the future study. On the other hand, the present gene sets may include some noncoding genes or other noises, even if we used the most strict pipeline to identify the proteins. This is the puzzle of RNA-seq data to identify coding genes. It needs to be verified by other data, e.g. full-length transcripts and protein data, which ought to be produced in the future. In addition, our strategy of de novo transcriptome assembly and protein identification is highly effective, and it is applicable to a wide range of other similar studies.

Availability of supporting data

All the clean reads were deposited in the National Center for Biotechnology Information and can be accessed in the Short Read Archive (SRA accession: SRP092015) linking to BioProject accession number PRJNA350354. The assemblies and annotations data and other relevant data have also been hosted in the GigaScience repository, GigaDB [17].

Competing interests

The authors declare that they have no competing interests.

Author contributions

CSX, XDF, and HW conceived the study and designed the experiments. HTS, QG, XYZ, and JLG performed the experiments. WSL, XFG, JL, and JBJ analyzed the data. BHZ, YW, BC, ZGQ, MHZ, FCZ, JM, and JBJ contributed reagents/materials/analysis tools. XFG and WSL wrote the manuscript with input from all authors. CSX, HW, FCZ, and JM revised the paper. All authors read and approved the final manuscript.

Additional files

Additional file 1: The expression profiling of all genes. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  17 in total

1.  Brief report. The karyotype of the caudate amphibian Andrias davidianus.

Authors:  Bicai Zhu; Zhaojun Feng; Ai Qu; Huan Gao; Yong Zhang; Dongxu Sun; Wenchao Song; Anssi Saura
Journal:  Hereditas       Date:  2002       Impact factor: 3.271

2.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

3.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.

Authors:  Felipe A Simão; Robert M Waterhouse; Panagiotis Ioannidis; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Bioinformatics       Date:  2015-06-09       Impact factor: 6.937

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  Transcriptome analysis of the endangered Chinese giant salamander (Andrias davidianus): Immune modulation in response to Aeromonas hydrophila infection.

Authors:  Zhitao Qi; Qihuan Zhang; Zisheng Wang; Tianyi Ma; Jie Zhou; Jason W Holland; Qian Gao
Journal:  Vet Immunol Immunopathol       Date:  2015-11-17       Impact factor: 2.046

6.  Detecting lineage-specific adaptive evolution of brain-expressed genes in human using rhesus macaque as outgroup.

Authors:  Xiao-Jing Yu; Hong-Kun Zheng; Jun Wang; Wen Wang; Bing Su
Journal:  Genomics       Date:  2006-07-18       Impact factor: 5.736

7.  LTR retrotransposons contribute to genomic gigantism in plethodontid salamanders.

Authors:  Cheng Sun; Donald B Shepard; Rebecca A Chong; José López Arriaza; Kathryn Hall; Todd A Castoe; Cédric Feschotte; David D Pollock; Rachel Lockridge Mueller
Journal:  Genome Biol Evol       Date:  2011-12-26       Impact factor: 3.416

8.  RNA-Seq analysis and gene discovery of Andrias davidianus using Illumina short read sequencing.

Authors:  Fenggang Li; Lixin Wang; Qingjing Lan; Hui Yang; Yang Li; Xiaolin Liu; Zhaoxia Yang
Journal:  PLoS One       Date:  2015-04-13       Impact factor: 3.240

9.  A reference gene set construction using RNA-seq of multiple tissues of Chinese giant salamander, Andrias davidianus.

Authors:  Xiaofang Geng; Wanshun Li; Haitao Shang; Qiang Gou; Fuchun Zhang; Xiayan Zang; Benhua Zeng; Jiang Li; Ying Wang; Ji Ma; Jianlin Guo; Jianbo Jian; Bing Chen; Zhigang Qiao; Minghui Zhou; Hong Wei; Xiaodong Fang; Cunshuan Xu
Journal:  Gigascience       Date:  2017-03-01       Impact factor: 6.524

10.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

View more
  6 in total

1.  RNA-seq analysis provides insight into molecular adaptations of Andrias davidianus.

Authors:  Xiaofang Geng; Lu Zhang; Xiayan Zang; Jianlin Guo; Cunshuan Xu
Journal:  Dev Genes Evol       Date:  2019-11-16       Impact factor: 0.900

2.  A reference gene set construction using RNA-seq of multiple tissues of Chinese giant salamander, Andrias davidianus.

Authors:  Xiaofang Geng; Wanshun Li; Haitao Shang; Qiang Gou; Fuchun Zhang; Xiayan Zang; Benhua Zeng; Jiang Li; Ying Wang; Ji Ma; Jianlin Guo; Jianbo Jian; Bing Chen; Zhigang Qiao; Minghui Zhou; Hong Wei; Xiaodong Fang; Cunshuan Xu
Journal:  Gigascience       Date:  2017-03-01       Impact factor: 6.524

3.  Age-related changes in the gut microbiota of the Chinese giant salamander (Andrias davidianus).

Authors:  Mengjie Zhang; Sarah Gaughan; Qing Chang; Hua Chen; Guoqing Lu; Xungang Wang; Liangliang Xu; Lifeng Zhu; Jianping Jiang
Journal:  Microbiologyopen       Date:  2018-12-25       Impact factor: 3.139

4.  Environmental Temperatures Affect the Gastrointestinal Microbes of the Chinese Giant Salamander.

Authors:  Lifeng Zhu; Wei Zhu; Tian Zhao; Hua Chen; Chunlin Zhao; Liangliang Xu; Qing Chang; Jianping Jiang
Journal:  Front Microbiol       Date:  2021-03-19       Impact factor: 5.640

5.  Proteomic analysis of eleven tissues in the Chinese giant salamander (Andrias davidianus).

Authors:  Xiaofang Geng; Jianlin Guo; Xiayan Zang; Cuifang Chang; Haitao Shang; Hong Wei; Cunshuan Xu
Journal:  Sci Rep       Date:  2019-11-11       Impact factor: 4.379

6.  Functional identification of PsMYB57 involved in anthocyanin regulation of tree peony.

Authors:  Yanzhao Zhang; Shuzhen Xu; Yanwei Cheng; Jing Wang; Xiangxiang Wang; Runxiao Liu; Jianming Han
Journal:  BMC Genet       Date:  2020-11-16       Impact factor: 2.797

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.