| Literature DB >> 23637735 |
Kimberly D Spradling1, Jeremy P Glenn, Roy Garcia, Robert E Shade, Laura A Cox.
Abstract
The baboon is an invaluable model for the study of human health and disease, including many complex diseases of the kidney. Although scientists have made great progress in developing this animal as a model for numerous areas of biomedical research, genomic resources for the baboon, such as a quality annotated genome, are still lacking. To this end, we characterized the baboon kidney transcriptome using high-throughput cDNA sequencing (RNA-Seq) to identify genes, gene variants, single nucleotide polymorphisms (SNPs), insertion-deletion polymorphisms (InDels), cellular functions, and key pathways in the baboon kidney to provide a genomic resource for the baboon. Analysis of our sequencing data revealed 45,499 high-confidence SNPs and 29,813 InDels comparing baboon cDNA sequences with the human hg18 reference assembly and identified 35,900 cDNAs in the baboon kidney, including 35,150 transcripts representing 15,369 genic genes that are novel for the baboon. Gene ontology analysis of our sequencing dataset also identified numerous biological functions and canonical pathways that were significant in the baboon kidney, including a large number of metabolic pathways that support known functions of the kidney. The results presented in this study catalogues the transcribed mRNAs, noncoding RNAs, and hypothetical proteins in the baboon kidney and establishes a genomic resource for scientists using the baboon as an experimental model.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23637735 PMCID: PMC3634053 DOI: 10.1371/journal.pone.0057563
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Chromosomal distribution of 20,714 annotated coding RNA transcripts identified from the reference analysis.
| Chr. | Coding Transcripts | Genes with 1 Transcript | Genes with ≥2 Variants | Total Number Genic Genes | Average Variants per Gene |
| 1 | 2,146 | 908 | 457 | 1,365 | 1.6 |
| 2 | 1,359 | 610 | 277 | 887 | 1.5 |
| 3 | 1,265 | 485 | 283 | 768 | 1.7 |
| 4 | 801 | 339 | 167 | 506 | 1.6 |
| 5 | 922 | 429 | 190 | 619 | 1.5 |
| 6 | 1,027 | 494 | 200 | 694 | 1.5 |
| 7 | 979 | 414 | 208 | 622 | 1.6 |
| 8 | 722 | 304 | 145 | 449 | 1.6 |
| 9 | 847 | 347 | 177 | 524 | 1.6 |
| 10 | 820 | 320 | 171 | 491 | 1.7 |
| 11 | 1,288 | 529 | 275 | 804 | 1.6 |
| 12 | 1,089 | 466 | 231 | 697 | 1.6 |
| 13 | 329 | 150 | 72 | 222 | 1.5 |
| 14 | 660 | 275 | 146 | 421 | 1.6 |
| 15 | 666 | 283 | 141 | 424 | 1.6 |
| 16 | 884 | 422 | 181 | 603 | 1.5 |
| 17 | 1,176 | 578 | 222 | 800 | 1.5 |
| 18 | 270 | 120 | 55 | 175 | 1.5 |
| 19 | 1,296 | 678 | 248 | 926 | 1.4 |
| 20 | 561 | 198 | 139 | 337 | 1.7 |
| 21 | 221 | 73 | 50 | 123 | 1.8 |
| 22 | 513 | 199 | 117 | 316 | 1.6 |
| X | 854 | 258 | 205 | 463 | 1.8 |
| Y | 19 | 11 | 3 | 14 | 1.4 |
|
|
|
|
|
|
|
Chromosomal distribution of 1,492 annotated noncoding RNA transcripts identified from the reference analysis.
| Chr. | Noncoding Transcripts | Genes with 1 Transcript | Genes with ≥2 Variants | Total Number Genic Genes | Average Variants per Gene |
| 1 | 149 | 103 | 21 | 124 | 1.2 |
| 2 | 90 | 65 | 10 | 75 | 1.2 |
| 3 | 94 | 62 | 14 | 76 | 1.2 |
| 4 | 48 | 36 | 5 | 41 | 1.2 |
| 5 | 61 | 43 | 8 | 51 | 1.2 |
| 6 | 73 | 44 | 11 | 55 | 1.3 |
| 7 | 100 | 69 | 12 | 81 | 1.2 |
| 8 | 42 | 26 | 6 | 32 | 1.3 |
| 9 | 58 | 38 | 8 | 46 | 1.3 |
| 10 | 73 | 47 | 11 | 58 | 1.3 |
| 11 | 87 | 56 | 9 | 65 | 1.3 |
| 12 | 82 | 51 | 12 | 63 | 1.3 |
| 13 | 21 | 21 | 0 | 21 | 1.0 |
| 14 | 51 | 23 | 8 | 31 | 1.7 |
| 15 | 42 | 32 | 5 | 37 | 1.1 |
| 16 | 59 | 44 | 7 | 51 | 1.2 |
| 17 | 110 | 58 | 17 | 75 | 1.5 |
| 18 | 15 | 13 | 1 | 14 | 1.1 |
| 19 | 71 | 55 | 7 | 62 | 1.2 |
| 20 | 31 | 27 | 1 | 28 | 1.1 |
| 21 | 18 | 14 | 2 | 16 | 1.1 |
| 22 | 45 | 31 | 6 | 37 | 1.2 |
| X | 71 | 49 | 8 | 57 | 1.3 |
| Y | 1 | 1 | 0 | 1 | 1.0 |
|
|
|
|
|
|
|
Chromosomal distribution of 10,913 coding transcript SNPs identified among the 14 baboon kidney transcriptome samples.
| Chr. | Coding Transcripts | Coding Transcript SNPs | SNPs per Coding Transcript | Coding Transcript Average Length | SNPs per Coding RNA Nucleotide |
| 1 | 2,146 | 1,092 | 0.51 | 3,323.7 | 0.0001531 |
| 2 | 1,359 | 797 | 0.59 | 3,649.3 | 0.0001607 |
| 3 | 1,265 | 641 | 0.51 | 3,717.2 | 0.0001363 |
| 4 | 801 | 486 | 0.61 | 3,769.5 | 0.0001610 |
| 5 | 922 | 452 | 0.49 | 3,749.9 | 0.0001307 |
| 6 | 1,027 | 720 | 0.70 | 3,477.4 | 0.0002016 |
| 7 | 979 | 507 | 0.52 | 3,381.6 | 0.0001531 |
| 8 | 722 | 276 | 0.38 | 3,523.1 | 0.0001085 |
| 9 | 847 | 478 | 0.56 | 3,657.6 | 0.0001543 |
| 10 | 820 | 447 | 0.55 | 3,555.0 | 0.0001533 |
| 11 | 1,288 | 727 | 0.56 | 3,142.6 | 0.0001796 |
| 12 | 1,089 | 499 | 0.46 | 3,541.9 | 0.0001294 |
| 13 | 329 | 210 | 0.64 | 3,852.9 | 0.0001657 |
| 14 | 660 | 325 | 0.49 | 3,388.2 | 0.0001453 |
| 15 | 666 | 487 | 0.73 | 3,881.8 | 0.0001884 |
| 16 | 884 | 410 | 0.46 | 3,147.6 | 0.0001474 |
| 17 | 1,176 | 611 | 0.52 | 3,144.8 | 0.0001652 |
| 18 | 270 | 173 | 0.64 | 4,043.5 | 0.0001585 |
| 19 | 1,296 | 555 | 0.43 | 2,686.0 | 0.0001594 |
| 20 | 561 | 315 | 0.56 | 3,089.5 | 0.0001817 |
| 21 | 221 | 139 | 0.63 | 3,547.0 | 0.0001773 |
| 22 | 513 | 312 | 0.61 | 3,083.4 | 0.0001972 |
| X | 854 | 242 | 0.28 | 3,459.6 | 0.0000819 |
| Y | 19 | 12 | 0.63 | 4,252.1 | 0.0001485 |
|
|
|
|
|
|
|
Chromosomal distribution of 1,760 noncoding RNA SNPs identified among the 14 baboon kidney transcriptome samples.
| Chr. | Noncoding RNAs | Noncoding RNA SNPs | SNPs per Noncoding RNA | Noncoding RNA Average Length | SNPs per Noncoding RNA Nucleotide | |||||
| 1 | 149 | 116 | 0.78 | 2,383.0 | 0.0003267 | |||||
| 2 | 90 | 142 | 1.58 | 2,331.2 | 0.0006768 | |||||
| 3 | 94 | 124 | 1.32 | 2,702.5 | 0.0004881 | |||||
| 4 | 48 | 45 | 0.94 | 2,207.4 | 0.0004247 | |||||
| 5 | 61 | 54 | 0.89 | 2,932.4 | 0.0003019 | |||||
| 6 | 83 | 125 | 1.51 | 2,330.5 | 0.0006462 | |||||
| 7 | 100 | 79 | 0.79 | 2,246.0 | 0.0003517 | |||||
| 8 | 42 | 39 | 0.93 | 2,807.5 | 0.0003308 | |||||
| 9 | 58 | 96 | 1.66 | 3,020.7 | 0.0005479 | |||||
| 10 | 73 | 102 | 1.40 | 2,358.4 | 0.0005925 | |||||
| 11 | 87 | 114 | 1.31 | 3,239.5 | 0.0004045 | |||||
| 12 | 83 | 71 | 0.86 | 2,738.4 | 0.0003124 | |||||
| 13 | 21 | 26 | 1.24 | 2,084.1 | 0.0005941 | |||||
| 14 | 51 | 42 | 0.82 | 2,967.5 | 0.0002775 | |||||
| 15 | 42 | 71 | 1.69 | 2,497.2 | 0.0006769 | |||||
| 16 | 59 | 99 | 1.68 | 2,596.3 | 0.0006463 | |||||
| 17 | 110 | 100 | 0.91 | 2,128.6 | 0.0004271 | |||||
| 18 | 15 | 14 | 0.93 | 2,801.8 | 0.0003331 | |||||
| 19 | 71 | 87 | 1.23 | 1,994.4 | 0.0006144 | |||||
| 20 | 31 | 47 | 1.52 | 2,155.4 | 0.0007034 | |||||
| 21 | 18 | 38 | 2.11 | 2,544.4 | 0.0008297 | |||||
| 22 | 45 | 57 | 1.27 | 2,584.5 | 0.0004901 | |||||
| X | 71 | 70 | 0.99 | 2,462.7 | 0.0004003 | |||||
| Y | 1 | 2 | 2.00 | 7,217.0 | 0.0002771 | |||||
|
|
|
|
|
|
| |||||
Chromosomal distribution of 29,813 InDels identified comparing baboon sequences with human hg18 reference assembly.
| Chr. | Baboon to Human hg18 InDels |
| 1 | 3324 |
| 2 | 2227 |
| 3 | 2020 |
| 4 | 1212 |
| 5 | 1627 |
| 6 | 1511 |
| 7 | 1342 |
| 8 | 1040 |
| 9 | 1100 |
| 10 | 1306 |
| 11 | 1635 |
| 12 | 1729 |
| 13 | 598 |
| 14 | 1114 |
| 15 | 1106 |
| 16 | 1031 |
| 17 | 1633 |
| 18 | 460 |
| 19 | 993 |
| 20 | 830 |
| 21 | 281 |
| 22 | 584 |
| X | 1093 |
| Y | 17 |
Figure 1Graphical representation of oxidative phosphorylation and mitochondrial dysfunction pathways revealed significant genes in baboon kidney.
IPA was used to identify canonical pathways from the IPA library that were most significant in the baboon kidney RNA-Seq dataset. Molecules from the dataset that met a fold coverage cutoff of 302.8 and were associated with a canonical pathway in Ingenuity's knowledge Base were considered for the analysis. Oxidative phosphorylation and mitochondrial dysfunction were identified as the two most significant pathways in the dataset. Groups of molecules in the oxidative phosphorylation and mitochondrial dysfunction pathways are represented as various shades of red. The intensity of the node color indicates the degree of fold coverage in the RNA-Seq dataset. Nodes shown in gray represent genes from the dataset that did not meet the fold coverage cutoff, and nodes shown in white represent genes that are in IPA's Knowledge Base but not in the dataset. Members within each significant group of molecules are shown in Table S20.
Figure 2Network of genes associated with cancer, cell death, cellular assembly and organization in baboon kidney.
Each transcript and corresponding fold coverage value was imported into IPA and mapped to its corresponding gene in the IPA Knowledge Base. A fold coverage value of 302.8 was set to limit the number of molecules considered for the analysis. Genes meeting the cutoff criteria were overlaid onto a global molecular network developed from information within the IPA Knowledge Base, and the networks were then algorithmically generated based on their connectivity. Graphical representation of the network reveals genes with highest fold coverage in the baboon kidney. Genes are represented as nodes of various shapes to represent the functional class of the gene product, and the biological relationship between two nodes is represented as a line. The intensity of the node color indicates the degree of fold coverage.