| Literature DB >> 35480326 |
Aamir Khan1, Kalpana Singh1, Sarika Jaiswal1, Mustafa Raza1, Rahul Singh Jasrotia1, Animesh Kumar1, Anoop Kishor Singh Gurjar1, Juli Kumari1, Varij Nayan2, Mir Asif Iquebal1, U B Angadi1, Anil Rai1, Tirtha Kumar Datta2, Dinesh Kumar1.
Abstract
Water buffalo (Bubalus bubalis), belonging to the Bovidae family, is an economically important animal as it is the major source of milk, meat, and drought in numerous countries. It is mainly distributed in tropical and subtropical regions with a global population of approximately 202 million. The advent of low cost and rapid sequencing technologies has opened a new vista for global buffalo researchers. In this study, we utilized the genomic data of five commercially important buffalo breeds, distributed globally, namely, Mediterranean, Egyptian, Bangladesh, Jaffrarabadi, and Murrah. Since there is no whole-genome sequence analysis of these five distinct buffalo breeds, which represent a highly diverse ecosystem, we made an attempt for the same. We report the first comprehensive, holistic, and user-friendly web genomic resource of buffalo (BuffGR) accessible at http://backlin.cabgrid.res.in/buffgr/, that catalogues 6028881 SNPs and 613403 InDels extracted from a set of 31 buffalo tissues. We found a total of 7727122 SNPs and 634124 InDels distributed in four breeds of buffalo (Murrah, Bangladesh, Jaffarabadi, and Egyptian) with reference to the Mediterranean breed. It also houses 4504691 SSR markers from all the breeds along with 1458 unique circRNAs, 37712 lncRNAs, and 938 miRNAs. This comprehensive web resource can be widely used by buffalo researchers across the globe for use of markers in marker trait association, genetic diversity among the different breeds of buffalo, use of ncRNAs as regulatory molecules, post-transcriptional regulations, and role in various diseases/stresses. These SNPs and InDelscan also be used as biomarkers to address adulteration and traceability. This resource can also be useful in buffalo improvement programs and disease/breed management.Entities:
Keywords: CircRNAs; bovine; lncRNA; miRNA; molecular markers; web-resource
Year: 2022 PMID: 35480326 PMCID: PMC9035531 DOI: 10.3389/fgene.2022.809741
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
The list of assemblies of buffalo from public domain.
| Accession | Breed | Submitter | Assembly level | Remarks |
|---|---|---|---|---|
| GCA_003121395.1 | Mediterranean | University of Adelaide | Chromosome-wise | UOA_WB_1 ( |
| GCA_019923935.1 | Murrah | National Dairy Development Board, India | Chromosome-wise | NDDB_SH_1 ( |
| GCA_004794615.1 | Bangladesh | BGI-Shenzhen | Scaffold level | Bubbub1.0 ( |
| GCA_002993835.1 | Egyptian | Egyptian Water Buffalo Genome Consortium (Agriculture Genetic Engineering Research Institute and Nile University) | Scaffold level | ASM299383v1 ( |
| GCA_000180995.3 | Jaffrabadi | Anand Agricultural University, Anand, Gujarat, India | Scaffold level | Bubalus_bubalis_Jaffrabadi_v3.0 ( |
The details of RNA-seq data from the International Water Buffalo Genome Project representing different buffalo tissues along with SRA IDs and mapping %.
| Tissue | SRA IDs | Mapping % | Tissue | SRA IDs | Mapping % |
|---|---|---|---|---|---|
| Tongue | ERR315616 | 95.71 | Ovary-corpus luteum | ERR315632 | 94.30 |
| Rumen | ERR315617 | 93.69 | Ovary follicle | ERR315633 | 97.60 |
| Abomasum | ERR315618 | 95.91 | Oviduct | ERR315634 | 96.67 |
| Small intestine | ERR315619 | 93.88 | Endometrium | ERR315635 | 96.59 |
| Large intestine | ERR315620 | 96.03 | Mammary gland | ERR315636 | 95.18 |
| Obex | ERR315621 | 94.02 | Embryo pool | ERR315637 | 70.87 |
| Hypophysis | ERR315622 | 96.84 | Embryo single | ERR315638 | 73.61 |
| Spinal Cord | ERR315623 | 95.40 | Thymus | ERR315639 | 96.71 |
| WBC | ERR315624 | 97.04 | Mesenteric lymph node | ERR315640 | 96.47 |
| Cerebellum | ERR315625 | 90.61 | Spleen | ERR315641 | 96.07 |
| Bone Marrow | ERR315626 | 95.55 | Liver | ERR315642 | 96.57 |
| Muscle longissimus dorsai | ERR315627 | 96.21 | Pancreas | ERR315643 | 96.70 |
| Muscle semitendinosus | ERR315628 | 96.62 | Kidney | ERR315644 | 95.23 |
| Testis | ERR315629 | 97.40 | Lung | ERR315645 | 96.53 |
| Thyroid | ERR315630 | 96.19 | Testis | SRR527266-72 | 90.02 |
| Heart | ERR315631 | 94.68 | Milk | SRR7091387-98 | 94.88 |
FIGURE 1(A) Database preparation and data retrieval for BuffGR; (B) Layout of data, data options, and data tables of BuffGR.
FIGURE 2Frequencies of SNP/InDels in (A) 31 different buffalo tissues (B) different breeds of buffalo: Common and unique genes with abundance of (C) SNPs and (D) InDels in different breeds of buffalo.
Annotated genes with abundance of extracted SNP/InDels from buffalo tissues.
| Tissue | Genes with SNPs | Genes with InDels | Tissue | Genes with SNPs | Genes with InDels |
|---|---|---|---|---|---|
| Tongue | 14392 | 6944 | Muscle longissimus dorsai | 12381 | 5149 |
| Rumen | 13503 | 5751 | Muscle semitendinosus | 12428 | 5038 |
| Obex | 15038 | 7404 | Small intestine | 14322 | 6867 |
| WBC | 13422 | 6813 | Large intestine | 15438 | 7879 |
| Testis | 16121 | 8588 | Ovary-corpus luteum | 13538 | 5821 |
| Thyroid | 14227 | 6429 | Ovary follicle | 14208 | 6751 |
| Heart | 13279 | 6125 | Cerebellum | 14711 | 7385 |
| Thymus | 14602 | 7179 | Endometrium | 14882 | 7376 |
| Oviduct | 14728 | 7109 | Mesenteric lymph node | 14445 | 7189 |
| Spleen | 14629 | 7479 | Mammary gland | 14674 | 7052 |
| Liver | 13969 | 6593 | Spinal cord | 14583 | 7229 |
| Pancreas | 14620 | 7128 | Bone marrow | 13376 | 6252 |
| Kidney | 14726 | 7303 | Embryo pool | 9531 | 3510 |
| Lung | 14989 | 7600 | Embryo single | 6008 | 1338 |
| Testis | 16121 | 8588 | Hypophysis | 14763 | 7144 |
| Milk | 16090 | 9308 | Abomasum | 14877 | 7514 |
Genes with abundance of extracted tissue/breed SNPs found to be common within the reported candidate genes of QTL traits.
| Genes with abundance of SNPs: Chromosome (reported candidate genes) | Total SNPs (within respective genes) | Tissue/breed of extracted SNPs | QTL trait | Reference |
|---|---|---|---|---|
| SPP1: chr7, SCD: chr23, SREBF1: chr3, STAT1: chr2, TG: chr15, LALBA: chr4, INSIG2: chr2, GHRL: chr21, DGAT1: chr15, CSN1S1: chr7, BTN1A1: chr2, ADRA1A: chr3 | 15, 22, 17, 53, 122, 04, 18, 03, 19, 13, 05, 01 | Milk tissue | Milk production |
|
| COL1A2: chr8, APOB: chr12 | 112, 193 | Murrah, Bangladesh, Egyptian, Mediterranean | Milk yield |
|
| GDF7: chr12 | 1598 | Murrah, Bangladesh, Mediterranean | Milk yield |
|
| KLHL29: chr12 | 1458 | Murrah, Bangladesh, Egyptian, Jaffrabadi, Mediterranean | Milk yield |
|
| RGS22: chr15, VPS13B: chr15 | 3249 | Murrah, Bangladesh, Egyptian, Jaffrabadi, Mediterranean | Milk yield, fat yield, protein yield |
|
| 344 | ||||
| MFSD14A: chr6, SLC35A3: chr6, PALMD: chr6 | 60, 41, 215 | Murrah, Bangladesh, Egyptian, Mediterranean | Fat %, protein % |
|
FIGURE 3(A) Breed-wise frequencies of SSRs. (B) Breed-wise representation of different repeat motifs. (C) Common and unique genes with abundance of SSRs in the five breeds of buffalo.
Breed-wise frequencies of SSRs, their proportions, SSR density, and distance between two SSRs in different repeat motifs.
| Breeds | Repeats | Number | Proportion % | Frequency of SSRs per Mb | Distance between two SSRs in Kb |
|---|---|---|---|---|---|
| Mediterranean | Mono | 515343 | 57.01 | 191.64 | 5.22 |
| Di | 176276 | 19.24 | 65.55 | 15.25 | |
| Tri | 113425 | 12.40 | 42.18 | 23.71 | |
| Tetra | 10120 | 1.10 | 3.76 | 265.72 | |
| Penta | 13514 | 1.48 | 5.03 | 198.98 | |
| Hexa | 289 | 0.03 | 0.11 | 9304.68 | |
| Compound | 79435 | 8.75 | 29.54 | 33.85 | |
| Egyptian | Mono | 436413 | 60.08 | 145.18 | 6.89 |
| Di | 152723 | 21.02 | 50.81 | 19.68 | |
| Tri | 71979 | 9.91 | 23.95 | 41.76 | |
| Tetra | 6521 | 0.90 | 2.17 | 460.96 | |
| Penta | 5122 | 0.71 | 1.70 | 586.87 | |
| Hexa | 107 | 0.01 | 0.04 | 28092.99 | |
| Compound | 53541 | 7.37 | 17.81 | 56.14 | |
| Jaffrabadi | Mono | 580010 | 56.41 | 154.26 | 6.48 |
| Di | 209586 | 20.38 | 55.74 | 17.94 | |
| Tri | 127585 | 12.41 | 33.93 | 29.47 | |
| Tetra | 11688 | 1.14 | 3.11 | 321.70 | |
| Penta | 14154 | 1.38 | 3.76 | 265.65 | |
| Hexa | 343 | 0.03 | 0.09 | 10962.04 | |
| Compound | 84815 | 8.25 | 22.56 | 44.33 | |
| Murrah | Mono | 516017 | 57.63 | 196.77 | 5.08 |
| Di | 174481 | 19.49 | 66.53 | 15.03 | |
| Tri | 112283 | 12.54 | 42.82 | 23.36 | |
| Tetra | 10097 | 1.13 | 3.85 | 259.73 | |
| Penta | 13527 | 1.51 | 5.16 | 193.87 | |
| Hexa | 300 | 0.03 | 0.11 | 8741.53 | |
| Compound | 68658 | 7.67 | 26.18 | 38.20 | |
| Bangladesh | Mono | 533868 | 56.41 | 192.71 | 5.19 |
| Di | 190507 | 20.13 | 68.77 | 14.54 | |
| Tri | 113377 | 11.98 | 40.93 | 24.43 | |
| Tetra | 10575 | 1.12 | 3.82 | 261.96 | |
| Penta | 12246 | 1.29 | 4.42 | 226.22 | |
| Hexa | 268 | 0.03 | 0.10 | 10336.79 | |
| Compound | 85501 | 9.03 | 30.86 | 32.40 |
FIGURE 4(A) Tissue-wise frequencies of circRNAs and lncRNAs (B) chromosome-wise frequencies of miRNAs and circRNAs; (C) length-wise frequencies of lncRNAs in buffalo.
FIGURE 5Web interface of BuffGR.