| Literature DB >> 36229544 |
T Connelley1,2, J Prendergast3,4, A Talenti5, J Powell6, D Wragg6,7, M Chepkwony8,9, A Fisch10, B R Ferreira10, M E Z Mercadante11, I M Santos12, C K Ezeasor13, E T Obishakin14,15, D Muhanguzi16, W Amanyire16, I Silwamba17,18, J B Muma17, G Mainda19, R F Kelly6,7, P Toye8.
Abstract
Structural variants (SV) have been linked to important bovine disease phenotypes, but due to the difficulty of their accurate detection with standard sequencing approaches, their role in shaping important traits across cattle breeds is largely unexplored. Optical mapping is an alternative approach for mapping SVs that has been shown to have higher sensitivity than DNA sequencing approaches. The aim of this project was to use optical mapping to develop a high-quality database of structural variation across cattle breeds from different geographical regions, to enable further study of SVs in cattle. To do this we generated 100X Bionano optical mapping data for 18 cattle of nine different ancestries, three continents and both cattle sub-species. In total we identified 13,457 SVs, of which 1,200 putatively overlap coding regions. This resource provides a high-quality set of optical mapping-based SV calls that can be used across studies, from validating DNA sequencing-based SV calls to prioritising candidate functional variants in genetic association studies and expanding our understanding of the role of SVs in cattle evolution.Entities:
Mesh:
Year: 2022 PMID: 36229544 PMCID: PMC9561109 DOI: 10.1038/s41597-022-01684-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Description of the samples.
| Sampling Continent | Sampling Country | Group | Breed | ENA project ID | ENA sample ID |
|---|---|---|---|---|---|
| S. America | Brazil | Indicine | Nelore | PRJEB52551 | ERS11891755 |
| PRJEB52551 | ERS11891754 | ||||
| Africa | Kenya | Indicine | Boran | PRJEB52551 | ERS11891767 |
| PRJEB52551 | ERS11891766 | ||||
| Africa | Nigeria | Indicine | White Fulani | PRJEB52551 | ERS11891768 |
| PRJEB52551 | ERS11891769 | ||||
| Africa | Zambia | Indicine | Angoni | PRJEB52551 | ERS11891764 |
| PRJEB52551 | ERS11891765 | ||||
| Africa | Uganda | Sanga | Ankole | PRJEB52551 | ERS11891756 |
| PRJEB52551 | ERS11891757 | ||||
| Africa | Zambia | Taurine | Barotse | PRJEB52551 | ERS11891762 |
| PRJEB52551 | ERS11891763 | ||||
| Africa | Nigeria | Taurine | N’Dama | PRJEB47998 | ERS8452869 |
| PRJEB47998 | ERS8452868 | ||||
| Europe | United Kingdom | Taurine | Hereford | PRJEB52551 | ERS11891760 |
| PRJEB52551 | ERS11891761 | ||||
| Europe | United Kingdom | Taurine | Holstein-Friesian | PRJEB52551 | ERS11891759 |
| PRJEB52551 | ERS11891758 |
Table describing the breeds and ancestry of samples, with the continent and country of origin. The identifiers, as well as the ENA accession codes, for each of the two animals sampled per breed are also reported.
Raw number of structural variants (SVs) and type detected in the different samples.
| Sample | Deletions | Insertions | Duplications | Inversion breakpoints | Interchr. translocation breakpoints | Intrachr. translocation breakpoints | Total | Insertion/Deletion ratio |
|---|---|---|---|---|---|---|---|---|
| Angoni 1 | 4349 | 4505 | 45 | 91 | 13 | 4 | 9007 | 1.036 |
| Angoni 2 | 4387 | 4673 | 64 | 100 | 11 | 7 | 9242 | 1.065 |
| Ankole 1 | 4314 | 4324 | 67 | 111 | 10 | 8 | 8834 | 1.002 |
| Ankole 2 | 3911 | 3984 | 66 | 101 | 8 | 5 | 8075 | 1.019 |
| Barotse 1 | 3971 | 4044 | 42 | 52 | 11 | 6 | 8126 | 1.018 |
| Barotse 2 | 4199 | 4159 | 67 | 106 | 7 | 9 | 8547 | 0.990 |
| Boran 1 | 4935 | 5087 | 56 | 113 | 15 | 6 | 10212 | 1.031 |
| Boran 2 | 4990 | 5007 | 68 | 138 | 6 | 14 | 10223 | 1.003 |
| Hereford 1 | 2465 | 2380 | 43 | 41 | 11 | 4 | 4944 | 0.966 |
| Hereford 2 | 2435 | 2437 | 77 | 88 | 9 | 7 | 5053 | 1.001 |
| Holstein 1 | 2756 | 2759 | 48 | 52 | 15 | 18 | 5648 | 1.001 |
| Holstein 2 | 2702 | 2801 | 59 | 76 | 9 | 5 | 5652 | 1.037 |
| N’Dama 1 | 3411 | 3481 | 92 | 125 | 10 | 13 | 7132 | 1.021 |
| N’Dama 2 | 3005 | 3082 | 67 | 86 | 6 | 8 | 6254 | 1.026 |
| Nelore 2 | 5294 | 5508 | 58 | 113 | 11 | 5 | 10989 | 1.040 |
| Nelore 1 | 5420 | 5499 | 96 | 136 | 15 | 18 | 11184 | 1.015 |
| White Fulani 1 | 4467 | 4642 | 54 | 114 | 11 | 3 | 9291 | 1.039 |
| White Fulani 2 | 4782 | 4805 | 41 | 45 | 17 | 14 | 9704 | 1.005 |
This table summarises the number of raw SVs detected in each sample, and their classification (e.g. insertion, deletion, duplication, inversion and inter- and intra-chromosomal translocation).
Fig. 1Histogram of the structural variant (SV) sizes. Histogram of the size of the identified SVs in bins of 5Kb.
Fig. 2Upset plot of the structural variants. Upset plot of the structural variants by individual for the 40 sets containing the most SVs.
Fig. 3Density plot of the size of the structural variants found in only one (support = 1) or in more than one (support >1) sample. The strip of lines below the X axis shows the individual variant sizes, the vertical lines indicate the mean variant size, in each of the group.
Fig. 4Gene set enrichment of genes potentially impacted by an SV. FUMA results showing the proportion of genes in sets, their enrichment and the heatmap of the genes in each for A) Hallmark gene sets and B) curated gene sets.
| Measurement(s) | Optical Mapping |
| Technology Type(s) | Optical Mapping |
| Factor Type(s) | Structural variants |
| Sample Characteristic - Organism | Bos taurus |
| Sample Characteristic - Location | United Kingdom • Kenya • Zambia • Uganda • Brazil • Nigeria |