| Literature DB >> 34065739 |
Tugdem Muslu1, Sezgi Biyiklioglu-Kaya1, Bala Ani Akpinar2, Meral Yuce3, Hikmet Budak2.
Abstract
Pan-genomes are efficient tools for the identification of conserved and varying genomic sequences within lineages of a species. Investigating genetic variations might lead to the discovery of genes present in a subset of lineages, which might contribute into beneficial agronomic traits such as stress resistance or yield. The content of varying genomic regions in the pan-genome could include protein-coding genes as well as microRNA(miRNAs), small non-coding RNAs playing key roles in the regulation of gene expression. In this study, we performed in silico miRNA identification from the genomic sequences of 54 lineages of Brachypodium distachyon, aiming to explore varying miRNA contents and their functional interactions. A total of 115 miRNA families were identified in 54 lineages, 56 of which were found to be present in all lineages. The miRNA families were classified based on their conservation among lineages and potential mRNA targets were identified. Obtaining information about regulatory mechanisms stemming from these miRNAs offers strong potential to provide a better insight into the complex traits that were potentially present in some lineages. Future work could lead us to introduce these traits to different lineages or other economically important plant species in order to promote their survival in different environmental conditions.Entities:
Keywords: Brachypodium; microRNA; pan-genome
Year: 2021 PMID: 34065739 PMCID: PMC8156279 DOI: 10.3390/plants10050991
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1The distribution of miRNA families across lineages. The x-axis shows 115 miRNA families identified in 54 Brachypodium lineages, shown in y-axis. Conservation of miRNA families among lineages increased along the x-axis and miRNA families were classified into four groups based on this conservation: Rare miRNA families (purple), moderately conserved miRNA families (pink), highly conserved miRNA families (orange) and common miRNA families (green). The total number of miRNA families identified in each lineage is also shown in parenthesis next to lineage names.
Figure 2The number of precursor miRNA (pre-miRNA) sequences identified for each miRNA family in (a) rare miRNA family group, (b) moderately conserved miRNA families group, (c) highly conserved miRNA families group, and (d) common miRNA families group are shown. Each bar indicates both repetitive pre-miRNAs, defined as containing repetitive elements for more than half of their lengths, with the light shade of gray, and non-repetitive pre-miRNAs with the dark shade of gray.
All miRNA families identified from de novo genome assemblies of 54 Brachypodium lineages are classified based on their presence in multiple lineages. Phylogenetic inference based on miRBase phyla data for each miRNA family is given.
| Magnoliophyta | Coniferophyta | Embryophyta | ||
|---|---|---|---|---|
| Rare | miR1139, miR2873, miR5063, miR5161, miR5167, miR5169, miR5179, miR531, miR5566, miR6224, miR7708, miR7709, miR7717, miR7726, miR7729, miR7745, miR7748, miR7763, miR7765, miR9494 | √ (m) | ||
| Mod. | miR398 | √ | √ | |
| miR5068, miR5184, miR7725, miR7727, miR7744, miR7766, miR9480, miR9490 | √ (m) | |||
| miR8155 | √ (e) | |||
| Highly | miR394 | √ | ||
| miR1133, miR1436, miR5062, miR5163, miR5201, miR7715, miR7722, miR7732, miR7736, miR7740, miR7754, miR7756, miR7771, miR7775, miR7781 | √ (m) | |||
| miR5281, miR845 | √ * | |||
| Common | miR1122, miR1127, miR1128, miR1130, miR1135, miR1432, miR1435, miR1439, miR5049, miR5054, miR5067, miR5070, miR5164, miR5165, miR5171, miR5174, miR5175, miR5176, miR5180, miR5181, miR5182, miR5183, miR5185, miR5198, miR5199, miR5200, miR5202, miR528, miR5522, miR5568, miR6197, miR7716, miR7723, miR7728, miR7731, miR7733, miR7738, miR7755, miR7770, miR7772, miR7773, miR7777, miR9481, miR9485, miR9486, miR9489, miR9493, miR9495, miR9783 | √ (m) | ||
| miR156, miR159, miR160, miR166, miR171, miR395, miR396 | √ | √ | √ | |
| miR164, miR169, miR397 | √ | √ | ||
| miR165 | √ (e) | |||
| miR167 | √ | √ | ||
| miR172, miR393 | √ | |||
| miR2118, miR2275, miR399 | √ * | |||
*: All monocotyledons and eudicotyledons, except Amborella trichopoda; (m): Only in monocotyledons; (e): Only in eudicotyledons.
Figure 3Heatmap displaying the miRNA family commonality between lineages. Darker green cells represent higher numbers of common miRNAs shared between two lineages while darker purple represents fewer numbers of common miRNA families. The heatmap is not symmetrical; it should be read from y > x. For example, the extent of conservation between ABR2 and Uni2, defined as the ratio of common miRNA families between ABR2 and Uni2 over the number of total miRNA families in ABR2 by our approach, is given on the top right corner (indicated by a blue square), while the conservation between Uni2 and ABR2, the ratio over the total number of families in Uni2, is given on the bottom left (red square).
Figure 4Gene Ontology analysis for biological processes for rare, moderately conserved, highly conserved and common miRNA family groups targets, combined and clustered. Predicted targets from five biological processes appear to be collectively targeted by all four miRNA groups.
Figure 5(a) The number of miRNA families having multiple predicted targets were averaged across lineages. For simplicity, targets were considered within 11 bins of certain sizes as given. Most miRNA families seemed to target a minimum of 2 and maximum of 50 coding sequences, whereas only a few miRNA families targeted only one target or had more than 50 predicted targets. (b) The number of predicted targets of only one or multiple miRNA families were averaged across lineages. In contrast to miRNAs targeting multiple targets, each target appeared to be predominantly targeted by one miRNA family.