Literature DB >> 35732627

The evolution of BDNF is defined by strict purifying selection and prodomain spatial coevolution, but what does it mean for human brain disease?

Alexander G Lucaci1, Michael J Notaras2, Sergei L Kosakovsky Pond1, Dilek Colak3,4.   

Abstract

Brain-Derived Neurotrophic Factor (BDNF) is an essential mediator of brain assembly, development, and maturation. BDNF has been implicated in a variety of brain disorders such as neurodevelopmental disorders (e.g., autism spectrum disorder), neuropsychiatric disorders (e.g., anxiety, depression, PTSD, and schizophrenia), and various neurodegenerative disorders (e.g., Parkinson's, Alzheimer's, etc.). To better understand the role of BDNF in disease, we sought to define the evolution of BDNF within Mammalia. We conducted sequence alignment and phylogenetic reconstruction of BDNF across a diverse selection of >160 mammalian species spanning ~177 million years of evolution. The selective evolutionary change was examined via several independent computational models of codon evolution including FEL (pervasive diversifying selection), MEME (episodic selection), and BGM (structural coevolution of sites within a single molecule). We report strict purifying selection in the main functional domain of BDNF (NGF domain, essentially comprising the mature BDNF protein). Additionally, we discover six sites in our homologous alignment which are under episodic selection in early regulatory regions (i.e. the prodomain) and 23 pairs of coevolving sites that are distributed across the entirety of BDNF. Coevolving BDNF sites exhibited complex spatial relationships and geometric features including triangular relations, acyclic graph networks, double-linked sites, and triple-linked sites, although the most notable pattern to emerge was that changes in the mature region of BDNF tended to coevolve along with sites in the prodomain. Thus, we propose that the discovery of both local and distal sites of coevolution likely reflects 'evolutionary fine-tuning' of BDNF's underlying regulation and function in mammals. This tracks with the observation that BDNF's mature domain (which encodes mature BDNF protein) is largely conserved, while the prodomain (which is linked to regulation and its own unique functionality) exhibits more pervasive and diversifying evolutionary selection. That said, the fact that negative purifying selection also occurs in BDNF's prodomain also highlights that this region also contains critical sites of sensitivity which also partially explains its disease relevance (via Val66Met and other prodomain variants). Taken together, these computational evolutionary analyses provide important context as to the origins and sensitivity of genetic changes within BDNF that may help to deconvolute the role of BDNF polymorphisms in human brain disorders.
© 2022. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35732627      PMCID: PMC9217794          DOI: 10.1038/s41398-022-02021-w

Source DB:  PubMed          Journal:  Transl Psychiatry        ISSN: 2158-3188            Impact factor:   7.989


Introduction

Brain-derived neurotrophic factor (BDNF) is one of the most ubiquitously studied molecules in modern neuroscience [1]. BDNF is a neurotrophin that binds with high affinity to its cognate tyrosine kinase receptor, TrkB [2], to elicit rapid induction of synaptic plasticity [3-5] and neuronal spine remodeling [6, 7]. Additionally, BDNF has been implicated in a variety of brain disorders [1], including depression [8-10], PTSD [11-14], schizophrenia [9, 15–17], Parkinson’s disease [18, 19], and autism spectrum disorders [20-22] amongst many more. BDNF has correspondingly been the primary target, or an ancillary factor, of many novel therapeutics including small molecule mimetics [23, 24] and existing drugs (e.g., antidepressants [25, 26]). Yet, nascent research has provided the humbling reminder that much remains to be discovered about BDNF. In recent years, new BDNF ligands have been discovered [27], new receptor interactions unveiled [27, 28], and mechanisms of behavioral function unlocked [7]. This is a timely reminder that while BDNF has remained a seminal molecule of interest across the broader neuroscience literature, much remains to be discovered about its origins, evolution, function, and disease relevance.

A primer of the molecular biology of BDNF and its functional topology

BDNF is encoded by the BDNF gene [29], whose expression is regulated in humans by an antisense gene (BDNF-AS) that can form RNA-duplexes to attenuate translation [30]. Thus, the natural antisense for BDNF is capable of directly downregulating endogenous expression on demand [31]. The BDNF gene in humans comprises 11 exons [30] and can produce at least 17 detectable transcript isoforms [29]. Different transcripts are induced in response to activity and/or cellular states, allowing the BDNF gene to adjust to environmental stimuli and potential selection pressures. However, all transcripts ultimately yield a singular preproBDNF protein that (prior to intracellular processing, cleavage, and transport) can be partitioned into three domains [11, 29]: a signal peptide, a prodomain, and the mature domain. The signal domain is only 18 amino acid residues long (with ambiguously defined functionality) with the majority of BDNFs functional outputs reflecting sequence specificity to the prodomain and mature domain. The BDNF prodomain encodes binding sites for intracellular transport of both BDNF mRNA [32] and BDNF protein [33], and contains numerous posttranslational modification sites [29]. The BDNF prodomain is also the resident location of a widely studied Single Nucleotide Polymorphism (SNP) in neuroscience (Val66Met, or rs6265) [1], and the Furin consensus sequence (Arg 125) for cleavage to its mature form (including by plasmin [34]). The prodomain is composed of 110 amino acids within the N-terminus, and must be processed via proteases to generate mature BDNF [5]. The mature domain of BDNF is almost exclusively composed of the Nerve growth factor (NGF) domain and is responsible for the canonical trophic actions associated with BDNF (e.g., long-term potentiation, rapid-acting antidepressant effects, etc.). Following intracellular handling, processing, and transport, the preproBDNF isoform is cleaved to yield the mature BDNF peptide (which only contains the mature NGF domain). For many years the prodomain was thought to be degraded following the facilitation of BDNF trafficking. However, recent work has shown that the cleaved prodomain can be secreted and bind as a ligand to novel receptors (e.g., SorCS2) [27]. Thus, the BDNF prodomain can influence brain circuits as well as behavior [7]. For a comprehensive, detailed, analysis of the various intricacies of the BDNF gene, protein, and its regulation, more information is provided in [29].

The conservation of BDNF and neurotrophins: a signal that evolution is important

One of the interesting curiosities surrounding BDNF is its relationship to other neurotrophic (NT) growth factors, comprising NGF, NT-3, and NT-4. Specifically, all neurotrophins retain some intercalated functionality. Neurotrophins also share some commonalities in structure (pre-, pro-, and mature-domains) [29], posttranslational modification potential (e.g., glycosylation [35]), as well as catalytic processing, trafficking, and composition [36]. Specifically, neurotrophins share approximately 50% sequence homology [29], and a comparison of domains and motifs reveals that each comprises a prototypic NGF domain as the principal component of the mature pro-growth peptide (see PFAM database [37]). While each neurotrophin elicits functionality via binding to cognate receptors, neurotrophins also exhibit cross-affinity amongst neurotrophin receptors [38] presumably due to their high rates of structural homology. Not surprisingly then, there is some redundancy in the trophic effects of neurotrophins, yet each still maintains nuanced functionality which remains specific to each factor during central nervous system development [39]. Differences in the evolution and temporal dynamics of regulatory sequences, which target gene products to specific destinations within cell-compartments (e.g., dendrites) [40] or to processing routes (e.g., the activity-dependent release pathway) which alter secretory dynamics and/or bioavailability [41], likely contribute to both similarities and differences between neurotrophins. However, almost nothing is known about how the BDNF prodomain has evolutionarily adapted to specifically regulate BDNF dynamics. While evolution has almost certainly shaped the sequences, structure, and function of BDNF, the modeling of such remains relatively unexplored but could provide important insight into the phylogenetic evolutionary history of BDNF, its selection pressure sensitivity across lineages, and quantitative metrics of evolutionary change across species.

Purpose of this Study

Here, we use computational methods to explore the molecular evolution of BDNF. To reconstruct phylogenetic trees of BDNF, we utilized sequence alignments of over 160 mammalian species (all available mammalian sequences) to determine the genomic attributes of BDNF evolution that are specific to Mammalia. This analysis was specific by being constrained to sequences that have the most direct evolutionary relevance to humans. Notably, we sought to identify sites in BDNF that are subject to pervasive (i.e., consistently across the entire phylogeny) diversifying selection (FEL) or pervasive/episodic (i.e., only on a single lineage or subset of lineages, diversifying selection (MEME). Likewise, utilizing multiple models for the inference of selective pressure and the evaluation of evolutionary change, we identify novel sites within the BDNF prodomain and mature peptide coding regions that are susceptible to synonymous and nonsynonymous changes. Additionally, we investigate which sites in BDNF may be coevolving (BGM). Taken together, these computational evolutionary analyses provide an important context as to the origins and sensitivity of genetic changes within the BDNF gene, which may be important for providing insight into genetic risk factors linked to disease in humans.

Results

We find that unique evolutionary pressures have shaped the BDNF gene across time. These forces have mostly operated through strict purifying selection. Of note, BDNF elicits tight regulation and specific functionality that can be separated from other neurotrophins, yet these growth factors remain closely related in their structure and sequence, especially in the conserved NGF domain.

Evolutionary history of mammalian BDNF

Prior to conducting our primary evolutionary analysis, we ported our mammalian species into a platform (timetree.org, see refs. [42, 43]) to examine the epoch events that may have influenced the analysis described here. This was an important pre-analysis step to frame the age of our genomes, and the broad-stroke evolutionary pressures that these species have been exposed to (which, in theory, could contribute to subsequent purifying selection and coevolution analyses). As expected, this revealed BDNF as an ancient gene that has been preserved throughout the mammalian lineage and has both survived and been shaped under all major evolutionary events of the past ~177 million years (data not shown). We identified several examples of species-level evolutionary epochs that cross-referenced with major earth events (e.g., bottleneck events) that have historically been believed to drive evolutionary adaptation. This included major geologic periods that are cross-referenced against earth impacts, oxygenation changes across time, atmospheric carbon dioxide concentrations, and solar luminosity. This indicates that even under extreme evolutionary pressures, the BDNF gene has exhibited (relatively speaking) very specific adaptation events (see results below) over millions of years within Mammalia. This tracks with the idea that “old genes” tend to be highly conserved, evolve more slowly, and therefore are more likely to exhibit both specific and selective changes as opposed to more dramatic permutations (e.g., gene duplications, etc.).

Predominant purifying selection in BDNF

A common approach to gain an increased understanding of the evolutionary forces that have shaped proteins is to measure the omega ratio ω consisting of the nonsynonymous (β or dN) and synonymous (α or dS) substitution rates, with ω = β/α for each site in a particular gene of interest [44]. We define two major changes for the amino acid being coded for at each site: synonymous changes, which keep the same amino acid coded for at a particular site, and nonsynonymous changes, which change the amino acid coded for at a particular site. Non-synonymous changes can have strong influences on the structural, functional, and fitness measures of an organism. This is in contrast to synonymous changes which leave the amino acid at a particular site unchanged but can confer weak fitness effects through the emergent properties of codon usage bias, mRNA structural stability, translation, and tRNA availability. However, synonymous changes are typically understood to represent neutral selection acting on coding sequences and provide a baseline rate against which nonsynonymous evolutionary rates can be compared. The omega ratio ω of relative rates of nonsynonymous and synonymous substitutions is a common measure in evolutionary biology of the selective pressure acting on protein-coding sequences. These estimates provide increased information availability as to the type of selection (positive, with omega >1 or negative, with omega <1, or neutral with omega =1) that has acted upon any given set of protein-coding sequences. As FEL analysis is a sensitive measure of negative (purifying) selection, for this analysis we observe a predominant amount of purifying selection (over 66% of sites, 174 sites out of 261; Table S1) in our recombination-free alignment for BDNF. The dN/dS estimates for the entire alignment were plotted including 95% lower- and upper-bound estimates (see Fig. 1 or Table S1). Overwhelmingly, the mature NGF domain of the BDNF exhibited evidence of greater pervasive negative purifying selection relative to the prodomain region of BDNF. Thus over the evolutionary history of Mammalia, negative selection has predominantly occurred in the regions of BDNF that encode the functional mature protein that binds TrkB to elicit neurotrophic effects. The mature domain of BDNF has exhibited remarkable conservation across innumerous epochs that have been defined by rapid evolutionary adaptation in other genes and taxa.
Fig. 1

Mammalian BDNF exhibits overwhelmingly strict purifying selection, but also evidence of specific evolutionary pressures at particular sites.

The FEL analysis of the BDNF gene found 174 of 261 (66.7%) sites to be statistically significant (LRT p value ≤ 0.1) for pervasive negative (purifying) selection. We plot the estimated values of omega (dN/dS) for each site in the alignment. Additionally, we plot 95% confidence intervals (CI) for each site. These results are also available in Table S1. We observe a high degree of strict purifying selection in the Human NGF region. The region for Human NGF corresponds to alignment sites 144–254 (NP_001700.2 and https://github.com/aglucaci/AnalysisOfOrthologousCollections/blob/main/tables/BDNF/BDNF_AlignmentMap.csv). This alignment of BDNF across all selected species (Mammalia, see Table 3) reveals a site-specific positive/adaptive diversifying selection and negative purifying selection. The thick line represents the point estimate (i.e., the evolutionary pressure) and the shadings reflect 95% confidence intervals which relate to the upper and lower bound of the point estimates. As shown, the prodomain sites exhibit more pervasive/episodic and positive/diversifying evolutionary selection, consistent with the fact that more disease-associated SNPs occur in this topological region of the BDNF gene in humans (early prodomain mapping not further shown due to nuanced variation across mammalian species).

Mammalian BDNF exhibits overwhelmingly strict purifying selection, but also evidence of specific evolutionary pressures at particular sites.

The FEL analysis of the BDNF gene found 174 of 261 (66.7%) sites to be statistically significant (LRT p value ≤ 0.1) for pervasive negative (purifying) selection. We plot the estimated values of omega (dN/dS) for each site in the alignment. Additionally, we plot 95% confidence intervals (CI) for each site. These results are also available in Table S1. We observe a high degree of strict purifying selection in the Human NGF region. The region for Human NGF corresponds to alignment sites 144–254 (NP_001700.2 and https://github.com/aglucaci/AnalysisOfOrthologousCollections/blob/main/tables/BDNF/BDNF_AlignmentMap.csv). This alignment of BDNF across all selected species (Mammalia, see Table 3) reveals a site-specific positive/adaptive diversifying selection and negative purifying selection. The thick line represents the point estimate (i.e., the evolutionary pressure) and the shadings reflect 95% confidence intervals which relate to the upper and lower bound of the point estimates. As shown, the prodomain sites exhibit more pervasive/episodic and positive/diversifying evolutionary selection, consistent with the fact that more disease-associated SNPs occur in this topological region of the BDNF gene in humans (early prodomain mapping not further shown due to nuanced variation across mammalian species).
Table 3

Tabulation of species included in our analysis, comprising NCBI ortholog gene IDs, symbols, mammalian species, common name, and RefSeq accessions.

Gene IDGene symbolDescriptionScientific nameCommon nameRefSeq transcript accessionsRefSeq protein accessions
627BDNFbrain-derived neurotrophic factorHomo sapienshumanNM_001709.5NP_001700.2
12064Bdnfbrain-derived neurotrophic factorMus musculushouse mouseNM_001048142.1NP_001041607.1
24225Bdnfbrain-derived neurotrophic factorRattus norvegicusNorway ratNM_001270630.1NP_001257559.1
397495BDNFbrain-derived neurotrophic factorSus scrofapigXM_005654684.3XP_005654741.1
403461BDNFbrain-derived neurotrophic factorCanis lupus familiarisdogXM_038429434.1XP_038285362.1
493690BDNFbrain-derived neurotrophic factorFelis catusdomestic catNM_001009828.1NP_001009828.1
503511BDNFbrain-derived neurotrophic factorPan troglodyteschimpanzeeNM_001012441.1NP_001012443.1
554233BDNFbrain-derived neurotrophic factorMonodelphis domesticagray short-tailed opossumXM_007497196.2XP_007497258.1
617701BDNFbrain-derived neurotrophic factorBos tauruscattleXM_005216334.4XP_005216391.1
701245BDNFbrain-derived neurotrophic factorMacaca mulattaRhesus monkeyXM_015114598.2XP_014970084.1
100009689BDNFbrain-derived neurotrophic factorEquus caballushorseNM_001081787.1NP_001075256.1
100081142BDNFbrain-derived neurotrophic factorOrnithorhynchus anatinusplatypusXM_029059317.2XP_028915150.1
100356949BDNFbrain-derived neurotrophic factorOryctolagus cuniculusrabbitXM_017345633.1XP_017201122.1
100409412BDNFbrain-derived neurotrophic factorCallithrix jacchuswhite-tufted-ear marmosetXM_009007854.3XP_009006102.1
100447350BDNFbrain-derived neurotrophic factorPongo abeliiSumatran orangutanXM_002821931.2XP_002821977.1
100467162BDNFbrain-derived neurotrophic factorAiluropoda melanoleucagiant pandaXM_011226480.3XP_011224782.2
100594402BDNFbrain-derived neurotrophic factorNomascus leucogenysnorthern white-cheeked gibbonXM_003254347.2XP_003254395.1
100667885BDNFbrain-derived neurotrophic factorLoxodonta africanaAfrican savanna elephantXM_023550772.1XP_023406540.1
100730257Bdnfbrain-derived neurotrophic factorCavia porcellusdomestic guinea pigXM_013147262.2XP_013002716.1
100768664Bdnfbrain-derived neurotrophic factorCricetulus griseusChinese hamsterXM_007653166.4XP_007651356.1
100934810BDNFbrain-derived neurotrophic factorSarcophilus harrisiiTasmanian devilXM_031944159.1XP_031800019.1
100958946BDNFbrain-derived neurotrophic factorOtolemur garnettiismall-eared galagoXM_012814047.1XP_012669501.1
100983866BDNFbrain-derived neurotrophic factorPan paniscuspygmy chimpanzeeXM_034932318.1XP_034788209.1
101007866BDNFbrain-derived neurotrophic factorPapio anubisolive baboonXM_017948537.3XP_017804026.1
101037414BDNFbrain-derived neurotrophic factorSaimiri boliviensisBolivian squirrel monkeyXM_003919940.3XP_003919989.2
101117275BDNFbrain-derived neurotrophic factorOvis ariessheepXM_012096129.4XP_011951519.2
101134399BDNFbrain-derived neurotrophic factorGorilla gorillawestern gorillaXM_004050851.2XP_004050899.1
101281702BDNFbrain-derived neurotrophic factorOrcinus orcakiller whaleXM_004263941.3XP_004263989.1
101338565BDNFbrain-derived neurotrophic factorTursiops truncatuscommon bottlenose dolphinXM_019947883.2XP_019803442.1
101350169LOC101350169brain-derived neurotrophic factorTrichechus manatus latirostrisFlorida manateeXM_004369733.1XP_004369790.1
101372507BDNFbrain-derived neurotrophic factorOdobenus rosmarus divergensPacific walrusXM_004408653.1XP_004408710.1
101398458LOC101398458brain-derived neurotrophic factorCeratotherium simum simumsouthern white rhinocerosXM_004418542.2XP_004418599.1
101428552BDNFbrain-derived neurotrophic factorDasypus novemcinctusnine-banded armadilloXM_012523604.2XP_012379058.1
101528222BDNFbrain-derived neurotrophic factorOchotona princepsAmerican pikaXM_004585440.1XP_004585497.1
101553155BDNFbrain-derived neurotrophic factorSorex araneusEuropean shrewXM_004607860.1XP_004607917.1
101566518Bdnfbrain-derived neurotrophic factorOctodon degusdeguXM_004644113.2XP_004644170.1
101600646Bdnfbrain-derived neurotrophic factorJaculus jaculuslesser Egyptian jerboaXM_004660846.2XP_004660903.1
101632951BDNFbrain-derived neurotrophic factorCondylura cristatastar-nosed moleXM_004682964.2XP_004683021.1
101643198BDNFbrain-derived neurotrophic factorEchinops telfairismall Madagascar hedgehogXM_004708253.2XP_004708310.1
101687038BDNFbrain-derived neurotrophic factorMustela putorius furodomestic ferretXM_004755891.2XP_004755948.1
101704465Bdnfbrain-derived neurotrophic factorHeterocephalus glabernaked mole-ratXM_004851581.3XP_004851638.1
101837384Bdnfbrain-derived neurotrophic factorMesocricetus auratusgolden hamsterXM_005064810.4XP_005064867.1
101954451Bdnfbrain-derived neurotrophic factorIctidomys tridecemlineatusthirteen-lined ground squirrelXM_040284582.1XP_040140516.1
101993181Bdnfbrain-derived neurotrophic factorMicrotus ochrogasterprairie voleXM_005364052.2XP_005364109.1
102016703Bdnfbrain-derived neurotrophic factorChinchilla lanigeralong-tailed chinchillaXM_005401751.2XP_005401808.1
102126294BDNFbrain-derived neurotrophic factorMacaca fasciculariscrab-eating macaqueXM_005578346.2XP_005578403.1
102180782BDNFbrain-derived neurotrophic factorCapra hircusgoatXM_005690025.2XP_005690082.2
102247090BDNFbrain-derived neurotrophic factorMyotis brandtiiBrandt’s batXM_014545223.1XP_014400709.1
102279714BDNFbrain-derived neurotrophic factorBos mutuswild yakXM_005890667.2XP_005890729.1
102395046BDNFbrain-derived neurotrophic factorBubalus bubaliswater buffaloXM_006068006.2XP_006068068.1
102440705BDNFbrain-derived neurotrophic factorMyotis lucifuguslittle brown batXM_023756374.1XP_023612142.1
102475289BDNFbrain-derived neurotrophic factorTupaia chinensisChinese tree shrewXM_014591303.2XP_014446789.1
102511705BDNFbrain-derived neurotrophic factorCamelus ferusWild Bactrian camelXM_006195083.3XP_006195145.1
102532564BDNFbrain-derived neurotrophic factorVicugna pacosalpacaXM_031681074.1XP_031536934.1
102730927BDNFbrain-derived neurotrophic factorLeptonychotes weddelliiWeddell sealXM_031025064.1XP_030880924.1
102754946BDNFbrain-derived neurotrophic factorMyotis davidiiXM_006753948.2XP_006754011.1
102839210BDNFbrain-derived neurotrophic factorChrysochloris asiaticaCape golden moleXM_006864851.1XP_006864913.1
102870215BDNFbrain-derived neurotrophic factorElephantulus edwardiiCape elephant shrewXM_006883713.1XP_006883775.1
102896087BDNFbrain-derived neurotrophic factorPteropus alectoblack flying foxXM_006907988.2XP_006908050.1
102911929Bdnfbrain-derived neurotrophic factorPeromyscus maniculatus bairdiiprairie deer mouseXM_006980818.2XP_006980880.1
102958723BDNFbrain-derived neurotrophic factorPanthera tigris altaicaAmur tigerXM_015544869.1XP_015400355.1
102977934BDNFbrain-derived neurotrophic factorPhyseter catodonsperm whaleXM_028501134.1XP_028356935.1
103017938BDNFbrain-derived neurotrophic factorBalaenoptera acutorostrata scammoniXM_007180991.1XP_007181053.1
103085069BDNFbrain-derived neurotrophic factorLipotes vexilliferYangtze River dolphinXM_007461679.1XP_007461741.1
103126099BDNFbrain-derived neurotrophic factorErinaceus europaeuswestern European hedgehogXM_016194241.1XP_016049727.1
103206496BDNFbrain-derived neurotrophic factorOrycteropus afer aferXM_007951952.1XP_007950143.1
103238537BDNFbrain-derived neurotrophic factorChlorocebus sabaeusgreen monkeyXM_008003703.2XP_008001894.1
103252117BDNFbrain-derived neurotrophic factorCarlito syrichtaPhilippine tarsierXM_008050727.1XP_008048918.1
103292699BDNFbrain-derived neurotrophic factorEptesicus fuscusbig brown batXM_008148947.2XP_008147169.1
103552694BDNFbrain-derived neurotrophic factorEquus przewalskiiPrzewalski’s horseXM_008522855.1XP_008521077.1
103599960BDNFbrain-derived neurotrophic factorGaleopterus variegatusSunda flying lemurXM_008584165.1XP_008582387.1
103659367BDNFbrain-derived neurotrophic factorUrsus maritimuspolar bearXM_008686880.2XP_008685102.2
103748489Bdnfbrain-derived neurotrophic factorNannospalax galiliUpper Galilee mountains blind mole-ratXM_029555090.1XP_029410950.1
104671126BDNFbrain-derived neurotrophic factorRhinopithecus roxellanagolden snub-nosed monkeyXM_010374529.2XP_010372831.1
104873642Bdnfbrain-derived neurotrophic factorFukomys damarensisDamara mole-ratXM_033764060.1XP_033619951.1
104982427BDNFbrain-derived neurotrophic factorBison bison bisonXM_010831788.1XP_010830090.1
105076551BDNFbrain-derived neurotrophic factorCamelus bactrianusBactrian camelXM_010964743.1XP_010963045.1
105106448BDNFbrain-derived neurotrophic factorCamelus dromedariusArabian camelXM_031459858.1XP_031315718.1
105293721BDNFbrain-derived neurotrophic factorPteropus vampyruslarge flying foxXM_011362565.2XP_011360867.1
105463281BDNFbrain-derived neurotrophic factorMacaca nemestrinapig-tailed macaqueXM_011710310.2XP_011708612.1
105507710BDNFbrain-derived neurotrophic factorColobus angolensis palliatusXM_011936307.1XP_011791697.1
105528120BDNFbrain-derived neurotrophic factorMandrillus leucophaeusdrillXM_011964715.1XP_011820105.1
105572781BDNFbrain-derived neurotrophic factorCercocebus atyssooty mangabeyXM_012031697.1XP_011887087.1
105717219BDNFbrain-derived neurotrophic factorAotus nancymaaeMa’s night monkeyXM_012452238.1XP_012307661.1
105814556BDNFbrain-derived neurotrophic factorPropithecus coquereliCoquerel’s sifakaXM_012649556.1XP_012505010.1
105860148BDNFbrain-derived neurotrophic factorMicrocebus murinusgray mouse lemurXM_020286534.1XP_020142123.1
105988089Bdnfbrain-derived neurotrophic factorDipodomys ordiiOrd’s kangaroo ratXM_013019592.1XP_012875046.1
106824927BDNFbrain-derived neurotrophic factorEquus asinusassXM_014831462.1XP_014686948.1
106970278BDNFbrain-derived neurotrophic factorAcinonyx jubatuscheetahXM_027072960.1XP_026928761.1
107134338Bdnfbrain-derived neurotrophic factorMarmota marmota marmotaAlpine marmotXM_015476449.1XP_015331935.1
107512170BDNFbrain-derived neurotrophic factorRousettus aegyptiacusEgyptian rousetteXM_036221046.1XP_036076939.1
107531942BDNFbrain-derived neurotrophic factorMiniopterus natalensisXM_016206154.1XP_016061640.1
108311306BDNFbrain-derived neurotrophic factorCebus imitatorPanamanian white-faced capuchinXM_017538722.1XP_017394211.1
108401589BDNFbrain-derived neurotrophic factorManis javanicaMalayan pangolinXM_017667612.2XP_017523101.2
108519651BDNFbrain-derived neurotrophic factorRhinopithecus bietiblack snub-nosed monkeyXM_017858805.1XP_017714294.1
109271282BDNFbrain-derived neurotrophic factorPanthera pardusleopardXM_019456577.1XP_019312122.1
109387036BDNFbrain-derived neurotrophic factorHipposideros armigergreat roundleaf batXM_019650718.1XP_019506263.1
109569212BDNFbrain-derived neurotrophic factorBos indicuszebu cattleXM_019974522.1XP_019830081.1
109695020Bdnfbrain-derived neurotrophic factorCastor canadensisAmerican beaverXM_020177315.1XP_020032904.1
110143608BDNFbrain-derived neurotrophic factorOdocoileus virginianus texanusXM_020903256.1XP_020758915.1
110202027BDNFbrain-derived neurotrophic factorPhascolarctos cinereuskoalaXM_020977976.1XP_020833635.1
110285237Bdnfbrain-derived neurotrophic factorMus caroliRyukyu mouseXM_021151236.2XP_021006895.1
110318089Bdnfbrain-derived neurotrophic factorMus paharishrew mouseXM_021192829.2XP_021048488.1
110576669BDNFbrain-derived neurotrophic factorNeomonachus schauinslandiHawaiian monk sealXM_021685880.1XP_021541555.1
111150176LOC111150176brain-derived neurotrophic factorEnhydra lutris kenyoniXM_022507574.1XP_022363282.1
111179820BDNFbrain-derived neurotrophic factorDelphinapterus leucasbeluga whaleXM_022583733.1XP_022439441.1
111554690BDNFbrain-derived neurotrophic factorPiliocolobus tephroscelesUgandan red ColobusXM_023230347.2XP_023086115.1
112303343BDNFbrain-derived neurotrophic factorDesmodus rotunduscommon vampire batXM_024558903.1XP_024414671.1
112412908BDNFbrain-derived neurotrophic factorNeophocaena asiaeorientalis asiaeorientalisYangtze finless porpoiseXM_024764694.1XP_024620462.1
112607193BDNFbrain-derived neurotrophic factorTheropithecus geladageladaXM_025358305.1XP_025214090.1
112667789BDNFbrain-derived neurotrophic factorCanis lupus dingodingoXM_025459816.2XP_025315601.1
112829628BDNFbrain-derived neurotrophic factorCallorhinus ursinusnorthern fur sealXM_025879326.1XP_025735111.1
112865997BDNFbrain-derived neurotrophic factorPuma concolorpumaXM_025929035.1XP_025784820.1
112935019BDNFbrain-derived neurotrophic factorVulpes vulpesred foxXM_026018600.1XP_025874385.1
113199355Bdnfbrain-derived neurotrophic factorUrocitellus parryiiArctic ground squirrelXM_026412272.1XP_026268057.1
113265088BDNFbrain-derived neurotrophic factorUrsus arctos horribilisXM_026512203.1XP_026367988.1
113624889BDNFbrain-derived neurotrophic factorLagenorhynchus obliquidensPacific white-sided dolphinXM_027115138.1XP_026970939.1
113905455BDNFbrain-derived neurotrophic factorBos indicus x Bos taurushybrid cattleXM_027562786.1XP_027418587.1
113913517BDNFbrain-derived neurotrophic factorZalophus californianusCalifornia sea lionXM_027577807.1XP_027433608.1
114036167BDNFbrain-derived neurotrophic factorVombatus ursinuscommon wombatXM_027852578.1XP_027708379.1
114093712Bdnfbrain-derived neurotrophic factorMarmota flaviventrisyellow-bellied marmotXM_027936596.1XP_027792397.1
114220007BDNFbrain-derived neurotrophic factorEumetopias jubatusSteller sea lionXM_028117708.1XP_027973509.1
114500881BDNFbrain-derived neurotrophic factorPhyllostomus discolorpale spear-nosed batXM_028517638.2XP_028373439.1
114620789Bdnfbrain-derived neurotrophic factorGrammomys surdasterXM_028766921.1XP_028622754.1
114702098Bdnfbrain-derived neurotrophic factorPeromyscus leucopuswhite-footed mouseXM_037204438.1XP_037060333.1
114886864BDNFbrain-derived neurotrophic factorMonodon monocerosnarwhalXM_029207934.1XP_029063767.1
115306639BDNFbrain-derived neurotrophic factorSuricata suricattameerkatXM_029957152.1XP_029813012.1
115525821BDNFbrain-derived neurotrophic factorLynx canadensisCanada lynxXM_030333208.1XP_030189068.1
115857612BDNFbrain-derived neurotrophic factorGlobicephala melaslong-finned pilot whaleXM_030864833.1XP_030720693.1
116090620Bdnfbrain-derived neurotrophic factorMastomys couchasouthern multimammate mouseXM_031371313.1XP_031227173.1
116476405BDNFbrain-derived neurotrophic factorHylobates molochsilvery gibbonXM_032166370.1XP_032022261.1
116555191BDNFbrain-derived neurotrophic factorSapajus apellatufted capuchinXM_032283691.1XP_032139582.1
116599082BDNFbrain-derived neurotrophic factorMustela ermineaermineXM_032358760.1XP_032214651.1
116638304BDNFbrain-derived neurotrophic factorPhoca vitulinaharbor sealXM_032414183.1XP_032270074.1
116758559BDNFbrain-derived neurotrophic factorPhocoena sinusvaquitaXM_032641481.1XP_032497372.1
116881443BDNFbrain-derived neurotrophic factorLontra canadensisNorthern American river otterXM_032881246.1XP_032737137.1
116901726Bdnfbrain-derived neurotrophic factorRattus rattusblack ratXM_032903861.1XP_032759752.1
117029758BDNFbrain-derived neurotrophic factorRhinolophus ferrumequinumgreater horseshoe batXM_033118944.1XP_032974835.1
117080419BDNFbrain-derived neurotrophic factorTrachypithecus francoisiFrancois’s langurXM_033205475.1XP_033061366.1
117704234Bdnfbrain-derived neurotrophic factorArvicanthis niloticusAfrican grass ratXM_034496309.1XP_034352200.1
118015716BDNFbrain-derived neurotrophic factorMirounga leoninaSouthern elephant sealXM_035012477.1XP_034868368.1
118546649BDNFbrain-derived neurotrophic factorHalichoerus grypusgray sealXM_036109493.1XP_035965386.1
118582264Bdnfbrain-derived neurotrophic factorOnychomys torridussouthern grasshopper mouseXM_036184983.1XP_036040876.1
118628031BDNFbrain-derived neurotrophic factorMolossus molossusPallas’s mastiff batXM_036258707.1XP_036114600.1
118664943BDNFbrain-derived neurotrophic factorMyotis myotisXM_036325692.1XP_036181585.1
118713266BDNFbrain-derived neurotrophic factorPipistrellus kuhliiKuhl’s pipistrelleXM_036427676.1XP_036283569.1
118853508BDNFbrain-derived neurotrophic factorTrichosurus vulpeculacommon brushtailXM_036763631.1XP_036619526.1
118900201BDNFbrain-derived neurotrophic factorBalaenoptera musculusBlue whaleXM_036862552.1XP_036718447.1
118909009BDNFbrain-derived neurotrophic factorManis pentadactylaChinese pangolinXM_036879079.1XP_036734974.1
118983123BDNFbrain-derived neurotrophic factorSturnira hondurensisXM_037040421.1XP_036896316.1
119049902BDNFbrain-derived neurotrophic factorArtibeus jamaicensisJamaican fruit-eating batXM_037146035.1XP_037001930.1
119255407BDNFbrain-derived neurotrophic factorTalpa occidentalisIberian moleXM_037522538.1XP_037378435.1
119536361BDNFbrain-derived neurotrophic factorCholoepus didactylussouthern two-toed slothXM_037839114.1XP_037695042.1
119815449Bdnfbrain-derived neurotrophic factorArvicola amphibiusEurasian water voleXM_038331579.1XP_038187507.1
119943514BDNFbrain-derived neurotrophic factorTachyglossus aculeatusAustralian echidnaXM_038764545.1XP_038620473.1
120220096BDNFbrain-derived neurotrophic factorHyaena hyaenastriped hyenaXM_039217098.1XP_039073029.1
120605221BDNFbrain-derived neurotrophic factorPteropus giganteusIndian flying foxXM_039866065.1XP_039721999.1
120876831BDNFbrain-derived neurotrophic factorOryx dammahscimitar-horned oryxXM_040258919.1XP_040114853.1
121043625BDNFbrain-derived neurotrophic factorPuma yagouaroundijaguarundiXM_040495572.1XP_040351506.1
121156520BDNFbrain-derived neurotrophic factorOchotona curzoniaeblack-lipped pikaXM_040980297.1XP_040836231.1
121465693Bdnfbrain-derived neurotrophic factorMicrotus oregonicreeping voleXM_041679035.1XP_041534969.1
121476611BDNFbrain-derived neurotrophic factorVulpes lagopusArctic foxXM_041730708.1XP_041586642.1

Specific sites that are evolving non-neutrally

To examine specific sites for episodic adaptive evolutionary selection, we utilized an algorithm known as MEME which is fundamentally similar to our FEL analysis (described above) except that it applies a more sensitive method for the detection of both pervasive (persistent) and episodic selection (transient selection occurring only on one or a subset of branches in the phylogenetic tree) as compared to only pervasive selection which occurs across all branches of the phylogenetic tree. Essentially, only a subset of the lineages (i.e., species) are affected allowing for a more granular/sensitive method of detecting selection (whereas FEL is better geared towards broad changes). This analysis revealed that for all sites, only 2.3% (6 of 261; see Table 1) exhibit evidence for episodic diversifying selection (i.e., positive selection) in at least one branch within the phylogeny. Spatially, these mutations occur outside of the NGF functional region of BDNF. Further, this result is essentially relevant as the MEME analysis is a sensitive measure of episodic selection. The sites we observe as statistically significant were 26, 27, 30, 38, 249, and 254. For comparison, these specific sites were realigned to the respective human sites with indel (insertion/deletion) events accounting for any respective discrepancy in specific site numbers. When mapping these sites to the human BDNF coordinate system, they correspond to sites 26, 27, 29, 36, 238, and 240, respectively.
Table 1

MEME analysis of the BDNF gene found 6 of 261 (2.3%) of sites to be statistically significant (LRT p value ≤ 0.1).

#CodonSiteHuman CodonSitealphabeta-p-beta+p+LRTp value# branches under selectionMEME LogLFEL LogLOmega
126260.140.000.021.120.986.890.019−86.53−86.537.9
227270.250.080.9928.910.014.300.051−39.25−37.01117.6
330290.000.000.000.381.004.630.056−35.88−35.88inf
438360.720.000.938.070.073.800.075−69.22−66.2811.3
52492380.920.000.99203.110.018.250.011−50.74−44.13221.6
62542400.230.000.991357.120.0121.810.001−44.50−31.955850.4
MEME analysis of the BDNF gene found 6 of 261 (2.3%) of sites to be statistically significant (LRT p value ≤ 0.1).

Evidence of coevolutionary forces

To examine the coevolution of sites, i.e., if one particular amino acid was evolving in-tandem with another, we subjected our protein-coding gene sequences to the BGM algorithm which leverages Bayesian graphical models [45]. The BGM algorithm infers substitution history through the use of maximum-likelihood analyses for ancestral sequences and maps these to the phylogenetic tree, which allows for the detection of correlated patterns of substitution [45]. For our BGM analysis, we find evidence for 23 pairs of coevolving sites. This suggests interaction dynamics in tertiary space of the 3D, folded, protein level (see relevant sites in Fig. 2) BDNF protein structure. Alternatively, this data may be evidence that coevolving sites may be related to other fitness consequences (e.g., compensation) for maladaptive changes in another part of the protein sequence that may have occurred. When we review these sites, we notice that several pairs (see Fig. 2) occur within alignment sites, which correspond to the Human BDNF coordinate system (Table 1). These include pair-sites of (89, 184), (94, 155), (103, 233), and (135, 154). Of note, several other sites also display interesting geometric features including triangular relations [(81, 93), (93, 98), and (81, 98)], an acyclic graph network of site connections [(70, 74), (74, 94), (94, 155) and (25, 49), (49, 85), (49, 86)], more complex double-linked coevolutionary sites [(39, 103) and (103, 233)], and triple-linked coevolutionary sites [(30, 119), (33, 119), and (91, 119)]. Additionally, three-dimensional reconstruction—here focusing on a specific heterodimer configuration of BDNF and NT-4 as an example of a spatial protein–protein interaction—highlights that coevolving sites, as well as positively evolving sites, are likely to have been fine-tuned over time to help support BDNF’s cognate functionality (see Fig. 3). Mapping our FEL purifying sites in a structural configuration was not shown due to the overwhelming nature of negative selection acting on BDNF within mammals.
Fig. 2

The BGM analysis of BDNF found 23 pairs of coevolving sites out of 261 total sites to be statistically significant (with a posterior probability threshold of 0.5).

Here, we plot only the statistically significant coevolving pairs. The number of shared substitutions between pairs of coevolving sites is visualized by the size of the circle, with larger circles indicating more shared substitutions. Poster probability of the interaction (coevolving pair) corresponds to the color blue, with dark blue indicating higher values. Individual BDNF sites are mapped on both the X and Y axis so that readers can view which sites are coevolving with another. Once more, note that the coevolution tends to be focally constrained to the broader BDNF prodomain region at a topological level, which is once more consistent with the idea that the NGF domain (site >144; see Fig. 3) is highly conserved and probably deleteriously impacted by variation. However, we did discover four sites of coevolution in the NGF domain (basically, the mature BDNF protein) that are evolving with early prodomain sites. This highlights that both proximal and distal sites in BDNF can, and indeed are, evolving together over time.

Fig. 3

BDNF NT-4 heterodimer structural analysis to highlight coevolving and adaptive sites at the 3D protein level.

Demonstrating the structural configuration of the BDNF (blue) and NT-4 (pink) heterodimer (see https://www.rcsb.org/structure/1B8M), with rotations (arbitrary degrees) shown to accentuate the view of coevolving sites (orange, see also Fig. 2 and Table 2) and positively evolving sites (red; see Table 1). The PDB structure is limited to the NGF domain which limits our ability to highlight sites of interest (SOI), therefore we have limited our annotation only to the modeled sites in the structure. The relative positioning of coevolving and positively evolving sites in this heterodimer visualization are in proximity to looping and other macro tertiary structures of protein. An interactive figure that is rotatable in 3D space, where occupations occur in three dimensions (i.e., teasing out relative proximity in 2D linear space), is available here: https://observablehq.com/@aglucaci/bdnf-structure.

The BGM analysis of BDNF found 23 pairs of coevolving sites out of 261 total sites to be statistically significant (with a posterior probability threshold of 0.5).

Here, we plot only the statistically significant coevolving pairs. The number of shared substitutions between pairs of coevolving sites is visualized by the size of the circle, with larger circles indicating more shared substitutions. Poster probability of the interaction (coevolving pair) corresponds to the color blue, with dark blue indicating higher values. Individual BDNF sites are mapped on both the X and Y axis so that readers can view which sites are coevolving with another. Once more, note that the coevolution tends to be focally constrained to the broader BDNF prodomain region at a topological level, which is once more consistent with the idea that the NGF domain (site >144; see Fig. 3) is highly conserved and probably deleteriously impacted by variation. However, we did discover four sites of coevolution in the NGF domain (basically, the mature BDNF protein) that are evolving with early prodomain sites. This highlights that both proximal and distal sites in BDNF can, and indeed are, evolving together over time.

BDNF NT-4 heterodimer structural analysis to highlight coevolving and adaptive sites at the 3D protein level.

Demonstrating the structural configuration of the BDNF (blue) and NT-4 (pink) heterodimer (see https://www.rcsb.org/structure/1B8M), with rotations (arbitrary degrees) shown to accentuate the view of coevolving sites (orange, see also Fig. 2 and Table 2) and positively evolving sites (red; see Table 1). The PDB structure is limited to the NGF domain which limits our ability to highlight sites of interest (SOI), therefore we have limited our annotation only to the modeled sites in the structure. The relative positioning of coevolving and positively evolving sites in this heterodimer visualization are in proximity to looping and other macro tertiary structures of protein. An interactive figure that is rotatable in 3D space, where occupations occur in three dimensions (i.e., teasing out relative proximity in 2D linear space), is available here: https://observablehq.com/@aglucaci/bdnf-structure.
Table 2

The BGM analysis of BDNF found 23 pairs of coevolving sites out of 261 total sites to be statistically significant (with a posterior probability threshold of 0.5).

#Site 1Site 2P [Site 1 –> Site 2]P [Site 2 –> Site 1]P [Site 1 < –> Site 2]Site 1 subsSite 2 subsShared subs
15670.520.320.84111
225490.0220.670.69863
3271090.120.450.57211
4301190.20.560.756124
5331190.120.720.843123
6391030.220.580.8843
749850.480.0490.53632
849860.360.550.91653
950690.310.270.58121
1062640.250.270.52111
1170740.170.380.557144
1270950.590.260.857215
1374940.0340.640.681474
1478900.460.380.84111
1581930.270.310.58111
1681980.270.330.61111
1782910.0450.920.9615299
18891840.240.50.74262
19911190.220.670.8929128
2093980.310.30.61111
21941550.280.290.57722
221032330.40.130.53432
231351540.40.430.82111

Discussion

In this study, we explore the evolutionary history of the BDNF gene in Mammalia. The BDNF gene is implicated in a number of human diseases including a variety of brain disorders such as neurodevelopmental disorders (e.g., autism spectrum disorder), neuropsychiatric disorders (e.g., depression, PTSD, and schizophrenia), and some neurodegenerative disorders [1]. By using orthologous BDNF sequences within the Mammalia taxonomic group, our results indicate that unique site-specific changes within BDNF have evolved over time. We performed a number of comparative evolutionary analyses to tease out signals from our orthologous gene collection in BDNF. Of note, the BDNF gene elicits tight regulation and specific functionality that can be separated from other neurotrophins, yet these growth factors remain closely related in their structure and sequence and conservation of the NGF functional domain. In the NGF domain, we observe a high degree of conservation (via purifying selection) across species, owing to the functional importance of this region in protein–protein interactions. This work additionally provides broad comparative insights into the evolutionary history of the BDNF gene family. Our MEME method identified novel substitutions (see Table 1) in regions of BDNF that may provide significant areas of interest for designing molecular therapeutic approaches, and their potential broader significance are outlined in further detail below.

Predominant purifying selection across BDNF in Mammalia

Over time, evolution drives the divergence of genetic sequences, but what can we learn from the direct comparison of the sequences of the BDNF gene in Mammalia? By comparing the BDNF products of orthologous sequences in different species, we observe the accumulation of mutations at different sites with varying degrees of insight into both BDNF functionality (see [29] for site annotation) and potential disease [1]. These are summarized in full within Table S1 and Table 1. Coding sequences with highly constrained structures are expected to fix nonsynonymous mutations at a slower rate due to the maladaptive nature of changes such as what we observe with FEL negatively selected sites across BDNF. Additionally, we observe a high degree of negative (purifying) selection across the main functional domain (NGF) of BDNF. While structures for the NGF domain in most species under analysis do not exist, based on our findings we expect a highly conserved tertiary structure. Based on the high degree of purifying selection observed across BDNF, we hypothesize that BDNF plays a critical role in the underlying network of genes governing homeostasis and normal organismal development. This may have happened because BDNF is particularly useful specifically for the phylogenetic branch in question (i.e., mammals). This interpretation is also consistent with the observation that BDNF is essential for normative development and is lethal in non-conditional full knock-out mammalian models.

Non-neutral positive diversifying evolution sites in the BDNF gene

It has been described that BDNF plays a particular role as a foundational gene for brain development [11]. Despite a significant level of purifying selection shaping the evolutionary history of BDNF (Fig. 1), we observe several novel statistically significant sites under positive episodic diversifying selection across the BDNF gene (see Table 1). Traditionally, the evolution of this variety consists of amino acid diversifying events that may promote phylogenetic adaptation and/or functionality. These results are entirely novel—they have not been previously reported (to the best of our knowledge) and MEME is an established and sensitive method for the analysis of episodic diversifying sites. Thus, the very specific and limited sites within BDNF to exhibit such patterns is a highly promising result from which to further disentangle BDNF's complex functionality and disease linkage. We would encourage biologists to consider these sites as those that may contain important adaptive functions within the BDNF gene. However, where our results fall within the context of a core protein–protein interaction network of required genes for neural cellular diversity and development is yet to be determined. We do note that at least one identified site (238) overlaps with potential posttranslational modifications to the human BDNF peptide (specifically, a disulfide bridge; see UniProt and [29]). This supports the idea that non-neutral positive diversifying sites within BDNF are not spurious and likely reflect specialized, regulatory, or functional capacities that may have yet to be annotated in full. Given that this manuscript is devoted to the analysis of BDNF’s evolution in mammals, we highlight the potential importance of these sites but emphasize that their importance remains a hypothesis that should be tested in well-defined experiments under controlled laboratory conditions.

Discovery of proximal and distal coevolving site-pairs in the BDNF gene

Another novel, and potentially important, series of findings in this manuscript was the presence of numerous sites that exhibit coevolution. In fact, we observe a significant number of coevolving sites within the BDNF gene (see Fig. 2 and Table 2), and these too reflect an entirely novel aspect of BDNF biology that has not previously been reported. Evidence of coevolving sites are not limited to a particular domain (e.g., prodomain vs mature) nor specific motifs. Instead, coevolving pairs seem to be distributed across the entirety of BDNF with, perhaps unsurprisingly, an increased density of interactions early in the prodomain region. However, we also note that there are coevolving sites in the mature NGF domain which are “linked” to early domain sites. Importantly, these relationships may confer strong epistatic interactions shaping the continued evolution of this critically important gene. The new evidence for coevolution may point to the importance of these sites in shaping the early regulatory or main functional (NGF) domain of BDNF. These residues may form important interactions for the functional integrity of BDNF and, importantly, the highly specific pairs which span the BDNF prodomain and its mature region point to a new mechanism by which the BDNF prodomain may have coregulated the mature domain (or vice versa). Alternatively, these coevolving pairs may be part of a network of residues occupying a shifted fitness landscape in order to accommodate new or species-specific functional requirements. The BGM analysis of BDNF found 23 pairs of coevolving sites out of 261 total sites to be statistically significant (with a posterior probability threshold of 0.5).

Potential structural implications of evolving sites

In considering our observation of both diversifying selection and coevolving sites in the BDNF gene, we considered the potential implications this may have at a protein structural level in three-dimensional space (see Fig. 3). While protein structural impacts from evolution remain poorly understood and cannot be completely experimentally disentangled in a confirmatory sense, the implications fall upon our understanding of basic BDNF neurobiology. Here we note that our BGM and FEL analyses implicate the prodomain—the primary topological region of BDNF known for polymorphic variability (e.g., Val66Met and Gln75His) that is often linked to disease [1, 11, 29], and our 3D modeling suggests that two of our coevolving sites appear to be associated with looping structures that could have important yet to be discovered functionality. In this regard, we predict that the evolutionary changes described here are likely to reflect some form of specialization and/or divergence in function and/or interaction partners at different points of BDNF’s evolutionary history in mammals. Thus, further work may unveil yet more novel sites that could provide further insight into the origins of BDNF’s diverse functionality and its role in disease.

Limitations of our computational evolutionary analysis

This analysis focused on BDNF sequences contained in the taxonomic group Mammalia in lieu of examining a more inclusive dataset for BDNF containing sequences from all of Gnathostomata (jawed vertebrates) or extension into invertebrate clades which may contain BDNF or BDNF-like analog genes. Our results are applicable to mammals, which are our intentional taxonomic group of study, but we nonetheless emphasize that our results do not capture the entirety of BDNF’s evolutionary history (e.g., there could be more to learn about BDNF from birds, lizards, fish, and higher-order taxonomic groups which we do not evaluate here). In addition, we do not explore the patterns or mutational processes occurring outside of coding-sequence evolution which include complex structure and dynamics of non-coding regions in the BDNF gene. Therefore, evolutionary temporality is important in the context and interpretation of our results because Mammalia represents only a portion of the long evolutionary history of BDNF. Although we failed to find evidence for recombination in our dataset, species where we may find evidence for recombination may have been precluded from our analysis due to our decision to focus on mammalian BDNF evolution. Further, a limitation of the current analysis is owed to the presence of indel events, especially in the early region of the alignment but which also occur in other spatially distributed regions of the BDNF gene. These indel events are not currently modeled in existing codon substitution models but may represent an additional pathway of evolutionary change. Nonetheless, the prominence of indels in our observations indicates that several regions of BDNF may evolve significantly through indel events across species. Lastly, although there is a risk that the “gappy” nature of the early region of our multiple sequence alignment may be a computational artifact of the alignment procedure, based on all other outputs we believe that our results are reasonably interpreted and have subsequently tolerated these potential effects.

Future directions: understanding the remainder of the neurotrophin family

We hypothesize that the similarities between neurotrophins reflects conserved evolutionary selection for motifs and domains which support common functionality in neurotrophic factors between sites and lineages. While we note significant isotropy in mature peptide sequences for these factors, anisotropic pressures likely influenced the prodomain sequences of neurotrophins leading to alterations in processing, trafficking, regulation, and secretion. As such, we also predict differences in the evolutionary fate of other neurotrophins which also exhibit compartmentalized functionality due to similar alterations within their prodomains (i.e., similar results may be reasonably anticipated in NGF, NT-3, and NT-4).

Conclusion

To sum up, our research modeled the natural evolutionary history of changes in BDNF across >160 mammalian genomes. Conservatively, this analysis spans ~177 million years of evolution—and going deeper could yet reveal more information on the ontogenesis of BDNF and its topological structure (and, consequently, function). Notably, we observed strict purifying selection in the main functional domain of the BDNF gene in mammals and discovered 6 specific sites in our homologous alignment that are under episodic selection in the early regulatory region of BDNF (i.e., the prodomain) and in the terminal region of BDNF. We also make the case for spatial coevolution within this gene, with 23 pair-sites that have evolved together. In sum, these data go above and beyond the common trope that “BDNF is highly conserved” by defining exactly where and how the mammalian BDNF has evolved. Thus, we confirm the widespread belief that the BDNF prodomain is more prone to change than the mature BDNF protein, having important implications for how we think about and consider genetic variation in BDNF and its linkage to disease.

Methods

Data retrieval

For this study, we queried the NCBI Ortholog database via https://www.ncbi.nlm.nih.gov/kis/ortholog/627/?scope=7776. For the purpose of this study, as we are interested in mammalian BDNF evolution, we limited our search to only include species from this taxonomic group (mammals, Mammalia). This returned 162 full gene transcripts and protein sequences. We downloaded all available files: RefSeq protein sequences, RefSeq transcript sequences, Tabular data (CSV, metadata). In Table 3, we provide a table of the species included in this analysis but we also make this accessible via GitHub. Furthermore, we also make these species NCBI accessions (see also Table 3) available for download on GitHub: AnalysisOfOrthologousCollections/BDNF_orthologs.csv at main · aglucaci/AnalysisOfOrthologousCollections · GitHub Tabulation of species included in our analysis, comprising NCBI ortholog gene IDs, symbols, mammalian species, common name, and RefSeq accessions.

Data cleaning

We used the protein sequence and full gene transcripts to derive coding sequences (CDS) (via a custom script, scripts/codons.py). However, this process was met with errors in 20 “PREDICTED” protein sequences, which had invalid characters such as sequences, which have incorrect “X”, or unresolved amino acids and these sequences were subsequently exempt from the analysis. This process removes low-quality protein sequences from analysis which may inflate rates of nonsynonymous change.

Analysis of orthologous collections (AOC): alignment, recombination detection, tree inference, and selection algorithms

The analysis of orthologous collections (AOC) application is designed for comprehensive protein-coding molecular sequence analysis (https://github.com/aglucaci/AnalysisOfOrthologousCollections). It accomplishes this through a series of comparative evolutionary methods. AOC allows for the inclusion of recombination detection, a powerful force in shaping gene evolution and interpreting analytic results. As well, it allows for lineage assignment and annotation. This feature (lineage assignment) allows between-group comparisons of selective pressures. This application currently accepts two input files: a protein sequence unaligned fasta file, and a transcript sequence unaligned fasta file for the same gene. Typically, this can be retrieved from public databases such as NCBI Orthologs. Although other methods of data compilation are also acceptable. In addition, the application is easily modifiable to accept a single CDS input, if that data is available. If protein and transcript files are provided, a custom script “scripts/codons.py” is executed and returns coding sequences where available. Note that this script currently is set to use the standard genetic code, this will need to be modified for alternate codon tables. This script also removes “low-quality” sequences if no match is found, see the above Data cleaning section. Step 1. Alignment. We used the HyPhy [46] codon-aware multiple sequence alignment procedure available at (https://github.com/veg/hyphy-analyses/tree/master/codon-msa). This was performed with a Human BDNF coding sequence NM_001709.5 Homo sapiens brain-derived neurotrophic factor (BDNF), transcript variant 4, mRNA as a reference-based alignment. Our alignment procedure retained 126 unique in-frame sequences. Step 2. Recombination detection. Performed manually via RDP v5 [47], see below, the “Recombination detection” section for additional details. A recombination-free file is placed in the following folder: results/BDNF/Recombinants. For the purpose of this study, we did not detect recombination in our dataset. Step 3. Tree inference and selection analyses. For the recombination-free fasta file, we perform maximum-likelihood phylogenetic inference via IQ-TREE [48]. Next, the recombination-free alignment and an unrooted phylogenetic tree is evaluated through a standard suite of molecular evolutionary methods. This set of selection analyses includes the following but for the sake of brevity, some of these results were not shown (essentially, most were not statistically significant or not meaningful as relevant to the evolutionary results presented here). FEL: locates codon sites with evidence of pervasive positive diversifying or negative selection [44]. BUSTEDS: tests for gene-wide episodic selection [49]. MEME: locates codon sites with evidence of episodic positive diversifying selection [50]. aBSREL: tests if a positive selection has occurred on a proportion of branches [51]. SLAC: performs substitution mapping [44]. BGM: identifies groups of sites that are apparently coevolving [45]. RELAX: compare gene-wide selection pressure between the query clade and background sequences [52]. CFEL: comparison site-by-site selection pressure between query and background sequences [53]. FMM: examines model fit by permitting multiple instantaneous substitutions [54]. Step 4A. Lineage assignment and tree annotation. For the unrooted phylogenetic tree, we perform lineage discovery, via NCBI and the python package ete3 toolkit. Assigning lineages to a K (by default, K = 20) number of taxonomic groups. Here, the aim is to have a broad representation of taxonomic groups, rather than the species being heavily clustered into a single group. As a reasonable approximation, we aim for <40% of species to be assigned to any one particular taxonomic group. Step 4B. We perform tree labeling via the hyphy-analyses/Label-Trees (REF, link) method. Resulting in one annotated tree per lineage designation. For the purpose of this study, we will only consider the following five lineages for additional analyses (Artiodactyla, Carnivora, Chiroptera, Glires, Primates) as they are the most populated lineages. Step 5. Selection analyses on lineages. Here, the recombination-free fasta file and the set of annotated phylogenetic trees (where labeling was performed in Step 4) is provided for analysis with the RELAX and Contrast-FEL methods.

Recombination detection

Manually tested via RDP v5.5 with modified settings as follows: We also included the following algorithms/analyses: RDP [55], GENECONV [56], Chimaera [57], MaxChi [58], BootScan [59] (Primary and Secondary Scan), SiScan [60] (Primary and Secondary Scan), 3Seq [61]. Recombination events are “accepted” in cases where three or more methods are in agreement. We slightly modified default parameters, such that Require topological evidence. Polish breakpoints. Check alignment consistency. Sequences are linear. List events detected by >2 methods. We manually recheck all of the events via “Recheck all identified events with all methods”. We manually accept events detected by >2 methods. The resulting alignment was saved as a distributed alignment (with recombinant regions separated). Recombination was not detected within our Human reference-based alignment. Therefore we used the single recombination-free alignment for analyses. Supplementary Material
  61 in total

1.  Neurodegenerative disease: BDNF copycats.

Authors:  Katie Kingwell
Journal:  Nat Rev Drug Discov       Date:  2010-06       Impact factor: 84.694

Review 2.  Analyzing the mosaic structure of genes.

Authors:  J M Smith
Journal:  J Mol Evol       Date:  1992-02       Impact factor: 2.395

Review 3.  Postsynaptic BDNF-TrkB signaling in synapse maturation, plasticity, and disease.

Authors:  Akira Yoshii; Martha Constantine-Paton
Journal:  Dev Neurobiol       Date:  2010-04       Impact factor: 3.964

4.  Brain-derived neurotrophic factor Val66Met and psychiatric disorders: meta-analysis of case-control studies confirm association to substance-related disorders, eating disorders, and schizophrenia.

Authors:  Mònica Gratacòs; Juan R González; Josep M Mercader; Rafael de Cid; Mikel Urretavizcaya; Xavier Estivill
Journal:  Biol Psychiatry       Date:  2007-01-09       Impact factor: 13.382

Review 5.  Activity-dependent modulation of the BDNF receptor TrkB: mechanisms and implications.

Authors:  Guhan Nagappan; Bai Lu
Journal:  Trends Neurosci       Date:  2005-09       Impact factor: 13.837

Review 6.  BDNF in schizophrenia, depression and corresponding animal models.

Authors:  F Angelucci; S Brenè; A A Mathé
Journal:  Mol Psychiatry       Date:  2005-04       Impact factor: 15.992

7.  Increased BDNF levels and NTRK2 gene association suggest a disruption of BDNF/TrkB signaling in autism.

Authors:  C T Correia; A M Coutinho; A F Sequeira; I G Sousa; L Lourenço Venda; J P Almeida; R L Abreu; C Lobo; T S Miguel; J Conroy; L Cochrane; L Gallagher; M Gill; S Ennis; G G Oliveira; A M Vicente
Journal:  Genes Brain Behav       Date:  2010-08-19       Impact factor: 3.449

Review 8.  New insights into BDNF function in depression and anxiety.

Authors:  Keri Martinowich; Husseini Manji; Bai Lu
Journal:  Nat Neurosci       Date:  2007-09       Impact factor: 24.884

Review 9.  BDNF as a Promising Therapeutic Agent in Parkinson's Disease.

Authors:  Ewelina Palasz; Adrianna Wysocka; Anna Gasiorowska; Malgorzata Chalimoniuk; Wiktor Niewiadomski; Grazyna Niewiadomska
Journal:  Int J Mol Sci       Date:  2020-02-10       Impact factor: 5.923

10.  Extra base hits: Widespread empirical support for instantaneous multiple-nucleotide changes.

Authors:  Alexander G Lucaci; Sadie R Wisotsky; Stephen D Shank; Steven Weaver; Sergei L Kosakovsky Pond
Journal:  PLoS One       Date:  2021-03-12       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.