Literature DB >> 28707194

Regularization of Ill-Posed Point Neuron Models.

Abstract

Point neuron models with a Heaviside firing rate function can be ill-posed. That is, the initial-condition-to-solution map might become discontinuous in finite time. If a Lipschitz continuous but steep firing rate function is employed, then standard ODE theory implies that such models are well-posed and can thus, approximately, be solved with finite precision arithmetic. We investigate whether the solution of this well-posed model converges to a solution of the ill-posed limit problem as the steepness parameter of the firing rate function tends to infinity. Our argument employs the Arzelà-Ascoli theorem and also yields the existence of a solution of the limit problem. However, we only obtain convergence of a subsequence of the regularized solutions. This is consistent with the fact that models with a Heaviside firing rate function can have several solutions, as we show. Our analysis assumes that the vector-valued limit function v, provided by the Arzelà-Ascoli theorem, is threshold simple: That is, the set containing the times when one or more of the component functions of v equal the threshold value for firing, has zero Lebesgue measure. If this assumption does not hold, we argue that the regularized solutions may not converge to a solution of the limit problem with a Heaviside firing function.

Entities: Chemical Disease

Keywords: Existence; Ill-posed; Point neuron models; Regularization

Year: 2017 PMID： 28707194 PMCID： PMC5509800 DOI： 10.1186/s13408-017-0049-1

Source DB: PubMed Journal: J Math Neurosci Impact factor: 1.300

Introduction

In this paper we analyze some mathematical properties of the following classical point neuron model: for , where Here, represents the unknown electrical potential of the ith unit in a network of N units. The nonlinear function is called the firing rate function, β is the steepness parameter of , is the threshold value for firing, are the connectivities, are membrane time constants and model the external drive/external sources, see, e.g., [1-3] for further details. The system of ODEs (1) is also referred to as a voltage-based model or Hopfield model (due to Hopfield [4]). By employing electrophysiological properties one can argue that it is appropriate to use a steep sigmoid firing rate function . But due to mathematical convenience the Heaviside function is also often employed, see, e.g., [5-8]. Unfortunately, when the initial-condition-to-solution map for (1) can become discontinuous in finite time [9]. Such models are thus virtually impossible to solve with finite precision arithmetic [10, 11]. Also, in the steep but Lipschitz continuous firing rate regime, the error amplification can be extreme, even though a minor perturbation of the initial condition does not change which neurons that fire. It is important to note that this ill-posed nature of the model is a fundamentally different mathematical property from the possible existence of unstable equilibria, which typically also occur if a firing rate function with moderate steepness is used. See [9] for further details. The solution of (1) depends on the steepness parameter β. That is, and the purpose of this paper is to analyze the limit process . This investigation is motivated by the fact that the stable numerical solution of an ill-posed problem is very difficult, if not to say impossible, see, e.g., [10, 11]. Consequently, such models must be regularized to obtain a sequence of well-posed equations which, at least in principle, can be approximately solved by a computer. Also, steep firing rate functions, or even the Heaviside function, are often used in simulations. It is thus important to explore the limit process rigorously. In Sects. 3 and 4 we use the Arzelà–Ascoli theorem [12-14] to analyze the properties of the sequence , where More specifically, we prove that this sequence has at least one subsequence which converges uniformly to a limit and that this limit satisfies the integral/Volterra version of (1) with , provided that the following set has zero Lebesgue measure: It is thus sufficient that this set is finite or countable; see, e.g., [13]. Furthermore, in Sect. 7 we argue that, if v does not satisfy this threshold property, then this function will not necessarily solve the limit problem. According to the Picard–Lindelöf theorem [15-17], (1) has a unique solution, provided that , and that the assumptions presented in the next section hold. In Sect. 5 we show that this uniqueness feature is not necessarily inherited by the limit problem obtained by employing a Heaviside firing rate function. It actually turns out that a different subsequence of can converge to different solutions of (1) with . This is explained in Sect. 6, which also contains a result addressing the convergence of the entire sequence . The limit process , using different techniques, is studied in [18, 19] for the stationary solutions of neural field equations. It has also been observed [20] for the Wilson–Cowan model that this transition is a subtle matter: Using a steep sigmoid firing rate function instead of the Heaviside mapping can lead to significant changes in a Hopf bifurcation point. ‘the limiting value of the Hopf depends on the choice of the firing rate function’. If one uses a Heaviside firing rate function in (1) the right-hand-sides of these ODEs become discontinuous. A rather general theory for such equations has been developed [21]. In this theory the system of ODEs is replaced by a differential inclusion, in which the right-hand side of the ODE system is substituted by a set-valued function. The construction of this set-valued operator can be accomplished by invoking Filippov regularization/convexification. But this methodology serves a different purpose than the smoothing processes considered in this paper. More specifically, it makes it possible to prove that generalized solutions (Filippov solutions) to the problem exist but do not provide a family of well-posed equations suitable for numerical solution. Smoothening techniques for discontinuous vector fields, which are similar to the regularization method considered in this paper, have been proposed and analyzed for rather general phase spaces [22-24]. Nevertheless, these studies consider qualitative properties of large classes of problems, whereas we focus on a quantitative analysis of a very special system of ODEs. For the sake of easy notation, we will sometimes write (1) in the form where Note that we, for the sake of simplicity, use the same threshold value for all the units in the network; see (4).

Assumptions

Throughout this text we use the standard notation Concerning the sequence of finite steepness firing rate functions, we make the following assumption.

Assumption A

We assume that , , is Lipschitz continuous, , , for every pair of positive numbers there exists such that There are many continuous sigmoidal functions which approximate the Heaviside step function and satisfy Assumption A. For example, More generally, if is nondecreasing (for every ), a) and b) hold and converges pointwise to the Heaviside function, then Assumption A holds. Also, if Assumption A is satisfied and , then converges pointwise to the Heaviside function. We will consider a slightly more general version of the model than (3). More specifically, we allow the source term to depend on the steepness parameter, , but in such a way that the following assumption holds.

Assumption B

We assume that , , is continuous and that Allowing the external drive to depend on the steepness parameter makes it easier to construct illuminating examples. However, we note that our theory will also hold for the simpler case when q does not change as β increases. In this paper we will assume that Assumptions A and B are satisfied.

Uniformly Bounded and Equicontinuous

In order to apply the Arzelà–Ascoli theorem we must show that constitutes a family of uniformly bounded and equicontinuous functions. (For the sake of simple notation, we will write and , instead of and , for the component functions of and , respectively.) Multiplying with yields that and by integrating Hence, since and we assume that for , where the last inequality follows from (10). This implies that Since the right-hand side of (12) is independent of β and t we conclude that the sequence is uniformly bounded. Next, from the model (3), the triangle inequality, the assumption that and assumption (10) we find that where B̃ is defined in (12). Since is a diagonal matrix with positive entries on the diagonal, this yields that Here the constant K is independent of both β and . Let and be arbitrary. Then, for any time instances , with , the mean value theorem implies that there exists such that and hence This inequality holds for any , . It therefore follows that from which we conclude that is a set of equicontinuous functions The Arzelà–Ascoli theorem now asserts that there is a uniformly convergent subsequence : According to standard ODE theory, is continuous for each . Hence the uniform convergence implies that v is also continuous.

Threshold Terminology

As we will see in subsequent sections it depends on v’s threshold properties whether we can prove that v actually solves the limit problem with a Heaviside firing rate function. The following concepts turn out to be useful. For a vector-valued function we define

Definition

Threshold simple A measurable vector-valued function is threshold simple if the Lebesgue measure of the set is zero, i.e. . Extra threshold simple A measurable vector-valued function is extra threshold simple if there exist open intervals such that In words, z is extra threshold simple if there is a finite number of threshold crossings on the time interval .

The Limit of the Subsequence

Preparations

We will prove that the limit v in (13) solves the integral form of (3) with , the Heaviside function, provided that v is threshold simple. The inhomogeneous nonlinear Volterra equation associated with (3) reads where etc.; see also (2) and (5). Note that we consider the equations satisfied by the subsequence , see (13). We will analyze the convergence of the entire sequence in Sect. 6. The uniform convergence of to v implies that the left-hand-side and the first integral on the right-hand side of (16) converge to and , respectively, as . Also, due to assumption (11), the third integral on the right-hand side does not require any extra attention. We will thus focus on the second integral on the right-hand side of (16). For and , define the sets where is defined in (14) and v is the limit in (13). Since v is continuous it follows that , , is continuous. Hence, the sets and are Lebesgue measurable. We note that, provided that is small, the set contains the times where at least one of the components of v is close to the threshold value for firing. The following lemma turns out to be crucial for our analysis of the second integral on the right-hand side of (16).

Lemma 4.1

If the limit function v in (13) is threshold simple, then where denotes the Lebesgue measure of the set .

Proof

Since v is the uniform limit of a sequence of continuous functions, v is continuous and hence measurable. If v is threshold simple, then see (15). Let be arbitrary. Assume that or that this limit does not exist. Then such that there is a sequence satisfying By construction, and . Hence, see, e.g., [13] (page 62). Since the sequence is nonincreasing and bounded below, exists. Next, i.e. Hence, which contradicts (19). □

Convergence of the Integral

Lemma 4.2

If the limit v in (13) is threshold simple, then Let and be arbitrary and define From (18) we know that there exists such that Choose a δ which satisfies . By part (c) of Assumption A, for this δ and there exists such that (6) and (7) hold. Recall that are the values for the steepness parameter associated with the convergent subsequence in (13). By the uniform convergence of to v, there is a so that From the definition of the set , see (17) and (14), and from (24) and the triangle inequality it follows that From (24)–(26) we find that Also, because of the properties of the Heaviside function, . Consequently, due to (23) and part (c) of Assumption A, see (6) and (7), we find that Hence, for all , where the second last inequality follows from (22), the fact that for and (21). Since and were arbitrary, we conclude that (20) must hold. □

Limit Problem

By employing the uniform convergence (13), the convergence of the integral (20) and assumption (11), we conclude from (16) that the limit function v satisfies provided that v is threshold simple. Recall that v is continuous. Consequently, if v is extra threshold simple, then it follows from the fundamental theorem of calculus that v also satisfies the ODEs, except at time instances when one or more of the component functions equal the threshold value for firing: where is defined in (15). The existence of a solution matter for point neuron models with a Heaviside firing rate function is summarized in the following theorem.

Theorem 4.3

If the limit v in (13) is threshold simple, then v solves (27). In the case that v is extra threshold simple v also satisfies (28). In [25] the existence issue for neural field equations with a Heaviside activation function is studied but the analysis is different because a continuum model is considered. We would also like to mention that Theorem 4.3 cannot be regarded as a simple consequence of Carathéodory’s existence theorem [21, 26, 27] because the right-hand-side of (28) is discontinuous with respect to v.

Uniqueness

If , then standard ODE theory [15-17] implies that (3) has a unique solution. Unfortunately, as will be demonstrated below, this desirable property is not necessarily inherited by the infinite steepness limit problem. We will first explain why the uniqueness question is a subtle issue for point neuron models with a Heaviside firing rate function. Thereafter, additional requirements are introduced which ensure the uniqueness of an extra threshold simple solution.

Example: Several Solutions

Let us study the problem where we assume that Note that the ODE in (29) is not required to hold for . Consider the functions Since it follows that both and solves (29). Furthermore, with we actually obtain a third solution of (29). More specifically, the stationary solution We conclude that models with a Heaviside firing rate function can have several solutions – such problems can thus become ill-posed. (In [9] we showed that the initial-condition-to-solution map is not necessarily continuous for such problems and that the error amplification ratio can become very large in the steep but Lipschitz continuous firing rate regime.) Note that switching to the integral form (27) will not resolve the lack of uniqueness issue for the toy example considered in this subsection. We also remark that If we define , then neither nor satisfies the ODE in (29) for . (In the case , satisfies the ODE in (29) for .) If we define , then , but not , satisfies the ODE in (29) also for . If we define , then , but not , satisfies the ODE in (29) also for .

Enforcing Uniqueness

In order to enforce uniqueness we need to impose further restrictions. It turns out that it is sufficient to require that the derivative is continuous from the right and that the ODEs also must be satisfied whenever one, or more, of the component functions equals the threshold value for firing Note that the ODEs in (33) also must be satisfied for , in case one of the components of equals .

Definition 1

Right smooth A vector-valued function is right smooth if is continuous from the right for all .

Theorem 5.1

The initial value problem (33) can have at most one solution which is both extra threshold simple and right smooth. Let v and be two solutions of (33) which are both right smooth and extra threshold simple: and where and are disjoint open intervals; see (14) and the definition of extra threshold simple in Sect. 3.1. Then there exist disjoint open intervals such that Let us focus on one of these intervals, . Define and assume that which obviously holds for . Then where Note that, due to (34), equals a constant vector c, with components or 1, except possibly at : Furthermore, from (35) we find that Putting in (36) and invoking (37) and (39) yield and from the right continuity of and d, (36), (37) and (38) we find that Since , , and , see (39), we conclude from (36)–(37) that d satisfies which has the unique solution , . Both and are differentiable on and hence continuous. It follows that, by employing the continuity of v and at time , Since we can repeat the argument on the next interval . It follows by induction that . □ We would like to comment the findings presented in the bullet-points at the end of Sect. 5.1 in view of Theorem 5.1: In order to enforce uniqueness for the solution of (29) we can require that the ODE in (29) also should be satisfied for . Nevertheless, this might force us to define , which differs from the standard definition of the Heaviside function H. More generally, if one has accomplished to compute an extra threshold simple and right smooth function v which satisfies (27), one can attempt to redefine , , such that (33) holds and v is the only solution to this problem. This may imply that cannot be generated by using the composition . Instead one must determine , , . More precisely, for each one gets a linear system of algebraic equations which will have a unique solution if the connectivity matrix is nonsingular. (In this paragraph, are the time instances employed in the definition of extra threshold simple; see Sect. 3.1.)

Convergence of the Entire Sequence

We have seen that point neuron models with a Heaviside firing rate function can have several solutions. One therefore might wonder if different subsequences of can converge to different solutions of the limit problem. In this section we present an example which shows that this can happen, even though the involved sigmoid functions satisfy Assumption A.

Example: Different Subsequences Can Converge to Different Solutions

Let us again consider the initial value problem (29), which we discussed in Sect. 5.1. A finite steepness approximation of this problem, using the notation , reads: where and is, e.g., either the hyperbolic tangent sigmoid function (8)–(9) or Note that converges pointwise, except for , to the Heaviside function H as . In fact, satisfies Assumption A. We consider the case . Therefore (29) has three solutions and , see (30), (31) and (32) in Sect. 5.1. Note that has the property where is a constant which is independent of β. It therefore follows that and no subsequence converges to the third solution . Figure 1 shows numerical solutions of (40) with steepness parameter , using the firing rate function (41) to define . (If one instead employs (8)–(9) in the implementation of , the plots, which are not included, are virtually unchanged.)

Fig. 1

Numerical solutions of (40) computed with Matlab’s ode45 software. In these simulations we used and . The functions and , see (30) and (31), are the solutions of the associated limit problem (29)

if β is even, if β is odd, Numerical solutions of (40) computed with Matlab’s ode45 software. In these simulations we used and . The functions and , see (30) and (31), are the solutions of the associated limit problem (29) We would like to mention that we have not been able to construct an example of this kind for Lipschitz continuous firing rate functions which converge pointwise to the Heaviside function also for .

Entire Sequence

We have seen that almost everywhere convergence of the sequence of firing rate functions to the Heaviside limit is not sufficient to guarantee that the entire sequence converges to the same solution of the limit problem. Nevertheless, one has the following result.

Theorem 6.1

Let v be the limit function in (13). If the limit of every convergent subsequence of is extra threshold simple, right smooth and satisfies (33), then the entire sequence converges uniformly to v. Suppose that the entire sequence does not converge uniformly to v. Then there is an such that, for every positive integer M, there must exist , , satisfying Thus, the subsequence does not converge uniformly to v, but constitutes a set of uniformly bounded and equicontinuous functions, see Sect. 3. According to the Arzelà–Ascoli theorem, therefore possesses a uniformly convergent subsequence , Due to (42), On the other hand, both v and are limits of subsequences of and are by assumption extra threshold simple, right smooth, and they satisfy (33). Hence, Theorem 5.1 implies that , which contradicts (43). We conclude that the entire sequence must converge uniformly to v. □

Example: Threshold Advanced Limits

We will now show that threshold advanced limits, i.e. limits which are not threshold simple, may possess some peculiar properties. More precisely, such limits can potentially occur in (13). They do not necessarily satisfy the limit problem obtained by using a Heaviside firing rate function. With source terms which do not depend on the steepness parameter β we have not managed to construct an example with a threshold advanced limit v. If we allow , this can, however, be accomplished as follows. Let where we, for the sake of simplicity, work with the firing rate function (41). Then and we find that solves where It follows that and since, for any , we conclude that Note that but does not solve the limit problem because This argument assumes that . If one instead defines , then v̄ would solve the limit problem. Due to the properties of the firing rate function (41) the source term in (44) becomes discontinuous. This can be avoided by instead using the smooth version (8)–(9) but then the analysis of this example becomes much more involved.

Discussion and Conclusions

If a Heaviside firing rate function is used, the model (1) may not only have several solutions, but the initial-condition-to-solution map for this problem can become discontinuous [9]. It is thus virtually impossible to develop reliable numerical methods which employ finite precision arithmetic for such problems. One can try to overcome this issue by To the best of our knowledge, present symbolic techniques are not able to handle strongly nonlinear equations of the kind (1), even when . We therefore analyzed the approach b), using the straightforward regularization technique obtained by replacing the Heaviside firing rate function by a Lipschitz continuous mapping. This yields an equation which is within the scope of the Picard–Lindelöf theorem and standard stability estimates for ODEs. That is well-posed and, at least in principle, approximately solvable by numerical methods. Attempting to solve the ill-posed equation with symbolic computations. Regularize the problem. Our results show that the sequence of regularized solutions will have at least one convergent subsequence. The limit, v, of this subsequence will satisfy the integral/Volterra form (27) of the limit problem, provided that the set , see (15), has zero Lebesgue measure. Unfortunately, it seems to be very difficult to impose restrictions which would guarantee that v obeys this threshold property, which we refer to as threshold simple. Also, the example presented in Sect. 7 shows that, if the limit v is not threshold simple, then this function may not solve the associated equation with a Heaviside firing rate function. One could propose to overcome the difficulties arising when by always working with finite slope firing rate functions. This would potentially yield a rather robust approach, provided that the entire sequence converges, because increasing a large β would still guarantee that is close to the unique limit v. However, the fact that different convergent subsequences of can converge to different solutions of the limit problem, as discussed in Sect. 6, suggests that this approach must be applied with great care. In addition, the error amplification in the steep firing rate regime can become extreme [9] and the accurate numerical solution of such models is thus challenging. What are the practical consequences of our findings? As long as there does not exist very reliable biological information about the size of the steepness parameter β and the shape of the firing rate function , it seems that we have to be content with simulating with various . If one observes that approaches a threshold advanced limit, as β increases, or that the entire sequence does not converge, the alarm bell should ring. All simulations with large β must use error control methods which guarantee the accuracy of the numerical solution—we must keep in mind that we are trying to solve an almost ill-posed problem. In neural field equations one employs a continuous variable, e.g., , instead of a discrete index . The sum in (1) is replaced by an integral; see [1, 2, 6]. For each time instance one therefore does not get a vector , as for the point neural models analyzed in this paper, but a function , . That is, in neural field equations the object associated with each fixed belongs to an infinite dimensional space. It is often a subtle task to generalize concepts and proofs from a finite to an infinite dimensional setting: It is thus an open problem whether the techniques and results presented in this paper can be adapted to neural field models.

5 in total

Review 1. Waves, bumps, and patterns in neural field theories.

Authors: S Coombes
Journal: Biol Cybern Date: 2005-07-30 Impact factor: 2.086

2. Persistent neural states: stationary localized activity patterns in nonlinear continuous n-population, q-dimensional neural networks.

Authors: Olivier Faugeras; Romain Veltz; François Grimbert
Journal: Neural Comput Date: 2009-01 Impact factor: 2.026