Christophe Lecomte1. 1. Southampton Statistical Sciences Research Institute, University of Southampton, Southampton, UK.
Abstract
Models consisting of chains of particles that are coupled to their neighbours appear in many applications in physics or engineering, such as in the study of dynamics of mono-atomic and multi-atomic lattices, the resonances of crystals with impurities and the response of damaged bladed discs. Analytical properties of the dynamic responses of such disturbed chains of identical springs and masses are presented, including when damping is present. Several remarkable properties in the location of the resonances (poles) and anti-resonances (zeros) of the displacements in the frequency domain are presented and proved. In particular, it is shown that there exists an elliptical region in the frequency-disturbance magnitude plane from which zeros are excluded and the discrete values of the frequency and disturbance at which double poles occur are identified. A particular focus is on a local disturbance, such as when a spring or damper is modified at or between the first and last masses. It is demonstrated how, notably through normalization, the techniques and results of the paper apply to a broad category of more complex systems in physics, chemistry and engineering.
Models consisting of chains of particles that are coupled to their neighbours appear in many applications in physics or engineering, such as in the study of dynamics of mono-atomic and multi-atomic lattices, the resonances of crystals with impurities and the response of damaged bladed discs. Analytical properties of the dynamic responses of such disturbed chains of identical springs and masses are presented, including when damping is present. Several remarkable properties in the location of the resonances (poles) and anti-resonances (zeros) of the displacements in the frequency domain are presented and proved. In particular, it is shown that there exists an elliptical region in the frequency-disturbance magnitude plane from which zeros are excluded and the discrete values of the frequency and disturbance at which double poles occur are identified. A particular focus is on a local disturbance, such as when a spring or damper is modified at or between the first and last masses. It is demonstrated how, notably through normalization, the techniques and results of the paper apply to a broad category of more complex systems in physics, chemistry and engineering.
The dynamics of collinear chains of particles has been studied for a long time. Although apparently simple, these systems exhibit remarkable properties. Their interest is not limited to the academic realm as they model the fundamental vibrations of crystals in solid-state physics [1,2], the atomic and molecular dynamics of chains of molecules in physics, chemistry and biology [3-7], as well as the behaviour of real-life objects such as structures with repetitive components or rods and beams that are widely used in engineering. They are directly related to linear algebra in which they are represented by particular matrices, such as tri-diagonal, multi-diagonal, circulant or Toeplitz matrices, as well as to the theory of orthogonal polynomials. They are also often used to study or illustrate theoretical aspects like the effect of disorder on systems [8,9]. Their properties indicate how the systems can behave when used within a controlled system or when some of their components are perturbed or damaged, for example in the case of damaged bladed discs or impurities in crystals [10-15].However simple these systems appear, however important they are and for however long they have been studied, they are still under investigation. The present contribution is notably closely related to recent work on analytical expressions of the eigenvalues, eigenvectors and inverse of tri-diagonal matrices that have two or four of their corner coefficients disturbed [16-21]. The focus here is on the location of not only poles but also zeros of the transfer functions of a disturbed system in the frequency–disturbance magnitude plane. These loci are important system properties as they notably indicate if a system is stable, observable or controllable [22-24]. The loci of zeros also allow the localization of stable and unbiased frequency points in the theory of uncertain dynamic systems [13,25]. Analytical explicit expressions of the transfer functions of the system are derived by making use of Chebyshev polynomials and low-rank updates.After presentation of the system and resulting equations of motion in §2, the document is mainly split into the study of the nominal or undisturbed system, and that of the system with rank-one disturbance. For both, the exact expressions of all transfer functions are first presented in §3. Properties of the nominal system are then presented in §4. This includes the explicit expressions for the locations of poles and zeros of the transfer functions. Similar results and properties are presented in §5 for the disturbed system. Along the way, remarkable properties of the transfer function from the first to last mass are presented. In the case of the nominal system, it is shown in §4c that, besides the known property that this transfer function has no zero, its magnitude is also excluded from a circular region. Similarly, for the disturbed system, it is shown in §5b(i) that no zeros of this transfer function exist in an elliptical region of the real plane defined by the frequency parameter and the disturbance magnitude. Other exclusion and inclusion properties for the loci of poles and zeros are presented, such as the location of multiple poles of the disturbed system in §5a(ii).
Nominal and disturbed spring–mass systems
Considered is a nominal collinear system of alternating springs and masses, as illustrated in figure 1. The first and the last components of the system are springs that are connected to fixed points.
Figure 1.
Illustration of the nominal collinear spring–mass systems for N masses.
Illustration of the nominal collinear spring–mass systems for N masses.
Nominal spring–mass systems
From Newton's second law, the equations of motion of the masses are easily found to be
where k denotes the spring constants, m denotes the masses, and and are, respectively, the displacements of the masses and the external forces as a function of time, t. Note that is defined for conciseness. The Fourier transform gives the frequency domain equations that may be expressed in compact matrix form as with diagonal mass matrix M, tri-diagonal stiffness matrix K, and input and output vectors and , respectively.The nominal system considered here has all its masses, as well as all its spring constants, equal to each other, k=k,j=1,…,N+1 and m=m, j=1,…,N. Based on this assumption, the matrix K is k times the tri-diagonal matrix T, which has 2's on its diagonal and −1's on its two neighbour diagonals, while the matrix M is m times the identity matrix I. It is also assumed that all the components of the force vector have the same time dependency so that one can work with the constant vector, . This corresponds to an impulse force, , where δ(t) is the Dirac delta function. In matrix form, the nominal equations of motion are thereforeThe normalized case k=m=1 covers a wide range of cases. For example, the present case can be expressed with the normalized frequency parameter, λ=ω2(m/k), force vector, , and output vectors, . There is also no restriction to undamped systems, because the generally complex frequency parameter, λ, permits the consideration of damping.
Disturbed spring–mass systems
Some disturbance is allowed to the nominal spring–mass systems. Specifically, a different impedance is considered for the subsystem made of the two extreme (first and last) masses, as illustrated in figure 2. This disturbance might, for example, be an additional spring connecting the two masses, a small mass added on the first mass, or an additional spring connecting the last mass to a fixed point. Any such disturbance is a priori allowed (one does not prescribe the disturbance to necessarily be a combination of springs or masses or even physically possible). In order to conveniently study its effect on the nominal system, it is expressed as the product of a unit disturbance scaled by a scalar, s. In matrix form, this results in the bi-parameter system , where the three normalized matrices are
As in §2a, the normalized vectors are and for λ=ω2(m/k). For simplicity, S is rank-one, i.e. it may be expressed as the outer product of two vectors, .
Figure 2.
Illustration of a regular collinear system disturbed with components added to the pair of first and last masses.
Illustration of a regular collinear system disturbed with components added to the pair of first and last masses.
Normalized spring–mass systems
It was seen in §2a,b that the equations of motion can be written in a normalized form that is equivalent to working with unit mass and stiffness, and with a frequency parameter, λ=ω2. The same normalized systems, (T−λI)x(λ,0)=f and (T−λI−sS)x(λ,s)=f, are studied in the rest of the paper. They support a wide range of situations as illustrated in the following subsections. Note the mnemonic notation, T for tri-diagonal, I for identity and S for square.
First application: normalized damped system
A first normalization that involves damping is now illustrated with the damped benchmark problem schematized in figure 3. The nominal equations of motion are found to be
where c is the additional damping coefficient and ka and ca are the real positive stiffness and damping coefficients of the additional local elements. This results in the nominal matrix system
with complex frequency argument λ=(ω2 m−iωca−ka)/(k+iωc) and normalized force . Similarly, considering the disturbance leads to a non-normalized system such that
Figure 3.
Illustration of a regular damped collinear spring–mass system.
Illustration of a regular damped collinear spring–mass system.Particular case. Equation (2.4) may appear in many contexts. It can, for example, describe the linearized behaviour of nonlinear systems around an equilibrium or solution point. Particular cases of such a situation include that of the linearized discrete form of the sine-Gordon equation that notably appears in the modelling of magnetic flux transmission of Josephson-junction transmission lines [26]. The poles of this linearized sine-Gordon equation around soliton solutions could then provide information about the spectrum of the phonons and the corresponding eigenmodes would be the phonons far away from the solitons [27,28]. Considering equation (2.4) and disturbed spring–mass systems, such analysis could possibly involve forces, damping and disturbance.
Second application: normalized multi-degree-of-freedom system
The second normalization example and benchmark application involve non-normalized springs that are multi-degree-of-freedom (multi-d.f.) components. The particular model considered is schematized in figure 4. Each consecutive pair of the N masses is separated by an equivalent or generalized spring that consists of two masses, m1 and m2, and four springs, k1–k4. The full system has therefore more than N degrees of freedom. The transformation into the normalized form starts with the resolution of the equations of motion inside individual spring components. Looking specifically at the component between the (j)th and (j+1)th masses and at mass j gives
and
Using equations (2.8) and (2.7), the internal variables, and , can be solved in terms of the neighbouring variables, x and x, and substituted in equation (2.9). Basic algebra gives, for identical m1=m2=ms and k1=⋯=k4=ks, and with ωs2=ks/ms,
and
Although it may not be obvious, this equation can easily be written up in the normalized form by two operations: the multiplication of both sides by (3−ω2/ωs2), and the addition and substraction of 5x(ω) from the bracket [x+3x+x]=[x−2x+x]+5x. This results in the desired form (T−λI)x(λ)=fu(λ), where (3−ω2m/ks) is a rational function of ω2 with cubic numerator and linear denominator. For each value of λ equal to a pole or zero of the normalized system, it is thus possible to retrieve the corresponding values of ω2 by trivially solving a cubic polynomial.
Figure 4.
Illustration of a regular collinear system with multi-d.f. springs.
Illustration of a regular collinear system with multi-d.f. springs.Some care must now be taken with the right-hand side, i.e. with the normalized vector, fu(λ). Its scalar magnitude can indeed be zero or infinite for discrete values of ω2 that cancel the quadratic numerator or linear denominator of u(λ). Possible simultaneous cancelling of the numerator and denominator must be carefully considered as well as the a priori possible cases where a pole or zero of the system (T−λC) is neutralized, respectively, by a zero or pole of the magnitude, u(λ). With these considerations, the zeros and roots of the normalized system and of the scalar function u provide all those of the non-normalized system. Note that in the case of the first benchmark, i.e. in equation (2.5), the frequency-dependent scaling function 1/(k+iωc) can be neither zero nor infinite as long as the real k and c are strictly positive. All zeros and poles of the non-normalized and normalized systems (for a constant force) are therefore identical.For the current multi-d.f. benchmark, special attention must also be paid to the additional internal d.f. In order to avoid missing internal modes, the poles of the internal systems—in the present case, the roots of the polynomial —must be analysed. Finally, the study of the transfer functions and zeros of the normalized system provides direct information only for the main masses of the whole original system. More information is however available. For example, equations such as (2.10) and (2.11) provide all the system responses from those at the main masses. These expressions can thus be used to carry out the work of identification of zeros at the internal masses. By reciprocity, the unit transfer functions from an internal mass to a main mass can also be treated in a dual manner.Other cases. Under the same umbrella as that of the current benchmark, multi-d.f. systems such as in figure 4 allow one to deal directly with other systems such as the one illustrated in figure 5, where a modified stiffness component (the 1 d.f. subsystem circled by a dashed line) only appears every other spring. In this case, this is simply done by using k1=k and m1=m. The case of diatomic or multi-atomic lattice vibrations [2] is also covered by this benchmark, for both alternating masses and alternating springs. Periodic composite materials such as those described in Silva [29] and Andrianov et al. [30] can be put in the same framework. Furthermore, the present approach can be directly applied to condense macro cells of a homogeneous chain, when considering a so-called multi-field approach of vibrations as in Vasiliev et al. [31]. This can be illustrated by the case of figure 4 with m1=m2=m, k1=k2=k3=k and k4=0.
Figure 5.
Illustration of a colinear spring–mass systems with alternating regular and one-degree-of-freedom springs.
Illustration of a colinear spring–mass systems with alternating regular and one-degree-of-freedom springs.
Third application: bubbles vibrating in an acoustic field
The third application concerns the case of gas bubbles in water vibrating in an acoustic field. The dynamics of such bubbles is a very complex and important field of study, as its behaviour is highly nonlinear for large radial changes [32] so that it can lead to locally extreme conditions of velocity, pressure and temperature. The study of the linear vibrations of the bubble surfaces is also very important as their acoustic response, when submitted to an acoustic field, can be used as a tool to identify and count individual bubbles from a gas leakage [33]. In the presence of several bubbles, there is interaction and resonance because of coupling through radiated acoustic pressure from one bubble to the other. This effect can be modelled by this theory for the bubble vibrations in their linear regime.The corresponding normalization is demonstrated here by starting from the coupled linearized equations of motion of two identical bubbles [34-36]
and
where r denotes the variation of the radius R of the jth bubble compared with its equilibrium value, R0, i.e. R(t)=R0+r(t); α=R0/d is the ratio of this equilibrium and the distance, d, between the two bubbles; pac is a force acting on the system that is proportional to the incident acoustic field in which the bubbles are residing; and ω0 is the natural Minnaert's frequency of resonance of the bubbles. The factors δ are coupled damping coefficients that are here chosen by symmetry, such that δ12=δ21=δ and δ22=δ11=δa.The first step to normalization is the premultiplication of the system, expressed in matrix form, by the inverse of the matrix . In the frequency domain, this gives
where the frequency-dependent equivalent stiffness and mass of the bubbles are and . The division of both sides of this equation by κ then provides the normalized equation with λ=ω2μ/κ and right-hand side f=[1 1]pac/[κ(1+α)].All analysis in this paper can therefore be directly applied to analyse the vibration of gas bubbles in liquid. In particular, all the results about the transfer functions, g1, from the first to the last mass correspond to the transfer functions between the two bubbles in the present case, N=2. Note that even though the boundary conditions of the bubbles might not formally be equivalent rigid attachments, this model can perfectly be used, without any approximation or error, to analyse the two bubbles. One advantage of the theoretical model of this paper is that otherwise difficult to measure properties of the bubble–fluid system can be evaluated: by matching the predicted loci of the resonances and zeros to their evaluations obtained in a well-controlled experimental set-up, the actual values of parameters such as damping can be appropriately tuned. In the context of identification of gas leaks from pipes, natural methane seeps, or from undersea carbon capture and storage facilities [33], one can also push the analysis further to assess how the cross-bubble coupling affects the acoustic scattering from an ensemble of bubbles and how the consideration or non-consideration of this coupling impacts the measurements of the number of bubbles and quantity of leaking gas. The vibrational behaviour of pairs of bubbles with regard to that of single bubbles is also important in shock wave lithotripsy [37] as the source image of a bubble close to the surface of a kidney stone is exposed to a symmetric source image.General applicability. As illustrated in the previous sections and in figure 6, the work presented here can generally be applied and extended to various fields and the d.f. are not restricted to displacements. They may, for example, be rotations of a beam or values of the acoustic pressure at different points of a waveguide. Note that figure 6c corresponds to a circular chain with damage whose transfer functions have been studied in [13], section 5, appendix D, starting from this theory.
Figure 6.
Schematic of some fields of application of this work: (a) chain of molecules in a single-walled nano-tube; (b) vibrating diatomic lattice with alternating springs or alternating masses; (c) vibration of a bladed disc and its simplified model as a circular vibrating system; and (d) acoustically excited gas bubble in liquid.
Schematic of some fields of application of this work: (a) chain of molecules in a single-walled nano-tube; (b) vibrating diatomic lattice with alternating springs or alternating masses; (c) vibration of a bladed disc and its simplified model as a circular vibrating system; and (d) acoustically excited gas bubble in liquid.
Exact expressions of the nominal and disturbed transfer functions
Considered in this section are the responses or transfer functions between a unit force applied at the kth mass and the displacement of the jth mass. Such transfer functions are denoted
where e is a unit vector with only non-zero coefficient at position j, in the general, disturbed, case while the nominal transfer functions in the particular case of absence of disturbance are
The transfer functions for general input and output vectors, f and c, are available from linear combinations of those as . Both the nominal and disturbed transfer functions exhibit remarkable characteristics that are presented further in the paper. Exact expressions of the transfer functions are first derived in §3b,c, by making use of their link with Chebyshev polynomials which is first explained in §3a.
Transfer functions and Chebyshev recurrence relations
The nominal transfer functions have been studied at least since the time of Rayleigh. They have interesting properties by themselves because of the structure of T, which, for a fixed value of N and k, and even with (fixed) non-zero disturbance, has the inherent recurrence relation of Chebyshev polynomials, presented here in the general case,
Such recurrence relations are standard in the generation of orthogonal polynomials (e.g. [38]). They are in particular the recurrence relations for the Chebyshev polynomials of second kind, U(x), which are defined by
together with the initial functions, U0(x)=1 and U1(x)=2x. An alternative trigonometric form of these polynomials is [38], eqns (22.3.16)
Note that equation (3.3) differs only from equation (3.4) by a shifting and scaling of the argument (x=1−λ/2). This is the same kind of operation as the shift used to obtain the shifted Chebyshev polynomials (e.g. [39], p. 179). The recurrence relations and the definitions of Chebyshev polynomials allow easy derivation of the transfer functions, as presented in the next sections.
Explicit expression of the nominal transfer functions
Explicit expressions of nominal transfer functions are summarized in the following lemma.
Lemma 3.1
Consider the tri-diagonal dynamic system
where
T
is the tri-diagonal matrix of dimension
I
is the identity matrix of dimension
N
and
e
is the .Then, the components, g(λ,0), of the response, (λ)=[g1,(λ,0)…g(λ,0)]T, are
and
where the function
U(x) is the Chebyshev polynomial of second kind of order
j (U0(x)=1,U1(x)=2x,U(x)=2xU(x)−U(x), for
j=1,…). By symmetry, g(λ,0)=g(λ,0) and
g(λ,0)=g(λ,0) for all λ,N,j,k.
Proof.
For s=0 and fixed values of N and k, one uses the notation x(λ)=g(λ,0) for compactness. The recurrence relations above and below row k of equation (3.6) give
and
Considering first 2≤k≤N−1, the first and last rows of equation (3.6) may be written x2(λ)=2(1−λ/2)x1(λ) and x(λ)=2(1−λ/2)x(λ). Along the recurrence relations, they give
and
and, in particular, the expressions of x1 and x in terms of x, x1(λ)=x(λ)[U(1−λ/2)]−1 and x(λ)=x(λ)[U(1−λ/2)]−1. Substituting these in equations (3.11) and (3.12), at j=k−1 and j=k+1, respectively, and carrying the results in the kth row of equation (3.6), gives
Grouping the two first terms in brackets on the right-hand side of this equation and using the recurrence relation of Chebyshev polynomials results in
Combining the two terms in brackets on the right-hand side of this equation, considering the numerator, and using the following relation k−1 times (with q=k,k−1,…,2 and x=1−λ/2):
and then the recurrence relation of Chebyshev polynomials a last time, gives
which can then be combined with equations (3.11) and (3.12) to give the expected results for k=2,…,N−1. Note that equation (3.15) can be obtained by solving for 2x in the two occurrences of equation (3.4) obtained for j=N−q and q−1, respectively.For k=1, the first row of equation (3.6) and equation (3.12) (which also holds for k=1) gives x(λ)=1/(U(1−λ/2)) and the expected result. The same approach holds for k=N. The two symmetry properties can be readily verified from equations (3.7) and (3.8). □It will be shown in §4 not only that exact expressions of the poles and zeros of the transfer functions can be evaluated from the expressions of the transfer functions but also that their properties can advantageously be studied directly.Explicit rational expressions of the transfer functions obtained by making use of lemma 3.1 are provided in table 1. While poles and zeros can be evaluated from the expressions of the numerator and denominator polynomials, this becomes unfeasible or unpractical for large dimension systems. Explicit expressions of the poles and zeros will be provided in §4a,b and will clarify some of their properties. A particular case of the transfer functions from first to last masses that will be further studied is highlighted in the following corollary.
Table 1.
Samples of the nominal transfer functions from lemma 3.1 for the normalized system.
Dim. N
input k
output j
transfer function gj,k(λ,0)
poles
zeros
1
1
1
1/(−λ+2)
2
—
2
1
1
(−λ+2)/(λ2−4λ+3)
1,3
2
1
2
1/(λ2−4λ+3)
1,3
—
3
1
1
(λ2−4λ+3)/(−λ3+6λ2−10λ+4)
0.5858,2,3.4142
1,3
1
2
(−λ+2)/(−λ3+6λ2−10λ+4)
0.5858,2,3.4142
2
1
3
1/(−λ3+6λ2−10λ+4)
0.5858,2,3.4142
—
2
2
(λ2−4λ+4)/(−λ3+6λ2−10λ+4)
0.5858,2,3.4142
2,2
Samples of the nominal transfer functions from lemma 3.1 for the normalized system.
Corollary 3.2
Equations (3.7) and (3.8) collapse to
g(λ,0)=g1,(λ,0)= U/U, g(λ,0)=g(λ,0)=U/U, g1,1(λ,0)=g(λ,0)=U/U, and
g(λ,0)=g1,(λ,0)=1/U
where, for brevity,
U
denotes the value
U(1−λ/2).
Explicit expression of the disturbed transfer functions
The following theorem provides the disturbed transfer functions for any values of s, N and k.
Theorem 3.3
Consider the dynamic system (T−λI−sS)g(λ,s)=e, where
T
is the tri-diagonal matrix of dimension N with all twos on its diagonal and all negative ones on its neighbour diagonals,
I
is the identity matrix of dimension N,
S=slsTr
is a rank one disturbance matrix, such that
sr=sr1e1+se
and
sl=sl1e1+se, and
e
denotes the jth unit vector. Denote α=ss
for n,m=1,N.The components, g(λ,s), of the response vector,
g(λ,s)=[g1,(λ,s) … g(λ,s)]T, are
for j=1,…,k, and
for j=k,…,N, and, in both cases, Q(λ,s)=U−s[(α11+α)U+(α1+α)].Here, for brevity, U
denotes the value U(1−λ/2) where U(x) is the Chebyshev polynomial of second kind of order j (U0(x)=1, U1(x)=2x, U(x)=2xU(x)−U(x), for j=1,…).The disturbed transfer function may be expressed as a rank-one update of the nominal transfer function (details of this may be found in [13,25]), which gives
The nominal transfer functions are available from lemma 3.1. For the case 1≤j≤k≤N, one has g(λ,0)=UU/U and the relations of corollary 3.2. Substituting these expressions in equation (3.19) gives equation (3.17). Equation (3.18) is obtained similarly. □Particular cases appear. If j≤k, one sees from expression (3.17) that the coefficient of α11 disappears if j=1, while those of α or α1 disappear if k=N or j=k, respectively. Similarly, for j≥k, α11, α or α1 disappear if, respectively, k=1, j=N or j=k.
Properties of the nominal transfer functions
The expressions of lemma 3.1 are completely general for any regular collinear system that has identical point elements and connectors between two neighbour elements, independently of their physical nature or the real or complex value of their parameters.The availability of the exact expression of the responses, particularly in terms of Chebyshev polynomials, allows the identification of properties of the nominal system. Some of these are now presented and then illustrated on the damped benchmark and multi-d.f. problem introduced in §2c(i),(ii). A particular focus is on the position of poles and zeros.
Location of nominal poles
For the poles of the nominal system, one has the following result, which can be, and has been, derived by various methods. Notably, the poles are the eigenvalues of the tri-diagonal matrix T and they are the roots of a shifted Chebyshev polynomial [38], eqn 22.16.5.
Corollary 4.1
Using the same notations as in lemma 3.1, for a given value of
N, the poles of any transfer function,
g(λ,0)=cT(T−λI)−1f, share the same poles which are the roots of the shifted Chebyshev polynomial of second kind and order
N, U(1−λ/2). In particular, these roots,
are real and all located within the finite range 0<λ<4.Trivially, from lemma 3.1, since all g(λ,0) are rational polynomials with denominator [U(1−λ/2)]. The roots, , of U(.) can easily be obtained from equation (3.5). □The fact that all poles are known and real can be used to derive properties of any particular system that may be expressed as the nominal system. This is illustrated on two applications.
Damped benchmark
In the case of the damped benchmark problem, the Chebyshev polynomials have argument
Inverting this relation, the characteristic equation ω2 m−i(λc+ca)ω−(λk+ka)=0 follows. There are therefore two possible values of ω corresponding to any value of λ,
As all poles, in the λ domain, are in the open real interval 0<λ<4 and since (λk+ka)m>0, their corresponding values ω1,2(λ) in the ω domain have a strictly positive imaginary component as long as c or ca>0. The critical damping, c or ca, can be identified by the value that separates regions where the argument of the square root is positive or negative. When the argument is negative, the ω-eigenvalue is overdamped, i.e. purely imaginary so that the corresponding modal component of an impulse response is non-oscillatory in nature. In general, the critical damping values vary from one eigenvalue to another. This is illustrated by the root loci of figure 7 for N=5 and varying values of the additional viscous parameter, ca.
Figure 7.
Loci of the (a) five pairs of roots and (b) three pairs of zeros of the damped benchmark for N=5, k=0.7, m=1.3, c=0.4, ka=0.5 and ca varying from 0 to 2.15. The circles, crosses and squares indicate the locations, respectively, for ca=0 and the two smallest critical values of the poles, ca=1.7837 and 2.0980, while the stars indicate the location of the zeros at their first critical value, also at ca=2.0980.
Loci of the (a) five pairs of roots and (b) three pairs of zeros of the damped benchmark for N=5, k=0.7, m=1.3, c=0.4, ka=0.5 and ca varying from 0 to 2.15. The circles, crosses and squares indicate the locations, respectively, for ca=0 and the two smallest critical values of the poles, ca=1.7837 and 2.0980, while the stars indicate the location of the zeros at their first critical value, also at ca=2.0980.The poles for ca=0 are ω1,2(λ)={±0.7261+0.0412i,±0.9484+0.1538i,±1.1691+0.3077i,±1.3368+0.4615i,±1.4368+0.5742i}, and the corresponding critical values are ca={1.7837,2.0980,2.3432,2.4770,2.5302}.
Multi-degree-of-freedom benchmark
Turning to the particular multi-d.f. benchmark, there is a total of 3N+2 masses for N main masses. For N=3 and 11 masses, there are three roots of the normalized system, for j=1,…,N, i.e. λ=0.5858,2.0000,3.4142. Then, solving for ω2 provides nine root values of ω2. The consideration of the additional roots of the polynomial , as discussed in §3b, provides a supplement of two poles, , that correspond to λ=5. The last possibility of resonance cancels the denominator of the right-hand side of equation (T−λI)x=fu at ω2=3ωs2. For particular values of m=2, ms=3 and ks=0.7, this corresponds to a degenerate equation with . There is no additional resonance as expected, and the whole set of 11 poles is .
Location of nominal zeros
Contrarily to the poles that are identical for any transfer function, the location of zeros varies from one transfer function to the other depending on the particular input and output vectors. The number of zeros itself may vary from as much as N−1 to as little as zero. Using lemma 3.1, some properties of the zeros of the mass to mass transfer functions, g(λ,0), are now derived.
Corollary 4.2
Using the same notations as in lemma 3.1, for a given value of , , are the union of the roots of two shifted Chebyshev polynomials of second kind, U(1−λ/2) and
U(1−λ/2), of respective orders
and
. These zeros,
and
are real and located within the finite range 0<λ<4.Trivially from the lemma and the evaluation of the roots of the two Chebyshev polynomials at the numerator of the transfer functions. □The existence and location of the zeros appear to be slightly more elusive than those of the poles. This is particularly true when general input and output vectors, f and c, are selected. Even if such vectors were real, there would be no insurance that the zeros of the transfer function g(λ,0)=cT(T−λI)−1f would be real. This might seem a contradiction as this transfer function would be a linear combination of real polynomials with real poles and zeros, but is simply explained by the fact that any given real polynomial of order N−1 or less may be generated as a real linear combination of the numerator of the nominal transfer functions , and that polynomials with real coefficients may have complex roots.The general case does not preclude the existence of properties in particular cases. In corollary 4.2, for unit input and output vectors, one has full information on the existence and location of zeros. This includes the following results.
Corollary 4.3
Corollary (4.2) shows that the zeros of the nominal transfer functions
g
have double multiplicity when
j−1=N−k. Higher multiplicities are impossible.
Corollary 4.4
Corollary (4.2) shows that there is only one finite zero of the transfer function between the first to the penultimate masses of the system. Furthermore, this zero is simply at frequency λ=2.As for the case of poles, results of corollary 4.2 can be applied to any system that may be expressed in the form of the nominal system. This allows one to draw conclusions on the location of zeros for a wide range of problems. For example, in the case of the damped benchmark problem, one can use the inverted relation (4.3) and the discussion that follows to prove that all zeros of this problem are complex, with a strictly positive imaginary component as long as c or ca>0.Additional information may be obtained from Chebyshev polynomial properties.In the case of the damped benchmark for N=5 and considering the transfer function from the second to the third masses, g2,3, there are three a priori zeros, which are plotted in figure 7, as m1=1 and m=2. One can, however, note that each pair of zeros coincides with and therefore cancels a pair of poles. This corresponds, in the case of the first zero, to the value (j=4)/(N+1), which equals (j=2)/(m+1)=2/3. The cancelling of the poles by a zero in the transfer function between the second and the third masses for N=5 is a general property that is independent of the parameter values, k,c,ka,…. The critical values at which the zeros have or not a real part are defined by the same equation in terms of λ as that for the critical values of the poles.While all zeros of all transfer functions from one to another mass coincide with poles in the case N=5, its other transfer functions g2,2,g3,3,g4,4,g2,4,g4,2 still exhibit actual zeros because of their double multiplicity. The case N=5 is somewhat particular, as, in general, not all zeros of unit transfer functions coincide with poles. For example, there is no such coincidence for N=4.Considering the multi-d.f. benchmark, again for N=3 and a total of 11 masses, all the zeros of the unit transfer functions between the third, sixth and ninth masses are provided by corollary 4.2. The only possible cases where zeros of unit transfer functions between the main masses may exist correspond to non-zero values of m1 and m. This can only happen when j or k=2, i.e. when the second main mass, i.e. the sixth mass, is involved, at the location . This zero is cancelled by a pole for g1,2 and g2,3, so that the only actual zero with double multiplicity is only present in the transfer function g2,2 at the sixth mass. The additional possibility of zeros between the three main masses has to be considered in the case where the numerator of cancels on the right-hand side of (T−λI)x=fu. For m=2, ms=3 and ks=0.7, this happens at the locations where λ=5. These are actual zeros, as λ=5 is not a root of the normalized system, and the denominator of does not cancel at these zeros.
Magnitude of the nominal transfer functions
The focus of this section is on the particular transfer function from the first to the last masses. As it is the inverse of a Chebyshev polynomial, as presented in corollary 3.2, its properties may be obtained directly from properties of these polynomials. In particular, the following well-known equioscillation property leads to the fact that, in the range of real parameter λ∈[0,4], the transfer function is outside an ellipse.
Lemma 4.5
The scaled Chebyshev polynomial of second kind, , equioscillates
j+1 times between −1 and 1 in the range
x∈[−1,1]: (j+1) local extrema in this range and their values alternate between −1 and 1 for increasing values of
x. The extrema occur at
and the last extremum is a maximum, .In the range x∈[−1,1], θ=acos(x) is real and decreases from π to 0. From equation (3.5), one sees that the scaled function is , whose alternating extrema −1 and 1 are reached at (j+1)θ=(j+1/2)π,(j−1/2)π,…,1/2π. □The scaled Chebyshev polynomials of second kind are presented in figure 8 for the few first orders. The reader intrigued by the ‘white curves’ in this figure is referred to an ‘asin–acos’ version of the plots where the curves are straight lines and to Ortiz & Rivlin [40], where this phenomenon is explained. Such properties of the scaled Chebyshev polynomials translate into properties of the transfer functions. Notably, lemma 4.5 allows one to identify properties of the magnitude of the g1,(λ,0) transfer function as shown in the below corollary of the lemma and corollary 3.2.
Figure 8.
Illustration of scaled Chebyshev polynomials for j=0,…,4 in (a), and j=0,…,29 in (b), as a function of x=1−λ/2. The points corresponding to the loci of the last roots are indicated by squares in (a).
Illustration of scaled Chebyshev polynomials for j=0,…,4 in (a), and j=0,…,29 in (b), as a function of x=1−λ/2. The points corresponding to the loci of the last roots are indicated by squares in (a).
Corollary 4.6
Using the same notations as in lemma 3.1, for any real value λ∈[0,4], the absolute value of the transfer function,
from the first to the last degree of freedom, is larger than
. In other words, in the real plane, all points (λ,2g(λ,0)) are outside a circle of radius 2 centred at (2,0). The transfer function is tangent to this exclusion region at the points
. Also, this transfer function has no finite zero.Although lemma 4.5 is trivial and has been previously noted, as incidentally in [41], property 4.6 of the transfer function appears to not have been identified before. The exclusion zone of the magnitude of the transfer function g is illustrated in figure 9 for N=7. For any other system dimension, N, there will be N+1 tangency points between the same exclusion zone and the transfer function. They are to be put in the light of those of the scaled Chebyshev polynomial to the constant lines 1 and −1 illustrated in figure 8. Their location at equiangles of the exclusion zone in the λ–2g plane is evident from their expression in theorem 4.6.
Figure 9.
Illustration of the exclusion zone for the values of the nominal transfer function g for N=7. No values (λ,2g(λ)) can lie within the exclusion circle and g(λ) is tangent to this zone at eight points marked by squares.
Illustration of the exclusion zone for the values of the nominal transfer function g for N=7. No values (λ,2g(λ)) can lie within the exclusion circle and g(λ) is tangent to this zone at eight points marked by squares.
Properties of the disturbed transfer functions
The properties of the disturbed transfer functions may be derived from their exact expressions given in theorem 3.3. As previously, the focus is now on the location of poles and zeros.
Location of disturbed poles
The locations of poles of the disturbed transfer functions vary depending on the particular disturbance vectors d and d but, for any choice of these and a given magnitude, s, they are identical for all transfer functions. They are characterized by a necessary zero denominator Q(λ,s), Q(λ,s)=U(1−λ/2)−s[(α11+α)U(1−λ/2)+(α1+α)]=0 and are therefore the zeros of a polynomial of order N in λ. An alternative point of view is to consider Q(λ,s) to be a function of s (at a fixed value of λ). Two cases may happen: either both U(1−λ/2) and the coefficient of s are zero, in which case any value of s corresponds to a pole, or there is just one pole which is the single root of the polynomial of order 1 in s. The first case might only happen at a finite number of values of λ as U(1−λ/2) and [(α11+α)U(1−λ/2)+(α1+α)] are polynomials in λ of order N and N−1, respectively, which therefore have only N and N−1 zeros each. An example of the first case is given below. In the second case, the poles are simply defined by the roots, as a rational function of λ. This is expressed in the following theorem, whose proof is trivial.
Theorem 5.1
Using the notations and assumptions of theorem 3.3, for any vectors
c,
f
and values of α's, all poles of the transfer function
cT(T−λI−sS)−1f
are identified by either the function
for any value of λ, or any value of s at the discrete values of λ, if any, that cancel both the numerator and denominator of expression (5.1).The following particular case, which includes additional springs or dampers connecting the first and last massses, is remarkable.
Theorem 5.2
Using the notations and assumptions of theorem 3.3, if s=−sr
or s=−s
then the poles,
, of the transfer function g(λ,s)=cT(T−λI−sS)−1f
are equal to
for any value of s and
for any value of λ. The ‘floor’ value fl((N−1)/2) is (N−1)/2 if it is an integer, or N/2−1, otherwise. Note that the value of
could be expressed as a ratio of Chebyshev polynomials of second kind with half ordersIn either case of equation s=−s or s=−s, one has (α11+α)=−(α1+α). The denominator Q(λ,s) of the transfer functions with unit input and output vectors described in theorem 3.3 is therefore equal to
It is a polynomial of strict order N in λ and therefore has exactly N roots which are the poles of the transfer functions. By definition of the Chebyshev polynomials of second kind, where θ=acos(1−λ/2) so that
As λ is a function of , the poles can be identified by the unique values of θ in the interval [0,π] that cancel the right-hand side of equation (5.6). Particular attention is paid to the values θ=0 and π as they also cancel the denominator . Looking at the numerator, the sinusoidal terms can be developed as [38], eqns(4.3.24/35)
and
which shows that
The cosine function is zero for all values . Those, however, only correspond to distinct real zeros, , of Q for the fl((N+1)/2) values, k=0,…,fl((N−1)/2). This is due to the cyclicity of and because the case k=N/2 for even N cancels both the numerator and the denominator in equation (5.9). In the latter case, L'Hospital's rule gives
This shows two things, when N=2k is even: on the one hand, is not a pole for all values of s but, on the other hand, the pair is always a pole—this is a particular case of equation (5.1) for any value N.Now, looking at the term of equation (5.9) in brackets, in the interval 0<θ<π, each of its roots corresponds to a pole since in the selected interval. If both sinusoidal terms in the brackets were zero at the same time, one would have θ=2πk/(N+1) for k=1,2,…,fl(N/2) at the same time as , i.e. simultaneously to [38], eqn(4.3.35)
The sine term cannot cancel as its argument is 0<θ/2<π/2. The cosine term cancels if its argument (N/2)θ=πkN/(N+1) is equal to π/2+jπ where j=kN/(N+1)−1/2 is an integer. This is however impossible since kN/(N+1)−1 and kN/(N+1)=k−k/(N+1)>k−1/2 since k≤fl(N/2). Therefore, the term in brackets in equation (5.9) can only cancel in the interval 0<θ<π when
The only values of θ that cancel the sine terms and have not yet been considered are θ=0 and also θ=π if N is odd. These values however also cancel the denominator in equation (5.9) and the actual non-zero values of Q(λ,s) are found to be the following by L'Hospital's rule:
and
In both cases, it is generally non-zero, except at s=[(N+1)/(N−1)]/tr(S), which also corresponds to equation (5.1). All poles have thus been identified. The transfer function with general vectors c and f has the same poles as those with unit vectors. □Several general properties of the location of the poles can be derived. These include locations where a pole will always be present and the opposite of regions from which poles are excluded. The location of multiple poles and the real or complex character of the poles are also properties important to know in control. A sample of such properties are proved in §5a(i)–(iii). While these results can be directly applied in the situation of any systems that can be normalized, they can also be extended by following strategies similar to those used in the proofs.
Fixed location of poles for any system dimension
The following corollary of theorem 5.2 identifies fixed pole locations for any system dimension.
Corollary 5.3
Using the notations and assumptions of theorem 3.3, if
s=−sr
or
s=−sl
then(i) for any even
N=2k, k=1,2,…, the pair
corresponds to a pole;(ii) for any odd
N=2k+1, k=0,1,…, the pair (λ=2,s=0) corresponds to a pole.Furthermore, in either case, these poles are the only ones that are independent of the particular, respectively, even or odd values of .That the pair is a pole for any even dimension N=2k was noted in the proof of theorem 5.2 and it can easily be checked from equation (5.3).If N=2k+1 is odd, then corollary 4.1 shows that λ=2 is a nominal pole, i.e. it is a pole at s=0. Alternatively, two situations can be considered in the odd N case, using theorem 5.2. First, if N=4 m+1 for m=0,1,2,…, equation (5.2) with k=2 m shows that is a pole for any value of s, therefore also for s=0. Second, if N=4 m+3 for m=0,1,2,…, equation (5.3) shows that .The unicity of the invariant points for varying even or odd N is proved by examining different values of N. First, for odd N, there is only one possible pole for N=1, i.e. , for any s. This is a different value from those for N=3.Second, for even N and referring to figure 10, the poles for N=2 are along two lines in the real λ,s plane: ‘L1’ such that for any s and ‘L2’ such that s=1/tr(S)(3−λ). (Note that, in order to show the latter, one can apply and to and . This gives and . Therefore, U1/2(x)/ U−1/2(x)= 1+2x and U1/2(1−λ/2)/U−1/2(1−λ/2)=(3−λ).) Now, considering these two lines, one may consider the poles for N=4. These happen along three lines: the vertical constant lines, at and for any s that intersect L2 at the two points p1=(λ=0.3820,s=2.6180/tr(S)) and p2=(λ=2.6180,s=0.3820/tr(S)). The line L1 intersects the curve of equation (5.3) for N=4 at the point as acos(1−λ/2)=π/3 at λ=1. There is a single fourth intersection between the poles at N=2 and N=4, that is, the fixed point (λ=4,s=−1/tr(S)). In order to prove this, the expressions of are matched for N=2 and N=4, which gives
and
where the trigonometric identity was used to generate the third and the fourth identities. Note that the passage from the first to the second equation through multiplying both sides of the first equation by is only valid if this term is non-zero. The fourth equation can be expressed as , i.e. λ−λ2/4=0. This results in two possible common poles: the actual fixed point, p4=(λ=4,s=−1/tr(S)), and the possibility λ=0 that has to be discarded as it cancels the denominators of equation (5.15), while L'Hospital's rule shows that its right- and left-hand sides differ. One can easily verify that the three points p1 to p3 are not poles in the case N=6. □
Figure 10.
Location of poles in the case of corollary 5.3, for N=2,4 and tr(S)=2.4. The point p4 is the only pole common to all even values of N.
Location of poles in the case of corollary 5.3, for N=2,4 and tr(S)=2.4. The point p4 is the only pole common to all even values of N.
Location of multiple poles
The frequencies and values of the magnitude disturbance that lead to multiple eigenvalues are important information. In the context of theorem 5.2, the poles can only have multiplicity two at most. The location of the double eigenvalues as well as some of their properties can easily be obtained from the theorem as detailed below.
Corollary 5.4
Using the notations and assumptions of theorem 3.3, if
s=−sr
or
s=−sl
then(i) the poles have at most multiplicity two;(ii) there are fl((N+1)/2) poles,
for
k=0,…,fl((N−1)/2), with double multiplicity that are described by equations (5.2) and (5.3);(iii) the double multiplicity poles can only be along the two branches of 1/(tr(S)(1−λ/2)) for λ in the open interval 0<λ<4.The two first theses are trivial corollaries of theorem 5.2. In order to prove the third thesis, the expressions of are developed. This gives
and, using the identities and ,
Therefore, . Finally, equation (5.2) shows that lies in the open interval. □Damped benchmark.The identification of the location of the multiple poles is illustrated in the case of the damped system application of §3a. Starting from the non-normalized system, equation (2.6), corollary 5.4 can be directly applied provided that the frequency parameter, ω, and the disturbance magnitude of interest, s′=s(k+iωc), are properly normalized. For example, if N=5, there are only fl((N+1)/2)=3 poles with double multiplicity at locations for j=0,…,2 with , , and . Explicitly, the multiple poles are at the (λ,s) locations (0.2679,1.1547/tr(S)), and (3.7321,−1.1547/tr(S)). These are consistent with the identified property that and it is clear that the pole is the only possible multiple pole with infinite value of s for any dimension N.Two simple operations provide the location of poles in the (ω,s′) plane: the solution of the following quadratic equation in ω, and the further scaling and shifting, s′=s(k+iωc), with the particular solutions :
For the particular values k=0.7, m=1.3, c=0.4, ka=0.5 and ca=0.3, this results in the four finite poles, (ω,s)=(±0.7102+0.1566i,(0.7360±0.3280i)/tr(S)),(±1.3852+0.6895i,−(0.4898±0.6398i)/tr(S)) and in the two poles at ω=±1.1325+0.4231i and infinite s. The existence of these double poles is illustrated in the maps of figure 11.
Figure 11.
Illustration of multiple pole location. Log10 of the absolute value of the transfer function of equation (2.6) for N=5. All 10 poles are single in general cases such as for s′=0.4066+0.2367i in (a) while there is a double pole, which is indicated by the added white box, at ω=0.7102+0.1566i for s′=0.3066+0.1367i in (b). The parameters are k=0.7, m=1.3, c=0.4, ka=0.5, ca=0.3, arbitrary random input and output vectors, and c, sr(1)=1.5, sr(N)=−1.5, sl(1)=0.8, sl(N)=−0.8, so that tr(S)=2.4.
Illustration of multiple pole location. Log10 of the absolute value of the transfer function of equation (2.6) for N=5. All 10 poles are single in general cases such as for s′=0.4066+0.2367i in (a) while there is a double pole, which is indicated by the added white box, at ω=0.7102+0.1566i for s′=0.3066+0.1367i in (b). The parameters are k=0.7, m=1.3, c=0.4, ka=0.5, ca=0.3, arbitrary random input and output vectors, and c, sr(1)=1.5, sr(N)=−1.5, sl(1)=0.8, sl(N)=−0.8, so that tr(S)=2.4.
Real and complex character of the poles
In the context of theorem 5.2, if the argument λ is real, the value of is also real. This might not appear obvious because the arccosine of a real number may be complex. A formal proof of this statement is therefore given and it is incidentally shown that the poles are antisymmetric about λ=2 when N is odd. It might be worth noting that there is no (anti-)symmetry of the location of the poles when N is even.
Lemma 5.5
Using the same notations and assumptions as in theorem 5.2, if λ is real, i.e. if
imag(λ)=0, the function
is real, and
where
x=1−λ/2. Incidentally, if λ is real,
is antisymmetric about λ=2 if
N
is odd, i.e.
if .An explicit expression of the complex value acos(x) in terms of the logarithm can be derived as follows. One denotes θ=acos(x)=θ+iθ where θ and θ are real. As is real its imaginary component may only be zero if either θ=kπ for an integer k or θ=0. The latter corresponds to acos(x) real and values of λ that are in the interval 0≤λ≤4. It will be treated later. The former case corresponds to θ complex, real regions |x|>1 and intervals λ<0 and λ>4. As the cosine is a cyclic function, there are only two distinct cases, θ=0 or θ=π, to consider, which, respectively, give or . These are quadratic equations in e: one has either e2−2xe+1=0 or e2+2xe+1=0 whose roots are, respectively, and . One may only consider one of the two roots in each case as either gives the same ratio of sines in equation (5.12). One chooses the plus sign if x>1, λ<0, θ=0 and the minus sign if x<−1, λ>4, θ=π. The imaginary part θ of acos(x) is obtained by taking the logarithm of the expressions of e. Thus,
Evaluating the sines of equation (5.12), gives, for λ<0, . Similarly, by splitting the terms of the exponents, one obtains for λ>4,
Equation (5.21) is obtained straightforwardly for λ<0 from theorem 5.2 and the fact that (which can be easily verified). Equation (5.22) is similarly obtained for λ>4 by additionally enumerating the four possible values, i,−1,−i,1, of e depending on the modulo m=N mod 4. The expressions are real since |x|>1.The proof for the interior interval −1≤x≤1, i.e. for the case θ=0, follows a similar path. One has and . Solving the quadratic equation in ei gives , of which a single sign, say negative, needs to be considered and, thus,
One arrives at equation (5.21) again by using and at equation (5.22) by enumerating the values of (−1)( for all values of the modulo N mod 4. The expressions are real since is real for real argument. The antisymmetry is directly verifiable by equation (5.22) and considering that x=±(λ−2) at λ=2−(±1)(λ−2). □The fact that a function f(λ) is real when λ is does not automatically imply that the reciprocal is true. It is, however, the case for as proved below.
Corollary 5.6
Using the same notations and assumptions as in theorem 5.2, if str(S) is real, then all the poles of the transfer function
g(λ,s) occur at real values of λ.The proof proceeds by identifying N real poles for any real value of s tr(S). As g(λ,s) is a rational function of λ with denominator of order N for any fixed s, complex poles are then impossible.The first fl((N+1)/2) real poles are independent of s and defined by equation (5.2). The remaining fl(N/2) real poles for real s tr(S) are counted by studying the expression of given in equation (5.3). This is done by showing that is made of fl(N/2) branches that are defined on separate real regions of the frequency parameter λ and such that each has values that vary continuously from to .Looking first at the function of θ, , in the interval 0<θ<π over which is real, one now shows that its fl(N/2−1) poles alternate with its zeros , j=1,…,fl(N/2): consider a particular pole, , for some integer j∈{1,…,fl(N/2−1)}. One may express it as so that the non-integer j(−) equals j(−)=j(N+1)/(N−1). Now, j(−)>j because (N−1)/(N+1)<1 as long as N>−1. One also has that j(−)Consequently, the value of the (λ-)function varies continuously from to when it goes from any pole to the next in the interval 0<λ<4 and there is a sign change at each pole. The sign change of the value of at the poles arises because the poles are single. Similarly, the fact that the zeros are single implies that each branch has to cut the s=0 axis only once. The signs at the left and right of each pole may be determined by checking that the value of the rational function at λ=0 is positive. As the function reaches first a zero when λ increases, the function is negative at values of λ slightly smaller than a pole. There are fl(N/2−1)−1 branches of the function that each vary continuously from to between each of its poles.One can look again at the poles of the transfer function g(λ,s) and summarize those that have already been identified in the range of real values 0≤λ≤4. From theorem 5.2, one knows that all poles are on the branches defined by real and at where , k=0,…,fl((N−1)/2) for any value of s. One already knows that any chosen real value s tr(S) corresponds to a pole if λ is equal either to one of the fl((N−1)/2)+1 real values or to one of the fl(N/2−1)−1 continuous branches going from one to another pole of . Considering the multiplicity of multiple poles, this corresponds to N−2 poles.As the denominator Q(λ,s) is a polynomial of order N in λ for any value of s, there are only N poles of the transfer function, g(λ,s). Its two last poles are below the smallest and above the largest poles of : the value of this real continuous function is when it reaches its first pole from below. There is a first branch that spans at least all negative values of s tr(S) and all positive values smaller than or equal to its value (N+1)/(N−1) at θ=0 (otherwise, there would be an excess of poles of g(λ,s) for s tr(S)<(N+1)/(N−1)). Similarly, the last branch spans at least all positive values of s tr(S). The value of the first branch must therefore be larger than (N+1)/(N−1) for all λ<0. From equation (5.1) of theorem 5.1, one knows that, as a ratio of polynomials with order of the numerator larger than the order of the denominator, the absolute value of is infinite when λ is infinite. The asymptote of the first (real and continuous) branch is therefore infinity, . Similarly, . Two additional continuous branches going from to have been identified. For any value of s tr(S), N corresponding distinct (counting multiplicities) real values of λ have therefore also been identified such that the pairs correspond to a pole and there are no other poles. □
Location of disturbed zeros
The zeros of the disturbed transfer functions vary both for different values of the disturbance parameters (S and s) and for different input and output vectors. In the case of unit input and output vectors, the exact position of zeros may be found as the zeros of the numerator in the transfer function expressions of theorem 3.3.
Theorem 5.7
Using the notations and assumptions of theorem 3.3, the zeros of the transfer function g(λ,s) cancel the polynomial
if j≤k, and the following one, if j≥k,The location of zeros depends in a more or less intricate way on the particular value of S. One may identify interesting properties in particular cases.
Exclusion region for zeros
The following transfer function from the first to last masses is remarkable:
Its zeros are indeed excluded from a region centred at λ=2 for any real argument λ. This property is illustrated in the colourmap of figure 12. The colours in each horizontal line of the colourmap represent the magnitude of the disturbed transfer function for a particular value of the disturbance magnitude, s. As illustrated in the figure, the theorems of location of zeros and poles allow one to predict the exact number of real poles and zeros for various values of s.
Figure 12.
Illustration of the location of zeros of g(λ,s) and their exclusion zone. Here, N=11, s(1)=0.3, s(N)=1.7, s(1)=−0.5, s(N)=0.2. The colour map in (b) represents the magnitude of the transfer function for various real values of the disturbance magnitude s. The location of the (real) zeros matches their prediction from corollary 5.8 as presented in (a) where one can note that, as expected, no zeros are present in the exclusion zone. Two particular transfer functions are plotted in (c) and (d), respectively, for s=1 and 1.6. The predicted zeros and poles at real values of λ can be counted in these transfer functions: there are five zeros and 11 poles in the first case, and nine zeros and five poles in the second case.
Illustration of the location of zeros of g(λ,s) and their exclusion zone. Here, N=11, s(1)=0.3, s(N)=1.7, s(1)=−0.5, s(N)=0.2. The colour map in (b) represents the magnitude of the transfer function for various real values of the disturbance magnitude s. The location of the (real) zeros matches their prediction from corollary 5.8 as presented in (a) where one can note that, as expected, no zeros are present in the exclusion zone. Two particular transfer functions are plotted in (c) and (d), respectively, for s=1 and 1.6. The predicted zeros and poles at real values of λ can be counted in these transfer functions: there are five zeros and 11 poles in the first case, and nine zeros and five poles in the second case.
Corollary 5.8
Using the notations and assumptions of theorem 3.3, there is no zero of the transfer function
g(λ,s) within a circle (1/2λ−1)2+(sα)2=1 of the real (λ,sα) plane. The curves of zeros are tangent to the circle at pairs of values,
for
m=1,…,N−1.As the zeros of the transfer function cancel the numerator 1+sαU(1− λ/2), they happen at values of λ and sα such that sα=f(λ), where f(λ) is defined as f(λ)=−[U(1−λ/2)]−1. If λ is real, so is the value of the Chebyshev polynomial U(1−λ/2). One knows from lemma 4.5 that equioscillates between −1 and 1 in the range x∈[−1,1]. One therefore has that in the range 0≤λ≤4. As f(λ) is the value of sα corresponding to a pole for a value λ, the first part of the proof is complete. The location of the tangency points is found from the location of extrema given in the equioscillation lemma 4.5. □
Real and complex character of the zeros
From theorem 5.7 and corollary 5.8, it is clear that not all zeros of the transfer function between the first and the last masses occur at real λ, even if the disturbance sα is real. One can, however, derive properties of the imaginary part of λ at the zeros from the exact expression (5.29). The next result gives exact expression of this imaginary component when the real part of λ is equal to two and N is even. It also shows that there is no zero of this transfer function with real sα when real(λ)=2 and N is odd. In this case, all zeros correspond to purely imaginary values of sα. Incidentally, this shows the corollary that, if sα is a complex number with non-zero real and imaginary parts, there are no zeros of the transfer function with real(λ)=2.
Theorem 5.9
Using the notations and assumptions of theorem 3.3, if g(λ,s)=0 and real(λ)=2, then
where both x=(1−λ/2)/i=−1/2imag(λ) and
are real. The hyperbolic sinusoidal functions have their usual definition
and
.All values of λ may be written λ=2−2ix for some real x. Theorem 3.3 and consequent equation (5.29) give that the zeros occur when sα=−[U(1−λ/2)]−1=−[U(ix)]−1. Sinusoidal expression (3.5) of the Chebyshev polynomials, , necessitates θ=acos(ix)=θ+iθ, where θ and θ denote real numbers. As
is imaginary, has to cancel, which means that θ=π/2 if 0≤θ≤π is chosen without loss of generality. Substituting this value gives the quadratic equation in e, e2+2x(e)−1=0, whose only possible solution is the positive so that . Substituting in equation (3.5) and using and e=i completes the proof with little algebra. □The results of theorem 5.9 are illustrated in figure 13 for a dimension varying from N=2 to 12. The fact that a zero occurs at s=−1/α for N=2 and any value of λ can easily be verified in the expression of the theorem for m=2. Also notable is that zeros occur only at real values of s for even N and at purely imaginary s for odd N. It is relatively easy to derive additional information about the locations where zeros are absent. For example, the following corollary, whose proof is trivial considering theorem 5.9, is another exclusion result.
Figure 13.
Illustration of the location of zeros of g(λ,s) when real(λ)=2, as predicted in theorem 5.9. Depending on the dimension N, specifically on the modulo m=N mod 4, the zeros occur only at purely real or purely imaginary values of s. Here, α=−0.85. The plots are sorted on the modulo, m= (a) 1, (b) 2, (c) 3 and (d) 0. There is no zero for N=1 and s=−1/α=1.1765 for N=2.
Illustration of the location of zeros of g(λ,s) when real(λ)=2, as predicted in theorem 5.9. Depending on the dimension N, specifically on the modulo m=N mod 4, the zeros occur only at purely real or purely imaginary values of s. Here, α=−0.85. The plots are sorted on the modulo, m= (a) 1, (b) 2, (c) 3 and (d) 0. There is no zero for N=1 and s=−1/α=1.1765 for N=2.
Corollary 5.10
Using the notations and assumptions of theorem 3.3, if the pair (λ,s) corresponds to a zero of the disturbed transfer function
g(λ,s), i.e. if
g(λ,s)=0 and if
sα
is a complex number with non-zero real and imaginary parts, then the real part of the λ parameter cannot be equal to 2. In other words, if
g(λ,s)=0, the concomitance of the three following conditions is impossible: real(sα)≠0, imag(sα)≠0, and real(λ)=2.
Applications of the properties of the disturbed systems
Besides the applications already discussed, two fields in which the current results and approaches about disturbed systems could particularly be used in future work are those of vibration localization [42,43] and mode veering [44-46].At least two directions may be followed, regarding the analysis of vibration localization, namely the consideration either of the eigenmodes of the disturbed system or of its forced responses. Based on the assessment of localization criteria in these two sets of vectors, such localization could be matched to properties of the systems and disturbance.The question of mode veering can be addressed from an analytical point of view in the context of particular disturbances considered in this paper. More specifically, as all possible multiple poles have been identified in the case considered in corollary 5.4, the conditions for mode veering or mode crossing are readily available. Mode crossing happens when the mode loci for a particular system and disturbance type go through one of these multiple poles, while mode veering happens when these loci pass closely but not through these particular poles.
Conclusion
The results presented in this paper about the location of zeros and poles of nominal and disturbed chains of particles are useful at several levels.First, they can be applied directly to actual physical and engineering systems with repeated particles, bubbles and substructures, possibly after adequate normalization similar to that demonstrated here for damped and multi-d.f. systems and bubble dynamics. The exclusion results ensure that no eigenvalues, zeros or values of the transfer functions can be in given regions of the frequency and disturbance parameters. Such insurance may thus allow one to design structures that are safer and better tolerant to uncertainties and variance in their properties. The identification of the actual locations of zeros, eigenvalues, multiple eigenvalues and pole-zero cancellations of the systems has potential utility in several fields, particularly in, for example, those of control and propagation of uncertainty.Second, the various techniques used in the paper can be applied and extended to other situations in order to generate further theoretical results. The added or disturbed components can, for example, be considered at other locations than at the two last particles. Three kinds of disturbance may then also be distinguished: the low-rank one as discussed here, the multi-rank case where the disturbance is a sum of several outer products and the more general case where several disturbance magnitudes, s1,s2,…, and their associated matrices, S1,S2,…, are considered. The location of zeros of transfer functions and their properties can also be studied with different, possibly arbitrary, input and output vectors.Third, the exact expressions and properties described here can be used as a basis for fundamental characterization and treatment of general dynamic systems. For example, it was explained in the paper that systems with different alternating mass values can be treated as a normalized system. A question is therefore whether there exists a fundamental set of spring–mass systems to which general systems can be reduced. Such a task of categorization also necessitates the consideration of various boundary conditions for the extremities of the spring–mass systems.Finally, the exact results presented here provide problem cases that can be used as reference benchmarks for various algorithmic techniques to evaluate zeros and roots. For example, approaches to evaluate multiple eigenvalues can be validated by verifying that they can locate the double eigenvalues of the damped benchmark discussed in the paper.