Keivan Esfarjani1,2,3. 1. Department of Mechanical and Aerospace Engineering, University of Virginia, Charlottesville, VA 22904, USA. 2. Department of Materials Science and Engineering, University of Virginia, Charlottesville, VA 22904, USA. 3. Department of Physics, University of Virginia, Charlottesville, VA 22904, USA.
Abstract
We consider the problem of heat transport by vibrational modes between Langevin thermostats connected by a central device. The latter is anharmonic and can be subject to large temperature difference and thus be out of equilibrium. We develop a classical formalism based on the equation of motion method, the fluctuation-dissipation theorem and the Novikov theorem to describe heat flow in a multi-terminal geometry. We show that it is imperative to include a quartic term in the potential energy to insure stability and to properly describe thermal expansion. The latter also contributes to leading order in the thermal resistance, while the usually adopted cubic term appears in the second order. This formalism paves the way for accurate modeling of thermal transport across interfaces in highly non-equilibrium situations beyond perturbation theory.
We consider the problem of heat transport by vibrational modes between Langevin thermostats connected by a central device. The latter is anharmonic and can be subject to large temperature difference and thus be out of equilibrium. We develop a classical formalism based on the equation of motion method, the fluctuation-dissipation theorem and the Novikov theorem to describe heat flow in a multi-terminal geometry. We show that it is imperative to include a quartic term in the potential energy to insure stability and to properly describe thermal expansion. The latter also contributes to leading order in the thermal resistance, while the usually adopted cubic term appears in the second order. This formalism paves the way for accurate modeling of thermal transport across interfaces in highly non-equilibrium situations beyond perturbation theory.
Transport theories of non-interacting quantum systems based on the Keldysh formalism, which treats non-equilibrium flow of charge or heat carriers in a one-dimensional (1D) geometry have been developed in the past. To the best of our knowledge, the first such development was performed in 1971 by Caroli et al. [1] where a Green’s function formalism was used to describe dynamics of electrons in a 1D crystal. Following the seminal work of Caroli et al., many other groups worked on similar formalisms and proved a formula for the transmission through the system, now widely used for both non-interacting electrons and phonons. The equilibrium version of it, namely , where G and are, respectively, the retarded Green’s function and the escape rates to the leads, was established by Meir and Wingreen [2] and in a similar form by Pastawski [3] in 1991. This formula holds for a non-interacting (or harmonic, in the case of phonons) system near equilibrium, meaning the chemical potential or temperature gradients are to be infinitesimally small. These assumptions might not always be realistic, especially in small (mesoscopic) systems subject to temperature differences over fractions of a micrometer, and a formulation for non-equilibrium situations and interacting systems is preferable for the sake of testing the domain of validity of the equilibrium formulas and more accurate description in the case of large interactions and large driving fields.The general and practical problem of interest is the description of the heat transport across interfaces between different materials; however, to keep it more general, we adopt a multi-probe geometry where the system in which scatterings occur is connected to multiple energy reservoirs, which impose their temperature, and cause the flow of heat carriers (see Figure 1). This model is used for mesoscopic systems where the carrier mean free path could be on the order of the system length, implying Ohm’s law of addition of resistances in series does not necessarily hold, and coherence can play an important role. The geometry of the reservoirs is fundamentally one-dimensional (1D), and if there is translational symmetry perpendicular to the current flow, one can use Bloch’s theorem to decouple the 3D system into many non-interacting 1D systems, each labeled by a quantum number, which is the transverse momentum. So in what follows, we assume such decoupling has been performed and we will be dealing with strictly 1D semi-infinite harmonic leads, although the central device is arbitrary in shape and structure and maybe connected to multiple 1D probes, assumed to be Langevin thermostats and assumed to be isothermal.
Figure 1
(Top) a general “molecular” multiprobe geometry where the device defined by atoms within the ellipse are connected to 4 reservoirs imposing a temperature or measuring a current. (Bottom) the 2-probe geometry. Atoms in each lead are connected to a harmonic thermostat at fixed temperature and . In the Buttiker probe, also called the self-consistent reservoirs geometry, to measure the “local” temperature, each layer of the central device may also be weakly connected to a fictitious thermostat at a temperature to be (self-consistently) determined so that the net heat flow from that probe to the device is zero.
For simplicity, we will use a classical description. A generalization to the quantum case will be inferred at the end. So in our classical treatment, the frequency is just frequency of vibrational modes and not the energy of phonons. This classical formalism avoids fancier mathematics involving commutation relations, and concepts such as time-ordered or contour-ordered Green’s functions. It will only involve “retarded” or causal Green’s functions, which help us solve a differential equation in the frequency domain. In the high-temperature limit both theories lead in principle to the same results. To model interfacial thermal resistance, typical considered geometries will be identical to a non-equilibrium molecular dynamics (NEMD) setup where the two ends of the system are attached to two thermostats at different temperatures, and one is interested in measuring the established heat flow as a result of the temperature difference (see Figure 1).The non-equilibrium anharmonic phonon problem has been addressed in the past by Mingo [4] and separately by Wang [5] in 2006. They used the many-body perturbation approach of non-equilibrium quantum systems based on the Keldysh formalism, (also called the Non-Equilbrium Green’s Function or NEGF method) and derived a lowest-order approximation for the transmission function. Dai and Tian [6] recently implemented this rigorous formulation to calculate the effect of cubic anharmonicity on the phonon transmission function through an ideal interface, applied to Si/Ge and Al/Al with two different masses. A similar model, which explicitly incorporates transverse momentum dependence, was also developed recently by Guo et al. [7], based on previous work by Luisier [8]. Polanco has a recent review of these methods based on NEGF [9]. These NEGF-based models, although fully quantum mechanical, do not include anharmonicity beyond cubic order nor any thermal expansion effects. Other calculations of the transmission in the non-equilibrium regime [10] based on the Green’s function method, have been based on the self-consistent reservoirs (also called Buttiker probe method), which was first proposed by Bolsterli et al. in 1970 [11]. In this method, as shown in Figure 1, every layer is connected, with a very weak coupling, to a fictitious probe at a given temperature with which it becomes into equilibrium. The probe temperature, which is also the assigned temperature of that layer, is obtained from the constraint that the net heat current from the system to the added fictitious probe should be zero. Note that the system could be out of equilibrium, so that a temperature is not really well-defined at a given layer. This shortcoming is also present in NEMD simulations where the assigned “temperature” of a layer is just the average kinetic energy of atoms in that layer, although there is no evidence of local thermal equilibrium. So even though non-equilibrium effects are included through this Buttiker probe method, it does not include anharmonicity. We should mention at this point that there is numerical evidence of absence of equipartition in the vibrational modes near the interface [12,13], implying that a definition of local temperature is not really justified near an interface. In another work, based on the MD simulation, which fully includes anharmonicity, Saaskilahti et al. [14] extended the harmonic formulation of the transmission function based on the actual MD trajectories by Chalopin et al. [15,16], to include anharmonic corrections. In the actual calculation, arguing that the anharmonic part of the current is usually small, they only used its harmonic formula but with velocities and positions coming from the full anharmonic atomic trajectories, in order to deduce the interfacial thermal conductance. The advantage of this approach over NEMD is that the heat current and thermal conductance can be decomposed in the frequency domain. Approaches based on MD trajectories, while including full anharmonicity, suffer from noise and would require a large number of simulations in order to perform proper ensemble averaging, whereas many-body approaches might be inaccurate if a perturbative expansion in powers of anharmonicity is used, but otherwise do not suffer from noise and treat ensemble averaging analytically. In this work, we try to overcome these limitations by adopting a non-perturbative many-body approach by fully including in the current the effect of anharmonic terms introduced in the Hamiltonian, without using Buttiker probes. Furthermore, using a classical method to derive an expression for the heat current, we argue that to leading order, it is necessary to include quartic terms in the Hamiltonian, in order to properly describe both the thermal expansion and the dominant temperature dependence effect in the heat current.
2. Dynamics
We start by defining our model and the assumptions. A multiprobe geometry is assumed as shown in Figure 1 in which a central region, also called the “device” is connected to many semi-infinite one-dimensional (1D) leads, playing the role of thermostats imposing a temperature at the boundaries of the system. We will not be concerned with temperature drops in the thermostats, which are assumed to be harmonic and follow Langevin dynamics. If needed, parts of the leads can be incorporated in the device region to illustrate the temperature drops near the interface. The device (D) Hamiltonian is a fourth-degree polynomial in powers of atomic displacements, and it is harmonically coupled to the lead is represented by Equation (2):
The dynamical variable refers to the displacement of atom i about the zero-temperature equilibrium position (the force on atom is zero for ). The leads labeled by are semi-infinite harmonic chains. After the standard change of variables to , and toand we arrive at the following equation of motion for atoms in the central region:
Note we have used capitalized Greek letters () for mass-rescaled force constants, and lower-case Greek letters () for the bare potential energy derivatives. The letter refers to the leads, and the dynamical variable can be thought of as an array containing the displacements of all the atoms in the central region (also called device), as the force constant matrix between such atoms, and as the force constant matrix connecting atoms of the lead to atoms in the device. Finally, is the anharmonic part of the force, which, for now, we keep as a for brevity. The dynamics of atoms in the lead is of Langevin type [17], where a set of identical coupled harmonic oscillators are subject to damping and noise . The equation of motion for atoms with displacements is as follows:
where the superscript T stands for transpose. Here again all atomic degrees of freedom are stored in the array . The force constants can be thought of as effective FCs at the temperature of interest, so that we do not need to introduce anharmonicity in the leads, which merely play the role of absorbing phonons from the device and reinjecting thermalized phonons into the device. We will proceed by eliminating the lead variables in Equation (3) using the Green’s function method. To this end, we start by taking the Fourier transform of the above two equations according to:
In the frequency domain, the equations of motion (3) and (4) become:
Note that the frequency-domain variables are represented with capitalized letters, and the transient part of the solution will be omitted as we are interested in the steady-state solution. Now that the differential equations are transformed to algebraic ones, one can easily proceed to eliminate the lead degrees of freedom in the main equation of motion by using Green’s functions. Let be the retarded (causal) Green’s function associated with the lead . The positivity of the damping factor insures causality. The solution to Equation (7) after the transients have decayed to zero, can be written as:
where
In Equation (6) we need , which we can obtain from Equation (8):
Here , and . Likewise, defining the retarded Green’s function of the central region as , we can write the solution to the central region as:The function is traditionally called the self-energy of lead , and shows the effect of this lead on the spectrum of the device, which is given by the poles of G. Its real part provides a correction to the eigenvalue spectrum , and its imaginary part, divided by , gives the inverse lifetime of an excitation of the central region caused by interactions with the lead. It is the rate at which the excitation leaks into the lead. Omitting the transient contribution of the initial conditions, this is the solution to the equations of motion, which depend on the stochastic functions .
3. Physical Observables
The equations of motion we derived are deterministic for every realization of the random forces. To simulate the real thermodynamical behavior of the baths, so that a temperature can be assigned to them, one needs to perform an “ensemble” average, denoted by over all realizations of the forces subject to the constraint imposed by the fluctuation–dissipation (FD) theorem [17]:The noise is white and different sites of leads are uncorrelated with each other. Physical observables are then obtained after an ensemble average is performed over forces. This is where irreversibility is introduced in this deterministic formalism, as a result of which, entropy is generated in the device. The latter can be understood as the log of the distribution function of the device as the random forces are varied within the constraints imposed by the FD theorem.
3.1. Entropy Generation Rate
An important quantity of interest is the entropy generation rate in the device, which can be expressed as: . This quantity results from the interaction between the device degrees of freedom and the random noise due to Langevin thermostats. It involves the thermostats temperature and the incoming currents which will be discussed next.
3.2. Heat Current
The main quantity of interest is the heat current. The heat from lead can be defined as the net rate at which energy is flowing into the device from that lead. It is the work performed from lead on the device’s degrees of freedom per unit time, which is the product of the velocity degrees of freedom of the device times the force from lead acting on them: , where the trace is taken over the device degrees of freedom. Substituting the expressions for x and from Equations (10) and (11) we can obtain the final expression of the current in the frequency domain. Since the current depends on the stochastic functions , we will take its ensemble average to find the response of the system to an applied temperature difference. We will start by taking the Fourier transform of and call it . Note due to time integration, we must have , where is the time integration window. Finally the static DC current is given by:
where we used the notation for twice the imaginary part of the lead self-energy. The DC response is found by taking the limit. Note that because we are interested in the DC response, only diagonal terms of correlations () are needed here. Thus the calculation of the heat current is reduced to the calculation of the two correlation functions and the so-called lead self-energy , followed by a frequency integration. Note that this current from lead is the sum of two terms: the first one, proportional to , is the work per unit time of the stochastic forces on the device, while the second one , proportional to a displacement autocorrelation, is the work of the lead dampers trying to reabsorb some of the excess energy injected from the device in order to re-establish thermal equilibrium in the lead.The average denoted by is an average over the stochastic forces in the thermostats that have white noise characteristics. When it is performed over the device degrees of freedom, the leads being at different temperatures, it becomes a non-equilibrium average and can only be calculated using the equation of motion (11) and the statistical properties of the Langevin thermostats, i.e., the fluctuation–dissipation theorem. For anharmonic interactions involving higher powers of displacements in A, the calculation of current will lead to a hierarchy of equations, each containing higher powers of displacements, and has so far been computed using different approximations [4,5,18].
4. Constraints
Before proceeding to the calculation of the correlation functions, we recall two constraints that the heat current needs to satisfy. The first is the detailed balance relation, which states that if all leads are at the same temperature T, the net current should be identically zero for all leads . The second constraint is that of current conservation, which in steady state , and under no additional heat generation in the device, reduces to . This is also known as the Kirchhoff’s law in the context of electrical circuits. Note that AC components of the current need not satisfy this constraint as they reflect the information on transient currents during the relaxation process, and depend on the heat capacity of the system. Any physically correct description of transport should exactly satisfy these two constraints.
5. Thermal Expansion
As the temperature of a system is raised, there can be thermal expansion due anharmonicity. The equilibrium position of the atoms is shifted, and this will also cause a change in the force constants as bond lengths have changed. To take these effects into account, while simplifying the notations, we will slightly modify the formalism as follows: the displacement variable X is changed to or which has zero average by construction. The resulting nonlinear equations satisfied by are derived by taking the average of the equation of motion (11) or equivalently setting the average force on each atom to zero (see Appendix B for more details).
The resulting equations will depend on correlations such as , and higher powers. Accordingly, the potential energy derivatives will be evaluated at the zero of Y, and will be denoted with a bar sign on top of them ( etc…). While the variable X satisfies the equation of motion:
the new variable Y satisfies:Next we will linearize the forces with respect to y:
where we have set to define the thermal expansion (see Appendix B). As we will show, the effect of temperature will be to renormalize the FCs, not only through thermal expansion but also due the thermal fluctuations as we will show using the non-equilibrium mean-field approximation (NEMF) also detailed in Section 6. Next, we define a renormalized Green’s function using renormalized force constants as
With this GF, the displacements Y satisfy
Note one can add any constant to the force constant in the above GF, provided is also added to the anharmonic force A. We will make use of this freedom in the next section to further simplify the formalism.
6. Force Constant Renormalization
Given the form of the above equations, we can add to A and add to so that now the renormalized GF becomes:
while the renormalized anharmonic force now becomes in the right hand side of Equation (16). This renormalization of harmonic force constants captures a major part of anharmonicity (because ), and is in spirit very similar to the lowest-order self-consistent phonon theory, which in the past has been applied to equilibrium systems. The advantage of this renormalization is that, as we will see, the lowest anharmonic correction in disappears by construction since , leading to the smallest variance and higher moments of .We will refer to this choice of the reference GF as the Non-equilibrium mean-field approximation (NEMF). With this choice, the equation of motion for Y becomes:
The Feynman diagram associated with the new Green’s function is shown in Figure 2.
Figure 2
Feynman diagram associated with represented by thick dashed lines. The phonon Green’s function is shown with thin solid lines, and the quartic vertex with the solid square. The thick line with opposite arrows represents the displacement autocorrelation . Internal frequency is integrated over.
7. Displacement-Noise Correlations
One can see from Equation (13) that the calculation of the heat current requires the calculation of the displacement-noise correlation and displacement autocorrelations C. We proceed to the calculation of these quantities first within the harmonic approximation, and then in the presence of anharmonic forces of the form .Let us start with the noise autocorrelation, which will appear in the calculation of displacement-noise correlation . In the frequency domain, using the fluctuation–dissipation theorem Equation (12), one can derive (see Appendix C.2):
where, for brevity, we have replaced the “occupation factor” by , and represents the integration time which goes to infinity and cancels the in the expression for the current . With a Factor of ℏ in the denominator of , the latter would be a real Bose-Einstein occupation factor taken to the classical limit.In the case of a white noise, we show in Appendix C.2 that the result will not depend on the thermostat damping parameter , and thus we adopt this type of noise for the thermostats.Next, we need to calculate displacement-noise correlations: . Since is a velocity, is the power exerted by the random force on the device and therefore can be interpreted as the heat injected per unit time and unit frequency (mode) from lead α into the device.In the harmonic case (), using the equation motion (18), this expression is simplified to:
For non-zero anharmonicity, the three GFs are different, and adopting as the reference GF, we have the NEMF approximation to the displacement-noise correlation function as:To go one step further and include the effect of anharmonicity in , we will use the Novikov–Furutsu–Donsker identity [19] (for a proof also see Appendix D), which states that for any functional of the white noise , we have
It has the advantage of lowering the powers of in f. Using this theorem, we have: .From the equation of motion Equation (18) and the chain rule, we find
so that finally,Note that, by construction, the first term involving is identically zero. This justifies the choice of rather than as the reference Green’s function.In the language of many-body theory, this is the expanded form of Dyson’s equation involving thermal average of powers of the anharmonic force derivatives. This exact result cannot be calculated, because the average of the inverse of powers of Y cannot be calculated exactly, and instead one may add the power series term by term provided the sum is convergent.This is one of the main results of this paper, providing a more accurate and likely convergent expression for the displacement-noise correlations, and leading to a simple calculation of heat currents if cubic terms are to be neglected (using the NEMF reference Green’s function ).The next-order correction consists in adding the effect of anharmonicity to second order, i.e., writing the displacement-noise correlation function as:
with and . We can note that the major part of the quartic anharmonicity has been removed, and the expansion is in powers of which is centered at zero and has therefore the smallest higher moments. When raised to the second power in the above formula for , cubic and quartic terms become decoupled if we neglect averages of terms of odd power in Y which are expected to be small if non-zero. At low-temperatures or weak anharmonicity, the dominant contribution to the second-order terms can be written as:
An explicit form of this equation is provided in the following section and in Appendix C.3 Equation (A17).
8. Displacement Autocorrelations
Finally, the last correlation function needed is . Note the autocorrelation matrix C is Hermitian. The function can be interpreted as the (non-equilibrium) number of excitations of frequency present in the device due to its contact with the leads. If all lead temperatures are equal, we recover the equilibrium occupation times the total DOS, , which is the total (equilibrium) number of excitations in the device. In the heat current, this term appears as and can be interpreted as the heat current going from (because of the negative sign) the device into this lead as is the escape rate into the lead .Similar to the treatment of , we will start with the equation of motion, Equation (18), and the FD theorem, Equation (19), to write C as:The first term is the NEMF contribution and is reduced to a known result, with now the renormalized Green’s function being used instead of the standard harmonic one (G or ), in order to include thermal effects to some extent:
The NEMF approximation, which consists in using and neglecting the contributions of anharmonicity included in , is very similar to the harmonic approximation. Within this approximation, where and , the transmission becomes , very similar to the harmonic result (see Appendix E), but with the GF substituted by the renormalized , and includes the effect of (quartic) anharmonicity to the lowest order.Linear terms in lead to terms similar to , which has already been discussed. The only remaining difficulty is with the terms, which do not explicitly contain any noise term, but have higher powers of displacements. Such terms can only be calculated approximately as the use of the equations of motion will involve higher powers of displacements. To have a second-order approximation consistent with that used for in Equation (26), we have to use:It can be shown that this approximation, taken to a self-consistent level satisfies current conservation, meaning , however the spectral components of the current: do not necessarily lead to zero when summed over all leads. Including for completeness both the cubic and quartic components of the anharmonic forces to second order, the self-consistent set of equations to be solved with the reference Green’s function are:
The equations for and C can also be represented using Feynman diagrams as shown in Figure A1 and Figure A2 in Appendix C.3 and Appendix C.4. More explicit forms of these equations are also reproduced in this appendix as Equations (A17) and (A18).
Figure A1
Feynman diagrams associated with up to second order in anharmonic forces. Dashed lines represent the phonon Green’s function , the circle represents , the triangle represents the third-order vertex and the square the fourth-order vertex . The thick line with opposite arrows represents the displacement autocorrelation . Frequency must be conserved at each vertex.
Figure A2
Feynman diagrams associated with up to the second order in anharmonic forces. Conventions are the same as in Figure A1.
Starting inputs for C and could be their NEMF values in the right-hand sides of the above equations, and the latter can be solved iteratively until convergent. Note that while the term requires , the terms C require itself, but these equations contain only second powers of and . Once iterations converge, the obtained C and functions can then be inserted in Equation (13) to compute the heat currents from each lead.These equations would be the same as the ones obtained from the many-body non-equilibrium Keldysh formalism with the difference that the “occupation factors” are classical ones, instead of Bose–Einstein functions. In this sense, they can directly be compared to results from classical non-equilibrium MD simulations, which are exact in anharmonicity but have inherent statistical noise in them.Another interesting feature to note is the increase in the overall conductance with the temperature (if is held small). This is in agreement with previous MD simulations [13,14].
9. Conclusions
To summarize, we developed a self-consistent approximation for transport in anharmonic systems out of equilibrium in the high-temperature (classical) regime. There is therefore no factors of ℏ in the formalism and is to be interpreted as frequency only, not energy. Although the set of derived equations for the current Equation (13) the equation of motion Equation (18), and the Equations (24) and (27) defining the correlation functions, were formally exact, one has to develop approximations to solve the Dyson’s Equation (24) and the Equation (27) defining C. One, because the anharmonic force is an infinite Taylor expansion and is usually truncated, and two, because its derivative appears in the denominator of Equation (24), which cannot be exactly inverted. In this work, we truncated the Taylor expansion of up to quartic terms and only included up to second powers of and in Equations (24) and (27).We showed that thermal expansion needs to be included using both cubic and quartic terms (to avoid any divergence) and it has the effect of renormalizing FCs as T is increased. The reference GF to work with, , has two corrections: one due to thermal expansion implying changes in bond length and strength ( instead of ), and the other due to thermal fluctuations about the average position (, which usually involves the quartic term and the autocorrelation C), similar in spirit to the self-consistent phonon theory, except that one is not at thermal equilibrium. This is the leading-order anharmonic correction, and cubic anharmonicity contributes to second-order correction terms as shown in the self-consistent equations (29).Non-equilibrium averages were possible to calculate with the use of the fluctuation–dissipation theorem (Equation (19)), the equations of motion (Equation (18)) and the NFD theorem (Equation (22)).An alternative approach to investigate non-equilibrium effects at and near interfaces would be to perform a non-equilibrium molecular dynamics simulation (NEMD) of the system attached to thermostats at different temperatures and sample the atomic trajectories in the phase space to find the distribution functions and the position averages—this, however, has inherent noise in it.One way to extract the effective force constants is to fit from the knowledge of the forces on atoms and their positions in each MD snapshot, the forces to a linear model in order to extract the effective (non-equilibrium) harmonic force constants . The remainder can then be defined as the anharmonic force: , and the present results may be used. Despite the approximations used in this work, the advantage of this formalism over MD simulations, which includes anharmonicity to all orders, is that it is analytical and therefore fast and free of simulation noise, although reaching self-consistency can be challenging for some model systems. It would be desirable to make a comparison of the results with NEMD to validate these approximations for a given system. The accuracy also relies on the force field and strength of higher-order terms: sources of divergence would be in the denominator of Equation (24) if , signaling resonances, in which case the Taylor expansion in Equation (24) is not appropriate.Applications to nanoscale systems will appear in future publications.