Evolution of Energy Equipartition and the Boltzmann Distribution Law.

Introduction

            This animation shows un-ambiguously that the evolution of the Boltzmann-like energy equipartition and final exponential energy distribution stems from the dynamics of collisions.

            In this document, I first give the standard macroscopic Boltzmann treatment of the energy distribution of atoms of different mass.  Then I describe just what and how the program computes its several different types of plots.  Finally I show the mathematics of the velocity changes when two atoms of different mass and arbitrary velocities collide.  Then I compute the average atom energies after collisions from those before collisions.

 

Standard Thermal Physics Treatment

            The Boltzmann probability law for atom kinetic energy, E, is:

                                           (1)

where N0 is a normalization constant, m is the mass of the atom, k is Boltzmann's constant, v is the speed of the atom, and T is absolute temperature.  It turns out, for the case of hard spheres and for real atoms, such as the noble gases that behave like hard spheres, that the E dependence of equation 1 is wrong (see equation 26 with a=1/2 for 1 dimension and a=3/2 for 3 dimensions) for both 1 and 3 dimensions.   For the important physical case of 3 dimensions the animations show that the appropriate equation for energy distribution is

 as given in the above reference.

What the Animation Program Computes.

1. Energy Distribution N(E)

            The program uses the results for velocity changes that are given in the next section to compute the kinetic energies of all of the atoms at each time increment.  These energies are used to compute a bin number, b

where Emax is several times larger than the average energy of all the atoms and nbins is the number of energy bins that we use.

When an atom has bin number b, an integer array component, iEb, is incremented by an the integer 1.  After all of the atoms have been polled, iE contains the distribution of number of atoms Vs atom energy, N(E).  But, since we use 80 bins and we are limited to only about 1200 atoms, that would result, if N(E) is constant, in an average bin population of 15.  The variance of 15 is about 4 and this would result in very erratic N(E).  What we really want to know is the relative average occupancy of the energy bins. To obtain the final relative occupancy of the bins, it is acceptable to keep incrementing iEb after each time interval.  If we do this for a very large number of time increments, the initial transient changes of N(E) will be "averaged out".  It is equivalent to taking many "snapshots" of the atomic energy distribution and averaging the results for each bin.  Then what is plotted as a histogram is

             

where t is the snapshot number and T is much greater than one. 

            The animation makes separate histograms for red and blue atoms.  In addition, a least squares fit of an exponential function (the Boltzmann distribution) to the data in the histogram is made.  The relative occupancy number (RON) equation is:

where <E> is the average energy of the entire atom ensemble just as expected for a Boltzmann distribution.  The animation shows beyond doubt that the RON equation has the same form and constants for red and blue atoms regardless of their masses, numbers and initial energies.

Figure 1: Diagram for d=2 dimensions of the animation after evolution was reasonably complete with an average energy <E>=3.0.  Note that the green and purple exponential plots are a good fit to the red and blue atom numbers per bin histogram with correlation coefficients of 1.0.  Also note that the exponential coefficients were very close to -1/3.0 as expected for a Boltzmann distribution.

2. The Average Energy Bin Occupancy Time t(E)

            The bin occupancy at low energy is expected to be longer than that at high energy and that asymmetry feeds into the exponential distribution of N(E).  To compute the bin occupancy time, for each atom pair, c and d, that achieve the scatter condition, we record the time at which this occurs, t0c and t0d.  Then, at the next time, t1, that the scatter condition occurs, we take these differences t1c-t0c and t1d-t0d and increment a bin-based array by these numbers for each atom of the scattering pair.  Since we want to have the average time between scatterings, we also increment a similar bin-based integer array by 1 for each of the pair. To get the average we use the fraction

When a least squares fit is done, this results in another exponential of the following form with respect to energy:

 

where t0 is the average occupancy time at E=0 and <E> is the average energy of the entire ensemble of atoms.

Figure 2: Diagram of the animation showing, in orange, the relative energy level occupation times Vs E. The smooth orange curve is a least squares fit to the occupation time.  The coefficient is -1/(twice the square root of <E>) which in this case would be -1/3.5.

3. The Average Energy Change per Scatter

            Although it is not included as a program option, I have separately computed the average energy change as a function of initial energies.  The qualitative result is with initial energies above about 2<E> the energy change is negative while below initial energies of 2<E> the energy change is positive.

 

Figure 3: Diagram showing, in orange, the average energy change per scatter event. The zero of the energy change is the x axis of the plot as shown on the left.  <E>=3 is clearly marked.  At initial energy levels above about 2<E> the energy change is negative while at energy levels below about 2<E> the energy change is positive.  The energy change could probably be fit to a gaussian of the same form as that for the average occupation time.

3. The Energy History of 11 Atoms

            In order to get a better understanding of how the Boltzmann exponential distribution evolves, I chose 11 atoms with initial energy near the top of the initial energy spectrum and followed their energy history for 600 time increments.  The result is shown in Figure 4.

Figure 4: Energy histories of 11 atoms.  The traces representing the energies are colored by colors from the visible spectrum.  It is seen that the variance of the energies is quite large.  However, what is important to notice is that the occupancy time, represented by vertical trace excursions, is longer at low energies than high energies. This difference gives rise to the energy-dependent occupation time plot shown in Figure 2.  A plot, similar to Figure 2, of atoms per energy bin is also shown in orange.

Mathematics of Hard Sphere Scattering

Atom-Atom Collisions of Different Mass and Velocity

            Here we will consider spherical atoms which have the different masses, m1 and m2, and diameters, D1 and D2.  The centers of the spheres will be labeled (x1,y1,z1) and (x2,y2,z2).  Upon collision, the momentum transferred between the spheres will always be along the unit vector between their centers:

                                                       (2)

where

is the distance between centers.  Since the animation is illustrated in only 2 dimensions, the collision analysis will assume a containing box that is large in the x and y dimensions but very thin in the z dimension. The following vector mathematics is correct for either 2 or three dimensions.

The expression for the final momenta in terms of the initial momenta is:

                                                                            (3)

where the apostrophe on the left side of the equations indicates the final velocities.  We know that the energies are conserved so

                                                                           (4)

The directions of the change in momenta are along the vector of centers, u, and the values of the changes of momenta must be equal and opposite.

                                                                   (5)

where M has units of mass and is still to be determined.  Using equation 5 in equation 3:

                                                                         (6)

Now we can use equation 6 in equation 4 to solve for the value of Mdv.

  (7)

where the large dot stands for the dot product and equation 7 simplifies to:

                          (8)

We can now make the identification:

                                                                          (9)

where M is known as the "reduced mass".

Equations 6 and 8 are a complete solution for the final momenta. The final velocities are computed by dividing both sides of equations 6 by their respective masses:

                                           (10)

Suppose m2>m1.  Then we see that the magnitude of the speed added to molecule 1 will be larger than the magnitude of  the speed removed from molecule 2.  We can easily see from equation 7, even though the averages of the dot products are zero,  that, on average, the collision results in an increased kinetic energy for atom 1 and a decreased kinetic energy for molecule 2 because of the mass term in the denominators.  

Summary of Results for Average Energies After Collision

After a collision with initial energies E10 and E20 we obtain the following results:

(11)

 

When the above result is averaged over all angles between u and vi that actually lead to a collision i.e. u.(v2-v1)<0 we obtain:

                   (12)

                (13)

 

 

 

Summary of Average Energy Results for m1=m2=m

 

When m1=m2=m so that M=m the results for average E, <E> , are easily seen from equations 12 and 13 to be:

                                                                                         (14)

With a little more difficulty the results for the variances of E, <DE>,  are given by the following equations:

                                       (15)

Discussion of Scattering for Equal Mass Atoms.

So, what really happens when the masses are equal, is that the energies redistribute themselves into a gaussian-like pattern with the gaussian width greater when the energy differences are greater.  Of course, the minimum outcome for any energy is always zero, so the distribution tends to become biased toward its E=0 end. 

 

If we start with mono-energetic atoms, E0, so that Etotal=2E0, then the first variance will always be 0.250E0.  A fraction of these scattered atoms will have energy 2E0 and an exactly equal fraction will have zero energy.  The next scattering, when with atoms with energies E0, can result in energies from zero energy up to 3E0.  When the next scattering is between 2 atoms that have energy of 2E0 the final energy can be as large as 4E0 but that is a very unlikely event.  Of course, since neither is moving, atoms with zero energy don't scatter with other atoms of zero energy so these latter remain at zero energy.  In fact the number of scatterings per second depends on the energy of the atoms, so this makes  all the lower energy atoms less likely to scatter with similar low energy atoms.

We could write the following approximate differential equation for the rate of scattering of atoms

where n is the atom density, vr is some average relative speed, and s is the collision cross section.  Thus the rate of change of energies via scattering depends on the sum of the velocities of the two atoms involved.  This is one of the mechanisms that I think would cancel out the usual tendency for the  atoms to form a symmetric gaussian distribution centered at the average energy.  The other mechanism is that, while any energy higher than the average energy is possible, the minimum energy is always zero so a lot of atoms tend to accumulate near the zero end of the number Vs energy distribution.

            We can also say that there are few high energy atoms because they scatter quite often so that they usually get their energies degraded by scattering with lower energy atoms and become "thermalized".