BOSE-EINSTIEN DISTRIBUTION
Suppose we have a number of energy levels, labeled by index

, each level having energy

and containing a total of

particles. Suppose each level contains

distinct sublevels, all of which have the same energy, and which are
distinguishable. For example, two particles may have different momenta,
in which case they are distinguishable from each other, yet they can
still have the same energy. The value of

associated with level

is called the "degeneracy" of that energy level. Any number of bosons can occupy the same sublevel.
Let

be the number of ways of distributing

particles among the

sublevels of an energy level. There is only one way of distributing

particles with one sublevel, therefore

. It is easy to see that there are

ways of distributing

particles in two sublevels which we will write as:

With a little thought (see
Notes below) it can be seen that the number of ways of distributing

particles in three sublevels is

so that

where we have used the following
theorem involving
binomial coefficients:

Continuing this process, we can see that

is just a binomial coefficient (See
Notes below)

For example, the population numbers for two particles in three
sublevels are 200, 110, 101, 020, 011, or 002 for a total of six which
equals 4!/(2!2!). The number of ways that a set of occupation numbers

can be realized is the product of the ways that each individual energy level can be populated:

where the approximation assumes that

.
Following the same procedure used in deriving the
Maxwell–Boltzmann statistics, we wish to find the set of

for which
W
is maximised, subject to the constraint that there be a fixed total
number of particles, and a fixed total energy. The maxima of

and

occur at the value of

and, since it is easier to accomplish mathematically, we will maximise
the latter function instead. We constrain our solution using
Lagrange multipliers forming the function:

Using the

approximation and using
Stirling's approximation for the factorials

gives

Where
K is the sum of a number of terms which are not functions of the
ni. Taking the derivative with respect to

, and setting the result to zero and solving for

, yields the Bose–Einstein population numbers:

By a process similar to that outlined in the
Maxwell-Boltzmann statistics article, it can be seen that:

which, using Boltzmann's famous relationship

becomes a statement of the
second law of thermodynamics at constant volume, and it follows that

and

where
S is the
entropy,
μ is the
chemical potential,
k is
Boltzmann's constant and
T is the
temperature, so that finally:

Note that the above formula is sometimes written:

A derivation of the Maxwell–Boltzmann distribution
Suppose we have a container with a huge number of very small
identical particles. Although the particles are identical, we still
identify them by drawing numbers on them in the way lottery balls are
being labelled with numbers and even colors.
All of those tiny particles are moving inside that container in all
directions with great speed. Because the particles are speeding around,
they do possess some energy. The Maxwell–Boltzmann distribution is a
mathematical function that speaks about how many particles in the
container have a certain energy.
It can be so that many particles have the same amount of energy
εi. The number of particles with the same energy
εi is
Ni. The number of particles possessing another energy
εj is
Nj. In physical speech this statement is lavishly inflated into something complicated which states that those many particles
Ni with the same energy amount
εi, all occupy a so called "energy level"
i .
The concept of energy level is used to graphically/mathematically
describe and analyse the properties of particles and events experienced
by them. Physicists take into consideration the ways particles arrange
themself and thus there is more than one way of occupying an energy
level and that's the reason why the particles were tagged like lottery
ball, to know the intentions of each one of them.
To begin with, let's ignore the degeneracy problem: assume that there is only one single way to put
Ni particles into energy level
i .
What follows next is a bit of combinatorial thinking which has little
to do in accurately describing the reservoir of particles.
The number of different ways of performing an ordered selection of one single object from
N objects is obviously
N. The number of different ways of selecting two objects from
N objects, in a particular order, is thus
N(
N − 1) and that of selecting
n objects in a particular order is seen to be
N!/(
N −
n)!. The number of ways of selecting 2 objects from
N objects without regard to order is
N(
N − 1) divided by the number of ways 2 objects can be ordered, which is 2!. It can be seen that the number of ways of selecting
n objects from
N objects without regard to order is the binomial coefficient:
N!/(
n!(
N −
n)!). If we now have a set of boxes labelled
a, b, c, d, e, ..., k, then the number of ways of selecting
Na objects from a total of
N objects and placing them in box
a, then selecting
Nb objects from the remaining
N −
Na objects and placing them in box
b, then selecting
Nc objects from the remaining
N −
Na −
Nb objects and placing them in box
c, and continuing until no object is left outside is

and because not even a single object is to be left outside the boxes, implies that the sum made of the terms
Na, Nb, Nc, Nd, Ne, ..., Nk must equal
N, thus the term
(N - Na - Nb - Nc - ... - Nl - Nk)! in the relation above evaluates to
0! which makes possible to write down that relation as

Now going back to the degeneracy problem which characterize the reservoir of particles. If the
i-th box has a "degeneracy" of
gi, that is, it has
gi "sub-boxes", such that any way of filling the
i-th
box where the number in the sub-boxes is changed is a distinct way of
filling the box, then the number of ways of filling the
i-th box must be increased by the number of ways of distributing the
Ni objects in the
gi "sub-boxes". The number of ways of placing
Ni distinguishable objects in
gi "sub-boxes" is

. Thus the number of ways
W that a total of
N particles can be classified into energy levels according to their energies, while each level
i having
gi distinct states such that the
i-th level accommodates
Ni particles is:

This is the form for
W first derived by
Boltzmann. Boltzmann's fundamental equation

relates the thermodynamic
entropy S to the number of microstates
W, where
k is the
Boltzmann constant. It was pointed out by
Gibbs however, that the above expression for
W does not yield an extensive entropy, and is therefore faulty. This problem is known as the
Gibbs paradox The problem is that the particles considered by the above equation are not
indistinguishable. In other words, for two particles (
A and
B)
in two energy sublevels the population represented by [A,B] is
considered distinct from the population [B,A] while for
indistinguishable particles, they are not. If we carry out the argument
for indistinguishable particles, we are led to the
Bose-Einstein expression for
W:

Both the Maxwell-Boltzmann distribution and the Bose-Einstein
distribution are only valid for temperatures well above absolute zero,
implying that

. The Maxwell-Boltzmann distribution also requires low density, implying that

. Under these conditions, we may use Stirling's approximation for the factorial:

to write:

Using the fact that

for

we can again use Stirlings approximation to write:

This is essentially a division by
N! of Boltzmann's original expression for
W, and this correction is referred to as
correct Boltzmann counting.
We wish to find the
Ni for which the function
W is maximized, while considering the constraint that there is a fixed number of particles

and a fixed energy

in the container. The maxima of
W and
ln(W) are achieved by the same values of
Ni
and, since it is easier to accomplish mathematically, we will maximize
the latter function instead. We constrain our solution using
Lagrange multipliers forming the function:

![\ln W=\ln\left[\prod\limits_{i=1}^{n}\frac{g_i^{N_i}}{N_i!}\right] \approx \sum\limits_{i=1}^n\left(N_i\ln g_i-N_i\ln N_i + N_i\right)](https://lh3.googleusercontent.com/blogger_img_proxy/AEn0k_tCpoEIRqfFqHbIFwLzmMDQ-jpnm8gL2SoCGTF59h83ka8ILMYJc5g7mAgqYmLYaW7uHH_zAwxB0TXSliVfdpSES9gMKqjeF4mF0Hyi3yv9nKxNCojUEERTNOxrUIwbDgz8czqDAW2PYRCplsRS3r8KRPDjG3PkTn-ETcQ=s0-d)
Finally

In order to maximize the expression above we apply
Fermat's theorem (stationary points), according to which local extrema, if exist, must be at critical points (partial derivatives vanish):

By solving the equations above (

) we arrive to an expression for
Ni:

Substituting this expression for
Ni into the equation for
ln W and assuming that

yields:

or, differentiating and rearranging:

Boltzmann realized that this is just an expression of the
second law of thermodynamics. Identifying
dE as the internal energy, the second law of thermodynamics states that for variation only in entropy (
S) and particle number (
N):

where
T is the
temperature and μ is the
chemical potential. Boltzmann's famous equation

is the realization that the entropy is proportional to
ln W with the constant of proportionality being
Boltzmann's constant. It follows immediately that
β = 1 / kT and
α = − μ / kT so that the populations may now be written:

Note that the above formula is sometimes written:

where
z = exp(μ / kT) is the absolute
activity.
Alternatively, we may use the fact that

to obtain the population numbers as

where
Z is the
partition function defined by:

Consider a system of particles with allowed energy levels

. Let

be the number of allowed states at energy

, and let

be the actual number of particles at energy

. The
values

and

are fixed, and the values

are random according
to the particular arrangement of electrons. Two important physical
parameters are the total number of particles,

, and the
total energy of the system,

.
We impose the following hypotheses:
Given a specified

, then the
conditional probability that a state at energy level

is occupied is

since exactly

out of

states
are filled; this is a consequence of hypotheses
1 and
3. By hypothesis
2, the probability

that
the electron distribution is specified by

is given by the ratio of

, the number of such arrangements, to

, the
total number of possible arrangements. By the total probability theorem:
Thus,

is obtained as a weighted average of the
occupancy distribution conditioned on the various allowed values of

.
Hypothesis
4 is valid as long as the number of energy levels,
states and particles is sufficiently large. This hypothesis permits us to
approximate the weighted average formula (
1.5) for the
(unconditional) occupancy distribution very well with the term corresponding
to the most likely arrangement:

. Thus, we need only determine the most likely arrangement and
study its properties in order to accurately characterize the system in
general.
Our task now is to compute

, and to find

which maximizes it. The number of ways to place

indistinguishable
particles in

states (hypothesis
3), with no more than one
particle in a single state (hypothesis
1), is:
and therefore:
To simplify the algebra to follow, instead of maximizing

, we maximize

; since the logarithmic
function

is strictly monotonic increasing, the
maxima of
W and

occur at the same point

. Thus:
We now invoke Stirling's approximation formula: for large
n,

, where the approximation is valid in the sense that the
ratio of both sides approaches 1 as

. Thus,
We apply this approximation:
We now maximize (
1.10) subject to the constraints that

and

are fixed, and that

. We impose

and

using the methods of Lagrange multipliers: find

and

such that the function

:
is maximized. We first locate critical points where

for all
i are

,

. The last two simply return the constraints imposed by

and

. To compute

, first note that

. Thus:
Note that

involves only

, and not other

's; the significance of the introduction of the Lagrange multipliers is that
the parameters

and

that appear in (
1.12) are
the same for all indices i; specifically, it is important to emphasize
that
they do not depend on the energy level 
.
Letting

denote the value of

at which

, we obtain:
and thus:
for some
constants
which do not depend on 
.
As a final remark, we often write

instead of

, thus suppressing the discrete nature of the allowed energy
levels. This corresponds to either approximating the range of discrete
levels with a continuum, considering the small energy difference between the
allowed levels, or to extending the results of this discrete analysis to the
case where a continuum of energy levels in indeed allowed.