Mutual Information and Channel Capacity

Definition:

In the setting of the previous lecture, given two jointly distributed Finite Random Variables $\QTR{Large}{X}$ $\QTR{Large}{:}$ and $\QTR{Large}{Y}$ $\QTR{Large}{:}$ their Mutual Information is defined as follows:

There is no minus sign!
If $\QTR{Large}{X\ }$ and $\QTR{Large}{Y\ }$ are Independent since There is no Mutual Information, Example 2 from the previous section.
For a noiseless Channel

Since

MATH

and essentially the same calculation for . All Information is Mutual.

_____________________________________________________________________________________________

Theorem:

Proofs:

These are all simple variants of the definition, the calculation in the second bullet above and the material in the previous lecture. For example,

MATH

_____________________________________________________________________________________

One might read , the Mutual Information as:

the average information about the character received $\QTR{Large}{\ }$ after the transmission noise has been removed.
the average information about the character sent $\QTR{Large}{\ }$ after the Bayesian noise has been removed.
The information in after a copy of the joint information has been removed, the Mutual Information getting counted twice

_____________________________________________________________________________________

The Extreme Cases, Yet Again:

We have Random Variables $\QTR{Large}{T\ \ }$ and and {A,B,..,Z}

The two special cases to be considered are :

, error free transmission.

and

, All of the information is in what is transmitted.
for all $\QTR{Large}{t}$ and $\QTR{Large}{r}$ , total noise

, ,

Since $\QTR{Large}{R}$ and $\QTR{Large}{T}$ are independent.there is no mutual information.

________________________________________________________________________________________

Definition:

$\ \ \hspace{1in}$ for a given channel , the Channel Capacity, $\QTR{Large}{C\ }$ is defined by the formula

For the example of a Binary Symmetric Channel, since and is constant. The maximum is achieved when is a maximum (see below)

Exercise (Due March 7) : Compute the Channel Capacity for a Binary Symmetric Channel in terms of $\QTR{Large}{p}$ ?

_______________________________________________________________________________________________________

Theorem:

If the values of each row of a Channel Matrix , $\QTR{Large}{M}$ , are a permutation of the values in any other row then for any $\QTR{Large}{i.\ }$ In particular

if a Channel is Symmetric then is independent of the input probability vector
If the values of each column of a Channel Matrix , $\QTR{Large}{M}$ , are a permutation of the values in any other column then
If a Channel is Symmetric:

The Channel Capacity for any $\QTR{Large}{i}$

.

Since for any