Image Compression (PCA)

In the old age where a diskette (ha! hipsters can relate) is equivalent to today’s USB in terms of usefulness, Storage was once a problem, and in finding a way to solve this, they have come across what is currently known as image compression, which is still being used today (hello JPEG). In this activity, we show how to compress images using Principal Components Analysis (PCA).

Basically what PCA does is that it converts a set of correlated signals into orthogonal functions whose sums (with their respective coefficients) can reconstruct any of the signals. (I do not want to indulge in here. :PPP) If we have a set of functions, f1, f2, f3, …. , fn, then we can obtain a set of orthogonal functions W1, W2, W3… Wm such that:

$f_i = \sum\limits_{n=1} ^{m}a_n,_i W_n$

Where $a_n,_i$ is the coefficient for the function $W_n$ . The functions $W_n$ are called the principal components. However, each $W_n$ has a different percent contribution to the reconstruction of the image. In Scilab, $W_n$ and $a_n$ can be obtained using a single line of code. . If X is the matrix containing the signals (each column being an individual signal), the principal components and the coefficients can be obtained by:

$[lambda, facpr, comprinc] = pca(X);$

where facpr is the matrix containing the principal components and comprinc the matrix containing the coefficients. Lambda gives the percent contribution of each principal components to the signals.

In image compression, what we do is that we cut our image into $NxN$ parts. Then we convert each part to a single matrix with size $1 x N^2$ . Each $1 x N^2$ matrix will be one signal. Then we concatenate all the signals into one matrix which will be input to the pca. For this activity, we use $N = 10$ .

There will be $N^2 = 100$ principal components. What we will then do is try to reconstruct the images using the principal components. Because each principal component have different percent contribution, we will investigate what will happen to the reconstruction if we use only n principal components (0<n<100).

We use a picture of a cat (well its a tiger but…. still a cat).

A cat.

In the figure below, we show the difference of the original picture (in grayscale) to the reconstruction images.

The topleft figure is the grayscale image of the cat. The top center is for n=1, top right is for n = 5, bottom left is n = 10, bottom center is n = 50 and bottom right is n = 100.

From the figure above, we can see that the higher the number of principal components used, the better the quality. However, it can be observed that for n = 50 and n = 100, the images do not vary much. This is due to the percent contribution of the principal components.

In the next figure, we show what the principal components look like.

The principal components are each 10 x 10 part of this picture.

We show another example. This one came from the anime Fate zero, where the protagonist is a female swordsman…..well you can search it if you like.

The number of principal components are the same as the cat figure.

The author gives himself a score of 9/10 for this work. He acknowledges Mr. Tingzon for the help. 🙂

Sources:

Wikipedia

Dr. Maricor Soriano

Wynn186

Scientific thinking

Image Compression (PCA)

Leave a comment Cancel reply

Share this:

Related

Leave a comment Cancel reply