Learn Fourier Transform | Image Processing with OpenCV

Definition

The Fourier transform (FT) is a fundamental mathematical tool used in image processing to analyze the frequency components of an image.

It allows us to transform an image from the spatial domain (where pixel values are represented directly) to the frequency domain (where we analyze patterns and structures based on their frequency). This is useful for tasks like image filtering, edge detection, and noise reduction.

First, we need to convert the image to grayscale:

gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

Note

We used COLOR_BGR2GRAY because images are primarily read in BGR format, which is the reverse of RGB.

To compute the 2D Fourier transform:

dft = np.fft.fft2(image)
dft_shift = np.fft.fftshift(dft)

Here, fft2() converts the image from the spatial domain to the frequency domain, and fftshift() moves low-frequency components to the center.

To visualize the magnitude spectrum:

magnitude_spectrum = 20 * np.log(np.abs(dft_shift))

Since Fourier transform outputs complex numbers, we take the absolute values (np.abs()) for a meaningful visualization.

The np.log function enhances visibility, as raw magnitude values vary greatly in scale.

Task

Swipe to start coding

You are given an image:

Convert image to grayscale and store in gray_image variable;
Apply Fourier transform to the gray_image and stote in dft variable;
Make zero frequency shift to center and store the result in dft_shift variable;
Calculate a magnitude spectrum and store in magnitude_spectrum variable.

Solution

Everything was clear?

Thanks for your feedback!

Section 2. Chapter 2

single

Ask AI

Ask anything or try one of the suggested questions to begin our chat

Suggested prompts:

Can you explain why we use the frequency domain for image processing?

What are some practical applications of the magnitude spectrum?

How do I interpret the magnitude spectrum visually?

Awesome!

Completion rate improved to 3.45

Swipe to show menu