Project 4: Daniela Fajardo

Part A

Overview:

For this part of the project, we learned how to use homographies in order to create projections of scenes and change perspectives of objects.

Part 1: Shoot the pictures

Mosaic 1 Images: East Asian Library

Mosaic 2 Images: BWW

Mosaic 3 Images: Pretty House I Saw While Biking

Mosaic 4 Images: East Asian Books

Part 2: Recover Homographies

In order to warp the images into alignment to be able to create the mosaics, I needed to recover the parameters of the transformation which in this case is a homography. In order to do this, I took two sets of points, pts_a and pts_b. For explanation purposes, let pts_a = [x, y].T and pts_b = [x’, y’].T.

We will solve for the homography that brings pts_a into pts_b with the following logic.

Part 3: Warp the images

Now we can use the calculated homographies to take an image and change its perspective to match a certain shape (be it another image or take a part of this image and rectify it).

My steps to transform im1 into im2 (image or window over image) were:

Calculate homography H.
Apply H to the corners of im2 or the window we want to capture.
Create a bounding box for the destination image.
Create a grid of the destination image and shift it.
Find source coordinates of the grid I just created.
For each of the image channels, use griddata to fill in the corresponding pixels.

Here are some examples of rectification using homographies:

Example 1: iPad

Example 2: Charger

Part 4: Blend the images into a Mosaic

Now we make homographies interesting. The idea is to take the previously taken pictures of a scene and put them together into a mosaic. For the mosaics, I warped all images into one of the images, padded them as necessary so they have the same size, and then used alpha blending. I used scipy.ndimage.distance_transform_edt to get the distance of each of the pixels of the images from the center, and blended such pixels with a weight of 1 when in the center of the image. The weight of the pixels in the new image decreases as the pixels move away from the center of the image they belong to.

Here are some examples of Mosaics I created:

Mosaic 1 Images: East Asian Library

This was my first attempt. This is a mosaic made out of 3 images. I decided to take various pictures closer together so I had more points of overlap and could understand the results more easily.

Mosaic 2 Images: BWW

Outside mosaic made of 4 images. Alignment is difficult around items with a lot of details. Learned that with more correspondences we get clearer mosaics.

Mosaic 3 Images: A Pretty House I saw while biking

Just for fun. 2 images.

Mosaic 4 Images: East Asian Books

For fun. 2 images.

Part B

Overview:

For this part of the project, we learned how to use corners to do feature matching and produce correspondences between pictures in order to then build mosaics.

Corner Feature Detection

I used the Harris Corner Detection method to identify unique feature points within each image.

Harris Corners for Mosaic 1

Harris Corners for Mosaic 2

Harris Corners for Mosaic 3

Adaptive Non-Maximal Suppression (ANMS)

To avoid clustering of corner points, I applied Adaptive Non-Maximal Suppression (ANMS) to select a spread of high-quality corners. For each point, I determined a "suppression radius" that measured the distance to the nearest stronger corner. I then iterated through each point, comparing its strength to those around it, and discarded any that didn’t fall among the strongest within a specified radius. This process produced a well-distributed set of corners across the image.

Corners after ANMS for Mosaic 1

Corners after ANMS for Mosaic 2

Corners after ANMS for Mosaic 3

Feature Descriptor Extraction

For each corner, I extracted a descriptor that captures the surrounding image structure. This involved isolating a window around each corner, downsampling it for stability, and applying Gaussian blurring to reduce noise. I then normalized each descriptor for consistent brightness and contrast, making it more reliable for matching.

Feature Matching

With the descriptors extracted, I matched features between images using the Nearest-Neighbor method. For each descriptor in one image, I calculated the sum of squared differences with all descriptors in the other image, identifying the closest ones. To make the matches more reliable, I applied Lowe's ratio test, which checks that the closest match is noticeably better than the second-closest by a threshold (I chose it to be 0.4). This step helped filter out weaker matches, making the overall process more accurate.

Feature Matches Mosaic 1

Feature Matches Mosaic 2

Feature Matches Mosaic 3

Homography Calculation with RANSAC

To calculate the transformation matrix (homography) between the images, I used RANSAC to deal with outliers in the matches. The process involves randomly selecting small sets of matched feature points, computing a homography for each set, and seeing how well that transformation aligns the other points. By repeating this several times, RANSAC identifies the homography that fits the most points accurately, which allows me to warp one image so it aligns correctly with the other.

Creating the Mosaics

I proceeded as in part A with merging and blending.

Results

Mosaic 1

Manual

Automatic

Mosaic 2

Manual

Automatic

Mosaic 3

Manual

Automatic

Coolest thing I've learned?

I thought the coolest thing I learned was how to use transformations for rectification. I found very interesting the story about the art history phd that had been trying to figure out the floor of a painting for years, and thinking that through mathematics and computer science we can do this in a matter of hours intrigues me a lot, and makes me realize the importance of computer science and mathematics through different industries. Another very important thing that I learned is the importance of taking good pictures. Throughout the project I realized that if the pictures were not taken properly, some caviats rose when creating my results. This was interesting for me because it showed the connection between coding and algorithms and the human touch. It showed me that the human factor can actually ruin the result of an algorithm, and so I had to be more careful with my human area of error. The most difficult part of this project was to get the warping to work. Once I was able to do this, the project moved more swiftly.