Image-Stitching-using-SIFT-feature-detection-algorithm

The goal of this task is to experiment with image stitching methods. Two images may have the same background but different foreground. For example, a moving person may be moving in the scene. The two images must be stitched into one image eliminating foreground objects that move in the scene.

In this project there are 2 tasks performed:

Background Stitching: Two images with same background, but different foregrounds are provided. The task is to stitch the two images given and eliminate the foreground objects
Image Panorama: A set of 4 images are given which needs to be stitched together to create one panoramic image

TASK 1:

SIFT (Scale Invariant Feature Transform):

SIFT is a feature detection algorithm used to detect and describe the local features of an image. These features derived and Scale and Rotation Invariant. In this project the in-built function in Opencv is used to detect the features of the image. kp1, des1 = sift.detectAndCompute(img_1,None) kp1 – stores the orientation and coordinates of the descriptor des1 – stores the bin values (128 bins) / the feature vector

Nearest Neighbor:

This algorithm is generally used to solve optimization problems in a given set, of finding the point in a set to another given point. In this project, once the image features are detected for both the images this concept is used for matching the key descriptors between the two images. This algorithm is based on the magnitude of SSD (Sum of Squared Differences) of the feature vectors of the two descriptors selected in two different images. So comparing one feature vector of the first image with all the feature vectors of the second image, the least SSD (or the nearest descriptor) is taken as the match. To further improve the quality matches, the concept of ratio threshold is used, where the SSD values of the closest neighbor(A) and second closest neighbor(B) is selected and their ratio(A/B) is calculated. This ratio defines how distinct the match between the descriptors are. If the ratio is high then it means they are not so unique and vice versa. Using this algorithm the perfect matches are calculated.

Homography matrix:

The Homography matrix relates the transformation between two image planes. This matrix defines the action of image rectification, image registration, or computation of camera motion—rotation and translation—between two images. This matrix can be calculated using a minimum number of 4 key descriptors matches between two images. The inbuilt function in Opencv is used. cv2.findHomography(ptB, ptA, cv2.RANSAC, 5.0) ptB, ptA – represents the matched descriptors index

RANSAC (Random sample consensus) algorithm:

It is an iterative method to estimate parameters of a mathematical model from a set of observed data that contains outliers, when outliers are to be accorded no influence on the values of the estimates, thus making it an outlier detection method. In the cases where, more than 4 matching descriptors are generated, the RANSAC algorithm is used to determine the best set of 4 matching descriptors to determine the homography matrix. This method is passed as an argument to the function call of cv2.findHomography. RANSAC selects the best set of 4 matching descriptors, by finding the matches that gives maximum number of inliers (or least outliers) of points.

TASK 2

In task 2, since there are multiple images to be stitched the sequence in the order of stitching needs to be predicted before image stitching. In this case, the process is done by selecting one among the given image as the base image and then consecutively step by step the images are stitched. The sequential order of the stitching is selected based on the overlap matrix that is determined by the number of matching descriptors of 2 images.

This matrix is N*N matrix (N is the total number of give images), where Nth row elements represents the matching of Nth image with respect to every other images. The value in every element of the matrix is determined by the percentage of overlap between the two images, where the two images represent the element’s coordinates. [I,j] in matrix represents the value of overlap of the ith image with the jth image. The overlap value is the ratio of number of matching descriptors of ith, jth image(Mmatch ) and the total number of descriptors in ith image(Imatch).

R = Mmatch/Imatch

So by setting a threshold of 0.2 to R (20% overlap between two images), the matrix is determined. From the matrix to determine the base image the index of the row with maximum number of non-zero elements is taken. To find the sequence of stitching from the selected row of the matrix, sort the elements in descending order.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
images		images
LICENSE		LICENSE
README.md		README.md
t1.py		t1.py
t2.py		t2.py
t2_overlap_img1.txt		t2_overlap_img1.txt
t2_overlap_img2.txt		t2_overlap_img2.txt
task1.png		task1.png
task2_img1.png		task2_img1.png
task2_img2.png		task2_img2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image-Stitching-using-SIFT-feature-detection-algorithm

TASK 1:

SIFT (Scale Invariant Feature Transform):

Nearest Neighbor:

Homography matrix:

RANSAC (Random sample consensus) algorithm:

TASK 2

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image-Stitching-using-SIFT-feature-detection-algorithm

TASK 1:

SIFT (Scale Invariant Feature Transform):

Nearest Neighbor:

Homography matrix:

RANSAC (Random sample consensus) algorithm:

TASK 2

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages