Project1: Colorizing the Prokudin-Gorskii photo collection
Overview
The goal of this project is to take the digitized Prokudin-Gorskii glass plate images and, using modern image processing techniques, automatically produce a color image with as few visual artifacts as possible.
Algorithm Implementation
Aligning Method
For smaller images, I use the blue channel as a baseline, iteratively enumerate shifts([-15, 15]) in other channels, and compare the shifted images with the baseline image to identify the smallest difference as the result. Initially, I used the L2 norm difference between two images as the metric for evaluating differences. Although this metric can successfully match some images, it lacks adaptability and sensitivity to differences, because the values in the RGB channels can vary significantly and cannot be directly calculated at the pixel level. Subsequently, I utilized the Sobel operator to calculate the gradient information of both images. I chose the 3*3 Sobel kernel size to get finer details of gradient. The difference in gradient information between the two images serves as the evaluation metric. This method is advantageous because the gradient direction of the same images tends to be similar, allowing for better alignment of the images.
Speed up Method
However, for larger images, the time it takes to calculate differences for a single image becomes excessively long, making the time spent on enumerating shifts 31*31 times unacceptable. Because the distance moved at low resolution is longer than at high resolution, I recursively reduce the image size by half and calculate the offset at the lower resolution, doubling the offset at the end of each recursion. By reducing the search range to [-2, 2], for 4K resolution images, the recursion depth does not exceed 8. This algorithm significantly reduces the computational load while maintaining the same excellent performance in results.
Result Gallery
data:image/s3,"s3://crabby-images/22b73/22b73ee33484ac22779f3a1cd4547ac3a650d8f3" alt="First Image"
cathedral g_shift: [5, 2], r_shift: [12, 3]
data:image/s3,"s3://crabby-images/c3c7d/c3c7de8a5853edf7eaaa6df1b32f61a85679e8d0" alt="Second Image"
church g_shift: [25, 3], r_shift: [58, -4]
data:image/s3,"s3://crabby-images/aadf9/aadf9fdd79eed00945ca26729ab073801300c325" alt="First Image"
emir g_shift: [49, 24], r_shift: [106, 42]
data:image/s3,"s3://crabby-images/75575/75575fcebe025b5879b306ee8337f0826315f8a5" alt="Second Image"
harvesters g_shift: [56, 11], r_shift: [118, 10]
data:image/s3,"s3://crabby-images/648da/648dab8539971075966fb8035b95c9e916813cde" alt="First Image"
icon g_shift: [39, 16], r_shift: [88, 23]
data:image/s3,"s3://crabby-images/40faf/40faf986d2d2ddcba57a40a7164d619199e72799" alt="Second Image"
lady g_shift: [57, 9], r_shift: [121, 13]
data:image/s3,"s3://crabby-images/3d616/3d6164fe7cacd89ef3ad4fe34048e232f63c89ad" alt="First Image"
melons g_shift: [77, 5], r_shift: [176, 14]
data:image/s3,"s3://crabby-images/7ae70/7ae70fdaace9e6740a9e0b70407085cb0a1a9ca6" alt="Second Image"
monastery g_shift: [-3, 2], r_shift: [3, 2]
data:image/s3,"s3://crabby-images/f1daa/f1daa79517850e431c419692caef4d2deff1cd40" alt="First Image"
onion_church g_shift: [50, 28], r_shift: [108, 34]
data:image/s3,"s3://crabby-images/20c77/20c77eedbd8af392ad4d47171c0c4758718ca515" alt="Second Image"
sculpture g_shift: [33, -11], r_shift: [140, -27]
data:image/s3,"s3://crabby-images/9e987/9e987d9e916557f1dfa84d6532b06bc667faeee5" alt="First Image"
self_portrait g_shift: [81, 31], r_shift: [175, 37]
data:image/s3,"s3://crabby-images/ca435/ca4355f5c8ab8aa2334d38f7003d0bc088d7af79" alt="Second Image"
three_generations g_shift: [59, 15], r_shift: [115, 12]
data:image/s3,"s3://crabby-images/af751/af7514403122f4c76e894bd5fe06e3e75caeafb8" alt="First Image"
tobolsk g_shift: [3, 2], r_shift: [6, 3]
data:image/s3,"s3://crabby-images/0f2b5/0f2b57f81a1d299bfcc00d705f1df396acbf402f" alt="Second Image"
train g_shift: [41, 0], r_shift: [84, 28]
few examples of my own choosing
data:image/s3,"s3://crabby-images/9044e/9044e9a72a2886fdc0649865430cf2ce64d0e266" alt="First Image"
Adobe building in a grassy field, trees in background
data:image/s3,"s3://crabby-images/91c15/91c154373613f242389b14581f9b5e9b519b43ba" alt="Second Image"
Lugano
Bells & Whistles
Better features
As the description in “Vanilla Method” part above, I implemented a novel aligning method using Sobel operator to calculate the image gradient.
data:image/s3,"s3://crabby-images/02b57/02b5749df931cf24be89da89763f6e9c51e65fc3" alt="First Image"
L2 norm aligning
data:image/s3,"s3://crabby-images/aadf9/aadf9fdd79eed00945ca26729ab073801300c325" alt="Second Image"
Sobel operator aligning
Crop the images
I designed an algorithm to trim the excess edges of an image. First, the size of the image is reduced by 5% with the aim of cutting off the white parts at the edges. Next, Canny edge detection is applied separately to each of the RGB channels. This involves calculating the coordinates of the edge pixels that are closest to the frame. The largest common area across all channels is taken as the final result for the crop.
data:image/s3,"s3://crabby-images/aadf9/aadf9fdd79eed00945ca26729ab073801300c325" alt="First Image"
L2 norm aligning
data:image/s3,"s3://crabby-images/385f4/385f414d7794386ae7a97cdd48ecb61dc81b7873" alt="Second Image"
Sobel operator aligning