Project2: Fun with Filters and Frequencies!
Part 1: Fun with Filters
Finite Difference Operator
In this section, I first applied differential filters in the x and y directions separately and visualized them accordingly. For the gradient magnitude, I computed $\sqrt{dx^2+dy^2}$ for each pixel. Then, I set a threshold of 100 to binarize the high-gradient edge areas. The resulting image clearly showed the contours, but some overly high-frequency information caused noise.
data:image/s3,"s3://crabby-images/afa5d/afa5dff353a6f4b37e252e3b540748628654157e" alt="First Image"
x direction gradient
data:image/s3,"s3://crabby-images/e8668/e8668aa2fea400155318e5cca72718dddc6e065f" alt="Second Image"
y direction gradient
data:image/s3,"s3://crabby-images/19de7/19de70254a2ca90cf976ddf488d3497571bdd230" alt="First Image"
all direction gradient
data:image/s3,"s3://crabby-images/9ab64/9ab64e0169c4af2ff28adbf6f4d9ce6a0155c740" alt="Second Image"
threshhold image
Derivative of Gaussian (DoG) Filter
Therefore, I applied a low-pass filter with a Gaussian kernel of (12, 2) to the image, followed by binarization using a threshold of 15. The threshold was lowered because the gradient decreases after the low-pass filtering. However, this helps filter out a lot of noise, resulting in smoother and more continuous contours.
data:image/s3,"s3://crabby-images/f1632/f16322b367a8cf45ac18947b64b0d6a06db3aebb" alt="First Image"
blurred image gradient
data:image/s3,"s3://crabby-images/b8349/b8349064f594b78be17f5e26a0f88ed1c140c899" alt="Second Image"
blurred image threshhold
Part 2: Fun with Frequencies!
Image “Sharpening”
Blur the image using a Gaussian filter: $I_{blurred}=I * G$, where $I$ is the original image, $G$ is the Gaussian filter.
The result of a sharpened image:
$$I_{sharpened}=(1+\alpha)\cdot I-\alpha\cdot(I*G)$$You can control the sharpening intensity of the image by adjusting the value of $\alpha$. Here, I have chosen values of 0.5, 1.5, and 3.
data:image/s3,"s3://crabby-images/65cc6/65cc6e50ba6624414af2b271ff9838025ed6a1a5" alt="First Image"
raw image
data:image/s3,"s3://crabby-images/7878b/7878b2e9db249d2df2fb384c35d735c9ef3de3e2" alt="Second Image"
$\alpha=0.5$
data:image/s3,"s3://crabby-images/f0c52/f0c5217def6674c7d79bc197d88d39959451d403" alt="First Image"
$\alpha=1.5$
data:image/s3,"s3://crabby-images/89aa2/89aa20cbe60278166c5d51065f472b479646cded" alt="Second Image"
$\alpha=3$
Another cases
data:image/s3,"s3://crabby-images/96052/96052ab526def4443ec46e0f8cc639e553f5badd" alt="First Image"
raw image
data:image/s3,"s3://crabby-images/da554/da554b6b9e0ea897eeccb7a206ca2f4e98ecd119" alt="Second Image"
$\alpha=3$
data:image/s3,"s3://crabby-images/ee459/ee459ef399200ad9276c9b7bebff21cf028bea73" alt="First Image"
raw image
data:image/s3,"s3://crabby-images/9683f/9683f3f0f0354aa09f2972a236cea3d1dcccf5a6" alt="Second Image"
$\alpha=3$
Hybrid Images
To make the image blending more realistic, we first need to align the feature points of the two images. To achieve the effect where different content is seen at varying distances, we should separate the low-frequency and high-frequency components of each image. When viewed from a distance, the low-frequency part of the image is visible, while the high-frequency part is visible up close.
data:image/s3,"s3://crabby-images/d40fb/d40fb48e411726d11747358a4ee0a5dfdaf54b44" alt="First Image"
aligned derek
data:image/s3,"s3://crabby-images/164c7/164c75b7c825b57e3f350a443bd8c97735335f94" alt="Second Image"
aligned nutmeg
data:image/s3,"s3://crabby-images/fff33/fff333a572317848c9c0e9e027306401365989f3" alt="First Image"
low-pass derek
data:image/s3,"s3://crabby-images/9d5b0/9d5b0c60a58ced15be3bc17ea85441ef48919467" alt="Second Image"
high-pass nutmeg
data:image/s3,"s3://crabby-images/f4080/f408015a354cb9d1bb464ec86539eb05500e2b48" alt="Image description"
hybrid image
data:image/s3,"s3://crabby-images/8cd5a/8cd5a7df96609f5d64cdbe057719b13171b4811f" alt="Image description"
image spectrum
Another 2 cases (with one failure)
data:image/s3,"s3://crabby-images/c38ed/c38ed41f3cc6cd1d6ee0b170851557787f3dc2c0" alt="First Image"
aligned Berkeley logo
data:image/s3,"s3://crabby-images/30f66/30f66970a264e5ee6e3a8c4baef55eefdefd8288" alt="Second Image"
aligned Stanford logo
data:image/s3,"s3://crabby-images/dd1ed/dd1ed0e76fc0a3a3427abeee706630d03faac86d" alt="Image description"
hybrid image
data:image/s3,"s3://crabby-images/1e2f0/1e2f0ba029e0667d82a5c6177cf28b0f3b975e17" alt="First Image"
aligned cat
data:image/s3,"s3://crabby-images/d6ecd/d6ecdf64ee3360fc5d3e2991911e2bd1123b2435" alt="Second Image"
aligned dog
data:image/s3,"s3://crabby-images/f0080/f0080c20942718d9aab254b5821a84cac8785c66" alt="Image description"
hybrid image
However, this is a failure case because too much of the cat’s high-frequency information was lost, making it difficult to recognize the cat from a distance. The hybrid parameters need to be adjusted based on this case.
Gaussian and Laplacian Stacks
Gaussian and Laplacian Stacks are multi-scale image representations commonly used in image processing and blending techniques.
Gaussian Stack: This is a set of progressively blurred versions of an image, created by applying Gaussian filters with increasing levels of smoothing. Each level in the stack captures the image at a lower resolution, preserving only low-frequency information.
Laplacian Stack: Derived from the Gaussian stack, the Laplacian stack highlights the high-frequency information (edges and fine details) by subtracting consecutive levels of the Gaussian stack. Each layer in the Laplacian stack represents the details lost between the blurred levels, allowing precise control over image blending and sharpening.
Here, I selected a Gaussian low-pass filter with a kernel size of 31 and a sigma of 7, using a six-level pyramid.
data:image/s3,"s3://crabby-images/4564c/4564c7fb9471b277ad4a397c42fdfda015204f93" alt="Image description"
gaussian pyramid
data:image/s3,"s3://crabby-images/df45c/df45c978cd8f03894eb21a92255b0a881f929201" alt="Image description"
laplacian pyramid
Multiresolution Blending (a.k.a. the oraple!)
To blend the two images more smoothly, I used a Laplacian stack combined with a progressively blurred mask. Here, I applied a Gaussian kernel with a size of 11 and a sigma of 3 in the Laplacian pyramid, and for the blurred mask, I used a Gaussian kernel with a size of 51 and a sigma of 11.
data:image/s3,"s3://crabby-images/d9220/d9220f40a4f0626f441ab06b3b2056d0111dd858" alt="Image description"
oraple
data:image/s3,"s3://crabby-images/6bdd1/6bdd185485532ff7b966d94310a64badae53b421" alt="Image description"
Column1, 2: the Laplacian stack for apple and orange. Column3: gradually blurred mask
Another 2 cases (with one irregular mask)
data:image/s3,"s3://crabby-images/8b19f/8b19fc5ad55153f77dfbe2bebba1d89e6ecc7741" alt="First Image"
Biden
data:image/s3,"s3://crabby-images/6c92c/6c92cbbf554fdbafa564a24f85cd85da456cf585" alt="Second Image"
Trump
data:image/s3,"s3://crabby-images/c04a8/c04a89d1feb3525bad0ede218c5040d98510fa3b" alt="Image description"
splined image
data:image/s3,"s3://crabby-images/50792/507928a5a1c0ea8f6bd064907190320eac9519f8" alt="First Image"
face
data:image/s3,"s3://crabby-images/32ae7/32ae707dd9ea47f4874e84b705e5d31709349c26" alt="Second Image"
palm
data:image/s3,"s3://crabby-images/c2464/c24644f2935284d06d7a30c5c3e5d109d3195795" alt="First Image"
mask
data:image/s3,"s3://crabby-images/4e1a6/4e1a68355df19969f6da9a547d27bb29bebe5bd3" alt="Second Image"
splined image