CSCI1430 : Project 1 Hybrid Image

yunzhang

The goal of this assignment is to create hybrid images whose interpretation can be changed as a function of thew viewing distance.

Algorithm

Image alignment
Laplacian pyramids creation

imresize()

Image Hybriding

Extra Credit

We use color to enhance the visual effect of result images. We experiemented with following approaches:

Construct a gaussian pyramid and its corresponding laplacian pyramid for each channel of two input images, do the blending as described above for RGB channels separately and combine them tegether to form the final results.
Convert from RGB space to CIE L*a*b color space first, then create the hybrid images only on the intensity channel. We combine the other two color channels from the low frequency image with the hybrid intensity channel to recreate a color image in L*a*b channel and convert back to RGB space.

The images we created by these two approaches looks basically the same.

Results

For each pairs of images, we run our algorithm on different settings of scalar, cutoff and number of pyramid level, and show only best compositing results here. Please click the thumbnails to view images in original sizes.

Cat & Dog ( salar=0.7, cutoff=5, N=8 )

Marylin & Einstein ( scalar=0.6, cutoff=3, N=8 )

Derek & Cat ( scalar=0.6, cutoff=3, N=8 )

Discussion

As per Oliva et al., there are two primary mechanisms can be exploited to create compelling hybrid images. The first is maximizing the correlation between edges in the two scales so that they blend. The second resides in the fact that the remaining edges that do not correlate with other edges across scales can be perceived as noise. Based on these principles, we can further enhance our results in follow ways:

Do better alignment: We can specify more corresponding points so that a more complex image warp marix can be developed to align some strong edges in two inputs. For example, in Marylin-Einstein test, we can stretch one of the faces to make sure their eyes, eyebrows, noses and mouth align well, then other weak edges will be treated as noises when we look at the image.
Do local blending: Our algorithm so far does the blending merely based on a global cutoff of frequency. We can devise a smarter algorithm which performs the cutoff for each local regions. Doing this will give us an overall better decision boundary. Taking Marylin-Einstein test as instance again, one reason our result looks way from perfect is that Einstein's beard, which is not supposed to appear in the low frequnecy bands, still dominate our visual perception when we stand away from the image. For such regions, we may want to set the cutoff to favor more to low frequency information from Marylin's image.