SIFT image alignment tutorial#

SIFT (Scale-Invariant Feature Transform) is an algorithm developped by David Lowe in 1999. It is a worldwide reference for image alignment and object recognition. The robustness of this method enables to detect features at different scales, angles and illumination of a scene.

Silx provides an implementation of SIFT in OpenCL, meaning that it can run on Graphics Processing Units and Central Processing Units as well. Interest points are detected in the image, then data structures called descriptors are built to be characteristic of the scene, so that two different images of the same scene have similar descriptors. They are robust to transformations like translation, rotation, rescaling and illumination change, which make SIFT interesting for image stitching.

In the fist stage, descriptors are computed from the input images. Then, they are compared to determine the geometric transformation to apply in order to align the images. This implementation can run on most graphic cards and CPU, making it usable on many setups. OpenCL processes are handled from Python with PyOpenCL, a module to access OpenCL parallel computation API.

This tutuorial explains the three subsequent steps:

  • keypoint extraction

  • Keypoint matching

  • image alignment

All the tutorial has been made using the Jupyter notebook.

[1]:
import time

start_time = time.time()
%pylab nbagg
Populating the interactive namespace from numpy and matplotlib
[2]:
# display test image
import silx

print("Silx version %s" % silx.version)

from PIL import Image
from silx.test.utils import utilstest

path = utilstest.getfile("lena.png")
image = numpy.asarray(Image.open(path))
fig, ax = subplots()
ax.imshow(image)
Silx version 0.7.0-dev0
[2]:
<matplotlib.image.AxesImage at 0x7fa76a2fa9e8>
[3]:
# Initialization of the sift object is time consuming: it compiles all the code.
import os

# set to 1 to see the compilation going on
os.environ["PYOPENCL_COMPILER_OUTPUT"] = "0"
# switch to "GPU" to "CPU" to enable fail-save version.
devicetype = "GPU"
from silx.image import sift

%time sift_ocl = sift.SiftPlan(template=image, devicetype=devicetype)

print("Device used for calculation: ", sift_ocl.ctx.devices[0].name)
CPU times: user 680 ms, sys: 236 ms, total: 916 ms
Wall time: 919 ms
Device used for calculation:  TITAN V
[4]:
print("Time for calculating the keypoints on one image of size %sx%s" % image.shape[:2])
%time keypoints = sift_ocl(image)
print("Number of keypoints: %s" % len(keypoints))
print("Keypoint content:")
print(keypoints.dtype)
print(
    "x: %.3f \t y: %.3f \t sigma: %.3f \t angle: %.3f"
    % (keypoints[-1].x, keypoints[-1].y, keypoints[-1].scale, keypoints[-1].angle)
)
print("descriptor:")
print(keypoints[-1].desc)
Time for calculating the keypoints on one image of size 512x512
CPU times: user 20 ms, sys: 4 ms, total: 24 ms
Wall time: 23.6 ms
Number of keypoints: 411
Keypoint content:
(numpy.record, [('x', '<f4'), ('y', '<f4'), ('scale', '<f4'), ('angle', '<f4'), ('desc', 'u1', (128,))])
x: 275.483       y: 302.585      sigma: 36.518   angle: -0.194
descriptor:
[ 11   5   0   1   5  20  22   8  88  20   3   0   0   4  40 120  41   9
  13  52  32  36  15  81   1   8  14  25  89  84   7   1  12   0   0   0
  22  94  29   9 120  32   0   0   1  21  43  69  81  20   0   0  22 120
  43  49  48 120  13   2  16  79  17   3  24   6   0   0  30  76  16   9
 120  64   7   5   5  10   7  38  64  75  36  37  38  54   5   8 109 120
   9   1   2   4  12  21  39  22   0   0  18  19   5   4 120 120  10   5
   1   0   0   0  27  42  44  52  37  20   6   2  24  10   3   2   7  42
  81  25]
[5]:
# Overlay keypoints on the image:
fig, ax = subplots()
ax.imshow(image)
ax.plot(keypoints[:].x, keypoints[:].y, ".g")