Blog Logo
Research Scientist @ Foundation AI
Image Source:
· · ·

Basics to Computer Vision Series


· · ·

What is Computer Vision?

Computer vision is concerned with the automatic extraction, analysis and understanding of useful information from a single image or a sequence of images such as a video.

What is Computational Photography?

Computational photography refers to analysis, manipulation and synthesis of images using numerical algorithms. It combines methodologies from image processing, computer vision, computer graphics and photography.

Applications of Computer Vision

  • OCR and Face Recognition
  • Object Recognition
  • Special Effects and 3D Modeling
  • Smart Cars and Sports
  • Vision based computer interactions
  • Security and Medical Imaging

Why is Computer Vision Hard?

In order to understand why computer vision is hard, one has to familiarize themselves with the difference between measurements of metrics of an image and the perceptions that we draw from them. Essentially if one looks at the image below, it would seem that the boxes A and B are of different shade (essentially box A seems darker than box B).

Fig. 1 - Difference in Perception

But, in reality if we place a grayscale intesity matcher for comparison of the block shades, it is seen that the two intensities are the same as seen in the image below.

Fig. 2 - Uniformity of Measurement

Another classic example showing the way perception differs based on image manipulation can be seen below.

Fig. 3 - Ball in a Box - Shadow Manipulation

The shadow manipulation demo by Kersten Labs shows an apt example of how brain changes perception on slight changes in visual input to match the accepted norms.

It is these intricate details and variations among them that make the problem of computer vision a challenging one.


Introduction to Computer Vision - Udacity

· · ·