Understanding Computer Vision and Image Processing

Slide Note

Introduction to the fields of computer vision and image processing, exploring their differences and how they intertwine. Computer vision focuses on processing images for computer use, while image processing enhances images for human consumption. Topics include image analysis, restoration, enhancement, and compression. Both categories have fuzzy boundaries but play crucial roles in various applications such as manufacturing systems, medical imaging, and satellite analysis.

shreya Follow

Uploaded on Jul 17, 2024 | 1 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript

Image Processing 1

Introduction to Computer Vision and Image Processing Computer imaging can be separated into two primary categories: 1. Computer Vision. 2. Image Processing In computer vision application the processed images output for use by a computer, whereas in image processing applications the output images are for human consumption). 2

These two categories are not totally separate and distinct. The boundaries that separate the two are fuzzy, but this definition allows us to explore the differences between the two and to explore the difference between the two and to understand how they fit together Computer Vision Image Processing 3

Computer Vision Computer vision computer imaging where the application doses not involve a human being in visual loop. One of the major topics within this field of computer vision is image analysis. Image Analysis: involves the examination of the image data to facilitate solving vision problem. Computer vision systems are used in many and various types of environments, such as: Manufacturing Systems , Medical Community, DNA analysis, Satellites Orbiting, Infrared Imaging 4

Image Processing Computer Image processing is computer imaging where application involves a human being in the visual loop. In other words the image are to be examined and a acted upon by people. The major topics within the field of image processing include: Image restoration. Image enhancement. Image compression 5

Image Restoration Is the process of taking an image with some known, or estimated degradation, and restoring it to its original appearance. Image restoration is often used in the field of photography or publishing where an image was somehow degraded but needs to be improved before it can be printed 6

Image Enhancemnet Involves taking an image and improving it visually, typically by taking advantages of human Visual Systems responses. One of the simplest enhancement techniques is to simply stretch the contrast of an image. Enhancement methods tend to be problem specific. For example, a method that is used to enhance satellite images may not suitable for enhancing medical images. Although enhancement and restoration are similar in aim, to make an image look better. They differ in how they approach the problem. Restoration method attempt to model the distortion to the image and reverse the degradation, where enhancement methods use knowledge of the human visual systems responses to improve an image visually. 7

b. Image enhancement by a. image with poor contrast contrast stretching Image Enhancement 8

Image Compression Involves reducing the typically massive amount of data needed to represent an image. This done by eliminating data that are visually unnecessary and by taking advantage of the redundancy that is inherent in most images. Image data can be reduced 10 to 50 times, and motion image data (video) can be reduced by factors of 100 or even 200. 9

a. Image before compression b. Image after compression (92) KB (6.59)KB Image Compression. 10

Computer Imaging Systems Computer imaging systems are comprised of two primary components types, hardware and software. The hardware components can be divided into image acquiring sub system (computer, scanner, and camera) and display devices (monitor, printer).The software allows us to manipulate the image and perform any desired processing on the image data. 11

Digitization The process of transforming a standard analog signal (continuous) into digital signal (discrete) is called digitization. Analog signals use a continuous variety of amplitude values whereas digital signal takes a limited set of distinct values at consistently spaced spots in the time. Analog signals are continuous in nature, whereas digital signals are discrete. 12

Analog vs digital signal 13

What is a digital image? We usually operate on digital (discrete) images: Sample the 2D space on a regular grid Quantize each sample (round to nearest integer) If our samples are apart, we can write this as: f[i ,j] = Quantize{ f(i , j ) } The image can now be represented as a matrix of integer values

The image can now be accessed as a two-dimension array of data The digital image is 2D- array as: I(0,0) I(0,1) ..I(0,N-1) I(1,0) I(1,1) ..I(1,N-1) I(N-1,0) I(N-1,1) . ..I(N-1,N-1) In image matrix, the image size is (NXN) [matrix dimension] then: Ng= 2 m ..(1) Where Ng denotes the number of gray levels m is the no. of bits contains in digital image matrix. Example :If we have (6 bit) in 128 X 128 image .Find the no. of gray levels to represent it ,then find the no. of bit in this image? Solution: Ng= 26=64 Gray Level Nb= 128 * 128* 6= 9.8304 * 104 bit 15

The Human Visual System The Human Visual System (HVS) has two primary components: Eye. Brian. * The structure that we know the most about is the image receiving sensors (the human eye). * The brain can be thought as being an information processing unit analogous to the computer in our computer imaging system. These two are connected by the optic nerve, which is really a bundle of nerves that contains the path ways for visual information to travel from the receiving. 16

Image Resolution The resolution has to do with ability to separate two adjacent pixels as being separate, and then we can say that we can resolve the two. The concept of resolution is closely tied to the concepts of spatial frequency. Spatial frequency concept, frequency refers to how rapidly the signal is changing in space, and the signal has two values for brightness-0 and maximum. If we use this signal for one line (row) of an image and then repeat the line down the entire image, we get an image of vertical stripes. If we increase this frequency the strips get closer and closer together, until they finally blend together 17

Image resolution 18

Image brightness Adaption In image we observe many brightness levels and the vision system can adapt to a wide range. If the mean value of the pixels inside the image is around Zero gray level then the brightness is low and the images dark but for mean value near the 255 then the image is light. If fewer gray levels are used, we observe false contours bogus lines resulting from gradually changing light intensity not being accurately represented. 19

Image Representation We have seen that the human visual system (HVS) receives an input image as a collection of spatially distributed light energy; this is form is called an optical image. Optical images are the type we deal with every day cameras captures them, monitors display them, and we see them [we know that these optical images are represented as video information in the form of analog electrical signals and have seen how these are sampled to generate the digital image I(r , c). The digital image I (r, c) is represented as a two- dimensional array of data, where each pixel value corresponds to the brightness of the image at the point (r, c). in linear algebra terms , a two- dimensional array like our image model I( r, c ) is referred to as a matrix , and one row ( or column) is called a vector 20

Binary Image Binary images are the simplest type of images and can take on two values, typically black and white, or 0 and 1 . A binary image is referred to as a 1 bit/pixel image because it takes only 1 binary digit to represent each pixel. These types of images are most frequently in computer vision application where the only information required for the task is general shapes, or outlines information. For example, to position a robotics gripper to grasp ( ) an object or in optical character recognition (OCR). Binary images are often created from gray-scale images via a threshold value is turned white ( 1 ), and those below it are turned black ( 0 ). 21

Gray Scale Image Gray _scale images are referred to as monochrome, or one-color image. They contain brightness information only brightness information only, no color information. The number of different brightness level available. The typical image contains 8 bit/ pixel (data, which allows us to have (0-255) different brightness (gray) levels. The 8 bit representation is typically due to the fact that the byte, which corresponds to 8-bit of data, is the standard small unit in the world of digital computer. 22

Color Image Color image can be modeled as three band monochrome image data, where each band of the data corresponds to a different color. . The actual information stored in the digital image data is brightness information in each spectral band. When the image is displayed, the corresponding brightness information is displayed on the screen by picture elements that emit light energy corresponding to that particular color. 23

Multispectral Images Multispectral images typically contain information outside the normal human perceptual range. This may include infrared ultraviolet ,X-ray, acoustic or radar data. Source of these types of image include satellite systems underwater sonar systems and medical diagnostics imaging systems. 24

Multispectral Images A multispectral image is one that captures image data within specific wavelength ranges across the electromagnetic spectrum. The wavelengths may be separated by filters or detected via the use of instruments that are sensitive to particular wavelengths, including light from frequencies beyond the visible light range, i.e. infrared and ultra- violet. Spectral imaging can allow extraction of additional information the human eye fails to capture with its visible receptors for red, green and blue. It was originally developed for military target identification 25

electromagnetic spectrum 26

Overview of Image Properties An image is composed of an array of dots, more commonly referred to as pixels (short for picture elements). One generally refers to a computer image's dimensions in terms of pixels; this is also often (though slightly imprecisely) known as its resolution. Some common image sizes are 640 480, 800 600, and 1024 768 pixels, which also happen to be common dimensions for computer displays. 27

In addition to horizontal and vertical dimensions, an image is characterized by depth. The deeper the image, the more colors (or shades of gray) it can have. Pixel depths are measured in bits, the tiniest units of computer storage; - a 1-bit image can represent two colors (often, though not necessarily, black and white). - a 2-bit image four colors. - an 8-bit image 256 colors, and so on. 28

To calculate the raw size of the image data before any compression takes place, one needs only to know that 8 bits make a byte. Thus a 320 240, 24-bit image has 76,800 pixels, each of which is 3 bytes deep, so its total uncompressed size is 230,400 bytes. 29

1-bit Images Each pixel is stored as a single bit (0 or 1), so also referred to as binary image. Such an image is monochrome image since it contains no color. Next Figure shows a 1-bit monochrome image (called Lena by multimedia scientists - this is a standard image used to illustrate many algorithms). also called a 1-bit 30

8-bit Gray-level Images Each pixel has a gray-value between 0 and 255. Each pixel is represented by a single byte; e.g., a dark pixel might have a value of 10, and a bright one might be 230. Bitmap: The two-dimensional array of pixel values that represents the graphics/image data. Image resolution: refers to the number of pixels in a digital image (higher resolution always yields better quality). - Fairly high resolution for such an image might be 1600 * 1200, whereas lower resolution might be 640 * 480. 32

Bitmap (raster) versus vector images Vector graphics are computer graphics images that are defined in terms of points on a Cartesian plane, which are connected by lines and curves to form polygons and other shapes. Vector graphics have the unique advantage over raster graphics in that the points, lines, and curves may be scaled up or down to any resolution with no aliasing. The points determine the direction of the vector path; each path may have various properties including values for stroke color, shape, curve, thickness, and fill. 33

Frame Buffer 34

Multimedia Presentation Each pixel is usually stored as a byte (a value between 0 to 255), so a 640 480 grayscale image requires 300 kB of storage (640 480 = 307, 200). Next Fig shows the Lena image again, but this time in grayscale. 35

Dithering which is a means of mixing pixels of the available colors together appearance of other colors (though generally at the cost of some sharpness) . When an image is printed, the basic strategy of dithering is used, which trades intensity resolution for spatial resolution to provide ability to print multi-level images on 2-level (1- bit) printers. to give the 37

An ordered dither consists of turning on the printer output bit for a pixel if the intensity level is greater than the particular matrix element just at that pixel position. Next Fig. (a) shows a grayscale image of "Lena". The ordered-dither version is shown as (b), with a detail of Lena's right eye in (c). 38

Dithering examples 40

The image on the left is the original. The middle one is like taking those pristine files and dramatically reducing their bit depth. It s pretty hard to recognize the image. The one on the right has been reduced the same amount, but this time with dither applied. 41

Color Image Data Types The most common data types for graphics and image file formats ; 24-bit color and 8-bit color. Some formats are hardware/operating system platforms, while others are cross-platform formats. Even if some formats are not cross-platform, there are conversion applications translate formats from one system to another. Most image formats incorporate some variation of compression technique due to the large storage size of image files. Compression techniques can be classified into either lossless or lossy. restricted to particular that will recognize and 42

8-bit Color Images Many systems can make use of 8 bits of color information (the so-called 256 colors ) in producing a screen image. Such image files use the concept of a lookup table to store color information. - Basically, the image stores not color, but instead just a set of bytes, each of which is actually an index into a table with 3-byte values that specify the color for a pixel with that lookup table index. 43

Color Look-up Tables (LUTs) The idea used in 8-bit color images is to store only the index, or code value, for each pixel. Then, e.g., if a pixel stores the value 25, the meaning is to go to row 25 in a color look-up table (LUT). 44

24-bit Color Images In a color 24-bit image, each pixel is represented by three bytes, usually representing RGB. - This format supports 256* 256* 256 possible combined colors, or a total of 16,777,216 possible colors. - However such flexibility does result in a storage penalty: A 640*480 24-bit color image would require 921.6 kB of compression. Full-color photographs may contain an almost infinite range of color values. Dithering is the most common means of reducing the color range of images down to the 256 (or fewer) colors seen in 8-bit GIF images. storage without any 46

Chapter Two Image Analysis 47

Image Analysis Image analysis involves manipulating the image data to determine exactly the information necessary to help solve a computer imaging problem. This analysis is typically part of a larger process, is iterative in nature and allows us to answer application specific equations: Do we need color information? Do we need to transform the image data into the frequency domain? Do we need to segment the image to find object information? What are the important features of the image? Image analysis is primarily data reduction process. As we have seen, images contain enormous amount of data, typically on the order hundreds of kilobytes or even megabytes. Often much of this information is not necessary to solve a specific computer imaging problem, so primary part of the image analysis task is to determine exactly what information is necessary. Image analysis is used both computer vision and image processing. But it is different. 48

System Model The image analysis process can be broken down into three primary stages: 1- Preprocessing. 2- Data Reduction. 3- Features Analysis. Preprocessing: Is used to remove noise and eliminate irrelevant, visually unnecessary information. Noise is unwanted information that can result from the image acquisition process, other preprocessing steps might include: -Gray level or spatial quantization (reducing the number of bits per pixel or the image size). -Finding regions of interest for further processing. 49

Data Reduction Involves either reducing the data in the spatial domain or transforming it into another domain called the frequency domain, and then extraction features for the analysis process. 50

Understanding Computer Vision and Image Processing

Download Presentation

Presentation Transcript

Related

More Related Content