This page is a part of the Computer Vision Wiki. The wiki is devoted to computer vision, especially low level computer vision, digital image analysis, and applications. The exposition is geared towards software developers, especially beginners. The wiki contains discussions, mathematics, algorithms, code snippets, source code, and compiled software. Everybody is welcome to contribute with this in mind - all links are no-follow.

Main Page

From Computer Vision Wiki

Revision as of 15:04, 21 October 2007; view current revision
←Older revision | Newer revision→
Jump to: navigation, search
Welcome To ComputerVisionWiki.org
A software developer's resource of computer vision methods from Intelligent Perception.

Computer Vision For Beginners: A Developer’s Platform

The current image analysis and computer vision technology is a very large collection of disparate “tools” in the form of “toolboxes”, “cookbooks”, or code libraries. It follows the following outdated manual paradigm:

Image analysis tools include “edge detection”, “thresholding”, “segmentation”, “Fourier transform”, “wavelets”, "the Laplacian of the Gaussian", and on and on, all drown in a sea of "image processing" tools. It takes long training and experience in this area to put these pieces together to create image analysis software. The methods are mathematically advanced at a level that goes well beyond what is covered in a typical undergraduate degree in computer science: Fourier and wavelet transforms, partial differential equations, probability and statistics, discrete topology and geometry, etc.

Over the last few years, the fast growth of digital imaging has created the demand for image analysis systems that are both cheap and versatile. Platforms for such systems should allow the software developer to concentrate on the user’s needs and not on custom development of mathematical algorithms. Our goal is to take care of the "Mathematical tools" part above so that the developer would face this:

Of course we'll be able to handle only the fundamentals: objects in the image, their locations, measurements, their topology, etc. It is what may be called the low level computer vision. This data will allow the developer to concentrate on high level computer vision: what these objects represent in the context of his project.

For that we have our software developer's kit (SDK). If you also want to understand how everything works, this wiki gives you a unique chance. We have complete and detailed expositions and source code.

In fact, this wiki is self-contained...


Contents

--> Homology in 2D UC

--> Image Sequences UC

--> Color Images UC


"UC" means "under construction".

"<<" indicates dependency (and so does a lower position - to some degree).

">>" indicates suggested reading.

"-->" means that you can jump to that article.


Slides: A Short Introduction

Personal tools