<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Computer Vision For Dummies &#187; image search</title>
	<atom:link href="http://inperc.com/blog2/index.php/category/image-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://inperc.com/blog2</link>
	<description>Computer vision and image analysis for newcomers</description>
	<lastBuildDate>Fri, 18 Jun 2010 00:00:31 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Image-to-image search: a case study</title>
		<link>http://inperc.com/blog2/2010/04/05/image-to-image-search-a-case-study/</link>
		<comments>http://inperc.com/blog2/2010/04/05/image-to-image-search-a-case-study/#comments</comments>
		<pubDate>Tue, 06 Apr 2010 02:35:34 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[computer vision/machine vision/AI]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[updates]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/?p=400</guid>
		<description><![CDATA[This study was conducted in 2009 for a company that is “working in the online social media sector and are looking for an accurate image analysis solution that allows us to compare a reference photo to a large dataset of photos to determine if the reference photo is duplicated in the larger dataset.”
The full title [...]]]></description>
			<content:encoded><![CDATA[<p>This study was conducted in 2009 for a company that is “working in the online social media sector and are looking for an accurate image analysis solution that allows us to compare a reference photo to a large dataset of photos to determine if the reference photo is duplicated in the larger dataset.”</p>
<p>The full title of the report is <a class="external text" title="http://inperc.com/files/Image-to-image_search.pdf" rel="nofollow" href="http://inperc.com/files/Image-to-image_search.pdf">&#8220;Image-to-image search with Pixcavator (PxSearch): a case study&#8221;</a>. It was written by Dr. Ash Pahwa and myself and is presented here with minor modifications.</p>
<p>The first version of PxSearch was created in 2007. Using that version, initially the search results with the collection had 4-5 good hits (i.e., the transformed version of the original) at the top and then some bad hits. Some of the good matches weren&#8217;t even visible. After the upgrades, the results became 10 out of 10 or close. This improvement made this, more extensive, study possible. The results are OK, even though the collections are still very small. The company eventually went with another vendor, it’s still an interesting document to browse through.</p>
<p>Since 2009, there has been no work going on but, hopefully, this project will be one of the <a title="Computational science training: 2010 projects" href="/wiki/index.php?title=Computational_science_training:_2010_projects">summer projects</a> for the REU site.</p>
<p>Incidentally, I don’t like the term “reverse image search” popularized by TinEye. If the image search that we are used to at Google etc is “direct image search” (text-to-image) then the “reverse image search” is supposed to search for text based on images. Not only this isn’t what we are talking about, but also the problem hasn’t been even remotely solved (see this pathetic list: <a title="Visual image search engines" href="/wiki/index.php?title=Visual_image_search_engines">Visual image search engines</a>). This is the reason I prefer “image-to-image search” to describe this application.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2010/04/05/image-to-image-search-a-case-study/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Riya shut down. Big surprise!</title>
		<link>http://inperc.com/blog2/2009/08/15/riya-shut-down-big-surprise/</link>
		<comments>http://inperc.com/blog2/2009/08/15/riya-shut-down-big-surprise/#comments</comments>
		<pubDate>Sat, 15 Aug 2009 13:04:57 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[computer vision/machine vision/AI]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[news]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/08/15/riya-shut-down-big-surprise/</guid>
		<description><![CDATA[TechCrunch deadpools Riya. Certainly, no surprise to me. Their technology was never been impressive (posts are here). Like.com remains but does not seem to be going anywhere&#8230;
At least TechCrunch announced this death after promoting Riya for 3 years. Others have died or will die more quietly.
]]></description>
			<content:encoded><![CDATA[<p>TechCrunch <a href="http://www.techcrunch.com/2009/08/14/a-sad-day-goodbye-riya/">deadpools</a> Riya. Certainly, no surprise to me. Their <a href="http://inperc.com/wiki/index.php?title=Visual_image_search_engines#Riya">technology</a> was never been impressive (posts are <a href="http://inperc.com/blog2/index.php?s=Riya">here</a>). Like.com remains but does not seem to be going anywhere&#8230;</p>
<p>At least TechCrunch announced this death after promoting Riya for 3 years. Others have died or will die more quietly.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/08/15/riya-shut-down-big-surprise/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Similar Image Search reviewed</title>
		<link>http://inperc.com/blog2/2009/04/29/google-similar-image-search-reviewed/</link>
		<comments>http://inperc.com/blog2/2009/04/29/google-similar-image-search-reviewed/#comments</comments>
		<pubDate>Wed, 29 Apr 2009 00:23:23 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[image search]]></category>
		<category><![CDATA[reviews]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/04/29/google-similar-image-search-reviewed/</guid>
		<description><![CDATA[I was about to review the newly released Google Similar Image Search when I ran across this one. The verdict: not so good.
The guy does not seem to realize though that Microsoft released its own similarity search a few months before. I am not judging because I missed it myself when it came out. It [...]]]></description>
			<content:encoded><![CDATA[<p>I was about to review the newly released Google Similar Image Search when I ran across <a href="http://www.synapticacentral.com/content/content-based-image-retrieval-google-and-similar-image-search">this one</a>. The verdict: <strong>not so good</strong>.</p>
<p>The guy does not seem to realize though that Microsoft released its own similarity search a few months before. I am not judging because I missed it myself when it came out. It would be interesting to test and see which one is better (or not as bad). One point in favor of Microsoft is that Google didn’t index all images.</p>
<p>UPDATE: Another good revew at <a href="http://richmarr.wordpress.com/2009/04/21/google-image-similarity-first-impressions/">Rich Marr&#8217;s Tech Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/04/29/google-similar-image-search-reviewed/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Performance evaluation of image search: precision and recall (continued)</title>
		<link>http://inperc.com/blog2/2009/03/13/performance-evaluation-of-image-search-precision-and-recall-continued/</link>
		<comments>http://inperc.com/blog2/2009/03/13/performance-evaluation-of-image-search-precision-and-recall-continued/#comments</comments>
		<pubDate>Fri, 13 Mar 2009 13:55:46 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[image search]]></category>
		<category><![CDATA[mathematics]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/03/13/performance-evaluation-of-image-search-precision-and-recall-continued/</guid>
		<description><![CDATA[In the previous post:
Recall = Number of retrieved images that are also relevant / Total number of relevant images.
Precision = Number of retrieved images that are also relevant / Total number of retrieved images.
In Pixcavator Search, the matches are simply ordered based on their distance from the query (just like Google). Then we need choose [...]]]></description>
			<content:encoded><![CDATA[<p>In the <a href="http://inperc.com/blog2/2009/03/04/performance-evaluation-of-image-search-precision-and-recall/">previous post</a>:</p>
<p>Recall = Number of retrieved images that are also relevant / Total number of relevant images.</p>
<p>Precision = Number of retrieved images that are also relevant / Total number of retrieved images.</p>
<p>In <a href="http://www.inperc.com/wiki/index.php?title=Pixcavator_image_search">Pixcavator Search</a>, the matches are simply ordered based on their distance from the query (just like Google). Then we need choose a cut-off. In the example considered last time, the cut-off was implicitly “all that fit in one page”. This is a reasonable standard for user oriented applications. For experimentation and testing, however, we may want to use the distance instead. For example, below I may choose the cut-off distance of 80: all images within 80 from the query are declared matches and retrieved, the rest are not. Then recall = 7/8, precision = 7/14.</p>
<p><img style="width: 617px; height: 709px" height="709" src="http://inperc.com/wiki/images/a/a0/PxSearch-screenshot.jpg" width="617" /></p>
<p>The choice was made based on the examination of the search results for this particular image in an attempt to include as many as possible of “good” matches and, at the same time, to exclude as many as possible of the “bad” matches. More experimentation showed that 80 works OK for other queries as well. In general, however, this is not to be expected.</p>
<p>My conclusion is that the main drawback of precision and recall as a measure quality of the algorithm is that it requires a cut-off to separate the retrieved images from the rest. Then, the evaluation results depend on this choice. In fact, this measure ends up to be a measure of the quality of the query image, not the algorithm.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/03/13/performance-evaluation-of-image-search-precision-and-recall-continued/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is Microsoft to release a visual image search engine?</title>
		<link>http://inperc.com/blog2/2009/03/06/is-microsoft-to-release-a-visual-image-search-engine/</link>
		<comments>http://inperc.com/blog2/2009/03/06/is-microsoft-to-release-a-visual-image-search-engine/#comments</comments>
		<pubDate>Fri, 06 Mar 2009 15:22:51 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[image search]]></category>
		<category><![CDATA[news]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/03/06/is-microsoft-to-release-a-visual-image-search-engine/</guid>
		<description><![CDATA[
Technology News asks: &#8220;Will Microsoft&#8217;s Kumo Bring New Visual Dimension to Search?&#8221; and answers: “Microsoft seems to be amping up visual search capabilities in its upcoming Kumo search engine, if leaked screenshots are any indication.”
Well, they aren’t (see for yourself here). And neither is the leaked email.
The reporter seems to have been swayed by the [...]]]></description>
			<content:encoded><![CDATA[<p><img style="width: 576px; height: 324px" height="324" src="http://kara.allthingsd.com/files/2009/03/kumo.jpg" width="576" /></p>
<p>Technology News asks: &#8220;<a href="http://www.technewsworld.com/story/Will-Microsofts-Kumo-Bring-New-Visual-Dimension-to-Search-66366.html">Will Microsoft&#8217;s Kumo Bring New Visual Dimension to Search?</a>&#8221; and answers: “Microsoft seems to be amping up visual search capabilities in its upcoming Kumo search engine, if leaked screenshots are any indication.”</p>
<p>Well, they aren’t (see for yourself <a href="http://kara.allthingsd.com/20090302/a-sneak-peek-look-at-microsofts-new-kumo/">here</a>). And neither is the leaked email.</p>
<p>The reporter seems to have been swayed by the CEO of <a href="http://inperc.com/wiki/index.php?title=Visual_image_search_engines#Imprezzeo">Imprezzeo</a>, a company offering their own image-to-image search engine.</p>
<p>Microsoft is certainly capable of doing that, e.g., <a href="http://inperc.com/wiki/index.php?title=Visual_image_search_engines#Lincoln_from_MS_Research">Lincoln</a>. Incidentally, Imprezzeo’s site has only a video demo.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/03/06/is-microsoft-to-release-a-visual-image-search-engine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Performance evaluation of image search: precision and recall</title>
		<link>http://inperc.com/blog2/2009/03/04/performance-evaluation-of-image-search-precision-and-recall/</link>
		<comments>http://inperc.com/blog2/2009/03/04/performance-evaluation-of-image-search-precision-and-recall/#comments</comments>
		<pubDate>Wed, 04 Mar 2009 16:00:25 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[computer vision/machine vision/AI]]></category>
		<category><![CDATA[image search]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/03/04/performance-evaluation-of-image-search-precision-and-recall/</guid>
		<description><![CDATA[In a recent post I discussed Pixcavator Search, prototype image-to-image software. As it searches for images based on similarity, a potential application of this software is a search for copyrighted images. With numerous application of this type out there, the discussion of performance evaluation is usually absent.
Let’s consider the performance evaluation commonly used in information [...]]]></description>
			<content:encoded><![CDATA[<p>In a <a href="http://inperc.com/blog2/2009/01/25/pixcavator-image-search-11/">recent post</a> I discussed Pixcavator Search, prototype image-to-image software. As it searches for images based on similarity, a potential application of this software is a search for copyrighted images. With numerous <a href="http://inperc.com/wiki/index.php?title=Visual_image_search_engines">application of this type</a> out there, the discussion of performance evaluation is usually absent.</p>
<p>Let’s consider the performance evaluation commonly used in information retrieval. Below, we just replace “document” with “image”.</p>
<p>For a given query image, we assume the following:</p>
<ul>
<li>There are M relevant images in the collection = correct matches.</li>
<li>When the query is executed, N images are retrieved.</li>
<li>Out of those only R are relevant.</li>
</ul>
<p><img src="http://inperc.com/wiki/images/a/a2/Precision-recall.jpg" /> </p>
<p>Then the following two measurements quantify the quality of the search:</p>
<p><em>Recall</em> = R / M = Number of retrieved images that are also relevant / Total number of relevant images.</p>
<p><em>Precision</em> = R / N = Number of retrieved images that are also relevant / Total number of retrieved images.</p>
<p>The recall is the answer to the question: How close am I to getting <em>all</em> good matches? The precision is the answer to the question: How close am I to getting <em>only</em> good matches?</p>
<p>In the example below, lenaC.jpg is the query and the program searched for its modified versions (7 modified versions of the original: stretched, rotated, noised, etc). Then N = 27, M = 8, R = 7, so recall = 7/8 and precision = 7/27.</p>
<p><img style="width: 617px; height: 709px" height="709" src="http://inperc.com/wiki/images/a/a0/PxSearch-screenshot.jpg" width="617" /></p>
<p>Ideally, the value of recall and precision should each be equal to 1. However, in reality they go in the opposite directions. When the query is broad, the recall is high, but precision is low. When the query is restrictive, the precision is high and recall is low. We have all experienced this effect searching Google, Yahoo, etc.</p>
<p>But what does “restrictive” mean in the image search context? The query is an image, so what is a “restrictive image”?</p>
<p>Turns out that, for this particular algorithm, image with strong, distinctive features are best. This is certainly very vague. More specifically, these are images with large, high contrast objects. They should also have enough of these objects. For example, one large object, or several medium ones, or the whole image filled with dots. The last one isn’t something you’d call an image of good quality but it is very stable under noise, blur, and other transformations.</p>
<p>I’ll write some more on this. Meanwhile, this is the <a href="http://en.wikipedia.org/wiki/Precision_and_recall">Wikipedia article about precision and recall</a>. The article provides a probabilistic interpretation of precision and recall.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/03/04/performance-evaluation-of-image-search-precision-and-recall/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Object recognition demo from Numenta</title>
		<link>http://inperc.com/blog2/2009/02/16/object-recognition-demo-from-numenta/</link>
		<comments>http://inperc.com/blog2/2009/02/16/object-recognition-demo-from-numenta/#comments</comments>
		<pubDate>Mon, 16 Feb 2009 18:01:19 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[computer vision/machine vision/AI]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[rants]]></category>
		<category><![CDATA[reviews]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/02/16/object-recognition-demo-from-numenta/</guid>
		<description><![CDATA[The link to this demo was sent to me by Ricardo Niederberger Cabral (thanks!). The demo program is called Vision4 and was created by Numenta. This is its main point:
This program demonstrates some capabilities of Numenta&#8217;s Hierarchical Temporal Memory (HTM) technology applied to visual object recognition. .. The HTM network contained in this demo has [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://www.numenta.com/about-numenta/technology/vision4-demo.php">link</a> to this demo was sent to me by Ricardo Niederberger Cabral (thanks!). The demo program is called Vision4 and was created by Numenta. This is its main point:</p>
<blockquote><p>This program demonstrates some capabilities of Numenta&#8217;s Hierarchical Temporal Memory (HTM) technology applied to visual object recognition. .. The HTM network contained in this demo has been trained to recognize four types of objects: cell phones, sailboats, cows, and rubber ducks.</p></blockquote>
<p>Every image is given four ratings. Each represents how much the image resembles one of the four types.</p>
<p>As you can see, the goal is modest and there are no <a href="http://inperc.com/blog2/2009/02/10/image-search-engines-keep-launching-milabra/">unsubstantiated claims</a> of how this is ready to be applied in real life (and don’t get me started on academic publications!). This is refreshing. The program is also fun to play with. You can load your own images, you can add noise, blur etc to the images and see the effect on the recognition. The recognition results are often good and when they aren’t, it’s still interesting.</p>
<p>For serious purposes, it is unclear where this is going though.</p>
<p>It’s fine with me that there are only four categories – just one would be enough to test the concept. It does not bother me when a face is rated high in the cow category and another face high in the duck category. My main complaint is the instability of recognition under image transformations. For example, after turning “sailboat” a few degrees it became “cell phone”. A few degrees more and it becomes mixed &#8211; half “cow” (first image below). Adding noise, occlusion, etc has similar effect (second image).</p>
<p><img style="width: 287px; height: 199px" height="199" src="http://inperc.com/wiki/images/0/0c/Numenta_screenshot_1.jpg" width="287" /><img style="width: 282px; height: 197px" height="197" src="http://inperc.com/wiki/images/5/5f/Numenta_screenshot_2.jpg" width="282" /></p>
<p>Certainly, one does not expect rotations to affect image recognition. Meanwhile, a mixed recognition is a failed recognition and should be presented as such.</p>
<p>I am certainly biased here. I don’t believe in “build[ing] machines that work on principles used by the brain”. I don’t believe in trying to imitate brain and <a href="http://inperc.com/blog2/index.php?s=brain+inspired">I’ve written a few times about that</a>. Traditionally, a scientist tries to understand nature by observing it, analyzing it, etc. Instead, it is suggested to try to understand nature by first understanding how the brain understands it? Seems like a roundabout to me, bordering on a vicious circle. I also have serious reservations about the use of <a href="http://inperc.com/wiki/index.php?title=Machine_learning_in_computer_vision">machine learning in computer vision</a>.</p>
<p>Annoying bug: every time I start it, the program would turn on my webcam and it would keep it on even after I shut it down.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/02/16/object-recognition-demo-from-numenta/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Image search engines keep launching: Milabra</title>
		<link>http://inperc.com/blog2/2009/02/10/image-search-engines-keep-launching-milabra/</link>
		<comments>http://inperc.com/blog2/2009/02/10/image-search-engines-keep-launching-milabra/#comments</comments>
		<pubDate>Tue, 10 Feb 2009 02:26:43 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[computer vision/machine vision/AI]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[rants]]></category>
		<category><![CDATA[reviews]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/02/10/image-search-engines-keep-launching-milabra/</guid>
		<description><![CDATA[TechCrunch is happy to do PR for another visual search company: Milabra.
Milabra claims that it can categorize images, “from puppies to porn”:
…when searching through a library of images for dogs, Milabra doesn’t need to constantly compare each image with its database of known ‘dog’ images &#8211; instead, it can look for traits that it has learned to [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.techcrunch.com/2009/02/02/milabra-b2b-image-recognition-service-learns-to-find-anything-from-puppies-to-porn/">TechCrunch</a> is happy to do PR for another visual search company: Milabra.</p>
<p>Milabra claims that it can categorize images, “from puppies to porn”:</p>
<blockquote><p><em>…when searching through a library of images for dogs, Milabra doesn’t need to constantly compare each image with its database of known ‘dog’ images &#8211; instead, it can look for traits that it has learned to associate with “doggyness”…</em></p></blockquote>
<p>The two examples in the demo are “beach” and “dog”. You upload an image with people on the beach, click “Search” and you get a page of beach photos&#8230; Wait, you don’t get to upload anything – this is just a video! So, there is no way to test their claims. Unfortunately, this is <a href="http://inperc.com/wiki/index.php?title=Visual_image_search_engines">not unusual</a> in this area and in computer vision in general.</p>
<p>If your software can recognize a puppy in an image (95% of the time as you claim), it should be easy for you to demonstrate this ability. Create a little web application (or desktop, I don’t care) that allows me to upload my own image which is then identified as “puppy” (or “tree”, or “street”, I don&#8217;t care). There is no such program. Why not? The answer is obvious.</p>
<p>In response to some skepticism, this is what one of the founders wrote:</p>
<blockquote><p><em>&#8230;if you think that this cannot be done, then you are completely clueless: object classifiers have been made for more than 10 years now at leading CS labs around the world.</em></p></blockquote>
<p>That reminds me of the episode of <em>Seinfeld</em> when Kramer decides to build <strong>levels</strong> in his apartment:</p>
<blockquote><p><em>KRAMER: It&#8217;s a simple job. Why, you don&#8217;t think I can?</em></p>
<p><em>JERRY: Oh, no. It&#8217;s not that I don&#8217;t think you can. I know that you can&#8217;t, and I&#8217;m positive that you won&#8217;t.</em></p></blockquote>
<p>This is Millabra’s team:</p>
<ul>
<li>MBA</li>
<li>MS in Biological Engineering and PhD in neuroscience</li>
<li>MS in Computer Science and Ph.D. in Biophysics</li>
<li>Professional Project Manager</li>
<li>Expert in computer networking, user interface design</li>
</ul>
<blockquote><p><em>JERRY: I don&#8217;t see it happening.</em></p></blockquote>
<p>And what about TechCrunch? Same story again and again since I started to keep track a couple of years ago: they publish an enthusiastic report about a company doing image analysis/search/recognition, and then silence. The company slips into obscurity and there is no follow-up, nothing. These people never learn…</p>
<p>The people who do seem to learn, slowly, are the investors: Riya (like.com) $20 million or more, Polar Rose $5 million, Milabra $1.4 million. Or maybe this is just the effect of the economic downturn?</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/02/10/image-search-engines-keep-launching-milabra/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pixcavator Image Search 1.1</title>
		<link>http://inperc.com/blog2/2009/01/25/pixcavator-image-search-11/</link>
		<comments>http://inperc.com/blog2/2009/01/25/pixcavator-image-search-11/#comments</comments>
		<pubDate>Sun, 25 Jan 2009 19:49:36 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[image search]]></category>
		<category><![CDATA[software releases]]></category>
		<category><![CDATA[updates]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/01/25/pixcavator-image-search-11/</guid>
		<description><![CDATA[This has been an on-and-off project for almost two years (version 1.0 described here). The purpose is simple: find images similar to a given image. Since it is not even well understood what images are similar, the progress in this area of “image-to-image” search (aka “visual image search”) is very slow&#8211;. So, instead, we focus [...]]]></description>
			<content:encoded><![CDATA[<p>This has been an on-and-off project for almost two years (version 1.0 described <a href="http://inperc.com/wiki/index.php?title=Image-to-image_search">here</a>). The purpose is simple: find images similar to a given image. Since it is not even well understood what images are similar, the progress in this area of “image-to-image” search (aka “visual image search”) is very slow&#8211;. So, instead, we focus on the goal of <strong>finding modified versions of the original</strong>. This release is a way to report a limited success we have achieved.The executable PxSearch.exe is accompanied by a small collection of images (download <a href="http://inperc.com/files/PxSearch.zip">here</a>, 7.2 MB). The system consists of the following modules:</p>
<ul>
<li>the collection of images that can be extended; </li>
<li>the database containing “signatures” of images, images’ origins, and other data; </li>
<li>the image analysis unit (produces the signatures); </li>
<li>the matching unit (matches the signatures); </li>
<li>user interface (uploads an image, searches for similar images in the collection, displays the matches as a list);</li>
</ul>
<p>For every image to be added, first the image is converted to <em>grayscale</em> and then shrunk so that the larger dimension is 150. Then several of its <em>secondary versions</em> are created, analyzed, and added to the collection and their data is added to the database, total of 8:</p>
<ol>
<li>original </li>
<li>rotation, 5 degrees </li>
<li>rotation, 45 degrees </li>
<li>Gaussian blur </li>
<li>salt and pepper noise </li>
<li>stretch, 5% </li>
<li>shrink, 5% </li>
<li>crop from all sides, 5%</li>
</ol>
<p>The entry in the <em>database</em> for each image contains the information about its origin:</p>
<ul>
<li>date and time, </li>
<li>the filename of the original image, </li>
<li>the way the image was produced from the original (shrinking, rotation, etc), </li>
<li>the signature of the image.</li>
</ul>
<p>A <em>signature</em> is a sequence of 126 integers which is the output of image analysis: it is essentially the distribution of sizes of objects found in the image (the data comes form the same source as for <a href="http://inperc.com/wiki/index.php?title=Image_analysis">Pixcavator</a>).</p>
<p>Suppose the signature of the two images are {An} and {Bn}. Move along these sequences and compute the absolute value of the differences of n-th entries. The result is a <em>distance formula</em> as the “weighted 1-norm metric”:</p>
<p><em>D = Σ Cn |An &#8211; Bn|.</em></p>
<p><img style="width: 617px; height: 709px;" src="http://inperc.com/wiki/images/a/a0/PxSearch-screenshot.jpg" alt="" width="617" height="709" /></p>
<p>A search is deemed successful if most of the versions of the query image are at the top of the list. This is the case for images that are “good” in the sense that they have clear pattern (based on shapes not color). However, this standard is hard to <a href="http://en.wikipedia.org/wiki/Precision_and_recall">quantify</a> as it is dependent on the collection. Since the collection I used for testing was small (4500 images), I had to find a way to evaluate the quality of searches that is independent of the size of the collection, as much as possible. So, the quality score for a given image was</p>
<p><em>(average distance to its 7 versions) / (average distance to all images) * 100.</em></p>
<p>There are many interesting question to study based on this data and I will report further.</p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/01/25/pixcavator-image-search-11/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Image-to-image search: Gazopa</title>
		<link>http://inperc.com/blog2/2009/01/12/image-to-image-search-gazopa/</link>
		<comments>http://inperc.com/blog2/2009/01/12/image-to-image-search-gazopa/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 05:19:17 +0000</pubDate>
		<dc:creator>Peter</dc:creator>
				<category><![CDATA[computer vision/machine vision/AI]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[reviews]]></category>

		<guid isPermaLink="false">http://inperc.com/blog2/2009/01/12/image-to-image-search-gazopa/</guid>
		<description><![CDATA[Gazopa is a new visual search engine that is “a venture project inside Hitachi”.
I tried its Facebook application. I uploaded a few standard images and a few test images of my own and ran Gazopa. Some of the matches were awful while others were sort of meaningful. See for yourselves. The first match is displayed under [...]]]></description>
			<content:encoded><![CDATA[<p>Gazopa is a new visual search engine that is “a venture project inside Hitachi”.</p>
<p>I tried its <a href="http://apps.facebook.com/gazopa_book/">Facebook application</a>. I uploaded a few standard images and a few test images of my own and ran Gazopa. <strong>Some of the matches were awful while others were sort of meaningful.</strong> See for yourselves. The first match is displayed under the target image.</p>
<p><img src="http://inperc.com/wiki/images/b/b9/Gazopa1.jpg" /></p>
<p><img src="http://inperc.com/wiki/images/4/48/Gazopa2.jpg" /></p>
<p><img src="http://inperc.com/wiki/images/d/d9/Gazopa3.jpg" /></p>
<p><img src="http://inperc.com/wiki/images/0/04/Gazopa4.jpg" align="right" /></p>
<p>Gazopa also found a cropped copy of the “cameraman”, but not a rotated copy. The inability to handle rotations is a common problem with almost all <a href="http://inperc.com/wiki/index.php?title=Visual_image_search_engines">visual search engines</a>. Pixcavator Image Search can handle rotations with ease (read about it <a href="http://inperc.com/wiki/index.php?title=Image-to-image_search">here</a> or wait for the last version &#8211; to be released soon).</p>
<p>As far as the underlying technology, the site says that “GazoPa enables users to search for a similar image from characteristics such as a color or a shape extracted from an image itself” and nothing more. So, even what they consider similar is unknown. </p>
]]></content:encoded>
			<wfw:commentRss>http://inperc.com/blog2/2009/01/12/image-to-image-search-gazopa/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
