<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Elusive Information</title>
	<atom:link href="http://www.helenemartin.com/2010-06-elusive-information/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.helenemartin.com/2010-06-elusive-information/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=elusive-information</link>
	<description>You don&#039;t understand something until you&#039;ve taught a teenager to teach a computer to do it.</description>
	<lastBuildDate>Wed, 25 Jan 2012 03:01:53 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Hélène Martin</title>
		<link>http://www.helenemartin.com/2010-06-elusive-information/comment-page-1/#comment-242</link>
		<dc:creator>Hélène Martin</dc:creator>
		<pubDate>Sun, 20 Jun 2010 17:42:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.helenemartin.com/?p=337#comment-242</guid>
		<description>Insightful points.  You&#039;re absolutely right that there&#039;s a very real tradeoff between speed and usefulness.  I&#039;d be willing to wait five minutes for truly excellent results but that would be a suicide move on Google&#039;s part.

I agree that my first query was overly ambiguous... except Google &quot;sees&quot; me search for class materials day after day and saves my search history.  It&#039;d be nice if some of that could come into play.  Similarly, it knows when I&#039;ve clicked on a link and immediately left it or if I&#039;ve been shown a particular result before... then again, maybe I just think using that information would make my experience better but it actually wouldn&#039;t.  I&#039;m sure people much smarter than me have been thinking about these problems for a long time!  Google did have the up/down arrows for a bit which would have been great if only it weren&#039;t so easy to abuse.</description>
		<content:encoded><![CDATA[<p>Insightful points.  You&#8217;re absolutely right that there&#8217;s a very real tradeoff between speed and usefulness.  I&#8217;d be willing to wait five minutes for truly excellent results but that would be a suicide move on Google&#8217;s part.</p>
<p>I agree that my first query was overly ambiguous&#8230; except Google &#8220;sees&#8221; me search for class materials day after day and saves my search history.  It&#8217;d be nice if some of that could come into play.  Similarly, it knows when I&#8217;ve clicked on a link and immediately left it or if I&#8217;ve been shown a particular result before&#8230; then again, maybe I just think using that information would make my experience better but it actually wouldn&#8217;t.  I&#8217;m sure people much smarter than me have been thinking about these problems for a long time!  Google did have the up/down arrows for a bit which would have been great if only it weren&#8217;t so easy to abuse.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Clint Tseng</title>
		<link>http://www.helenemartin.com/2010-06-elusive-information/comment-page-1/#comment-240</link>
		<dc:creator>Clint Tseng</dc:creator>
		<pubDate>Sun, 20 Jun 2010 17:12:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.helenemartin.com/?p=337#comment-240</guid>
		<description>The problem here is not so much that search engines and their writers don&#039;t know how to make them more intelligent, it&#039;s that they don&#039;t know how to do so in an efficient manner. Mapreduce works well for finding keywords and creating a basic graph of what keywords point to what and in what order, because that&#039;s all static information that can be very quickly precalculated. Holistically parsing a document for meaning is an entirely different process that doesn&#039;t lend itself nearly as well to scale and parallelization.

And, on the other side of the pipe, 3-8 word queries generally don&#039;t give enough context to search engines for them to figure out which of two possible meanings you want. Even a human wouldn&#039;t necessarily be able to extract your intent out of your first query.

Sadly, RDF(a) won&#039;t really solve this problem, as it&#039;s really just about more keywords.

(PS &quot;while loop exercises&quot; seems to yield decent results)</description>
		<content:encoded><![CDATA[<p>The problem here is not so much that search engines and their writers don&#8217;t know how to make them more intelligent, it&#8217;s that they don&#8217;t know how to do so in an efficient manner. Mapreduce works well for finding keywords and creating a basic graph of what keywords point to what and in what order, because that&#8217;s all static information that can be very quickly precalculated. Holistically parsing a document for meaning is an entirely different process that doesn&#8217;t lend itself nearly as well to scale and parallelization.</p>
<p>And, on the other side of the pipe, 3-8 word queries generally don&#8217;t give enough context to search engines for them to figure out which of two possible meanings you want. Even a human wouldn&#8217;t necessarily be able to extract your intent out of your first query.</p>
<p>Sadly, RDF(a) won&#8217;t really solve this problem, as it&#8217;s really just about more keywords.</p>
<p>(PS &#8220;while loop exercises&#8221; seems to yield decent results)</p>
]]></content:encoded>
	</item>
</channel>
</rss>

