<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>Lucene Nutch vertical search consulting</title>
	<atom:link href="http://www.supermind.org/feed" rel="self" type="application/rss+xml" />
	<link>http://www.supermind.org</link>
	<description>Lucene Nutch vertical search consulting</description>
	<pubDate>Tue, 02 Sep 2008 17:58:13 +0000</pubDate>
	<generator>http://wordpress.org/?v=abc</generator>
	<language>en</language>
			<item>
		<title>100% height iframes</title>
		<link>http://www.supermind.org/blog/526/100-height-iframes</link>
		<comments>http://www.supermind.org/blog/526/100-height-iframes#comments</comments>
		<pubDate>Sat, 30 Aug 2008 11:51:13 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/526/100-height-iframes</guid>
		<description><![CDATA[http://brondsema.net/blog/index.php/2007/06/06/100_height_iframe was a solution that worked for me after trying several out.
]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/526/100-height-iframes/feed</wfw:commentRss>
		</item>
		<item>
		<title>Using Hadoop IPC/RPC for distributed applications</title>
		<link>http://www.supermind.org/blog/520/using-hadoop-ipcrpc-for-distributed-applications</link>
		<comments>http://www.supermind.org/blog/520/using-hadoop-ipcrpc-for-distributed-applications#comments</comments>
		<pubDate>Mon, 02 Jun 2008 18:59:15 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[Lucene / Nutch]]></category>

		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/520</guid>
		<description><![CDATA[Hadoop is growing to be a pretty large framework - release 0.17.0 has 483 classes!
Previously, I&#8217;d written about Hadoop SequenceFile. SequenceFile is part of the org.apache.hadoop.io package, the other notable useful classes in that package being ArrayFile and MapFile which are persistent array and dictionary data structures respectively.
About Hadoop IPC
Here, I&#8217;m going to introduce the [...]]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/520/using-hadoop-ipcrpc-for-distributed-applications/feed</wfw:commentRss>
		</item>
		<item>
		<title>TREC 2007 Million Queries Track</title>
		<link>http://www.supermind.org/blog/515/trec-2007-million-queries-track</link>
		<comments>http://www.supermind.org/blog/515/trec-2007-million-queries-track#comments</comments>
		<pubDate>Sat, 10 May 2008 15:25:56 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/515</guid>
		<description><![CDATA[Just read about the IBM Haifa Team&#8217;s experiences in tweaking Lucene relevance for TREC.
via Jeff&#8217;s Search Engine Caffè
]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/515/trec-2007-million-queries-track/feed</wfw:commentRss>
		</item>
		<item>
		<title>Lucene Tutorial.com</title>
		<link>http://www.supermind.org/blog/514/lucene-tutorialcom</link>
		<comments>http://www.supermind.org/blog/514/lucene-tutorialcom#comments</comments>
		<pubDate>Fri, 25 Apr 2008 12:15:48 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/514</guid>
		<description><![CDATA[I&#8217;ve been maintaining a website dedicated to introducing Lucene to beginners. 
Check it out here: http://www.lucenetutorial.com
Feedback is always welcome, including topics you&#8217;d like to see written on.
]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/514/lucene-tutorialcom/feed</wfw:commentRss>
		</item>
		<item>
		<title>A Collection of JVM Options</title>
		<link>http://www.supermind.org/blog/513/a-collection-of-jvm-options</link>
		<comments>http://www.supermind.org/blog/513/a-collection-of-jvm-options#comments</comments>
		<pubDate>Thu, 24 Apr 2008 19:52:18 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/513</guid>
		<description><![CDATA[Just found this collection of JVM options which might prove handy one day.
]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/513/a-collection-of-jvm-options/feed</wfw:commentRss>
		</item>
		<item>
		<title>Limiting system cache size in Windows Server 2003</title>
		<link>http://www.supermind.org/blog/512/limiting-system-cache-size-in-windows-server-2003</link>
		<comments>http://www.supermind.org/blog/512/limiting-system-cache-size-in-windows-server-2003#comments</comments>
		<pubDate>Thu, 24 Apr 2008 12:08:37 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/512</guid>
		<description><![CDATA[On a consulting gig, I was recently asked to investigate a strange problem with a Lucene server on Windows Server 2003. 
The Lucene index was periodically refreshed by running a new instance of the app, then killing the old one via &#8220;taskkill&#8221;. Worked fine, except the available memory displayed by Task Manager somehow steadily decreased [...]]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/512/limiting-system-cache-size-in-windows-server-2003/feed</wfw:commentRss>
		</item>
		<item>
		<title>Is Nutch appropriate for aggregation-type vertical search?</title>
		<link>http://www.supermind.org/blog/507/is-nutch-appropriate-for-aggregation-type-vertical-search</link>
		<comments>http://www.supermind.org/blog/507/is-nutch-appropriate-for-aggregation-type-vertical-search#comments</comments>
		<pubDate>Mon, 24 Sep 2007 16:32:42 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[Lucene / Nutch]]></category>

		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/507</guid>
		<description><![CDATA[I get pinged all the time by people who tell me they want to build a vertical search engine with Nutch. The part I can&#8217;t figure out, though, is why Nutch?
What&#8217;s vertical anyway?
So let&#8217;s start from basics. Vertical search engines typically fall into 2 categories: 

Whole-web search engines which selectively crawl the Internet for webpages [...]]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/507/is-nutch-appropriate-for-aggregation-type-vertical-search/feed</wfw:commentRss>
		</item>
		<item>
		<title>Fuzzy string matching</title>
		<link>http://www.supermind.org/blog/430/fuzzy-string-matching</link>
		<comments>http://www.supermind.org/blog/430/fuzzy-string-matching#comments</comments>
		<pubDate>Wed, 03 Jan 2007 13:47:13 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/430</guid>
		<description><![CDATA[I&#8217;ve been recently peripherally involved in a project which attempts to perform a fuzzy match on names in a MySQL database. With &#8220;Homethinking&#8221;:http://www.homethinking.com, we had to do something similar regarding matching for realtor and brokerage names. Its also related to some of the Lucene consulting I&#8217;ve been involved with.
Its an interesting problem. There&#8217;s an article [...]]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/430/fuzzy-string-matching/feed</wfw:commentRss>
		</item>
		<item>
		<title>Exploring Hadoop SequenceFile</title>
		<link>http://www.supermind.org/blog/428/exploring-hadoop-sequencefile</link>
		<comments>http://www.supermind.org/blog/428/exploring-hadoop-sequencefile#comments</comments>
		<pubDate>Tue, 02 Jan 2007 23:14:13 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[Lucene / Nutch]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/428</guid>
		<description><![CDATA[Hadoop&#8217;s SequenceFile is at the heart of  the Hadoop io package. Both MapFile (disk-backed Map) and ArrayFile (disk-backed Array) are built on top of SequenceFile. 
So what exactly is SequenceFile? Its class javadoc tells us: Support for flat files of binary key/value pairs.- not very helpful. 
Let&#8217;s dig through the code and find out [...]]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/428/exploring-hadoop-sequencefile/feed</wfw:commentRss>
		</item>
		<item>
		<title>MySQL Falcon open-sourced</title>
		<link>http://www.supermind.org/blog/427/mysql-falcon-open-sourced</link>
		<comments>http://www.supermind.org/blog/427/mysql-falcon-open-sourced#comments</comments>
		<pubDate>Tue, 02 Jan 2007 22:49:33 +0000</pubDate>
		<dc:creator>Kelvin</dc:creator>
		
		<category><![CDATA[programming]]></category>

		<guid isPermaLink="false">http://www.supermind.org/blog/427</guid>
		<description><![CDATA[Just read that &#8220;MySQL Falcon storage engine&#8221;:http://www.mysql.org/doc/refman/5.1/en/se-falcon.html has been &#8220;open-sourced&#8221;:http://it.slashdot.org/article.pl?sid=07/01/02/209227.
http://mike.kruckenberg.com/archives/2006/04/jim_starkey_int.html has a really good, concise brief on Falcon and what it does.
]]></description>
		<wfw:commentRss>http://www.supermind.org/blog/427/mysql-falcon-open-sourced/feed</wfw:commentRss>
		</item>
	</channel>
</rss>
