<?xml version="1.0" encoding="utf-8"?><!-- generator="WordPress/abc" -->
<rss version="0.92">
<channel>
	<title>Lucene Nutch vertical search consulting</title>
	<link>http://www.supermind.org</link>
	<description>Lucene Nutch vertical search consulting</description>
	<lastBuildDate>Tue, 02 Sep 2008 17:58:13 +0000</lastBuildDate>
	<docs>http://backend.userland.com/rss092</docs>
	<language>en</language>
	
	<item>
		<title>100% height iframes</title>
		<description><![CDATA[http://brondsema.net/blog/index.php/2007/06/06/100_height_iframe was a solution that worked for me after trying several out.
]]></description>
		<link>http://www.supermind.org/blog/526/100-height-iframes</link>
			</item>
	<item>
		<title>Using Hadoop IPC/RPC for distributed applications</title>
		<description><![CDATA[Hadoop is growing to be a pretty large framework - release 0.17.0 has 483 classes!
Previously, I&#8217;d written about Hadoop SequenceFile. SequenceFile is part of the org.apache.hadoop.io package, the other notable useful classes in that package being ArrayFile and MapFile which are persistent array and dictionary data structures respectively.
About Hadoop IPC
Here, I&#8217;m going to introduce the [...]]]></description>
		<link>http://www.supermind.org/blog/520/using-hadoop-ipcrpc-for-distributed-applications</link>
			</item>
	<item>
		<title>TREC 2007 Million Queries Track</title>
		<description><![CDATA[Just read about the IBM Haifa Team&#8217;s experiences in tweaking Lucene relevance for TREC.
via Jeff&#8217;s Search Engine Caffè
]]></description>
		<link>http://www.supermind.org/blog/515/trec-2007-million-queries-track</link>
			</item>
	<item>
		<title>Lucene Tutorial.com</title>
		<description><![CDATA[I&#8217;ve been maintaining a website dedicated to introducing Lucene to beginners. 
Check it out here: http://www.lucenetutorial.com
Feedback is always welcome, including topics you&#8217;d like to see written on.
]]></description>
		<link>http://www.supermind.org/blog/514/lucene-tutorialcom</link>
			</item>
	<item>
		<title>A Collection of JVM Options</title>
		<description><![CDATA[Just found this collection of JVM options which might prove handy one day.
]]></description>
		<link>http://www.supermind.org/blog/513/a-collection-of-jvm-options</link>
			</item>
	<item>
		<title>Limiting system cache size in Windows Server 2003</title>
		<description><![CDATA[On a consulting gig, I was recently asked to investigate a strange problem with a Lucene server on Windows Server 2003. 
The Lucene index was periodically refreshed by running a new instance of the app, then killing the old one via &#8220;taskkill&#8221;. Worked fine, except the available memory displayed by Task Manager somehow steadily decreased [...]]]></description>
		<link>http://www.supermind.org/blog/512/limiting-system-cache-size-in-windows-server-2003</link>
			</item>
	<item>
		<title>Is Nutch appropriate for aggregation-type vertical search?</title>
		<description><![CDATA[I get pinged all the time by people who tell me they want to build a vertical search engine with Nutch. The part I can&#8217;t figure out, though, is why Nutch?
What&#8217;s vertical anyway?
So let&#8217;s start from basics. Vertical search engines typically fall into 2 categories: 

Whole-web search engines which selectively crawl the Internet for webpages [...]]]></description>
		<link>http://www.supermind.org/blog/507/is-nutch-appropriate-for-aggregation-type-vertical-search</link>
			</item>
	<item>
		<title>Fuzzy string matching</title>
		<description><![CDATA[I&#8217;ve been recently peripherally involved in a project which attempts to perform a fuzzy match on names in a MySQL database. With &#8220;Homethinking&#8221;:http://www.homethinking.com, we had to do something similar regarding matching for realtor and brokerage names. Its also related to some of the Lucene consulting I&#8217;ve been involved with.
Its an interesting problem. There&#8217;s an article [...]]]></description>
		<link>http://www.supermind.org/blog/430/fuzzy-string-matching</link>
			</item>
	<item>
		<title>Exploring Hadoop SequenceFile</title>
		<description><![CDATA[Hadoop&#8217;s SequenceFile is at the heart of  the Hadoop io package. Both MapFile (disk-backed Map) and ArrayFile (disk-backed Array) are built on top of SequenceFile. 
So what exactly is SequenceFile? Its class javadoc tells us: Support for flat files of binary key/value pairs.- not very helpful. 
Let&#8217;s dig through the code and find out [...]]]></description>
		<link>http://www.supermind.org/blog/428/exploring-hadoop-sequencefile</link>
			</item>
	<item>
		<title>MySQL Falcon open-sourced</title>
		<description><![CDATA[Just read that &#8220;MySQL Falcon storage engine&#8221;:http://www.mysql.org/doc/refman/5.1/en/se-falcon.html has been &#8220;open-sourced&#8221;:http://it.slashdot.org/article.pl?sid=07/01/02/209227.
http://mike.kruckenberg.com/archives/2006/04/jim_starkey_int.html has a really good, concise brief on Falcon and what it does.
]]></description>
		<link>http://www.supermind.org/blog/427/mysql-falcon-open-sourced</link>
			</item>
</channel>
</rss>
