Thoughts on Lucene, Solr, Nutch and vertical search 

Two simple optimizations to DP algorithm for calculating Levenstein edit distance

Posted by Kelvin on 01 Jul 2009 | Tagged as: programming

Levenstein/edit distance is a most often calculated using a dynamic programming (DP) algorithm.
The algorithm goes like this:

1. given 2 strings, s and t
2. instantiate d, an m x n matrix where m = length of s + 1 and n = length of t + 1
3. for each char in s
4. for each char in [...]

Trie-based approximate autocomplete implementation with support for ranks and synonyms

Posted by Kelvin on 01 Jul 2009 | Tagged as: programming


100% height iframes

Posted by Kelvin on 30 Aug 2008 | Tagged as: programming


Using Hadoop IPC/RPC for distributed applications

Posted by Kelvin on 02 Jun 2008 | Tagged as: Lucene / Nutch, programming


TREC 2007 Million Queries Track

Posted by Kelvin on 10 May 2008 | Tagged as: programming


Lucene Tutorial.com

Posted by Kelvin on 25 Apr 2008 | Tagged as: programming


A Collection of JVM Options

Posted by Kelvin on 24 Apr 2008 | Tagged as: programming


Limiting system cache size in Windows Server 2003

Posted by Kelvin on 24 Apr 2008 | Tagged as: programming


Is Nutch appropriate for aggregation-type vertical search?

Posted by Kelvin on 24 Sep 2007 | Tagged as: Lucene / Nutch, programming


Fuzzy string matching

Posted by Kelvin on 03 Jan 2007 | Tagged as: programming


Next Page »

07/04/08 | Kelvin Tan | Lucene Vertical Search Consultant