Thursday, March 17, 2005

Google Code

Google has launched Google Code, Google's place for Open Source software.
The featured projects are picked applications that use some Google tool or api.
There are also links to some cool developer tools on the

Wednesday, March 16, 2005


Terrier (TERabyte RetrIEveR) is an open source (under Mozilla Public License) software (from University of Glasgow) for the rapid development of Web, intranet and desktop search engines. More generally, it is a modular platform for the rapid development of large-scale Information Retrieval applications, providing indexing and retrieval functionalities. It is written in Java.

A9 Open Search

A9 Launches Open Search to Syndicate Vertical Search Results using RSS.
Could it be used as the user interface of my experimental search engines?

Thursday, March 10, 2005


SIGMOD Online now offers an RSS feed with the news items posted on

"Learning in Web Search"

Wednesday, March 09, 2005


"MALLET is an integrated collection of Java code useful for statistical natural language processing, document classification, clustering, information extraction, and other machine learning applications to text".
It seems that MALLET has superseded the widely used toolkit Rainbow. The most important new feature of MALLET is probably its implementation of finite-state Conditional Random Fields. However, the current version of MALLET does not contain Support Vector Machines yet.

Sunday, March 06, 2005


I have started using Bloglines to read blog feeds that I am interested in.