siren song

December 18, 2006 by Doug Cutting

Nutch developer Sami Siren seems to be diving into Hadoop, with his second post, this time examining the underutilized record facility. I’m hoping that, once we get a particular bug fixed, we’ll start using records for lots of Hadoop’s internals. Some fun cases will be replacing things like the source for IntWritable with something as simple as:

class IntWritable { int value; }

Hadoop’s made the news!

November 22, 2006 by Doug Cutting

I just spotted a complementary article about Hadoop, Lucene & Nutch.

objectivity, again

July 3, 2006 by Doug Cutting

Battelle’s blog has elicited a good discussion of search engine objectivity. I discussed this issue a while ago. One comment led to a good article (pdf) on the topic.

travel plans

April 24, 2006 by Doug Cutting

Next Thursday, I’ll be in San Francisco for the Nutch Meeting.

I’ll be in Helsinki for most of July, hosted by Wray Buntine, attending the International Workshop on Intelligent Information Access there July 6-8, among other things.

I’ll probably also attend the Open Source Information Retrieval workshop at SIGIR in August.

closing the loop

March 16, 2006 by Doug Cutting

A blast from my past. When I wrote that code I didn’t think anyone would ever read it, more less try to run it. I wrote the paper so that I could visit Barcelona, and thought that fleshing it out with code would impress the reviewers. It seemed to work. Barcelona was incredible. Folks danced to a crazy band playing on the plaza outside the cathedral after Easter mass. Traditional Catalan music, I guess. The “Flying Norwegians” kept me out all night at strange, unmarked clubs. Two men were fighting at 4am on the Ramblas. One hit the other over the head and he fell down, covered in blood. I thought he was dead until he rose up from the ambulance stretcher, swinging his fists, screaming, and ran off into the night. The culinary academy’s menu was only in Catalan, every dish a delectable surprise. Now someone’s actually read the code!