In a previous blog
I wrote about Nutch's MapReduce implementation, for distributed processing of massive data sets. This, and the closely related Nutch Distributed File System (renamed Hadoop Distributed File System), have now been moved into a standalone project called Hadoop.
According to Doug Cutting, who created Hadoop (as well as Lucene and Nutch), the name comes from:
The name my kid...
on Feb 8, 2006
Doug Cutting has done it again. The creator of Lucene and Nutch has implemented (with Mike Cafarella and others) a distributed platform for high volume data processing called MapReduce.
MapReduce is the brainchild of Google and is very well documented by Jeffrey Dean and Sanjay Ghemawat in their paper MapReduce: Simplified Data Processing on Large Clusters. In essence, it allows...
on Sep 25, 2005
This article in Infoworld: http://www.infoworld.com/article/05/07/20/HNmeshnetworks_1.html?source=NLC-TB2005-07-20 reports that 15 competing proposals will be whittled down to create the new IEEE 802.11s specification.
What's so great about a mesh network topology? A mesh network is a network in which the routing of messages is performed as a decentralized, cooperative process involving many peer...
on Jul 21, 2005
One of the promises of J2EE or Web Services is to allow individual "component" to be discovered and reused to form new business functions. But in reality, things are usually a bit more complicated that this.
In addition to matching the functional requirements of the component, one would need to match the non-functional requirements before a component can be reused. For example:
on Aug 30, 2004
Some of you may know me as the host of the cajo project. In fact, the topic of my blog entry today is that thanks to java.net; there are a lot more of you than I thought!
I was just informed about the logger project; it allows java.net project owners to view access statistics for their projects. What I found out astonished me. So much so, that I thought it important to share this discovery with...
on Aug 19, 2004
The server side has published an article by N. Alex Rupp on how to use OpenEJB for testing your EJB applications.
Advanced EJB Testing with OpenEJB
David Blevin's and I founded OpenEJB some five years ago to create a fast and lightweight EJB container system that could be easily embedded in any application (its even been used in handheld devices). I stayed with the project for the first couple of...
on Jun 11, 2004
Philip Brittan blogs about his various articles dealing with Security Strategies.
State is the second worst thing in distributed computing.
No state is the worst.
on Dec 16, 2003
Somnifugi JMS is an implementation of the Java Messaging Service built on top of Doug Lea's Channels. This JMS implementation runs inside a single JVM, quickly delivering messages between java Threads. A few years back, I created Somnifugi JMS to speed up a project where the architects had gone overboard with messaging. I used Somnifugi JMS to prototype and test the next project, and left it in...
on Sep 11, 2003
(After the flurry of blogging last week at JavaOne, I've had to concentrate on work for a while. But I have a lot of notes from the conference that I'll continue to write about for a while.)
My introduction to collaboration software was a talk by Pavel Curtis at a USENIX conference in January 1995. He talked about his research project, Jupiter, at Xerox PARC. That talk spoiled me ... none of...
on Jun 18, 2003
So, I just attended Edge East
2004, a Web Services Conference here in scenic Boston. Full
disclosure: I'm not exactly the world's biggest fan of the Web Services. My
ten cents going in: they solve a very complex problem in a very complex
way. And thus, they are to be feared. If you can, by any means,
solve a simpler problem in a simpler way, you'll be happier. If you can,
by any means, solve...
on Feb 26, 2004