Tom White
Tom White is a committer on the Apache Hadoop project, and a member of the Lucene Project Management Committee. He works as an independent consultant specializing in Hadoop and distributed computing. He has been writing Java full time since 1996, and writing about Java since 2003 for O'Reilly, java.net and IBM's developerWorks. Outside programming Tom enjoys making his daughters laugh, and watching 1930s Hollywood films.
Articles
In the second part of this look at the Nutch web indexing and search engine, Tom White looks at how to perform searches on the index generated in part one's crawl, and shows how to integrate Nutch's search capabilities with your applications through direct Java calls to its API or via the...
Do you need your own search engine, when the world already has Google? Quite possibly so: you may belong to an organization with enough of its own contents that you want to manage and run your own search engine--and know how it works. Nutch is an open source search engine written in Java. In this...
All modern search engines attempt to detect and correct spelling errors in users' search queries. This article shows you one way of adding a "did you mean" suggestion facility to your own search applications using the Lucene Spell Checker.
Parallel computing allows some programs to run faster by dividing them up into smaller pieces and running these pieces on multiple processors. ComputeFarm is an open source Java framework for developing and running parallel programs.



