Yahoo are claiming that they now have an index for their search engine which covers 20.8 billion pages. This is larger than the previously known largest claim by Google of 11 billion.
Getting started with JDO
Despite the ongoing development and momentum building with EJB3.0 and it’s radically simplified persistence approach (similar to Hibernate), the JDO spec is also still alive and kicking as another alternative.
Sun have an article on their site giving an overview and quick introduction to JDO along with code examples.
Luke – Lucene Index Toolbox
Luke is a development and debugging tool that helps you work with Lucene generated indexes. The tool gives you a GUI frontend to allow you to browse indexes generated by Lucene and run queries against the index.
The site has a Java Web Start enabled version that you can easily downloads and install automatically including all necessary jars.
Building a search facility including spellchecking with Lucene
Java.net have a good article on their site showing how to add spell checking to a search engine capability using Lucene.
This article starts with a quick overview on how to use Lucene to implement a search using Lucene and then expands this concept to show how to also add in a spell check feature (similar to Google).