Check out http://www.elasticsearch.org/
<http://www.elasticsearch.org/>Not what you are doing, but possibly a
helpful bit of the pie.
Also, Solr integrates Tika and Lucene pretty nicely any more. No Hbase yet,
but it isn't hard to add that.
On Mon, Feb 28, 2011 at 1:01 PM, Mark Kerzner <[EMAIL PROTECTED]> wrote:
> I am working on an open-source project that would be using
> Hadoop/HDFS/HBase/Tika/Lucene and would make all files on a hard drive
> searchable. Like Nutch, only applied to hard drives, and like Google
> Search, only I want to output information about every file found. Not a big
> difference though.
> I am looking for an advice on the following
> 1. Have you heard of a similar project?
> 2. What license should I use? I am thinking of Apache V2.0, because it
> relies on other Apache V2.0 projects;
> 3. Any other advice?
> Thank you. Sincerely,