-Re: Lucene instead of HFiles?
Otis Gospodnetic 2012-10-06, 02:21
On Fri, Oct 5, 2012 at 2:36 AM, Adrien Mogenet <[EMAIL PROTECTED]> wrote:
> "Don't bother trying this in production" ;-)
> 1. Are you sure lookup by key are faster ?
No clue. But I also didn't say it's faster, just fast. :)
> 2. Updating Lucene files in a lock-free maneer and ensuring good
> concurrency can be a bit tricky
AFAIK Lucene files are immutable. Updates are delete and add.
Deletes are flags like tombstone markers in HBase.
> 3. AFAIK, Lucene files don't fit in HDFS and thus another distributed
> storage is required. Katta does not look as powerful as Hadoop.
Katta and Hadoop are two different tools, though. From what I recall,
Katta simply used HDFS for storing indices, but would push them
elsewhere for searching purposes.
Search Analytics - http://sematext.com/search-analytics/index.html
Performance Monitoring - http://sematext.com/spm/index.html
> On Fri, Oct 5, 2012 at 5:34 AM, Otis Gospodnetic
> <[EMAIL PROTECTED]> wrote:
>> Has anyone attempted using Lucene instead of HFiles (see
>> https://twitter.com/otisg/status/254047978174701568 )?
>> Is that a completely crazy, bad, would-never-work,
>> don't-bother-trying-this-at-home, it's-too-late-go-to-sleep idea? Or
>> Search Analytics - http://sematext.com/search-analytics/index.html
>> Performance Monitoring - http://sematext.com/spm/index.html
> Adrien Mogenet