Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> 0.96 Replication to Elasticsearch

Copy link to this message
Re: 0.96 Replication to Elasticsearch
Why not copy the approach (+ some code) from
https://github.com/NGDATA/hbase-indexer ?

Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/
On Wed, Jan 15, 2014 at 9:03 PM, Pradeep Gollakota <[EMAIL PROTECTED]>wrote:

> Hi All,
> I have a use case where I need to replicate data from HBase into
> Elasticsearch. I've found two implementations of an HBase River that
> accomplishes this.
> One uses timestamps to do a timerange scan of the table (since last sync)
> and replicates data across. For many reasons this is not desirable.
> The other hooks into the HBase replication mechanism to get update from
> WALEdits. However, it was written against 0.94 and we're running 0.96. I'm
> trying to update/rewrite the river, but I just don't know where to start.
> Can anyone give me some guidance for writing a custom HBase replicator?
> Thanks,
> Pradeep
> P.S: For the short term, we're probably going to start using the first even
> with it's downsides, but we'll need to migrate off that quickly.