Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - 0.96 Replication to Elasticsearch


Copy link to this message
-
Re: 0.96 Replication to Elasticsearch
Otis Gospodnetic 2014-01-16, 04:09
Why not copy the approach (+ some code) from
https://github.com/NGDATA/hbase-indexer ?

Otis
--
Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/
On Wed, Jan 15, 2014 at 9:03 PM, Pradeep Gollakota <[EMAIL PROTECTED]>wrote:

> Hi All,
>
> I have a use case where I need to replicate data from HBase into
> Elasticsearch. I've found two implementations of an HBase River that
> accomplishes this.
>
> One uses timestamps to do a timerange scan of the table (since last sync)
> and replicates data across. For many reasons this is not desirable.
>
> The other hooks into the HBase replication mechanism to get update from
> WALEdits. However, it was written against 0.94 and we're running 0.96. I'm
> trying to update/rewrite the river, but I just don't know where to start.
> Can anyone give me some guidance for writing a custom HBase replicator?
>
> Thanks,
> Pradeep
>
> P.S: For the short term, we're probably going to start using the first even
> with it's downsides, but we'll need to migrate off that quickly.
>