Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> 0.96 Replication to Elasticsearch


Copy link to this message
-
Re: 0.96 Replication to Elasticsearch
Why not copy the approach (+ some code) from
https://github.com/NGDATA/hbase-indexer ?

Otis
--
Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/
On Wed, Jan 15, 2014 at 9:03 PM, Pradeep Gollakota <[EMAIL PROTECTED]>wrote:

> Hi All,
>
> I have a use case where I need to replicate data from HBase into
> Elasticsearch. I've found two implementations of an HBase River that
> accomplishes this.
>
> One uses timestamps to do a timerange scan of the table (since last sync)
> and replicates data across. For many reasons this is not desirable.
>
> The other hooks into the HBase replication mechanism to get update from
> WALEdits. However, it was written against 0.94 and we're running 0.96. I'm
> trying to update/rewrite the river, but I just don't know where to start.
> Can anyone give me some guidance for writing a custom HBase replicator?
>
> Thanks,
> Pradeep
>
> P.S: For the short term, we're probably going to start using the first even
> with it's downsides, but we'll need to migrate off that quickly.
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB