Why not copy the approach (+ some code) from
Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/
On Wed, Jan 15, 2014 at 9:03 PM, Pradeep Gollakota <[EMAIL PROTECTED]>wrote:
> Hi All,
> I have a use case where I need to replicate data from HBase into
> Elasticsearch. I've found two implementations of an HBase River that
> accomplishes this.
> One uses timestamps to do a timerange scan of the table (since last sync)
> and replicates data across. For many reasons this is not desirable.
> The other hooks into the HBase replication mechanism to get update from
> WALEdits. However, it was written against 0.94 and we're running 0.96. I'm
> trying to update/rewrite the river, but I just don't know where to start.
> Can anyone give me some guidance for writing a custom HBase replicator?
> P.S: For the short term, we're probably going to start using the first even
> with it's downsides, but we'll need to migrate off that quickly.