Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Hbase delta load


Copy link to this message
-
Re: Hbase delta load
I think you may need to provide just a bit more information about your
use case. Could you define a bit more 'delta' and 'data matching'?

In a sense, every bulk load is a delta: updates for insert into a
larger table, representing a set of changes as a batch.

We could consider the existing HBase mechanisms for handling
multiversioning to be a simple "data matching functionality" via
simple existence testing by coordinate, although I know that is not
what you mean (but I don't know what you mean precisely).

* - coordinate: { row, column, qualifier, timestamp }

On 3/21/13, Jignesh Patel <[EMAIL PROTECTED]> wrote:
> We have a requirement to support data matching while loading deltas to
> HBase.
> I see there is a utility to support bulk loading.
> http://hbase.apache.org/book/arch.bulk.load.html
>
> But is there any way to support daily delta loading?
> Is there any open sourced MDM software which can be integrated with HBase?
>
> Does Hbase has any data matching functionality?
>
> -Jignesh
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB