Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Consistent Backup strategy

Copy link to this message
Re: Consistent Backup strategy
We're doing a version of that at Salesforce (we have our own M/R jobs, but the principle is the same).
Soon we'll run the backup M/R job over a snapshot for performance reasons, but even then the principle is the same.

Specifically we're keeping 48h worth of life data in HBase itself (TTL=48h, MIN_VERSIONS=1, KEEP_DELETED_CELLS=true), and run the jobs as of 2h in the past (rounded to an exact hour boundary), every night.

I think it's time I write an updated blog post. We plan to eventually open source the tools we've written.
-- Lars

 From: Timo Schaepe <[EMAIL PROTECTED]>
Sent: Monday, December 23, 2013 10:53 AM
Subject: Consistent Backup strategy

Hey guys,

we are searching for a consistent backup strategy with the export tool. Is this article still up-to-date and I can use it?


Thanks for answers.