Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Can manually remove HFiles (similar to bulk import, but bulk remove)?


Copy link to this message
-
Re: Can manually remove HFiles (similar to bulk import, but bulk remove)?
You could set your ttls and trigger a major compaction ...

Or, (this is pretty advanced) you can probably do it without taking down
RS's by:
1) closing the region in the hbase shell
2) deleting the file in the shell
3) reopening the region in the hbase shell

Jon.

On Mon, Jul 9, 2012 at 12:41 PM, Alex Baranau <[EMAIL PROTECTED]>wrote:

> Heh, this is what I want to avoid actually: restarting RSs.
>
> Alex Baranau
> ------
> Sematext :: http://blog.sematext.com/ :: Solr - Lucene - Hadoop - HBase
>
> On Mon, Jul 9, 2012 at 3:38 PM, Amandeep Khurana <[EMAIL PROTECTED]> wrote:
>
> > I _think_ you should be able to do it and be just fine but you'll need to
> > shut down the region servers before you remove and start them back up
> after
> > you are done. Someone else closer to the internals can confirm/deny this.
> >
> >
> > On Monday, July 9, 2012 at 12:36 PM, Alex Baranau wrote:
> >
> > > Hello,
> > >
> > > I wonder, for purging old data, if I'm OK with "remove all StoreFiles
> > which
> > > are older than ..." way, can I do that? To me it seems like this can
> be a
> > > very effective way to remove old data, similar to fast bulk import
> > > functionality, but for deletion.
> > >
> > > Thank you,
> > >
> > > Alex Baranau
> > > ------
> > > Sematext :: http://blog.sematext.com/ :: Solr - Lucene - Hadoop -
> HBase
> > >
> > >
> >
> >
> >
>

--
// Jonathan Hsieh (shay)
// Software Engineer, Cloudera
// [EMAIL PROTECTED]
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB