Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Fixing badly distributed table manually.


Copy link to this message
-
Re: Fixing badly distributed table manually.
Hello,

I also found this fairly recent script here which can be used with Gnuplot
to get a visual representation of data distribution across nodes:

http://bobcopeland.com/blog/2012/04/graphing-hbase-splits/

Again, my JRuby skills are non-existent so just blindly running the script
versus HBase 0.92 results in a:

NoMethodError: private method `load' called for
#<Java::OrgApacheHadoopHbase::ServerName:0x4f5264db>
    main at region_hist.rb:23
    call at org/jruby/RubyProc.java:270
    call at org/jruby/RubyProc.java:220
    each at
file:/usr/lib/hbase/lib/jruby-complete-1.6.5.jar!/builtin/java/java.util.rb:7
    main at region_hist.rb:19
  (root) at region_hist.rb:37

Maybe it has to do with the author's remark:

[Note: I've been advised (thanks ntelford!) that HServerInfo is gone in
newer releases and you now need to get HServerLoad via
ClusterStatus.getLoad(server_name).]

What are the changes that need to be made to the script to get it to run on
HBase 0.92?

Thank you,

/David
On Tue, Sep 4, 2012 at 11:42 PM, David Koch <[EMAIL PROTECTED]> wrote:

> Hello,
>
> Thank you for your replies. We are using CDH4 HBase 0.92. Good call on the
> web interface. The port is blocked so I never really got a chance to test
> it. As far as manual re-balancing is concerned I will check the book.
>
> /David
>
>
> On Tue, Sep 4, 2012 at 5:34 PM, Guillaume Gardey <
> [EMAIL PROTECTED]> wrote:
>
>> Hello,
>>
>> > a) What is the easiest way to get an overview of how a table is
>> distributed
>> > across regions of a cluster? I guess I could search .META. but I haven't
>> > figured out how to use filters from shell.
>> > b) What constitutes a "badly distributed" table and how can I re-balance
>> > manually?
>> > c) Is b) needed at all? I know that HBase does its balancing
>> automatically
>> > behind the scenes.
>>
>> I have found that
>> http://bobcopeland.com/blog/2012/04/graphing-hbase-splits/  is a good
>> source of information/tools to look at regions balancing in the cluster and
>> investigate it.
>>
>> > As for a) I tried running this script:
>> >
>> > https://github.com/Mendeley/hbase-scripts/blob/master/list_regions.rb
>> >
>> > like so:
>> >
>> > hbase org.jruby.Main ./list_regions.rb <_my_table>
>> >
>> > but I get
>> >
>> > ArgumentError: wrong number of arguments (1 for 2)
>> >  (root) at ./list_regions.rb:60
>> >
>> > If someone more proficient notices an obvious fix, I'd be glad to hear
>> > about it.
>>
>> Concerning https://github.com/Mendeley/hbase-scripts , I am afraid that
>> this is a repository that is no longer maintained and was written for old
>> releases of hbase (cdh2 I believe). There's no plan to upgrade it to newer
>> releases.
>>
>> Cheers
>> ---
>> Guillaume
>
>
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB