|
|
Ajay Bhosle 2012-11-21, 07:19
Hi,
I am inserting some data in hbase which is getting hot spotted in a particular server. The format of the row key is (0 or 1)|[timestamp]_[sequence]. Basically I want to add log information to hbase and search the records based on range of dates.
Can someone suggest any configuration changes or any ideas on how the row key should be design. I do not want to specify the splits while creating table.
Thanks
Ajay The information contained in this electronic message (email) and any attachments to this email are intended for the exclusive use of the addressee(s) and access to this email by any one else is unauthorised. The email may contain proprietary, confidential or privileged information or information relating to Reliance Group. If you are not the intended recipient, please notify the sender by telephone, fax, or return email and delete this communication and any attachments thereto, immediately from your computer. Any dissemination, distribution, or copying of this communication and the attachments thereto (in whole or part), in any manner, is strictly prohibited and actionable at law. The recipient acknowledges that emails are susceptible to alteration and their integrity can not be guaranteed and that Company does not guarantee that any e-mail is virus-free and accept no liability for any damage caused by any virus transmitted by this email.
+
Ajay Bhosle 2012-11-21, 07:19
-
Re: Region hot spotting
ramkrishna vasudevan 2012-11-21, 08:19
Hotspotting is bound to happen until the region starts splitting and gets assigned to diff region servers.
Regards Ram
On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle <[EMAIL PROTECTED]>wrote:
> Hi, > > > > I am inserting some data in hbase which is getting hot spotted in a > particular server. The format of the row key is (0 or > 1)|[timestamp]_[sequence]. Basically I want to add log information to > hbase > and search the records based on range of dates. > > > > Can someone suggest any configuration changes or any ideas on how the row > key should be design. I do not want to specify the splits while creating > table. > > > > Thanks > > Ajay > > > > > The information contained in this electronic message (email) and any > attachments to this email are intended for the exclusive use of the > addressee(s) and access to this email by any one else is unauthorised. The > email may contain proprietary, confidential or privileged information or > information relating to Reliance Group. If you are not the intended > recipient, please notify the sender by telephone, fax, or return email and > delete this communication and any attachments thereto, immediately from > your computer. Any dissemination, distribution, or copying of this > communication and the attachments thereto (in whole or part), in any > manner, is strictly prohibited and actionable at law. The recipient > acknowledges that emails are susceptible to alteration and their integrity > can not be guaranteed and that Company does not guarantee that any e-mail > is virus-free and accept no liability for any damage caused by any virus > transmitted by this email. >
+
ramkrishna vasudevan 2012-11-21, 08:19
-
Re: Region hot spotting
Mohammad Tariq 2012-11-21, 08:29
Hello Ajay, You can use 'salting' if you don't want to presplit your table. You might this link useful : http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/HTH Regards, Mohammad Tariq On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < [EMAIL PROTECTED]> wrote: > Hotspotting is bound to happen until the region starts splitting and gets > assigned to diff region servers. > > Regards > Ram > > On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle > <[EMAIL PROTECTED]>wrote: > > > Hi, > > > > > > > > I am inserting some data in hbase which is getting hot spotted in a > > particular server. The format of the row key is (0 or > > 1)|[timestamp]_[sequence]. Basically I want to add log information to > > hbase > > and search the records based on range of dates. > > > > > > > > Can someone suggest any configuration changes or any ideas on how the row > > key should be design. I do not want to specify the splits while creating > > table. > > > > > > > > Thanks > > > > Ajay > > > > > > > > > > The information contained in this electronic message (email) and any > > attachments to this email are intended for the exclusive use of the > > addressee(s) and access to this email by any one else is unauthorised. > The > > email may contain proprietary, confidential or privileged information or > > information relating to Reliance Group. If you are not the intended > > recipient, please notify the sender by telephone, fax, or return email > and > > delete this communication and any attachments thereto, immediately from > > your computer. Any dissemination, distribution, or copying of this > > communication and the attachments thereto (in whole or part), in any > > manner, is strictly prohibited and actionable at law. The recipient > > acknowledges that emails are susceptible to alteration and their > integrity > > can not be guaranteed and that Company does not guarantee that any e-mail > > is virus-free and accept no liability for any damage caused by any virus > > transmitted by this email. > > >
+
Mohammad Tariq 2012-11-21, 08:29
-
Re: Region hot spotting
ramkrishna vasudevan 2012-11-21, 08:44
Hi This link is pretty much useful. But still there too it says if you dont pre split you need to wait for the salting to help you from hotspotting till the region gets splitted. Mohammad just pointing this to say the usefulness of presplitting definitely your's is a good pointer to Ajay. :) Regards Ram On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> wrote: > Hello Ajay, > > You can use 'salting' if you don't want to presplit your table. You might > this link useful : > > http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/> > HTH > > Regards, > Mohammad Tariq > > > > On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < > [EMAIL PROTECTED]> wrote: > > > Hotspotting is bound to happen until the region starts splitting and gets > > assigned to diff region servers. > > > > Regards > > Ram > > > > On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle > > <[EMAIL PROTECTED]>wrote: > > > > > Hi, > > > > > > > > > > > > I am inserting some data in hbase which is getting hot spotted in a > > > particular server. The format of the row key is (0 or > > > 1)|[timestamp]_[sequence]. Basically I want to add log information to > > > hbase > > > and search the records based on range of dates. > > > > > > > > > > > > Can someone suggest any configuration changes or any ideas on how the > row > > > key should be design. I do not want to specify the splits while > creating > > > table. > > > > > > > > > > > > Thanks > > > > > > Ajay > > > > > > > > > > > > > > > The information contained in this electronic message (email) and any > > > attachments to this email are intended for the exclusive use of the > > > addressee(s) and access to this email by any one else is unauthorised. > > The > > > email may contain proprietary, confidential or privileged information > or > > > information relating to Reliance Group. If you are not the intended > > > recipient, please notify the sender by telephone, fax, or return email > > and > > > delete this communication and any attachments thereto, immediately from > > > your computer. Any dissemination, distribution, or copying of this > > > communication and the attachments thereto (in whole or part), in any > > > manner, is strictly prohibited and actionable at law. The recipient > > > acknowledges that emails are susceptible to alteration and their > > integrity > > > can not be guaranteed and that Company does not guarantee that any > e-mail > > > is virus-free and accept no liability for any damage caused by any > virus > > > transmitted by this email. > > > > > >
+
ramkrishna vasudevan 2012-11-21, 08:44
-
RE: Region hot spotting
Ajay Bhosle 2012-11-21, 11:46
Thanks for your comments, I am already prefixing the timestamp with integer in range of 1..10, also the hbase.hregion.max.filesize is defined as 256 MB. Still it is hot spotting. Thanks Ajay -----Original Message----- From: ramkrishna vasudevan [mailto:[EMAIL PROTECTED]] Sent: Wednesday, November 21, 2012 2:14 PM To: [EMAIL PROTECTED] Subject: Re: Region hot spotting Hi This link is pretty much useful. But still there too it says if you dont pre split you need to wait for the salting to help you from hotspotting till the region gets splitted. Mohammad just pointing this to say the usefulness of presplitting definitely your's is a good pointer to Ajay. :) Regards Ram On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> wrote: > Hello Ajay, > > You can use 'salting' if you don't want to presplit your table. You might > this link useful : > > http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/ > > HTH > > Regards, > Mohammad Tariq > > > > On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < > [EMAIL PROTECTED]> wrote: > > > Hotspotting is bound to happen until the region starts splitting and gets > > assigned to diff region servers. > > > > Regards > > Ram > > > > On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle > > <[EMAIL PROTECTED]>wrote: > > > > > Hi, > > > > > > > > > > > > I am inserting some data in hbase which is getting hot spotted in a > > > particular server. The format of the row key is (0 or > > > 1)|[timestamp]_[sequence]. Basically I want to add log information to > > > hbase > > > and search the records based on range of dates. > > > > > > > > > > > > Can someone suggest any configuration changes or any ideas on how the > row > > > key should be design. I do not want to specify the splits while > creating > > > table. > > > > > > > > > > > > Thanks > > > > > > Ajay > > > > > > > > > > > > > > > The information contained in this electronic message (email) and any > > > attachments to this email are intended for the exclusive use of the > > > addressee(s) and access to this email by any one else is unauthorised. > > The > > > email may contain proprietary, confidential or privileged information > or > > > information relating to Reliance Group. If you are not the intended > > > recipient, please notify the sender by telephone, fax, or return email > > and > > > delete this communication and any attachments thereto, immediately from > > > your computer. Any dissemination, distribution, or copying of this > > > communication and the attachments thereto (in whole or part), in any > > > manner, is strictly prohibited and actionable at law. The recipient > > > acknowledges that emails are susceptible to alteration and their > > integrity > > > can not be guaranteed and that Company does not guarantee that any > e-mail > > > is virus-free and accept no liability for any damage caused by any > virus > > > transmitted by this email. > > > > > > The information contained in this electronic message (email) and any attachments to this email are intended for the exclusive use of the addressee(s) and access to this email by any one else is unauthorised. The email may contain proprietary, confidential or privileged information or information relating to Reliance Group. If you are not the intended recipient, please notify the sender by telephone, fax, or return email and delete this communication and any attachments thereto, immediately from your computer. Any dissemination, distribution, or copying of this communication and the attachments thereto (in whole or part), in any manner, is strictly prohibited and actionable at law. The recipient acknowledges that emails are susceptible to alteration and their integrity can not be guaranteed and that Company does not guarantee that any e-mail is virus-free and accept no liability for any damage caused by any virus transmitted by this email.
+
Ajay Bhosle 2012-11-21, 11:46
-
Re: Region hot spotting
Mohammad Tariq 2012-11-21, 17:43
Hello again, Just a small query. Is your data getting streamed continuously or are you making use of bulk load. In case of latter, I don't find any harm in creating pre-splitted tables. Regards, Mohammad Tariq On Wed, Nov 21, 2012 at 2:14 PM, ramkrishna vasudevan < [EMAIL PROTECTED]> wrote: > Hi > This link is pretty much useful. But still there too it says if you dont > pre split you need to wait for the salting to help you from hotspotting > till the region gets splitted. > > Mohammad just pointing this to say the usefulness of presplitting > definitely your's is a good pointer to Ajay. :) > > Regards > Ram > > On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> > wrote: > > > Hello Ajay, > > > > You can use 'salting' if you don't want to presplit your table. You > might > > this link useful : > > > > > http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/> > > > HTH > > > > Regards, > > Mohammad Tariq > > > > > > > > On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < > > [EMAIL PROTECTED]> wrote: > > > > > Hotspotting is bound to happen until the region starts splitting and > gets > > > assigned to diff region servers. > > > > > > Regards > > > Ram > > > > > > On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle > > > <[EMAIL PROTECTED]>wrote: > > > > > > > Hi, > > > > > > > > > > > > > > > > I am inserting some data in hbase which is getting hot spotted in a > > > > particular server. The format of the row key is (0 or > > > > 1)|[timestamp]_[sequence]. Basically I want to add log information > to > > > > hbase > > > > and search the records based on range of dates. > > > > > > > > > > > > > > > > Can someone suggest any configuration changes or any ideas on how the > > row > > > > key should be design. I do not want to specify the splits while > > creating > > > > table. > > > > > > > > > > > > > > > > Thanks > > > > > > > > Ajay > > > > > > > > > > > > > > > > > > > > The information contained in this electronic message (email) and any > > > > attachments to this email are intended for the exclusive use of the > > > > addressee(s) and access to this email by any one else is > unauthorised. > > > The > > > > email may contain proprietary, confidential or privileged information > > or > > > > information relating to Reliance Group. If you are not the intended > > > > recipient, please notify the sender by telephone, fax, or return > email > > > and > > > > delete this communication and any attachments thereto, immediately > from > > > > your computer. Any dissemination, distribution, or copying of this > > > > communication and the attachments thereto (in whole or part), in any > > > > manner, is strictly prohibited and actionable at law. The recipient > > > > acknowledges that emails are susceptible to alteration and their > > > integrity > > > > can not be guaranteed and that Company does not guarantee that any > > e-mail > > > > is virus-free and accept no liability for any damage caused by any > > virus > > > > transmitted by this email. > > > > > > > > > >
+
Mohammad Tariq 2012-11-21, 17:43
-
Re: Region hot spotting
Michael Segel 2012-11-21, 21:21
Salting is not a good idea and I don't know why people suggest it. Case in point.... you want to fetch a single row/record back. Because the salt is arbitrary, you will need to send N number of get()s one for each salt value. Doing a simple one way hash of the data, even appending the data, would be a better idea. Note that if you are worried about key length, you can do the hash and take the first N bytes and then append the row to it. It should give you enough of a spread of hash values that you don't have to worry about hotspots. HTH -Mike On Nov 21, 2012, at 2:44 AM, ramkrishna vasudevan <[EMAIL PROTECTED]> wrote: > Hi > This link is pretty much useful. But still there too it says if you dont > pre split you need to wait for the salting to help you from hotspotting > till the region gets splitted. > > Mohammad just pointing this to say the usefulness of presplitting > definitely your's is a good pointer to Ajay. :) > > Regards > Ram > > On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> wrote: > >> Hello Ajay, >> >> You can use 'salting' if you don't want to presplit your table. You might >> this link useful : >> >> http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/>> >> HTH >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < >> [EMAIL PROTECTED]> wrote: >> >>> Hotspotting is bound to happen until the region starts splitting and gets >>> assigned to diff region servers. >>> >>> Regards >>> Ram >>> >>> On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle >>> <[EMAIL PROTECTED]>wrote: >>> >>>> Hi, >>>> >>>> >>>> >>>> I am inserting some data in hbase which is getting hot spotted in a >>>> particular server. The format of the row key is (0 or >>>> 1)|[timestamp]_[sequence]. Basically I want to add log information to >>>> hbase >>>> and search the records based on range of dates. >>>> >>>> >>>> >>>> Can someone suggest any configuration changes or any ideas on how the >> row >>>> key should be design. I do not want to specify the splits while >> creating >>>> table. >>>> >>>> >>>> >>>> Thanks >>>> >>>> Ajay >>>> >>>> >>>> >>>> >>>> The information contained in this electronic message (email) and any >>>> attachments to this email are intended for the exclusive use of the >>>> addressee(s) and access to this email by any one else is unauthorised. >>> The >>>> email may contain proprietary, confidential or privileged information >> or >>>> information relating to Reliance Group. If you are not the intended >>>> recipient, please notify the sender by telephone, fax, or return email >>> and >>>> delete this communication and any attachments thereto, immediately from >>>> your computer. Any dissemination, distribution, or copying of this >>>> communication and the attachments thereto (in whole or part), in any >>>> manner, is strictly prohibited and actionable at law. The recipient >>>> acknowledges that emails are susceptible to alteration and their >>> integrity >>>> can not be guaranteed and that Company does not guarantee that any >> e-mail >>>> is virus-free and accept no liability for any damage caused by any >> virus >>>> transmitted by this email. >>>> >>> >>
+
Michael Segel 2012-11-21, 21:21
-
RE: Region hot spotting
Ajay Bhosle 2012-11-22, 13:28
We are not fetching single row back. Hashing really helped, the data is now almost equally split between the servers. Thanks a lot. -Ajay -----Original Message----- From: Michael Segel [mailto:[EMAIL PROTECTED]] Sent: Thursday, November 22, 2012 2:52 AM To: [EMAIL PROTECTED] Subject: Re: Region hot spotting Salting is not a good idea and I don't know why people suggest it. Case in point.... you want to fetch a single row/record back. Because the salt is arbitrary, you will need to send N number of get()s one for each salt value. Doing a simple one way hash of the data, even appending the data, would be a better idea. Note that if you are worried about key length, you can do the hash and take the first N bytes and then append the row to it. It should give you enough of a spread of hash values that you don't have to worry about hotspots. HTH -Mike On Nov 21, 2012, at 2:44 AM, ramkrishna vasudevan <[EMAIL PROTECTED]> wrote: > Hi > This link is pretty much useful. But still there too it says if you dont > pre split you need to wait for the salting to help you from hotspotting > till the region gets splitted. > > Mohammad just pointing this to say the usefulness of presplitting > definitely your's is a good pointer to Ajay. :) > > Regards > Ram > > On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> wrote: > >> Hello Ajay, >> >> You can use 'salting' if you don't want to presplit your table. You might >> this link useful : >> >> http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/ >> >> HTH >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < >> [EMAIL PROTECTED]> wrote: >> >>> Hotspotting is bound to happen until the region starts splitting and gets >>> assigned to diff region servers. >>> >>> Regards >>> Ram >>> >>> On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle >>> <[EMAIL PROTECTED]>wrote: >>> >>>> Hi, >>>> >>>> >>>> >>>> I am inserting some data in hbase which is getting hot spotted in a >>>> particular server. The format of the row key is (0 or >>>> 1)|[timestamp]_[sequence]. Basically I want to add log information to >>>> hbase >>>> and search the records based on range of dates. >>>> >>>> >>>> >>>> Can someone suggest any configuration changes or any ideas on how the >> row >>>> key should be design. I do not want to specify the splits while >> creating >>>> table. >>>> >>>> >>>> >>>> Thanks >>>> >>>> Ajay >>>> >>>> >>>> >>>> >>>> The information contained in this electronic message (email) and any >>>> attachments to this email are intended for the exclusive use of the >>>> addressee(s) and access to this email by any one else is unauthorised. >>> The >>>> email may contain proprietary, confidential or privileged information >> or >>>> information relating to Reliance Group. If you are not the intended >>>> recipient, please notify the sender by telephone, fax, or return email >>> and >>>> delete this communication and any attachments thereto, immediately from >>>> your computer. Any dissemination, distribution, or copying of this >>>> communication and the attachments thereto (in whole or part), in any >>>> manner, is strictly prohibited and actionable at law. The recipient >>>> acknowledges that emails are susceptible to alteration and their >>> integrity >>>> can not be guaranteed and that Company does not guarantee that any >> e-mail >>>> is virus-free and accept no liability for any damage caused by any >> virus >>>> transmitted by this email. >>>> >>> >> The information contained in this electronic message (email) and any attachments to this email are intended for the exclusive use of the addressee(s) and access to this email by any one else is unauthorised. The email may contain proprietary, confidential or privileged information or information relating to Reliance Group. If you are not the intended recipient, please notify the sender by telephone, fax, or return email and delete this communication and any attachments thereto, immediately from your computer. Any dissemination, distribution, or copying of this communication and the attachments thereto (in whole or part), in any manner, is strictly prohibited and actionable at law. The recipient acknowledges that emails are susceptible to alteration and their integrity can not be guaranteed and that Company does not guarantee that any e-mail is virus-free and accept no liability for any damage caused by any virus transmitted by this email.
+
Ajay Bhosle 2012-11-22, 13:28
-
Re: Region hot spotting
Mohammad Tariq 2012-11-22, 08:06
Good point Mike. Regards, Mohammad Tariq On Thu, Nov 22, 2012 at 2:51 AM, Michael Segel <[EMAIL PROTECTED]>wrote: > Salting is not a good idea and I don't know why people suggest it. > > Case in point.... you want to fetch a single row/record back. Because the > salt is arbitrary, you will need to send N number of get()s one for each > salt value. > Doing a simple one way hash of the data, even appending the data, would be > a better idea. Note that if you are worried about key length, you can do > the hash and take the first N bytes and then append the row to it. It > should give you enough of a spread of hash values that you don't have to > worry about hotspots. > > HTH > > -Mike > > On Nov 21, 2012, at 2:44 AM, ramkrishna vasudevan < > [EMAIL PROTECTED]> wrote: > > > Hi > > This link is pretty much useful. But still there too it says if you dont > > pre split you need to wait for the salting to help you from hotspotting > > till the region gets splitted. > > > > Mohammad just pointing this to say the usefulness of presplitting > > definitely your's is a good pointer to Ajay. :) > > > > Regards > > Ram > > > > On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> > wrote: > > > >> Hello Ajay, > >> > >> You can use 'salting' if you don't want to presplit your table. You > might > >> this link useful : > >> > >> > http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-despite-writing-records-with-sequential-keys/> >> > >> HTH > >> > >> Regards, > >> Mohammad Tariq > >> > >> > >> > >> On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < > >> [EMAIL PROTECTED]> wrote: > >> > >>> Hotspotting is bound to happen until the region starts splitting and > gets > >>> assigned to diff region servers. > >>> > >>> Regards > >>> Ram > >>> > >>> On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle > >>> <[EMAIL PROTECTED]>wrote: > >>> > >>>> Hi, > >>>> > >>>> > >>>> > >>>> I am inserting some data in hbase which is getting hot spotted in a > >>>> particular server. The format of the row key is (0 or > >>>> 1)|[timestamp]_[sequence]. Basically I want to add log information to > >>>> hbase > >>>> and search the records based on range of dates. > >>>> > >>>> > >>>> > >>>> Can someone suggest any configuration changes or any ideas on how the > >> row > >>>> key should be design. I do not want to specify the splits while > >> creating > >>>> table. > >>>> > >>>> > >>>> > >>>> Thanks > >>>> > >>>> Ajay > >>>> > >>>> > >>>> > >>>> > >>>> The information contained in this electronic message (email) and any > >>>> attachments to this email are intended for the exclusive use of the > >>>> addressee(s) and access to this email by any one else is unauthorised. > >>> The > >>>> email may contain proprietary, confidential or privileged information > >> or > >>>> information relating to Reliance Group. If you are not the intended > >>>> recipient, please notify the sender by telephone, fax, or return email > >>> and > >>>> delete this communication and any attachments thereto, immediately > from > >>>> your computer. Any dissemination, distribution, or copying of this > >>>> communication and the attachments thereto (in whole or part), in any > >>>> manner, is strictly prohibited and actionable at law. The recipient > >>>> acknowledges that emails are susceptible to alteration and their > >>> integrity > >>>> can not be guaranteed and that Company does not guarantee that any > >> e-mail > >>>> is virus-free and accept no liability for any damage caused by any > >> virus > >>>> transmitted by this email. > >>>> > >>> > >> > >
+
Mohammad Tariq 2012-11-22, 08:06
-
Re: Region hot spotting
Suraj Varma 2012-11-21, 17:33
Ajay: Why would you not want to specify splits while creating table? If your 0-10 prefix is at random ... why not pre-split with that? Without presplitting, as Ram says, you cannot avoid region hotspotting until table starts automatic splits. --S On Wed, Nov 21, 2012 at 3:46 AM, Ajay Bhosle <[EMAIL PROTECTED]> wrote: > Thanks for your comments, > > I am already prefixing the timestamp with integer in range of 1..10, also > the hbase.hregion.max.filesize is defined as 256 MB. Still it is hot > spotting. > > Thanks > Ajay > > -----Original Message----- > From: ramkrishna vasudevan [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, November 21, 2012 2:14 PM > To: [EMAIL PROTECTED] > Subject: Re: Region hot spotting > > Hi > This link is pretty much useful. But still there too it says if you dont > pre split you need to wait for the salting to help you from hotspotting > till the region gets splitted. > > Mohammad just pointing this to say the usefulness of presplitting > definitely your's is a good pointer to Ajay. :) > > Regards > Ram > > On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> wrote: > >> Hello Ajay, >> >> You can use 'salting' if you don't want to presplit your table. You might >> this link useful : >> >> > http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspotting-d> espite-writing-records-with-sequential-keys/ >> >> HTH >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < >> [EMAIL PROTECTED]> wrote: >> >> > Hotspotting is bound to happen until the region starts splitting and > gets >> > assigned to diff region servers. >> > >> > Regards >> > Ram >> > >> > On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle >> > <[EMAIL PROTECTED]>wrote: >> > >> > > Hi, >> > > >> > > >> > > >> > > I am inserting some data in hbase which is getting hot spotted in a >> > > particular server. The format of the row key is (0 or >> > > 1)|[timestamp]_[sequence]. Basically I want to add log information to >> > > hbase >> > > and search the records based on range of dates. >> > > >> > > >> > > >> > > Can someone suggest any configuration changes or any ideas on how the >> row >> > > key should be design. I do not want to specify the splits while >> creating >> > > table. >> > > >> > > >> > > >> > > Thanks >> > > >> > > Ajay >> > > >> > > >> > > >> > > >> > > The information contained in this electronic message (email) and any >> > > attachments to this email are intended for the exclusive use of the >> > > addressee(s) and access to this email by any one else is unauthorised. >> > The >> > > email may contain proprietary, confidential or privileged information >> or >> > > information relating to Reliance Group. If you are not the intended >> > > recipient, please notify the sender by telephone, fax, or return email >> > and >> > > delete this communication and any attachments thereto, immediately > from >> > > your computer. Any dissemination, distribution, or copying of this >> > > communication and the attachments thereto (in whole or part), in any >> > > manner, is strictly prohibited and actionable at law. The recipient >> > > acknowledges that emails are susceptible to alteration and their >> > integrity >> > > can not be guaranteed and that Company does not guarantee that any >> e-mail >> > > is virus-free and accept no liability for any damage caused by any >> virus >> > > transmitted by this email. >> > > >> > >> > > > > > The information contained in this electronic message (email) and any attachments to this email are intended for the exclusive use of the addressee(s) and access to this email by any one else is unauthorised. The email may contain proprietary, confidential or privileged information or information relating to Reliance Group. If you are not the intended recipient, please notify the sender by telephone, fax, or return email and delete this communication and any attachments thereto, immediately from your computer. Any dissemination, distribution, or copying of this communication and the attachments thereto (in whole or part), in any manner, is strictly prohibited and actionable at law. The recipient acknowledges that emails are susceptible to alteration and their integrity can not be guaranteed and that Company does not guarantee that any e-mail is virus-free and accept no liability for any damage caused by any virus transmitted by this email.
+
Suraj Varma 2012-11-21, 17:33
-
Re: Region hot spotting
Doug Meil 2012-11-21, 18:52
Hi there- If he's using monotonically increasing keys the pre splits won't help because the same region is going to get all the writes. http://hbase.apache.org/book.html#rowkey.designOn 11/21/12 12:33 PM, "Suraj Varma" <[EMAIL PROTECTED]> wrote: >Ajay: >Why would you not want to specify splits while creating table? If your >0-10 prefix is at random ... why not pre-split with that? > >Without presplitting, as Ram says, you cannot avoid region hotspotting >until table starts automatic splits. >--S > >On Wed, Nov 21, 2012 at 3:46 AM, Ajay Bhosle ><[EMAIL PROTECTED]> wrote: >> Thanks for your comments, >> >> I am already prefixing the timestamp with integer in range of 1..10, >>also >> the hbase.hregion.max.filesize is defined as 256 MB. Still it is hot >> spotting. >> >> Thanks >> Ajay >> >> -----Original Message----- >> From: ramkrishna vasudevan [mailto:[EMAIL PROTECTED]] >> Sent: Wednesday, November 21, 2012 2:14 PM >> To: [EMAIL PROTECTED] >> Subject: Re: Region hot spotting >> >> Hi >> This link is pretty much useful. But still there too it says if you >>dont >> pre split you need to wait for the salting to help you from hotspotting >> till the region gets splitted. >> >> Mohammad just pointing this to say the usefulness of presplitting >> definitely your's is a good pointer to Ajay. :) >> >> Regards >> Ram >> >> On Wed, Nov 21, 2012 at 1:59 PM, Mohammad Tariq <[EMAIL PROTECTED]> >>wrote: >> >>> Hello Ajay, >>> >>> You can use 'salting' if you don't want to presplit your table. You >>>might >>> this link useful : >>> >>> >> >> http://blog.sematext.com/2012/04/09/hbasewd-avoid-regionserver-hotspottin>>g-d >> espite-writing-records-with-sequential-keys/ >>> >>> HTH >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Nov 21, 2012 at 1:49 PM, ramkrishna vasudevan < >>> [EMAIL PROTECTED]> wrote: >>> >>> > Hotspotting is bound to happen until the region starts splitting and >> gets >>> > assigned to diff region servers. >>> > >>> > Regards >>> > Ram >>> > >>> > On Wed, Nov 21, 2012 at 12:49 PM, Ajay Bhosle >>> > <[EMAIL PROTECTED]>wrote: >>> > >>> > > Hi, >>> > > >>> > > >>> > > >>> > > I am inserting some data in hbase which is getting hot spotted in a >>> > > particular server. The format of the row key is (0 or >>> > > 1)|[timestamp]_[sequence]. Basically I want to add log >>>information to >>> > > hbase >>> > > and search the records based on range of dates. >>> > > >>> > > >>> > > >>> > > Can someone suggest any configuration changes or any ideas on how >>>the >>> row >>> > > key should be design. I do not want to specify the splits while >>> creating >>> > > table. >>> > > >>> > > >>> > > >>> > > Thanks >>> > > >>> > > Ajay >>> > > >>> > > >>> > > >>> > > >>> > > The information contained in this electronic message (email) and >>>any >>> > > attachments to this email are intended for the exclusive use of the >>> > > addressee(s) and access to this email by any one else is >>>unauthorised. >>> > The >>> > > email may contain proprietary, confidential or privileged >>>information >>> or >>> > > information relating to Reliance Group. If you are not the intended >>> > > recipient, please notify the sender by telephone, fax, or return >>>email >>> > and >>> > > delete this communication and any attachments thereto, immediately >> from >>> > > your computer. Any dissemination, distribution, or copying of this >>> > > communication and the attachments thereto (in whole or part), in >>>any >>> > > manner, is strictly prohibited and actionable at law. The recipient >>> > > acknowledges that emails are susceptible to alteration and their >>> > integrity >>> > > can not be guaranteed and that Company does not guarantee that any >>> e-mail >>> > > is virus-free and accept no liability for any damage caused by any >>> virus >>> > > transmitted by this email. >>> > > >>> > >>> >> >> >> >> >> The information contained in this electronic message (email) and any
+
Doug Meil 2012-11-21, 18:52
|
|