|
Ian Varley
2012-04-09, 16:08
Andrew Purtell
2012-04-09, 16:56
Ian Varley
2012-04-09, 19:39
Alex Baranau
2012-04-10, 03:23
arnaud but
2012-04-10, 08:13
Jean-Daniel Cryans
2012-04-12, 17:00
|
-
Schema Updates: what do you do today?Ian Varley 2012-04-09, 16:08
All:
I'm doing a little research into various ways to apply schema modifications to an HBase cluster. Anybody care to share with the list what you currently do? E.g. - Connect via the HBase shell and manually issue commands ("create", "disable", "alter", etc.) - Write one-off scripts that do the above - Write tools that read from a static schema definition and then apply changes to a cluster (e.g. using HBaseAdmin) etc. My supposition is that some additional tooling in this area, to consolidate stuff everybody already does on their own, might be helpful. In light of recent discussions on the dev list about various ways to alter the schema on a running cluster, it seems like this area is still a bit of a "wild west" in the HBase community, both in how HBase works and in what people do in practice. What do you do today for schema changes, and what would you like to do in an ideal world? Thanks, Ian
-
Re: Schema Updates: what do you do today?Andrew Purtell 2012-04-09, 16:56
Manual schema changes via one-off shell scripts.
What I would like to do is write code that gets the HTD, checks if all of the schema structure and features are as they should be, and, if not, makes the necessary modifications without taking the table offline.(I typically write code like that which does offlining first. In practice, it creates the table if it is missing in some test environment, later it is disabled.) It could be possible to update HTD and HCD attributes without offlining, possibly even to add CFs. I wouldn't expect all admin actions could be accomplished without offlining. Best regards, - Andy Problems worthy of attack prove their worth by hitting back. - Piet Hein (via Tom White) ----- Original Message ----- > From: Ian Varley <[EMAIL PROTECTED]> > To: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]> > Cc: > Sent: Monday, April 9, 2012 9:08 AM > Subject: Schema Updates: what do you do today? > > All: > > I'm doing a little research into various ways to apply schema modifications > to an HBase cluster. Anybody care to share with the list what you currently do? > E.g. > > - Connect via the HBase shell and manually issue commands ("create", > "disable", "alter", etc.) > - Write one-off scripts that do the above > - Write tools that read from a static schema definition and then apply changes > to a cluster (e.g. using HBaseAdmin) > > etc. My supposition is that some additional tooling in this area, to consolidate > stuff everybody already does on their own, might be helpful. In light of recent > discussions on the dev list about various ways to alter the schema on a running > cluster, it seems like this area is still a bit of a "wild west" in > the HBase community, both in how HBase works and in what people do in practice. > > What do you do today for schema changes, and what would you like to do in an > ideal world? > > Thanks, > Ian >
-
Re: Schema Updates: what do you do today?Ian Varley 2012-04-09, 19:39
Thanks, Andy. Yeah, a tool that compares a schema definition with a running cluster, and gives you a way to apply changes (without offlining, where possible), would be pretty sweet.
Anybody else think so? Or, do you have tools you've already written for this? Seems like a common need (we also need that, and have started tools for it internally). Ian On Apr 9, 2012, at 11:56 AM, Andrew Purtell wrote: Manual schema changes via one-off shell scripts. What I would like to do is write code that gets the HTD, checks if all of the schema structure and features are as they should be, and, if not, makes the necessary modifications without taking the table offline.(I typically write code like that which does offlining first. In practice, it creates the table if it is missing in some test environment, later it is disabled.) It could be possible to update HTD and HCD attributes without offlining, possibly even to add CFs. I wouldn't expect all admin actions could be accomplished without offlining. Best regards, - Andy Problems worthy of attack prove their worth by hitting back. - Piet Hein (via Tom White) ----- Original Message ----- From: Ian Varley <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> To: "[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>" <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> Cc: Sent: Monday, April 9, 2012 9:08 AM Subject: Schema Updates: what do you do today? All: I'm doing a little research into various ways to apply schema modifications to an HBase cluster. Anybody care to share with the list what you currently do? E.g. - Connect via the HBase shell and manually issue commands ("create", "disable", "alter", etc.) - Write one-off scripts that do the above - Write tools that read from a static schema definition and then apply changes to a cluster (e.g. using HBaseAdmin) etc. My supposition is that some additional tooling in this area, to consolidate stuff everybody already does on their own, might be helpful. In light of recent discussions on the dev list about various ways to alter the schema on a running cluster, it seems like this area is still a bit of a "wild west" in the HBase community, both in how HBase works and in what people do in practice. What do you do today for schema changes, and what would you like to do in an ideal world? Thanks, Ian
-
Re: Schema Updates: what do you do today?Alex Baranau 2012-04-10, 03:23
I think I saw one effort of creating a nice tool for doing that long time
ago... Aha, here it is: https://github.com/larsgeorge/hbase-schema-manager. Might be outdated.. Lars? As for us, we do changes really rarely (usually have one table with one columnfamily in it), so one-off shell scripts work well. Alex Baranau ------ Sematext :: http://blog.sematext.com/ :: Solr - Lucene - Hadoop - HBase On Mon, Apr 9, 2012 at 3:39 PM, Ian Varley <[EMAIL PROTECTED]> wrote: > Thanks, Andy. Yeah, a tool that compares a schema definition with a > running cluster, and gives you a way to apply changes (without offlining, > where possible), would be pretty sweet. > > Anybody else think so? Or, do you have tools you've already written for > this? Seems like a common need (we also need that, and have started tools > for it internally). > > Ian > > On Apr 9, 2012, at 11:56 AM, Andrew Purtell wrote: > > Manual schema changes via one-off shell scripts. > > > What I would like to do is write code that gets the HTD, checks if > all of the schema structure and features are as they should be, and, if > not, makes the necessary modifications without taking the table offline.(I > typically write code like that which does offlining first. In practice, it > creates the table if it is missing in some test environment, later it is > disabled.) It could be possible to update HTD and HCD attributes without > offlining, possibly even to add CFs. I wouldn't expect all admin actions > could be accomplished without offlining. > > > Best regards, > > > - Andy > > Problems worthy of attack prove their worth by hitting back. - Piet Hein > (via Tom White) > > > > ----- Original Message ----- > From: Ian Varley <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> > To: "[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>" < > [EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> > Cc: > Sent: Monday, April 9, 2012 9:08 AM > Subject: Schema Updates: what do you do today? > > All: > > I'm doing a little research into various ways to apply schema modifications > to an HBase cluster. Anybody care to share with the list what you > currently do? > E.g. > > - Connect via the HBase shell and manually issue commands ("create", > "disable", "alter", etc.) > - Write one-off scripts that do the above > - Write tools that read from a static schema definition and then apply > changes > to a cluster (e.g. using HBaseAdmin) > > etc. My supposition is that some additional tooling in this area, to > consolidate > stuff everybody already does on their own, might be helpful. In light of > recent > discussions on the dev list about various ways to alter the schema on a > running > cluster, it seems like this area is still a bit of a "wild west" in > the HBase community, both in how HBase works and in what people do in > practice. > > What do you do today for schema changes, and what would you like to do in > an > ideal world? > > Thanks, > Ian > > >
-
Re: Schema Updates: what do you do today?arnaud but 2012-04-10, 08:13
+1, this kind of tools is very nice.
Le 09/04/2012 21:39, Ian Varley a �crit : > Thanks, Andy. Yeah, a tool that compares a schema definition with a running cluster, and gives you a way to apply changes (without offlining, where possible), would be pretty sweet. > > Anybody else think so? Or, do you have tools you've already written for this? Seems like a common need (we also need that, and have started tools for it internally). > > Ian > > On Apr 9, 2012, at 11:56 AM, Andrew Purtell wrote: > > Manual schema changes via one-off shell scripts. > > > What I would like to do is write code that gets the HTD, checks if > all of the schema structure and features are as they should be, and, if > not, makes the necessary modifications without taking the table offline.(I typically write code like that which does offlining first. In practice, it creates the table if it is missing in some test environment, later it is disabled.) It could be possible to update HTD and HCD attributes without offlining, possibly even to add CFs. I wouldn't expect all admin actions could be accomplished without offlining. > > > Best regards, > > > - Andy > > Problems worthy of attack prove their worth by hitting back. - Piet Hein (via Tom White) > > > > ----- Original Message ----- > From: Ian Varley<[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> > To: "[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>"<[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> > Cc: > Sent: Monday, April 9, 2012 9:08 AM > Subject: Schema Updates: what do you do today? > > All: > > I'm doing a little research into various ways to apply schema modifications > to an HBase cluster. Anybody care to share with the list what you currently do? > E.g. > > - Connect via the HBase shell and manually issue commands ("create", > "disable", "alter", etc.) > - Write one-off scripts that do the above > - Write tools that read from a static schema definition and then apply changes > to a cluster (e.g. using HBaseAdmin) > > etc. My supposition is that some additional tooling in this area, to consolidate > stuff everybody already does on their own, might be helpful. In light of recent > discussions on the dev list about various ways to alter the schema on a running > cluster, it seems like this area is still a bit of a "wild west" in > the HBase community, both in how HBase works and in what people do in practice. > > What do you do today for schema changes, and what would you like to do in an > ideal world? > > Thanks, > Ian > > >
-
Re: Schema Updates: what do you do today?Jean-Daniel Cryans 2012-04-12, 17:00
Still doing it old school unless I have to alter a bunch of tables for
the same reason, like enabling replication. J-D On Mon, Apr 9, 2012 at 9:08 AM, Ian Varley <[EMAIL PROTECTED]> wrote: > All: > > I'm doing a little research into various ways to apply schema modifications to an HBase cluster. Anybody care to share with the list what you currently do? E.g. > > - Connect via the HBase shell and manually issue commands ("create", "disable", "alter", etc.) > - Write one-off scripts that do the above > - Write tools that read from a static schema definition and then apply changes to a cluster (e.g. using HBaseAdmin) > > etc. My supposition is that some additional tooling in this area, to consolidate stuff everybody already does on their own, might be helpful. In light of recent discussions on the dev list about various ways to alter the schema on a running cluster, it seems like this area is still a bit of a "wild west" in the HBase community, both in how HBase works and in what people do in practice. > > What do you do today for schema changes, and what would you like to do in an ideal world? > > Thanks, > Ian |