|
陈加俊
2011-01-07, 12:30
Stack
2011-01-07, 21:32
Andrey Stepachev
2011-01-08, 19:36
M. C. Srivas
2011-01-08, 20:10
Stack
2011-01-09, 04:56
Andrey Stepachev
2011-01-09, 20:29
Stack
2011-01-10, 19:02
Andrey Stepachev
2011-01-10, 23:02
Tost
2011-01-10, 23:59
|
-
How to rename table's family name陈加俊 2011-01-07, 12:30
Hi everyone!
How to rename the table's family name. I created the table and it's families , and insert many data into it, but I want to rename one family name now premise is not lost data .
-
Re: How to rename table's family nameStack 2011-01-07, 21:32
On Fri, Jan 7, 2011 at 4:30 AM, 陈加俊 <[EMAIL PROTECTED]> wrote:
> Hi everyone! > > How to rename the table's family name. > > I created the table and it's families , and insert many data into it, but I > want to rename one family name now premise is not lost data . > We do not have a mechanism to do this 陈加俊. Currently at least, the CF name is part of each entry persisted to the filesystem so a rename would mean a rewrite of all data. Perhaps we should consider detaching CF name from whats stored? St.Ack
-
Re: How to rename table's family nameAndrey Stepachev 2011-01-08, 19:36
2011/1/8 Stack <[EMAIL PROTECTED]>
> > > Perhaps we should consider > detaching CF name from whats stored? > Yes! Are there any jira? I'll vote for it. > > St.Ack >
-
Re: How to rename table's family nameM. C. Srivas 2011-01-08, 20:10
In general. there's need for a loose "schema" to allow not only renames of
columns and column-families, but efficient delete of entire columns or CFs. (eg, mark this C as deleted in the "schema" and remove it during the next major compaction). But implementing the master-coordination for this (for instance, all RS's should delete 'atomically') will be interesting .... On Sat, Jan 8, 2011 at 11:36 AM, Andrey Stepachev <[EMAIL PROTECTED]> wrote: > 2011/1/8 Stack <[EMAIL PROTECTED]> > > > > > > > > Perhaps we should consider > > detaching CF name from whats stored? > > > > Yes! Are there any jira? I'll vote for it. > > > > > > St.Ack > > >
-
Re: How to rename table's family nameStack 2011-01-09, 04:56
Thanks for the below M. C. I like this delete suggestion. Plan is
in 0.92 or 0.94 moving schema out of .META. up into zk. We're aiming for online schema editing w/o having to take a table offline. When schema changes, regionserves are notified and take approriate action. To respond to Andrey, you seem to be asking about the old issue HBASE-68. It hasn't seen much attention of late. Its more about saving space rather than facilitating rename (though the latter is a legit case). St.Ack On Sat, Jan 8, 2011 at 12:10 PM, M. C. Srivas <[EMAIL PROTECTED]> wrote: > In general. there's need for a loose "schema" to allow not only renames of > columns and column-families, but efficient delete of entire columns or CFs. > (eg, mark this C as deleted in the "schema" and remove it during the next > major compaction). But implementing the master-coordination for this (for > instance, all RS's should delete 'atomically') will be interesting .... > > > > On Sat, Jan 8, 2011 at 11:36 AM, Andrey Stepachev <[EMAIL PROTECTED]> wrote: > >> 2011/1/8 Stack <[EMAIL PROTECTED]> >> >> > >> > >> >> > Perhaps we should consider >> > detaching CF name from whats stored? >> > >> >> Yes! Are there any jira? I'll vote for it. >> >> >> > >> > St.Ack >> > >> >
-
Re: How to rename table's family nameAndrey Stepachev 2011-01-09, 20:29
2011/1/9 Stack <[EMAIL PROTECTED]>
> > To respond to Andrey, you seem to be asking about the old issue > HBASE-68. It hasn't seen much attention of late. Its more about > saving space rather than facilitating rename (though the latter is a > legit case). > Thanks. But saving space is not an issue. Really renaming with going offline (rename table) or reload data (rename cf) is issue (especially in case of live data migration on version upgrade on production cluster). > > St.Ack > > > On Sat, Jan 8, 2011 at 12:10 PM, M. C. Srivas <[EMAIL PROTECTED]> wrote: > > In general. there's need for a loose "schema" to allow not only renames > of > > columns and column-families, but efficient delete of entire columns or > CFs. > > (eg, mark this C as deleted in the "schema" and remove it during the next > > major compaction). But implementing the master-coordination for this (for > > instance, all RS's should delete 'atomically') will be interesting .... > > > > > > > > On Sat, Jan 8, 2011 at 11:36 AM, Andrey Stepachev <[EMAIL PROTECTED]> > wrote: > > > >> 2011/1/8 Stack <[EMAIL PROTECTED]> > >> > >> > > >> > > >> > >> > Perhaps we should consider > >> > detaching CF name from whats stored? > >> > > >> > >> Yes! Are there any jira? I'll vote for it. > >> > >> > >> > > >> > St.Ack > >> > > >> > > >
-
Re: How to rename table's family nameStack 2011-01-10, 19:02
Want to make a new issue then Andrey? Thanks. We could knock out
hbase-68 at same time. When would we do the replacement of code with actual columnfamily name? Over on client or on server before result is sent the client? St.Ack On Sun, Jan 9, 2011 at 12:29 PM, Andrey Stepachev <[EMAIL PROTECTED]> wrote: > 2011/1/9 Stack <[EMAIL PROTECTED]> > >> >> To respond to Andrey, you seem to be asking about the old issue >> HBASE-68. It hasn't seen much attention of late. Its more about >> saving space rather than facilitating rename (though the latter is a >> legit case). >> > > Thanks. But saving space is not an issue. Really renaming with going > offline (rename table) or reload data (rename cf) is issue (especially in > case of live data migration on version upgrade on production cluster). > > >> >> St.Ack >> >> >> On Sat, Jan 8, 2011 at 12:10 PM, M. C. Srivas <[EMAIL PROTECTED]> wrote: >> > In general. there's need for a loose "schema" to allow not only renames >> of >> > columns and column-families, but efficient delete of entire columns or >> CFs. >> > (eg, mark this C as deleted in the "schema" and remove it during the next >> > major compaction). But implementing the master-coordination for this (for >> > instance, all RS's should delete 'atomically') will be interesting .... >> > >> > >> > >> > On Sat, Jan 8, 2011 at 11:36 AM, Andrey Stepachev <[EMAIL PROTECTED]> >> wrote: >> > >> >> 2011/1/8 Stack <[EMAIL PROTECTED]> >> >> >> >> > >> >> > >> >> >> >> > Perhaps we should consider >> >> > detaching CF name from whats stored? >> >> > >> >> >> >> Yes! Are there any jira? I'll vote for it. >> >> >> >> >> >> > >> >> > St.Ack >> >> > >> >> >> > >> >
-
Re: How to rename table's family nameAndrey Stepachev 2011-01-10, 23:02
Stack, I don't think, that I know good solution, but I see some possible way
how this can be implemented. I think, that rename should be atomic (in case of schema in zookeeper, it is possible). So, the application developer should take care on how application should handle this rename. And because it will live in zookeeper application can listen for changes and take needed actions. If we stilll store r/cf/q/v tuples in hfile we can consider simple logic: 1. all cf should have uuids to identify renames like a->b->a 2. region server sees cf rename and begin rename all results from not converted hfiles with old cf 2. if hfile converted, no renames needed at all 3. actual keyvalues rewrite performed on compaction If some denormalization will be implemented (like don't store cf in each keyvalue in hfile at all) then nothing needed, cf name always reconstructed on Result instantination. To exclude possibility of some races, like a->b->a, we must store history of all renames to help RS identify was or not given cf renamed. 2011/1/10 Stack <[EMAIL PROTECTED]> > Want to make a new issue then Andrey? Thanks. We could knock out > hbase-68 at same time. When would we do the replacement of code with > actual columnfamily name? Over on client or on server before result > is sent the client? > St.Ack > > On Sun, Jan 9, 2011 at 12:29 PM, Andrey Stepachev <[EMAIL PROTECTED]> > wrote: > > 2011/1/9 Stack <[EMAIL PROTECTED]> > > > >> > >> To respond to Andrey, you seem to be asking about the old issue > >> HBASE-68. It hasn't seen much attention of late. Its more about > >> saving space rather than facilitating rename (though the latter is a > >> legit case). > >> > > > > Thanks. But saving space is not an issue. Really renaming with going > > offline (rename table) or reload data (rename cf) is issue (especially in > > case of live data migration on version upgrade on production cluster). > > > > > >> > >> St.Ack > >> > >> > >> On Sat, Jan 8, 2011 at 12:10 PM, M. C. Srivas <[EMAIL PROTECTED]> > wrote: > >> > In general. there's need for a loose "schema" to allow not only > renames > >> of > >> > columns and column-families, but efficient delete of entire columns or > >> CFs. > >> > (eg, mark this C as deleted in the "schema" and remove it during the > next > >> > major compaction). But implementing the master-coordination for this > (for > >> > instance, all RS's should delete 'atomically') will be interesting > .... > >> > > >> > > >> > > >> > On Sat, Jan 8, 2011 at 11:36 AM, Andrey Stepachev <[EMAIL PROTECTED]> > >> wrote: > >> > > >> >> 2011/1/8 Stack <[EMAIL PROTECTED]> > >> >> > >> >> > > >> >> > > >> >> > >> >> > Perhaps we should consider > >> >> > detaching CF name from whats stored? > >> >> > > >> >> > >> >> Yes! Are there any jira? I'll vote for it. > >> >> > >> >> > >> >> > > >> >> > St.Ack > >> >> > > >> >> > >> > > >> > > >
-
Re: How to rename table's family nameTost 2011-01-10, 23:59
I couldn't find the way.
family cannot be changed without deleting and recreating it. 2011/1/11 Andrey Stepachev <[EMAIL PROTECTED]> > Stack, I don't think, that I know good solution, but I see some possible > way > how > this can be implemented. > > I think, that rename should be atomic (in case of schema in zookeeper, it > is > possible). > So, the application developer should take care on how application should > handle this rename. And because it will live in zookeeper application can > listen for changes and take needed actions. > > If we stilll store r/cf/q/v tuples in hfile we can consider simple logic: > 1. all cf should have uuids to identify renames like a->b->a > 2. region server sees cf rename and begin rename all results from not > converted > hfiles with old cf > 2. if hfile converted, no renames needed at all > 3. actual keyvalues rewrite performed on compaction > > If some denormalization will be implemented (like don't store cf in each > keyvalue in hfile at all) > then nothing needed, cf name always reconstructed on Result instantination. > > To exclude possibility of some races, like a->b->a, we must store history > of > all renames to > help RS identify was or not given cf renamed. > > 2011/1/10 Stack <[EMAIL PROTECTED]> > > > Want to make a new issue then Andrey? Thanks. We could knock out > > hbase-68 at same time. When would we do the replacement of code with > > actual columnfamily name? Over on client or on server before result > > is sent the client? > > St.Ack > > > > On Sun, Jan 9, 2011 at 12:29 PM, Andrey Stepachev <[EMAIL PROTECTED]> > > wrote: > > > 2011/1/9 Stack <[EMAIL PROTECTED]> > > > > > >> > > >> To respond to Andrey, you seem to be asking about the old issue > > >> HBASE-68. It hasn't seen much attention of late. Its more about > > >> saving space rather than facilitating rename (though the latter is a > > >> legit case). > > >> > > > > > > Thanks. But saving space is not an issue. Really renaming with going > > > offline (rename table) or reload data (rename cf) is issue (especially > in > > > case of live data migration on version upgrade on production cluster). > > > > > > > > >> > > >> St.Ack > > >> > > >> > > >> On Sat, Jan 8, 2011 at 12:10 PM, M. C. Srivas <[EMAIL PROTECTED]> > > wrote: > > >> > In general. there's need for a loose "schema" to allow not only > > renames > > >> of > > >> > columns and column-families, but efficient delete of entire columns > or > > >> CFs. > > >> > (eg, mark this C as deleted in the "schema" and remove it during the > > next > > >> > major compaction). But implementing the master-coordination for this > > (for > > >> > instance, all RS's should delete 'atomically') will be interesting > > .... > > >> > > > >> > > > >> > > > >> > On Sat, Jan 8, 2011 at 11:36 AM, Andrey Stepachev <[EMAIL PROTECTED] > > > > >> wrote: > > >> > > > >> >> 2011/1/8 Stack <[EMAIL PROTECTED]> > > >> >> > > >> >> > > > >> >> > > > >> >> > > >> >> > Perhaps we should consider > > >> >> > detaching CF name from whats stored? > > >> >> > > > >> >> > > >> >> Yes! Are there any jira? I'll vote for it. > > >> >> > > >> >> > > >> >> > > > >> >> > St.Ack > > >> >> > > > >> >> > > >> > > > >> > > > > > > |