|
Lord Khan Han
2011-12-09, 21:50
Jean-Daniel Cryans
2011-12-09, 21:53
Lord Khan Han
2011-12-09, 21:56
Jean-Daniel Cryans
2011-12-09, 21:58
Lord Khan Han
2011-12-09, 22:00
Lord Khan Han
2011-12-09, 22:02
Jean-Daniel Cryans
2011-12-09, 22:02
Lord Khan Han
2011-12-09, 22:05
lars hofhansl
2011-12-09, 21:58
Lord Khan Han
2011-12-09, 22:07
Lord Khan Han
2011-12-09, 22:11
Lord Khan Han
2011-12-09, 22:27
lars hofhansl
2011-12-09, 22:42
Lord Khan Han
2011-12-09, 22:45
lars hofhansl
2011-12-09, 23:39
Lord Khan Han
2011-12-10, 00:05
lars hofhansl
2011-12-10, 00:19
Lord Khan Han
2011-12-10, 07:53
Lars George
2011-12-10, 11:03
Lord Khan Han
2011-12-10, 16:40
Lord Khan Han
2011-12-10, 18:09
lars hofhansl
2011-12-11, 01:07
Lord Khan Han
2011-12-13, 13:15
Stack
2011-12-14, 06:00
|
-
Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 21:50
Hi ,
We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this we make a separate cluster with same config and installed CDH3u2. We exported our hbase table from cdh3b4 cluster and import it to the new cdh3u2 cluster. Table is LZO and both cluster config is same. After import finished hbase table size doubled!! even its configured to use LZO. We changed table to snappy import again and same result. Table size multiplied x 2 in new cdh3u2 cluster. We didnt find why ? Is there any ideas for this ? thanks Khan +
Lord Khan Han 2011-12-09, 21:50
-
Re: Hbase export / import Why doubling the Table Size ?Jean-Daniel Cryans 2011-12-09, 21:53
How are you measuring the size? hadoop dfs -dus /hbase or only that
table's folder? J-D On Fri, Dec 9, 2011 at 1:50 PM, Lord Khan Han <[EMAIL PROTECTED]> wrote: > Hi , > > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > we make a separate cluster with same config and installed CDH3u2. > > We exported our hbase table from cdh3b4 cluster and import it to the > new cdh3u2 cluster. Table is LZO and both cluster config is same. > > After import finished hbase table size doubled!! even its configured > to use LZO. We changed table to snappy import again and same result. > Table size multiplied x 2 in new cdh3u2 cluster. > > We didnt find why ? Is there any ideas for this ? > > thanks > > Khan +
Jean-Daniel Cryans 2011-12-09, 21:53
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 21:56
yes exactly. hadoop dfs -du /hbase its gives us all the table sizes...
funny thing table with lzo size bigger than the exported FILE size also.. really strange... On Fri, Dec 9, 2011 at 11:53 PM, Jean-Daniel Cryans <[EMAIL PROTECTED]>wrote: > How are you measuring the size? hadoop dfs -dus /hbase or only that > table's folder? > > J-D > > On Fri, Dec 9, 2011 at 1:50 PM, Lord Khan Han <[EMAIL PROTECTED]> > wrote: > > Hi , > > > > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > > we make a separate cluster with same config and installed CDH3u2. > > > > We exported our hbase table from cdh3b4 cluster and import it to the > > new cdh3u2 cluster. Table is LZO and both cluster config is same. > > > > After import finished hbase table size doubled!! even its configured > > to use LZO. We changed table to snappy import again and same result. > > Table size multiplied x 2 in new cdh3u2 cluster. > > > > We didnt find why ? Is there any ideas for this ? > > > > thanks > > > > Khan > +
Lord Khan Han 2011-12-09, 21:56
-
Re: Hbase export / import Why doubling the Table Size ?Jean-Daniel Cryans 2011-12-09, 21:58
The region servers store their write-ahead logs in /hbase/.logs and
they are archived .oldlogs, you are probably measuring that too. J-D On Fri, Dec 9, 2011 at 1:56 PM, Lord Khan Han <[EMAIL PROTECTED]> wrote: > yes exactly. hadoop dfs -du /hbase its gives us all the table sizes... > funny thing table with lzo size bigger than the exported FILE size also.. > really strange... > > > > > > On Fri, Dec 9, 2011 at 11:53 PM, Jean-Daniel Cryans <[EMAIL PROTECTED]>wrote: > >> How are you measuring the size? hadoop dfs -dus /hbase or only that >> table's folder? >> >> J-D >> >> On Fri, Dec 9, 2011 at 1:50 PM, Lord Khan Han <[EMAIL PROTECTED]> >> wrote: >> > Hi , >> > >> > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this >> > we make a separate cluster with same config and installed CDH3u2. >> > >> > We exported our hbase table from cdh3b4 cluster and import it to the >> > new cdh3u2 cluster. Table is LZO and both cluster config is same. >> > >> > After import finished hbase table size doubled!! even its configured >> > to use LZO. We changed table to snappy import again and same result. >> > Table size multiplied x 2 in new cdh3u2 cluster. >> > >> > We didnt find why ? Is there any ideas for this ? >> > >> > thanks >> > >> > Khan >> +
Jean-Daniel Cryans 2011-12-09, 21:58
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 22:00
is this logs files inside the tables directory ?
On Fri, Dec 9, 2011 at 11:58 PM, Jean-Daniel Cryans <[EMAIL PROTECTED]>wrote: > The region servers store their write-ahead logs in /hbase/.logs and > they are archived .oldlogs, you are probably measuring that too. > > J-D > > On Fri, Dec 9, 2011 at 1:56 PM, Lord Khan Han <[EMAIL PROTECTED]> > wrote: > > yes exactly. hadoop dfs -du /hbase its gives us all the table sizes... > > funny thing table with lzo size bigger than the exported FILE size > also.. > > really strange... > > > > > > > > > > > > On Fri, Dec 9, 2011 at 11:53 PM, Jean-Daniel Cryans <[EMAIL PROTECTED] > >wrote: > > > >> How are you measuring the size? hadoop dfs -dus /hbase or only that > >> table's folder? > >> > >> J-D > >> > >> On Fri, Dec 9, 2011 at 1:50 PM, Lord Khan Han <[EMAIL PROTECTED]> > >> wrote: > >> > Hi , > >> > > >> > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > >> > we make a separate cluster with same config and installed CDH3u2. > >> > > >> > We exported our hbase table from cdh3b4 cluster and import it to the > >> > new cdh3u2 cluster. Table is LZO and both cluster config is same. > >> > > >> > After import finished hbase table size doubled!! even its configured > >> > to use LZO. We changed table to snappy import again and same result. > >> > Table size multiplied x 2 in new cdh3u2 cluster. > >> > > >> > We didnt find why ? Is there any ideas for this ? > >> > > >> > thanks > >> > > >> > Khan > >> > +
Lord Khan Han 2011-12-09, 22:00
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 22:02
there is only .logs and .oldlogs in the /hbase but I am looking the
/hbase/table1 size. On Sat, Dec 10, 2011 at 12:00 AM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > is this logs files inside the tables directory ? > > > > On Fri, Dec 9, 2011 at 11:58 PM, Jean-Daniel Cryans <[EMAIL PROTECTED]>wrote: > >> The region servers store their write-ahead logs in /hbase/.logs and >> they are archived .oldlogs, you are probably measuring that too. >> >> J-D >> >> On Fri, Dec 9, 2011 at 1:56 PM, Lord Khan Han <[EMAIL PROTECTED]> >> wrote: >> > yes exactly. hadoop dfs -du /hbase its gives us all the table >> sizes... >> > funny thing table with lzo size bigger than the exported FILE size >> also.. >> > really strange... >> > >> > >> > >> > >> > >> > On Fri, Dec 9, 2011 at 11:53 PM, Jean-Daniel Cryans < >> [EMAIL PROTECTED]>wrote: >> > >> >> How are you measuring the size? hadoop dfs -dus /hbase or only that >> >> table's folder? >> >> >> >> J-D >> >> >> >> On Fri, Dec 9, 2011 at 1:50 PM, Lord Khan Han <[EMAIL PROTECTED] >> > >> >> wrote: >> >> > Hi , >> >> > >> >> > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this >> >> > we make a separate cluster with same config and installed CDH3u2. >> >> > >> >> > We exported our hbase table from cdh3b4 cluster and import it to >> the >> >> > new cdh3u2 cluster. Table is LZO and both cluster config is same. >> >> > >> >> > After import finished hbase table size doubled!! even its configured >> >> > to use LZO. We changed table to snappy import again and same >> result. >> >> > Table size multiplied x 2 in new cdh3u2 cluster. >> >> > >> >> > We didnt find why ? Is there any ideas for this ? >> >> > >> >> > thanks >> >> > >> >> > Khan >> >> >> > > +
Lord Khan Han 2011-12-09, 22:02
-
Re: Hbase export / import Why doubling the Table Size ?Jean-Daniel Cryans 2011-12-09, 22:02
No like I wrote they are at /hbase/.logs
J-D On Fri, Dec 9, 2011 at 2:00 PM, Lord Khan Han <[EMAIL PROTECTED]> wrote: > is this logs files inside the tables directory ? > > > > On Fri, Dec 9, 2011 at 11:58 PM, Jean-Daniel Cryans <[EMAIL PROTECTED]>wrote: > >> The region servers store their write-ahead logs in /hbase/.logs and >> they are archived .oldlogs, you are probably measuring that too. >> >> J-D >> >> On Fri, Dec 9, 2011 at 1:56 PM, Lord Khan Han <[EMAIL PROTECTED]> >> wrote: >> > yes exactly. hadoop dfs -du /hbase its gives us all the table sizes... >> > funny thing table with lzo size bigger than the exported FILE size >> also.. >> > really strange... >> > >> > >> > >> > >> > >> > On Fri, Dec 9, 2011 at 11:53 PM, Jean-Daniel Cryans <[EMAIL PROTECTED] >> >wrote: >> > >> >> How are you measuring the size? hadoop dfs -dus /hbase or only that >> >> table's folder? >> >> >> >> J-D >> >> >> >> On Fri, Dec 9, 2011 at 1:50 PM, Lord Khan Han <[EMAIL PROTECTED]> >> >> wrote: >> >> > Hi , >> >> > >> >> > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this >> >> > we make a separate cluster with same config and installed CDH3u2. >> >> > >> >> > We exported our hbase table from cdh3b4 cluster and import it to the >> >> > new cdh3u2 cluster. Table is LZO and both cluster config is same. >> >> > >> >> > After import finished hbase table size doubled!! even its configured >> >> > to use LZO. We changed table to snappy import again and same result. >> >> > Table size multiplied x 2 in new cdh3u2 cluster. >> >> > >> >> > We didnt find why ? Is there any ideas for this ? >> >> > >> >> > thanks >> >> > >> >> > Khan >> >> >> +
Jean-Daniel Cryans 2011-12-09, 22:02
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 22:05
i am looking with hadoop dfs -du /hbase (not using -dus) so in the
hbase directory I can see all directories size. On Sat, Dec 10, 2011 at 12:02 AM, Jean-Daniel Cryans <[EMAIL PROTECTED]>wrote: > No like I wrote they are at /hbase/.logs > > J-D > > On Fri, Dec 9, 2011 at 2:00 PM, Lord Khan Han <[EMAIL PROTECTED]> > wrote: > > is this logs files inside the tables directory ? > > > > > > > > On Fri, Dec 9, 2011 at 11:58 PM, Jean-Daniel Cryans <[EMAIL PROTECTED] > >wrote: > > > >> The region servers store their write-ahead logs in /hbase/.logs and > >> they are archived .oldlogs, you are probably measuring that too. > >> > >> J-D > >> > >> On Fri, Dec 9, 2011 at 1:56 PM, Lord Khan Han <[EMAIL PROTECTED]> > >> wrote: > >> > yes exactly. hadoop dfs -du /hbase its gives us all the table > sizes... > >> > funny thing table with lzo size bigger than the exported FILE size > >> also.. > >> > really strange... > >> > > >> > > >> > > >> > > >> > > >> > On Fri, Dec 9, 2011 at 11:53 PM, Jean-Daniel Cryans < > [EMAIL PROTECTED] > >> >wrote: > >> > > >> >> How are you measuring the size? hadoop dfs -dus /hbase or only that > >> >> table's folder? > >> >> > >> >> J-D > >> >> > >> >> On Fri, Dec 9, 2011 at 1:50 PM, Lord Khan Han < > [EMAIL PROTECTED]> > >> >> wrote: > >> >> > Hi , > >> >> > > >> >> > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing > this > >> >> > we make a separate cluster with same config and installed CDH3u2. > >> >> > > >> >> > We exported our hbase table from cdh3b4 cluster and import it to > the > >> >> > new cdh3u2 cluster. Table is LZO and both cluster config is same. > >> >> > > >> >> > After import finished hbase table size doubled!! even its > configured > >> >> > to use LZO. We changed table to snappy import again and same > result. > >> >> > Table size multiplied x 2 in new cdh3u2 cluster. > >> >> > > >> >> > We didnt find why ? Is there any ideas for this ? > >> >> > > >> >> > thanks > >> >> > > >> >> > Khan > >> >> > >> > +
Lord Khan Han 2011-12-09, 22:05
-
Re: Hbase export / import Why doubling the Table Size ?lars hofhansl 2011-12-09, 21:58
Can you try flushing and compacting the table? How did you measure the size?
Both can be done from the shell using the 'flush' and 'major_compact' commands, resp. ----- Original Message ----- From: Lord Khan Han <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Cc: Sent: Friday, December 9, 2011 1:50 PM Subject: Hbase export / import Why doubling the Table Size ? Hi , We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this we make a separate cluster with same config and installed CDH3u2. We exported our hbase table from cdh3b4 cluster and import it to the new cdh3u2 cluster. Table is LZO and both cluster config is same. After import finished hbase table size doubled!! even its configured to use LZO. We changed table to snappy import again and same result. Table size multiplied x 2 in new cdh3u2 cluster. We didnt find why ? Is there any ideas for this ? thanks Khan +
lars hofhansl 2011-12-09, 21:58
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 22:07
I made major_compact but not flush... will do now with flush..
On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED]> wrote: > Can you try flushing and compacting the table? How did you measure the > size? > > Both can be done from the shell using the 'flush' and 'major_compact' > commands, resp. > > > > ----- Original Message ----- > From: Lord Khan Han <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED] > Cc: > Sent: Friday, December 9, 2011 1:50 PM > Subject: Hbase export / import Why doubling the Table Size ? > > Hi , > > We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > we make a separate cluster with same config and installed CDH3u2. > > We exported our hbase table from cdh3b4 cluster and import it to the > new cdh3u2 cluster. Table is LZO and both cluster config is same. > > After import finished hbase table size doubled!! even its configured > to use LZO. We changed table to snappy import again and same result. > Table size multiplied x 2 in new cdh3u2 cluster. > > We didnt find why ? Is there any ideas for this ? > > thanks > > Khan > > +
Lord Khan Han 2011-12-09, 22:07
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 22:11
Now flushed and compacting again..
one more clue: I tested to import CDH3B4 (same as exported cluster) with lzo.. all is okay.. table size is same.. than I upgrade to cdh3u2 table also is ok and same size.. But when I try to import in cdh3u2 this size doubling happens.. On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > I made major_compact but not flush... will do now with flush.. > > > > On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED]>wrote: > >> Can you try flushing and compacting the table? How did you measure the >> size? >> >> Both can be done from the shell using the 'flush' and 'major_compact' >> commands, resp. >> >> >> >> ----- Original Message ----- >> From: Lord Khan Han <[EMAIL PROTECTED]> >> To: [EMAIL PROTECTED] >> Cc: >> Sent: Friday, December 9, 2011 1:50 PM >> Subject: Hbase export / import Why doubling the Table Size ? >> >> Hi , >> >> We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this >> we make a separate cluster with same config and installed CDH3u2. >> >> We exported our hbase table from cdh3b4 cluster and import it to the >> new cdh3u2 cluster. Table is LZO and both cluster config is same. >> >> After import finished hbase table size doubled!! even its configured >> to use LZO. We changed table to snappy import again and same result. >> Table size multiplied x 2 in new cdh3u2 cluster. >> >> We didnt find why ? Is there any ideas for this ? >> >> thanks >> >> Khan >> >> > +
Lord Khan Han 2011-12-09, 22:11
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 22:27
I flush ed and major_compact ed .. nothing changed... i am stuck this
last two days...:( any idea? On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > Now flushed and compacting again.. > > one more clue: > > I tested to import CDH3B4 (same as exported cluster) with lzo.. all is > okay.. table size is same.. > than I upgrade to cdh3u2 table also is ok and same size.. > > But when I try to import in cdh3u2 this size doubling happens.. > > > > > On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > >> I made major_compact but not flush... will do now with flush.. >> >> >> >> On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED]>wrote: >> >>> Can you try flushing and compacting the table? How did you measure the >>> size? >>> >>> Both can be done from the shell using the 'flush' and 'major_compact' >>> commands, resp. >>> >>> >>> >>> ----- Original Message ----- >>> From: Lord Khan Han <[EMAIL PROTECTED]> >>> To: [EMAIL PROTECTED] >>> Cc: >>> Sent: Friday, December 9, 2011 1:50 PM >>> Subject: Hbase export / import Why doubling the Table Size ? >>> >>> Hi , >>> >>> We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this >>> we make a separate cluster with same config and installed CDH3u2. >>> >>> We exported our hbase table from cdh3b4 cluster and import it to the >>> new cdh3u2 cluster. Table is LZO and both cluster config is same. >>> >>> After import finished hbase table size doubled!! even its configured >>> to use LZO. We changed table to snappy import again and same result. >>> Table size multiplied x 2 in new cdh3u2 cluster. >>> >>> We didnt find why ? Is there any ideas for this ? >>> >>> thanks >>> >>> Khan >>> >>> >> > +
Lord Khan Han 2011-12-09, 22:27
-
Re: Hbase export / import Why doubling the Table Size ?lars hofhansl 2011-12-09, 22:42
What happens when you export/import into the same (CDH3B4) cluster using a new table name?
Does the size double as well? ----- Original Message ----- From: Lord Khan Han <[EMAIL PROTECTED]> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> Cc: Sent: Friday, December 9, 2011 2:27 PM Subject: Re: Hbase export / import Why doubling the Table Size ? I flush ed and major_compact ed .. nothing changed... i am stuck this last two days...:( any idea? On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > Now flushed and compacting again.. > > one more clue: > > I tested to import CDH3B4 (same as exported cluster) with lzo.. all is > okay.. table size is same.. > than I upgrade to cdh3u2 table also is ok and same size.. > > But when I try to import in cdh3u2 this size doubling happens.. > > > > > On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > >> I made major_compact but not flush... will do now with flush.. >> >> >> >> On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED]>wrote: >> >>> Can you try flushing and compacting the table? How did you measure the >>> size? >>> >>> Both can be done from the shell using the 'flush' and 'major_compact' >>> commands, resp. >>> >>> >>> >>> ----- Original Message ----- >>> From: Lord Khan Han <[EMAIL PROTECTED]> >>> To: [EMAIL PROTECTED] >>> Cc: >>> Sent: Friday, December 9, 2011 1:50 PM >>> Subject: Hbase export / import Why doubling the Table Size ? >>> >>> Hi , >>> >>> We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this >>> we make a separate cluster with same config and installed CDH3u2. >>> >>> We exported our hbase table from cdh3b4 cluster and import it to the >>> new cdh3u2 cluster. Table is LZO and both cluster config is same. >>> >>> After import finished hbase table size doubled!! even its configured >>> to use LZO. We changed table to snappy import again and same result. >>> Table size multiplied x 2 in new cdh3u2 cluster. >>> >>> We didnt find why ? Is there any ideas for this ? >>> >>> thanks >>> >>> Khan >>> >>> >> > +
lars hofhansl 2011-12-09, 22:42
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-09, 22:45
in same configured cluster (carbon copy) when I made import there is no
increas on size.. same size.. problem in the cdh3u2.. On Sat, Dec 10, 2011 at 12:42 AM, lars hofhansl <[EMAIL PROTECTED]> wrote: > What happens when you export/import into the same (CDH3B4) cluster using a > new table name? > Does the size double as well? > > > > ----- Original Message ----- > From: Lord Khan Han <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > Cc: > Sent: Friday, December 9, 2011 2:27 PM > Subject: Re: Hbase export / import Why doubling the Table Size ? > > I flush ed and major_compact ed .. nothing changed... i am stuck this > last two days...:( any idea? > > > On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han <[EMAIL PROTECTED] > >wrote: > > > Now flushed and compacting again.. > > > > one more clue: > > > > I tested to import CDH3B4 (same as exported cluster) with lzo.. all is > > okay.. table size is same.. > > than I upgrade to cdh3u2 table also is ok and same size.. > > > > But when I try to import in cdh3u2 this size doubling happens.. > > > > > > > > > > On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han <[EMAIL PROTECTED] > >wrote: > > > >> I made major_compact but not flush... will do now with flush.. > >> > >> > >> > >> On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED] > >wrote: > >> > >>> Can you try flushing and compacting the table? How did you measure the > >>> size? > >>> > >>> Both can be done from the shell using the 'flush' and 'major_compact' > >>> commands, resp. > >>> > >>> > >>> > >>> ----- Original Message ----- > >>> From: Lord Khan Han <[EMAIL PROTECTED]> > >>> To: [EMAIL PROTECTED] > >>> Cc: > >>> Sent: Friday, December 9, 2011 1:50 PM > >>> Subject: Hbase export / import Why doubling the Table Size ? > >>> > >>> Hi , > >>> > >>> We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > >>> we make a separate cluster with same config and installed CDH3u2. > >>> > >>> We exported our hbase table from cdh3b4 cluster and import it to the > >>> new cdh3u2 cluster. Table is LZO and both cluster config is same. > >>> > >>> After import finished hbase table size doubled!! even its configured > >>> to use LZO. We changed table to snappy import again and same result. > >>> Table size multiplied x 2 in new cdh3u2 cluster. > >>> > >>> We didnt find why ? Is there any ideas for this ? > >>> > >>> thanks > >>> > >>> Khan > >>> > >>> > >> > > > > +
Lord Khan Han 2011-12-09, 22:45
-
Re: Hbase export / import Why doubling the Table Size ?lars hofhansl 2011-12-09, 23:39
Hmm... I'm afraid I am out of options. If you want you can try to copy the table
from CHD3u2 and your CDH3B4 system, and see if the size remains doubled. Does this happen with very small table, too? If so, you could take a small sample HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so that we can have a look. -- Lars ----- Original Message ----- From: Lord Khan Han <[EMAIL PROTECTED]> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> Cc: Sent: Friday, December 9, 2011 2:45 PM Subject: Re: Hbase export / import Why doubling the Table Size ? in same configured cluster (carbon copy) when I made import there is no increas on size.. same size.. problem in the cdh3u2.. On Sat, Dec 10, 2011 at 12:42 AM, lars hofhansl <[EMAIL PROTECTED]> wrote: > What happens when you export/import into the same (CDH3B4) cluster using a > new table name? > Does the size double as well? > > > > ----- Original Message ----- > From: Lord Khan Han <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > Cc: > Sent: Friday, December 9, 2011 2:27 PM > Subject: Re: Hbase export / import Why doubling the Table Size ? > > I flush ed and major_compact ed .. nothing changed... i am stuck this > last two days...:( any idea? > > > On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han <[EMAIL PROTECTED] > >wrote: > > > Now flushed and compacting again.. > > > > one more clue: > > > > I tested to import CDH3B4 (same as exported cluster) with lzo.. all is > > okay.. table size is same.. > > than I upgrade to cdh3u2 table also is ok and same size.. > > > > But when I try to import in cdh3u2 this size doubling happens.. > > > > > > > > > > On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han <[EMAIL PROTECTED] > >wrote: > > > >> I made major_compact but not flush... will do now with flush.. > >> > >> > >> > >> On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED] > >wrote: > >> > >>> Can you try flushing and compacting the table? How did you measure the > >>> size? > >>> > >>> Both can be done from the shell using the 'flush' and 'major_compact' > >>> commands, resp. > >>> > >>> > >>> > >>> ----- Original Message ----- > >>> From: Lord Khan Han <[EMAIL PROTECTED]> > >>> To: [EMAIL PROTECTED] > >>> Cc: > >>> Sent: Friday, December 9, 2011 1:50 PM > >>> Subject: Hbase export / import Why doubling the Table Size ? > >>> > >>> Hi , > >>> > >>> We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > >>> we make a separate cluster with same config and installed CDH3u2. > >>> > >>> We exported our hbase table from cdh3b4 cluster and import it to the > >>> new cdh3u2 cluster. Table is LZO and both cluster config is same. > >>> > >>> After import finished hbase table size doubled!! even its configured > >>> to use LZO. We changed table to snappy import again and same result. > >>> Table size multiplied x 2 in new cdh3u2 cluster. > >>> > >>> We didnt find why ? Is there any ideas for this ? > >>> > >>> thanks > >>> > >>> Khan > >>> > >>> > >> > > > > +
lars hofhansl 2011-12-09, 23:39
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-10, 00:05
Thanks for your time..
Is there any reliable way to copy table between these cluster instead of export/import? On Sat, Dec 10, 2011 at 1:39 AM, lars hofhansl <[EMAIL PROTECTED]> wrote: > Hmm... I'm afraid I am out of options. If you want you can try to copy the > table > from CHD3u2 and your CDH3B4 system, and see if the size remains doubled. > > Does this happen with very small table, too? If so, you could take a small > sample > HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so > that we can have a look. > > > -- Lars > > > ----- Original Message ----- > From: Lord Khan Han <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > Cc: > Sent: Friday, December 9, 2011 2:45 PM > Subject: Re: Hbase export / import Why doubling the Table Size ? > > in same configured cluster (carbon copy) when I made import there is no > increas on size.. same size.. > > problem in the cdh3u2.. > > > On Sat, Dec 10, 2011 at 12:42 AM, lars hofhansl <[EMAIL PROTECTED]> > wrote: > > > What happens when you export/import into the same (CDH3B4) cluster using > a > > new table name? > > Does the size double as well? > > > > > > > > ----- Original Message ----- > > From: Lord Khan Han <[EMAIL PROTECTED]> > > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > > Cc: > > Sent: Friday, December 9, 2011 2:27 PM > > Subject: Re: Hbase export / import Why doubling the Table Size ? > > > > I flush ed and major_compact ed .. nothing changed... i am stuck > this > > last two days...:( any idea? > > > > > > On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han <[EMAIL PROTECTED] > > >wrote: > > > > > Now flushed and compacting again.. > > > > > > one more clue: > > > > > > I tested to import CDH3B4 (same as exported cluster) with lzo.. all is > > > okay.. table size is same.. > > > than I upgrade to cdh3u2 table also is ok and same size.. > > > > > > But when I try to import in cdh3u2 this size doubling happens.. > > > > > > > > > > > > > > > On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han < > [EMAIL PROTECTED] > > >wrote: > > > > > >> I made major_compact but not flush... will do now with flush.. > > >> > > >> > > >> > > >> On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED] > > >wrote: > > >> > > >>> Can you try flushing and compacting the table? How did you measure > the > > >>> size? > > >>> > > >>> Both can be done from the shell using the 'flush' and 'major_compact' > > >>> commands, resp. > > >>> > > >>> > > >>> > > >>> ----- Original Message ----- > > >>> From: Lord Khan Han <[EMAIL PROTECTED]> > > >>> To: [EMAIL PROTECTED] > > >>> Cc: > > >>> Sent: Friday, December 9, 2011 1:50 PM > > >>> Subject: Hbase export / import Why doubling the Table Size ? > > >>> > > >>> Hi , > > >>> > > >>> We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > > >>> we make a separate cluster with same config and installed CDH3u2. > > >>> > > >>> We exported our hbase table from cdh3b4 cluster and import it to > the > > >>> new cdh3u2 cluster. Table is LZO and both cluster config is same. > > >>> > > >>> After import finished hbase table size doubled!! even its configured > > >>> to use LZO. We changed table to snappy import again and same > result. > > >>> Table size multiplied x 2 in new cdh3u2 cluster. > > >>> > > >>> We didnt find why ? Is there any ideas for this ? > > >>> > > >>> thanks > > >>> > > >>> Khan > > >>> > > >>> > > >> > > > > > > > > > +
Lord Khan Han 2011-12-10, 00:05
-
Re: Hbase export / import Why doubling the Table Size ?lars hofhansl 2011-12-10, 00:19
There's copytable (also an MR job - written by J-D), but it reuses the mapper class from the Import.java, so it
probably won't make a difference. What I meant to say below... When you export/import the table from your CDH3u2 cluster back to your CDH3B4 cluster, is the size still doubled? If both clusters are shutdown, you can use Hadoop's distcp to copy directly on the filesystem level; in fact that might be your best option. -- Lars ----- Original Message ----- From: Lord Khan Han <[EMAIL PROTECTED]> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> Cc: Sent: Friday, December 9, 2011 4:05 PM Subject: Re: Hbase export / import Why doubling the Table Size ? Thanks for your time.. Is there any reliable way to copy table between these cluster instead of export/import? On Sat, Dec 10, 2011 at 1:39 AM, lars hofhansl <[EMAIL PROTECTED]> wrote: > Hmm... I'm afraid I am out of options. If you want you can try to copy the > table > from CHD3u2 and your CDH3B4 system, and see if the size remains doubled. > > Does this happen with very small table, too? If so, you could take a small > sample > HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so > that we can have a look. > > > -- Lars > > > ----- Original Message ----- > From: Lord Khan Han <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > Cc: > Sent: Friday, December 9, 2011 2:45 PM > Subject: Re: Hbase export / import Why doubling the Table Size ? > > in same configured cluster (carbon copy) when I made import there is no > increas on size.. same size.. > > problem in the cdh3u2.. > > > On Sat, Dec 10, 2011 at 12:42 AM, lars hofhansl <[EMAIL PROTECTED]> > wrote: > > > What happens when you export/import into the same (CDH3B4) cluster using > a > > new table name? > > Does the size double as well? > > > > > > > > ---> > From: Lord Khan Han <[EMAIL PROTECTED]> > > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > > Cc: > > Sent: Friday, December 9, 2011 2:27 PM > > Subject: Re: Hbase export / import Why doubling the Table Size ? > > > > I flush ed and major_compact ed .. nothing changed... i am stuck > this > > last two days...:( any idea? > > > > > > On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han <[EMAIL PROTECTED] > > >wrote: > > > > > Now flushed and compacting again.. > > > > > > one more clue: > > > > > > I tested to import CDH3B4 (same as exported cluster) with lzo.. all is > > > okay.. table size is same.. > > > than I upgrade to cdh3u2 table also is ok and same size.. > > > > > > But when I try to import in cdh3u2 this size doubling happens.. > > > > > > > > > > > > > > > On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han < > [EMAIL PROTECTED] > > >wrote: > > > > > >> I made major_compact but not flush... will do now with flush.. > > >> > > >> > > >> > > >> On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED] > > >wrote: > > >> > > >>> Can you try flushing and compacting the table? How did you measure > the > > >>> size? > > >>> > > >>> Both can be done from the shell using the 'flush' and 'major_compact' > > >>> commands, resp. > > >>> > > >>> > > >>> > > >>> ----- Original Message ----- > > >>> From: Lord Khan Han <[EMAIL PROTECTED]> > > >>> To: [EMAIL PROTECTED] > > >>> Cc: > > >>> Sent: Friday, December 9, 2011 1:50 PM > > >>> Subject: Hbase export / import Why doubling the Table Size ? > > >>> > > >>> Hi , > > >>> > > >>> We are usng CDH3B4 and want to upgrade to CDH3u2. Before doing this > > >>> we make a separate cluster with same config and installed CDH3u2. > > >>> > > >>> We exported our hbase table from cdh3b4 cluster and import it to > the > > >>> new cdh3u2 cluster. Table is LZO and both cluster config is same. > > >>> > > >>> After import finished hbase table size doubled!! even its configured > > >>> to use LZO. We changed table to snappy import again and same > result. > > >>> Table size multiplied x 2 in new cdh3u2 cluster. +
lars hofhansl 2011-12-10, 00:19
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-10, 07:53
I will check the reverse export imprt to cdh3b4 today to see is it same
size in the cluster.. when we use the hadoop dst copy how we candeal with the .META ? because we are copying 1 tabel not all and also there is region info in .META including their dns which is different offcoures in new cluster. I tried the import again today with no compression.. It is doubled the exported file size!! I mean I have 200gig exported hbase table size. when import without compression its going 400gig.. Its definitely writing twice something.. thanks On Sat, Dec 10, 2011 at 2:19 AM, lars hofhansl <[EMAIL PROTECTED]> wrote: > There's copytable (also an MR job - written by J-D), but it reuses the > mapper class from the Import.java, so it > probably won't make a difference. > > What I meant to say below... When you export/import the table from your > CDH3u2 cluster back to your CDH3B4 > cluster, is the size still doubled? > > > If both clusters are shutdown, you can use Hadoop's distcp to copy > directly on the filesystem level; in fact that might be your > best option. > > -- Lars > > > ----- Original Message ----- > From: Lord Khan Han <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > Cc: > Sent: Friday, December 9, 2011 4:05 PM > Subject: Re: Hbase export / import Why doubling the Table Size ? > > Thanks for your time.. > > Is there any reliable way to copy table between these cluster instead of > export/import? > > > > On Sat, Dec 10, 2011 at 1:39 AM, lars hofhansl <[EMAIL PROTECTED]> > wrote: > > > Hmm... I'm afraid I am out of options. If you want you can try to copy > the > > table > > from CHD3u2 and your CDH3B4 system, and see if the size remains doubled. > > > > Does this happen with very small table, too? If so, you could take a > small > > sample > > HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so > > that we can have a look. > > > > > > -- Lars > > > > > > ----- Original Message ----- > > From: Lord Khan Han <[EMAIL PROTECTED]> > > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > > Cc: > > Sent: Friday, December 9, 2011 2:45 PM > > Subject: Re: Hbase export / import Why doubling the Table Size ? > > > > in same configured cluster (carbon copy) when I made import there is no > > increas on size.. same size.. > > > > problem in the cdh3u2.. > > > > > > On Sat, Dec 10, 2011 at 12:42 AM, lars hofhansl <[EMAIL PROTECTED]> > > wrote: > > > > > What happens when you export/import into the same (CDH3B4) cluster > using > > a > > > new table name? > > > Does the size double as well? > > > > > > > > > > > > ----- Original Message ----- > > > From: Lord Khan Han <[EMAIL PROTECTED]> > > > To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > > > Cc: > > > Sent: Friday, December 9, 2011 2:27 PM > > > Subject: Re: Hbase export / import Why doubling the Table Size ? > > > > > > I flush ed and major_compact ed .. nothing changed... i am stuck > > this > > > last two days...:( any idea? > > > > > > > > > On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han < > [EMAIL PROTECTED] > > > >wrote: > > > > > > > Now flushed and compacting again.. > > > > > > > > one more clue: > > > > > > > > I tested to import CDH3B4 (same as exported cluster) with lzo.. all > is > > > > okay.. table size is same.. > > > > than I upgrade to cdh3u2 table also is ok and same size.. > > > > > > > > But when I try to import in cdh3u2 this size doubling happens.. > > > > > > > > > > > > > > > > > > > > On Sat, Dec 10, 2011 at 12:07 AM, Lord Khan Han < > > [EMAIL PROTECTED] > > > >wrote: > > > > > > > >> I made major_compact but not flush... will do now with flush.. > > > >> > > > >> > > > >> > > > >> On Fri, Dec 9, 2011 at 11:58 PM, lars hofhansl <[EMAIL PROTECTED] > > > >wrote: > > > >> > > > >>> Can you try flushing and compacting the table? How did you measure > > the > > > >>> size? > > > >>> > > > >>> Both can be done from the shell using the 'flush' and +
Lord Khan Han 2011-12-10, 07:53
-
Re: Hbase export / import Why doubling the Table Size ?Lars George 2011-12-10, 11:03
Could you use the ComressionTest to verify that the library path is set up properly?
$ hbase org.apache.hadoop.hbase.util.CompressionTest hdfs://<your-namenode>:8020/<some-writable-path>/test.lzo lzo Does it report OK? Same for Snappy? The reason I am asking is that when it does not find the native libs it uses no compression at all, and if your original was compressed then you will see the copied one being uncompressed and therefore much larger. Also, what is the content like? How large are the cells that are stored? Lars On Dec 10, 2011, at 8:53 AM, Lord Khan Han wrote: > I will check the reverse export imprt to cdh3b4 today to see is it same > size in the cluster.. > > when we use the hadoop dst copy how we candeal with the .META ? because we > are copying 1 tabel not all and also there is region info in .META > including their dns which is different offcoures in new cluster. > > I tried the import again today with no compression.. It is doubled the > exported file size!! I mean I have 200gig exported hbase table size. when > import without compression its going 400gig.. Its definitely writing twice > something.. > > thanks > > > > On Sat, Dec 10, 2011 at 2:19 AM, lars hofhansl <[EMAIL PROTECTED]> wrote: > >> There's copytable (also an MR job - written by J-D), but it reuses the >> mapper class from the Import.java, so it >> probably won't make a difference. >> >> What I meant to say below... When you export/import the table from your >> CDH3u2 cluster back to your CDH3B4 >> cluster, is the size still doubled? >> >> >> If both clusters are shutdown, you can use Hadoop's distcp to copy >> directly on the filesystem level; in fact that might be your >> best option. >> >> -- Lars >> >> >> ----- Original Message ----- >> From: Lord Khan Han <[EMAIL PROTECTED]> >> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> >> Cc: >> Sent: Friday, December 9, 2011 4:05 PM >> Subject: Re: Hbase export / import Why doubling the Table Size ? >> >> Thanks for your time.. >> >> Is there any reliable way to copy table between these cluster instead of >> export/import? >> >> >> >> On Sat, Dec 10, 2011 at 1:39 AM, lars hofhansl <[EMAIL PROTECTED]> >> wrote: >> >>> Hmm... I'm afraid I am out of options. If you want you can try to copy >> the >>> table >>> from CHD3u2 and your CDH3B4 system, and see if the size remains doubled. >>> >>> Does this happen with very small table, too? If so, you could take a >> small >>> sample >>> HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so >>> that we can have a look. >>> >>> >>> -- Lars >>> >>> >>> ----- Original Message ----- >>> From: Lord Khan Han <[EMAIL PROTECTED]> >>> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> >>> Cc: >>> Sent: Friday, December 9, 2011 2:45 PM >>> Subject: Re: Hbase export / import Why doubling the Table Size ? >>> >>> in same configured cluster (carbon copy) when I made import there is no >>> increas on size.. same size.. >>> >>> problem in the cdh3u2.. >>> >>> >>> On Sat, Dec 10, 2011 at 12:42 AM, lars hofhansl <[EMAIL PROTECTED]> >>> wrote: >>> >>>> What happens when you export/import into the same (CDH3B4) cluster >> using >>> a >>>> new table name? >>>> Does the size double as well? >>>> >>>> >>>> >>>> ----- Original Message ----- >>>> From: Lord Khan Han <[EMAIL PROTECTED]> >>>> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> >>>> Cc: >>>> Sent: Friday, December 9, 2011 2:27 PM >>>> Subject: Re: Hbase export / import Why doubling the Table Size ? >>>> >>>> I flush ed and major_compact ed .. nothing changed... i am stuck >>> this >>>> last two days...:( any idea? >>>> >>>> >>>> On Sat, Dec 10, 2011 at 12:11 AM, Lord Khan Han < >> [EMAIL PROTECTED] >>>>> wrote: >>>> >>>>> Now flushed and compacting again.. >>>>> >>>>> one more clue: >>>>> >>>>> I tested to import CDH3B4 (same as exported cluster) with lzo.. all >> is >>>>> okay.. table size is same.. +
Lars George 2011-12-10, 11:03
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-10, 16:40
It is a succes for both lzo snappy. Content is the html document.. Web
document hbase org.apache.hadoop.hbase.util.CompressionTest hdfs://localhost:8020/user/root/testfile.lzo lzo 11/12/10 18:37:04 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library 11/12/10 18:37:04 INFO lzo.LzoCodec: Successfully loaded & initialized native-lzo library [hadoop-lzo rev 2ad6654f3e9cad97d13f716e51a0509253c0aabb] 11/12/10 18:37:04 INFO compress.CodecPool: Got brand-new compressor SUCCESS On Sat, Dec 10, 2011 at 1:03 PM, Lars George <[EMAIL PROTECTED]> wrote: > Could you use the ComressionTest to verify that the library path is set up > properly? > > $ hbase org.apache.hadoop.hbase.util.CompressionTest > hdfs://<your-namenode>:8020/<some-writable-path>/test.lzo lzo > > Does it report OK? Same for Snappy? The reason I am asking is that when it > does not find the native libs it uses no compression at all, and if your > original was compressed then you will see the copied one being uncompressed > and therefore much larger. > > Also, what is the content like? How large are the cells that are stored? > > Lars > > > On Dec 10, 2011, at 8:53 AM, Lord Khan Han wrote: > > > I will check the reverse export imprt to cdh3b4 today to see is it same > > size in the cluster.. > > > > when we use the hadoop dst copy how we candeal with the .META ? because > we > > are copying 1 tabel not all and also there is region info in .META > > including their dns which is different offcoures in new cluster. > > > > I tried the import again today with no compression.. It is doubled the > > exported file size!! I mean I have 200gig exported hbase table size. > when > > import without compression its going 400gig.. Its definitely writing > twice > > something.. > > > > thanks > > > > > > > > On Sat, Dec 10, 2011 at 2:19 AM, lars hofhansl <[EMAIL PROTECTED]> > wrote: > > > >> There's copytable (also an MR job - written by J-D), but it reuses the > >> mapper class from the Import.java, so it > >> probably won't make a difference. > >> > >> What I meant to say below... When you export/import the table from your > >> CDH3u2 cluster back to your CDH3B4 > >> cluster, is the size still doubled? > >> > >> > >> If both clusters are shutdown, you can use Hadoop's distcp to copy > >> directly on the filesystem level; in fact that might be your > >> best option. > >> > >> -- Lars > >> > >> > >> ----- Original Message ----- > >> From: Lord Khan Han <[EMAIL PROTECTED]> > >> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > >> Cc: > >> Sent: Friday, December 9, 2011 4:05 PM > >> Subject: Re: Hbase export / import Why doubling the Table Size ? > >> > >> Thanks for your time.. > >> > >> Is there any reliable way to copy table between these cluster instead of > >> export/import? > >> > >> > >> > >> On Sat, Dec 10, 2011 at 1:39 AM, lars hofhansl <[EMAIL PROTECTED]> > >> wrote: > >> > >>> Hmm... I'm afraid I am out of options. If you want you can try to copy > >> the > >>> table > >>> from CHD3u2 and your CDH3B4 system, and see if the size remains > doubled. > >>> > >>> Does this happen with very small table, too? If so, you could take a > >> small > >>> sample > >>> HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so > >>> that we can have a look. > >>> > >>> > >>> -- Lars > >>> > >>> > >>> ----- Original Message ----- > >>> From: Lord Khan Han <[EMAIL PROTECTED]> > >>> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > >>> Cc: > >>> Sent: Friday, December 9, 2011 2:45 PM > >>> Subject: Re: Hbase export / import Why doubling the Table Size ? > >>> > >>> in same configured cluster (carbon copy) when I made import there is > no > >>> increas on size.. same size.. > >>> > >>> problem in the cdh3u2.. > >>> > >>> > >>> On Sat, Dec 10, 2011 at 12:42 AM, lars hofhansl <[EMAIL PROTECTED]> > >>> wrote: > >>> > >>>> What happens when you export/import into the same (CDH3B4) cluster > >> using > >>> a > >>>> new table name? +
Lord Khan Han 2011-12-10, 16:40
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-10, 18:09
When we exporting from hbase table which is LZO compression on it, the
exported file is decompressed or as is with LZO columns? On Sat, Dec 10, 2011 at 6:40 PM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > It is a succes for both lzo snappy. Content is the html document.. Web > document > > > hbase org.apache.hadoop.hbase.util.CompressionTest > hdfs://localhost:8020/user/root/testfile.lzo lzo > > 11/12/10 18:37:04 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library > > 11/12/10 18:37:04 INFO lzo.LzoCodec: Successfully loaded & initialized > native-lzo library [hadoop-lzo rev 2ad6654f3e9cad97d13f716e51a0509253c0aabb] > > 11/12/10 18:37:04 INFO compress.CodecPool: Got brand-new compressor > > SUCCESS > > > > > > On Sat, Dec 10, 2011 at 1:03 PM, Lars George <[EMAIL PROTECTED]>wrote: > >> Could you use the ComressionTest to verify that the library path is set >> up properly? >> >> $ hbase org.apache.hadoop.hbase.util.CompressionTest >> hdfs://<your-namenode>:8020/<some-writable-path>/test.lzo lzo >> >> Does it report OK? Same for Snappy? The reason I am asking is that when >> it does not find the native libs it uses no compression at all, and if your >> original was compressed then you will see the copied one being uncompressed >> and therefore much larger. >> >> Also, what is the content like? How large are the cells that are stored? >> >> Lars >> >> >> On Dec 10, 2011, at 8:53 AM, Lord Khan Han wrote: >> >> > I will check the reverse export imprt to cdh3b4 today to see is it same >> > size in the cluster.. >> > >> > when we use the hadoop dst copy how we candeal with the .META ? because >> we >> > are copying 1 tabel not all and also there is region info in .META >> > including their dns which is different offcoures in new cluster. >> > >> > I tried the import again today with no compression.. It is doubled the >> > exported file size!! I mean I have 200gig exported hbase table size. >> when >> > import without compression its going 400gig.. Its definitely writing >> twice >> > something.. >> > >> > thanks >> > >> > >> > >> > On Sat, Dec 10, 2011 at 2:19 AM, lars hofhansl <[EMAIL PROTECTED]> >> wrote: >> > >> >> There's copytable (also an MR job - written by J-D), but it reuses the >> >> mapper class from the Import.java, so it >> >> probably won't make a difference. >> >> >> >> What I meant to say below... When you export/import the table from your >> >> CDH3u2 cluster back to your CDH3B4 >> >> cluster, is the size still doubled? >> >> >> >> >> >> If both clusters are shutdown, you can use Hadoop's distcp to copy >> >> directly on the filesystem level; in fact that might be your >> >> best option. >> >> >> >> -- Lars >> >> >> >> >> >> ----- Original Message ----- >> >> From: Lord Khan Han <[EMAIL PROTECTED]> >> >> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> >> >> Cc: >> >> Sent: Friday, December 9, 2011 4:05 PM >> >> Subject: Re: Hbase export / import Why doubling the Table Size ? >> >> >> >> Thanks for your time.. >> >> >> >> Is there any reliable way to copy table between these cluster instead >> of >> >> export/import? >> >> >> >> >> >> >> >> On Sat, Dec 10, 2011 at 1:39 AM, lars hofhansl <[EMAIL PROTECTED]> >> >> wrote: >> >> >> >>> Hmm... I'm afraid I am out of options. If you want you can try to copy >> >> the >> >>> table >> >>> from CHD3u2 and your CDH3B4 system, and see if the size remains >> doubled. >> >>> >> >>> Does this happen with very small table, too? If so, you could take a >> >> small >> >>> sample >> >>> HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so >> >>> that we can have a look. >> >>> >> >>> >> >>> -- Lars >> >>> >> >>> >> >>> ----- Original Message ----- >> >>> From: Lord Khan Han <[EMAIL PROTECTED]> >> >>> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> >> >>> Cc: >> >>> Sent: Friday, December 9, 2011 2:45 PM >> >>> Subject: Re: Hbase export / import Why doubling the Table Size ? >> >>> >> >>> in same configured cluster (carbon copy) when I made import there +
Lord Khan Han 2011-12-10, 18:09
-
Re: Hbase export / import Why doubling the Table Size ?lars hofhansl 2011-12-11, 01:07
You need enable LZO compression on the target table (the table you are importing to), but I assume you did that.
----- Original Message ----- From: Lord Khan Han <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Cc: Sent: Saturday, December 10, 2011 10:09 AM Subject: Re: Hbase export / import Why doubling the Table Size ? When we exporting from hbase table which is LZO compression on it, the exported file is decompressed or as is with LZO columns? On Sat, Dec 10, 2011 at 6:40 PM, Lord Khan Han <[EMAIL PROTECTED]>wrote: > It is a succes for both lzo snappy. Content is the html document.. Web > document > > > hbase org.apache.hadoop.hbase.util.CompressionTest > hdfs://localhost:8020/user/root/testfile.lzo lzo > > 11/12/10 18:37:04 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library > > 11/12/10 18:37:04 INFO lzo.LzoCodec: Successfully loaded & initialized > native-lzo library [hadoop-lzo rev 2ad6654f3e9cad97d13f716e51a0509253c0aabb] > > 11/12/10 18:37:04 INFO compress.CodecPool: Got brand-new compressor > > SUCCESS > > > > > > On Sat, Dec 10, 2011 at 1:03 PM, Lars George <[EMAIL PROTECTED]>wrote: > >> Could you use the ComressionTest to verify that the library path is set >> up properly? >> >> $ hbase org.apache.hadoop.hbase.util.CompressionTest >> hdfs://<your-namenode>:8020/<some-writable-path>/test.lzo lzo >> >> Does it report OK? Same for Snappy? The reason I am asking is that when >> it does not find the native libs it uses no compression at all, and if your >> original was compressed then you will see the copied one being uncompressed >> and therefore much larger. >> >> Also, what is the content like? How large are the cells that are stored? >> >> Lars >> >> >> On Dec 10, 2011, at 8:53 AM, Lord Khan Han wrote: >> >> > I will check the reverse export imprt to cdh3b4 today to see is it same >> > size in the cluster.. >> > >> > when we use the hadoop dst copy how we candeal with the .META ? because >> we >> > are copying 1 tabel not all and also there is region info in .META >> > including their dns which is different offcoures in new cluster. >> > >> > I tried the import again today with no compression.. It is doubled the >> > exported file size!! I mean I have 200gig exported hbase table size. >> when >> > import without compression its going 400gig.. Its definitely writing >> twice >> > something.. >> > >> > thanks >> > >> > >> > >> > On Sat, Dec 10, 2011 at 2:19 AM, lars hofhansl <[EMAIL PROTECTED]> >> wrote: >> > >> >> There's copytable (also an MR job - written by J-D), but it reuses the >> >> mapper class from the Import.java, so it >> >> probably won't make a difference. >> >> >> >> What I meant to say below... When you export/import the table from your >> >> CDH3u2 cluster back to your CDH3B4 >> >> cluster, is the size still doubled? >> >> >> >> >> >> If both clusters are shutdown, you can use Hadoop's distcp to copy >> >> directly on the filesystem level; in fact that might be your >> >> best option. >> >> >> >> -- Lars >> >> >> >> >> >> ----- Original Message ----- >> >> From: Lord Khan Han <[EMAIL PROTECTED]> >> >> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> >> >> Cc: >> >> Sent: Friday, December 9, 2011 4:05 PM >> >> Subject: Re: Hbase export / import Why doubling the Table Size ? >> >> >> >> Thanks for your time.. >> >> >> >> Is there any reliable way to copy table between these cluster instead >> of >> >> export/import? >> >> >> >> >> >> >> >> On Sat, Dec 10, 2011 at 1:39 AM, lars hofhansl <[EMAIL PROTECTED]> >> >> wrote: >> >> >> >>> Hmm... I'm afraid I am out of options. If you want you can try to copy >> >> the >> >>> table >> >>> from CHD3u2 and your CDH3B4 system, and see if the size remains >> doubled. >> >>> >> >>> Does this happen with very small table, too? If so, you could take a >> >> small >> >>> sample >> >>> HFile and upload it (both the CHD3B4 and CDH3u2 versions) somewhere so >> >>> that we can have a look. >> >>> >> >>> >> >>> -- Lars +
lars hofhansl 2011-12-11, 01:07
-
Re: Hbase export / import Why doubling the Table Size ?Lord Khan Han 2011-12-13, 13:15
Hi Again,
One more symptom : When I look the one of the hbase table column ie: PureText (which we configured to be LZO) from the hadoop dfs , I can read the file. Isnt it looks like a lzo'ed file ? Is this shows LZO is not worked ? Or when I look the hbase LZO'ed file from dfs its automatically unlzo'ed to show me ? On Sun, Dec 11, 2011 at 3:07 AM, lars hofhansl <[EMAIL PROTECTED]> wrote: > You need enable LZO compression on the target table (the table you are > importing to), but I assume you did that. > > > > ----- Original Message ----- > From: Lord Khan Han <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED] > Cc: > Sent: Saturday, December 10, 2011 10:09 AM > Subject: Re: Hbase export / import Why doubling the Table Size ? > > When we exporting from hbase table which is LZO compression on it, the > exported file is decompressed or as is with LZO columns? > > > > On Sat, Dec 10, 2011 at 6:40 PM, Lord Khan Han <[EMAIL PROTECTED] > >wrote: > > > It is a succes for both lzo snappy. Content is the html document.. Web > > document > > > > > > hbase org.apache.hadoop.hbase.util.CompressionTest > > hdfs://localhost:8020/user/root/testfile.lzo lzo > > > > 11/12/10 18:37:04 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library > > > > 11/12/10 18:37:04 INFO lzo.LzoCodec: Successfully loaded & initialized > > native-lzo library [hadoop-lzo rev > 2ad6654f3e9cad97d13f716e51a0509253c0aabb] > > > > 11/12/10 18:37:04 INFO compress.CodecPool: Got brand-new compressor > > > > SUCCESS > > > > > > > > > > > > On Sat, Dec 10, 2011 at 1:03 PM, Lars George <[EMAIL PROTECTED] > >wrote: > > > >> Could you use the ComressionTest to verify that the library path is set > >> up properly? > >> > >> $ hbase org.apache.hadoop.hbase.util.CompressionTest > >> hdfs://<your-namenode>:8020/<some-writable-path>/test.lzo lzo > >> > >> Does it report OK? Same for Snappy? The reason I am asking is that when > >> it does not find the native libs it uses no compression at all, and if > your > >> original was compressed then you will see the copied one being > uncompressed > >> and therefore much larger. > >> > >> Also, what is the content like? How large are the cells that are stored? > >> > >> Lars > >> > >> > >> On Dec 10, 2011, at 8:53 AM, Lord Khan Han wrote: > >> > >> > I will check the reverse export imprt to cdh3b4 today to see is it > same > >> > size in the cluster.. > >> > > >> > when we use the hadoop dst copy how we candeal with the .META ? > because > >> we > >> > are copying 1 tabel not all and also there is region info in .META > >> > including their dns which is different offcoures in new cluster. > >> > > >> > I tried the import again today with no compression.. It is doubled the > >> > exported file size!! I mean I have 200gig exported hbase table size. > >> when > >> > import without compression its going 400gig.. Its definitely writing > >> twice > >> > something.. > >> > > >> > thanks > >> > > >> > > >> > > >> > On Sat, Dec 10, 2011 at 2:19 AM, lars hofhansl <[EMAIL PROTECTED]> > >> wrote: > >> > > >> >> There's copytable (also an MR job - written by J-D), but it reuses > the > >> >> mapper class from the Import.java, so it > >> >> probably won't make a difference. > >> >> > >> >> What I meant to say below... When you export/import the table from > your > >> >> CDH3u2 cluster back to your CDH3B4 > >> >> cluster, is the size still doubled? > >> >> > >> >> > >> >> If both clusters are shutdown, you can use Hadoop's distcp to copy > >> >> directly on the filesystem level; in fact that might be your > >> >> best option. > >> >> > >> >> -- Lars > >> >> > >> >> > >> >> ----- Original Message ----- > >> >> From: Lord Khan Han <[EMAIL PROTECTED]> > >> >> To: [EMAIL PROTECTED]; lars hofhansl <[EMAIL PROTECTED]> > >> >> Cc: > >> >> Sent: Friday, December 9, 2011 4:05 PM > >> >> Subject: Re: Hbase export / import Why doubling the Table Size ? > >> >> > >> >> Thanks for your time.. > >> >> > >> >> Is there any reliable way to copy table between these cluster instead +
Lord Khan Han 2011-12-13, 13:15
-
Re: Hbase export / import Why doubling the Table Size ?Stack 2011-12-14, 06:00
On Tue, Dec 13, 2011 at 5:15 AM, Lord Khan Han <[EMAIL PROTECTED]> wrote:
> Hi Again, > > One more symptom : When I look the one of the hbase table column ie: > PureText (which we configured to be LZO) from the hadoop dfs , I can read > the file. Isnt it looks like a lzo'ed file ? Is this shows LZO is not > worked ? Or when I look the hbase LZO'ed file from dfs its automatically > unlzo'ed to show me ? > hdfs won't unlzo it for you. Sounds like hbase didn't compress it? St.Ack +
Stack 2011-12-14, 06:00
|