Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Which LZO library to use?


Copy link to this message
-
Re: Which LZO library to use?
Thanks. Now I feel much more comfortable using Kevin's code.

On Mon, Aug 2, 2010 at 21:12, Alex Kozlov <[EMAIL PROTECTED]> wrote:

> The code is currently maintained by Kevin Weil and Todd Lipcon.  For
> completeness, there is one more distribution at
> http://github.com/toddlipcon/hadoop-lzo.  AFAIK, the Todd Lipcon's and
> Kevin
> Weil's distribution are synced.
>
> Most of the differences with google's code are bug fixes: the lzo file
> format itself had not changed and you can actually read the files created
> with lzop (the LZO command line tool).   Their are no version compatibility
> issues currently.
>
> Alex K
>
> On Mon, Aug 2, 2010 at 3:25 AM, Eran Kutner <[EMAIL PROTECTED]> wrote:
>
> > Hi,
> > I want to enable LZO compression on my cluster but see there are a few
> > alternatives and the wiki page itself is very confusing so it's not clear
> > what is the right choice. I was looking at this page:
> > http://wiki.apache.org/hadoop/UsingLzoCompression, at the top it
> > recommends
> > using Kevin Weil's version (which seems to be the same one released by
> > Twitter) but warns it doesn't contain all fixes and lower in the article
> it
> > refers to the original google code repository (
> > http://code.google.com/p/hadoop-gpl-compression/).
> > The thing the concerns me most is future compatibility, so whichever
> > library
> > I pick now I want to be certain my data compressed will still be readable
> > when I I upgrade to the next major version of Hadoop and Hbase. It seems
> > that only the Google code project has newer releases compatible with
> future
> > version of Hadoop.
> >
> > So I'm looking for recommendations on which library to use.
> >
> >
> > Thanks,
> > Eran
> >
>