Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop, mail # dev - Do we support contatenated/splittable bzip2 files in branch-1?


Copy link to this message
-
Re: Do we support contatenated/splittable bzip2 files in branch-1?
Harsh J 2012-12-03, 11:42
Hi Yu Li,

The JIRA HADOOP-7823 backported support for splitting Bzip2 files plus
MR support for it, into branch-1, and it is already available in the
1.1.x releases out currently.

Concatenated Bzip2 files, i.e., HADOOP-7386, is not implemented yet
(AFAIK), but Chris over HADOOP-6335 suggests that HADOOP-4012 may have
fixed it - so can you try and report back?

On Mon, Dec 3, 2012 at 3:19 PM, Yu Li <[EMAIL PROTECTED]> wrote:
> Dear all,
>
> About splitting support for bzip2, I checked on the JIRA list and found
> HADOOP-7386 marked as "Won't fix"; I also found some work done in
> branch-0.21(also in trunk), say HADOOP-4012 and MAPREDUCE-830, but not
> integrated/migrated into branch-1, so I guess we don't support contatenated
> bzip2 in branch-1, correct? If so, is there any special reason? Many thanks!
>
> --
> Best Regards,
> Li Yu

--
Harsh J