-Re: cloudera vs apache hadoop migration stories ?
Todd Lipcon 2012-04-05, 08:45
Probably makes sense to move this to the cdh-user list if you have any
Cloudera-specific questions. But I just wanted to clarify: CDH doesn't
make any API changes that aren't already upstream. So, in some places,
CDH may be ahead of whatever Apache release you are comparing against,
but it is always made up of patches from the Apache trunk. In the
specific case of MultipleInputs, we did backport the new API
implementation from Apache Hadoop 0.21+.
If you find something in CDH that you would like backported to
upstream Apache Hadoop 1.0.x, please feel free to file a JIRA and
assign it to me - I'm happy to look into it for you.
On Wed, Apr 4, 2012 at 10:15 AM, Jay Vyas <[EMAIL PROTECTED]> wrote:
> Seems like cloudera and standard apache-hadoop are really not cross
> compatible. Things like MultipleInputs and stuff that we are finding don't
> work the same. Any good (recent) war stories on the migration between the
> two ?
> Its interesting to me that cloudera and amazon are that difficult to swap
> in/out in cloud.
Software Engineer, Cloudera