Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> cloudera vs apache hadoop migration stories ?

Copy link to this message
Re: cloudera vs apache hadoop migration stories ?
Hi Jay,

Probably makes sense to move this to the cdh-user list if you have any
Cloudera-specific questions. But I just wanted to clarify: CDH doesn't
make any API changes that aren't already upstream. So, in some places,
CDH may be ahead of whatever Apache release you are comparing against,
but it is always made up of patches from the Apache trunk. In the
specific case of MultipleInputs, we did backport the new API
implementation from Apache Hadoop 0.21+.

If you find something in CDH that you would like backported to
upstream Apache Hadoop 1.0.x, please feel free to file a JIRA and
assign it to me - I'm happy to look into it for you.


On Wed, Apr 4, 2012 at 10:15 AM, Jay Vyas <[EMAIL PROTECTED]> wrote:
> Seems like cloudera and standard apache-hadoop are really not cross
> compatible.  Things like MultipleInputs and stuff that we are finding don't
> work the same.  Any good (recent) war stories on the migration between the
> two ?
> Its interesting to me that cloudera and amazon are that difficult to swap
> in/out in cloud.

Todd Lipcon
Software Engineer, Cloudera