The development time in Pig and hive are pretty less compared to its equivalent mapreduce code and for generic cases it is very efficient.
If your requirement is that complex and you need very low level control of your code mapreduce is better. If you are an expert in mapreduce your code can be efficient as yours would very specific to your app but the MR in hive and pig may be more generic.
To just write your custom mapreduce functions, just basic knowledge on java is good. As you are better with java you can understand the internals better.
Sent from handheld, please excuse typos.
From: <[EMAIL PROTECTED]>
Date: Wed, 7 Nov 2012 15:33:07
To: <[EMAIL PROTECTED]>
Reply-To: [EMAIL PROTECTED]
Subject: Map-Reduce V/S Hadoop Ecosystem
Hello Hadoop Champs,
Please give some suggestion..
As Hadoop Ecosystem(Hive, Pig...) internally do Map-Reduce to process.
My Question is
1). where Map-Reduce program(written in Java, python etc) are overtaking Hadoop Ecosystem.
2). Limitations of Hadoop Ecosystem comparing with Writing Map-Reduce program.
3) for writing Map-Reduce jobs in java how much we need to have skills in java out of 10 (?/10)
Please put some light over it.
Thanks & Regards
The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments.
WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.