|
|
lee.sungeun@... 2010-04-14, 04:35
Hi i am studying about hadoop and sending a mail to know how to split data. if the block size is 64MB and i am supposed to hadle 129MB. How will it split? (64,64,1) or (64,64,64) . i think first one is right. but i want to get informaiton more detail. so if possible, just let me know it.
Regards sungeun lee
Please do not print this email unless it is absolutely necessary.
The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments.
WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.
www.wipro.com
Segel, Mike 2010-04-17, 22:43
Sungeun,
Its really the second one. Files are split in to blocks depending on a configuration parameter. By default its 64MB. Using your example, the splits are 64,64,1 however the 1 MB is within an allocated block of size 64MB.
HTH -Mike
-----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] Sent: Tuesday, April 13, 2010 11:35 PM To: [EMAIL PROTECTED] Subject: how to split data
Hi i am studying about hadoop and sending a mail to know how to split data. if the block size is 64MB and i am supposed to hadle 129MB. How will it split? (64,64,1) or (64,64,64) . i think first one is right. but i want to get informaiton more detail. so if possible, just let me know it. Regards sungeun lee
Please do not print this email unless it is absolutely necessary.
The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments.
WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.
www.wipro.com The information contained in this communication may be CONFIDENTIAL and is intended only for the use of the recipient(s) named above. If you are not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication, or any of its contents, is strictly prohibited. If you have received this communication in error, please notify the sender and delete/destroy the original message and any copy of it from your computer or paper files.
Doug Cutting 2010-04-21, 17:42
[EMAIL PROTECTED] wrote: > if the block size is 64MB and i am supposed to hadle 129MB. How will it split? (64,64,1) or (64,64,64)
(64,65)
FileInputFormat permits the final split to exceed the desired split size by up to 10%.
Doug
|
|