Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Logic of isSplittable() of class FileInputFormat


Copy link to this message
-
Re: Logic of isSplittable() of class FileInputFormat
Hi Sugandha,

Take gz file as an example, It is not splittable because of the compression
algorithm it is used.  It can not guarantee that one record is located in
one block, if one record is in 2 blocks, your program will crash since you
can not get the whole record.
On Wed, Feb 26, 2014 at 1:24 PM, Sugandha Naolekar
<[EMAIL PROTECTED]>wrote: