Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> How does mapper process partial records?


Copy link to this message
-
Re: How does mapper process partial records?
Hello Praveen,

       Do you mean the InputFormat splits the file across record boundaries??I
actually didn't get your question. What do you mean by 'record' with
respect to HDFS. Did you mean HDFS block?

Warm Regards,
Tariq
https://mtariq.jux.com/
cloudfront.blogspot.com
On Thu, Jan 24, 2013 at 10:20 PM, Praveen Sripati
<[EMAIL PROTECTED]>wrote:

> Hi,
>
> HDFS splits the file across record boundaries. So, how does the mapper
> processing the second block (b2) determine that the first record is
> incomplete and should process starting from the second record in the block
> (b2)?
>
> Thanks,
> Praveen
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB