Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> custom format

hello user,

I am trying to create a map reduce program which will have splits
based on a specific length. The content has to be extracted in a way
such that the newline(\n) or tab(\t) etc characters will be considered
as a byte and not as a mapper instance. is this possible through
custom input? if yes, how will i create a custom file split based on a
specific length value. Any suggestions?