Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
MapReduce >> mail # user >> CustomInputFormat

Copy link to this message

I have a small situation.

I need to pass a group of lines as an input to my mapper.The number of
lines may vary depending on the situation.

Basically those lines are logs which are grouped by the tests they run.

If a test passes less number of lines but if it fails more number of lines.

There is a definitive word which marks the start of test and end of test.
So how can i achieve this and pass each test log as a whole input to a
mapper once.