Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce, mail # user - RE: Any method to get input splits by column?


Copy link to this message
-
RE: Any method to get input splits by column?
java8964 2013-12-23, 16:48
You need to store your data into "column-based" format, checking out Hive RCFile, and its InputFormat option.
Yong

Date: Mon, 23 Dec 2013 21:37:23 +0800
Subject: Any method to get input splits by column?
From: [EMAIL PROTECTED]
To: [EMAIL PROTECTED]

Hi,

By default, MR inputformat classes break input file into splits by rows. However, we have a specilal requirement on MR app: get input splits by column.

Is there any good method?
Thanks!