Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> RE: Any method to get input splits by column?


Copy link to this message
-
RE: Any method to get input splits by column?
You need to store your data into "column-based" format, checking out Hive RCFile, and its InputFormat option.
Yong

Date: Mon, 23 Dec 2013 21:37:23 +0800
Subject: Any method to get input splits by column?
From: [EMAIL PROTECTED]
To: [EMAIL PROTECTED]

Hi,

By default, MR inputformat classes break input file into splits by rows. However, we have a specilal requirement on MR app: get input splits by column.

Is there any good method?
Thanks!
     
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB