Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # dev >> specific implementations of CombineFileInputFormat


Copy link to this message
-
specific implementations of CombineFileInputFormat
My apologies if this has already been discussed in the past (couldn't find
any discussion threads on Google).

CombineFileInputFormat is abstract, and its specific equivalents to
TextInputFormat, SequenceFileInputFormat, etc. are currently not in the
hadoop code base. Is there a reason why these specific classes are not part
of hadoop-mapreduce-client-core?

To me these sound like very common need whenever CombineFileInputFormat is
used, and different folks would write the same code over and over to
achieve the same goal. It sounds very natural for hadoop to provide at
least the text and sequence file implementations of the
CombineFileInputFormat class. Thoughts?

Thanks,
Sangjin
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB