Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> Processing Large XML in Hadoop


Copy link to this message
-
Processing Large XML in Hadoop
Hi,

Could you kindly explain the pros and cons of using Hadoop's
StreamInputFormat and Mahout XmlInputFormat.
How the record reader reads the record if it across the other blocks when
dealing with large size xml files?

Thanks in advance.

Cheers!
Manoj.
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB