Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> Processing Large XML in Hadoop

Copy link to this message
Processing Large XML in Hadoop

Could you kindly explain the pros and cons of using Hadoop's
StreamInputFormat and Mahout XmlInputFormat.
How the record reader reads the record if it across the other blocks when
dealing with large size xml files?

Thanks in advance.