Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> Processing Large XML in Hadoop


Copy link to this message
-
Processing Large XML in Hadoop
Hi,

Could you kindly explain the pros and cons of using Hadoop's
StreamInputFormat and Mahout XmlInputFormat.
How the record reader reads the record if it across the other blocks when
dealing with large size xml files?

Thanks in advance.

Cheers!
Manoj.