Could you describe your use case in more detail?
Generally, HDFS will behave poorly in the face of many small files. Could
you perhaps colocate several data in one file? This will help both with the
relative overhead of the schema and the pressure on the HDFS NameNode.
On Mon, Mar 17, 2014 at 2:55 PM, Salman Haq <[EMAIL PROTECTED]>wrote: