Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HDFS >> mail # user >> When to use DFSInputStream and HdfsDataInputStream


Copy link to this message
-
When to use DFSInputStream and HdfsDataInputStream
Hi

What is the use case difference between:
- DFSInputStream and HdfsDataInputStream
- DFSOutputStream and HdfsDataOutputStream

When one should be preferred over other? From sources I see they have
similar functionality, only HdfsData*Stream "follows" Data*Stream instead
of *Stream. Also is DFS*Stream more general than HdfsData*Stream, in the
sense it works on higher abstraction layer, can work with other Distributed
FS (even though it contact HDFS specific components), or its just naming
convention?

Which one should I chose to read/write data from/to HDFS and why (sounds
like academic question ;) )?

* -> means both Input and Output

regards
tmp
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB