Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HDFS >> mail # user >> When to use DFSInputStream and HdfsDataInputStream

Copy link to this message
When to use DFSInputStream and HdfsDataInputStream

What is the use case difference between:
- DFSInputStream and HdfsDataInputStream
- DFSOutputStream and HdfsDataOutputStream

When one should be preferred over other? From sources I see they have
similar functionality, only HdfsData*Stream "follows" Data*Stream instead
of *Stream. Also is DFS*Stream more general than HdfsData*Stream, in the
sense it works on higher abstraction layer, can work with other Distributed
FS (even though it contact HDFS specific components), or its just naming

Which one should I chose to read/write data from/to HDFS and why (sounds
like academic question ;) )?

* -> means both Input and Output