I was not able to find an answer to the following question. If the
question has already been answered please give me the pointer to the
Which are actually the differences between read file from HDFS in one
mapper and use DistributedCache.
I saw that with DistributedCache you can give an hdfs path and the
task nodes will get the data on local file system. But which
advantages we have compared with a simple HDFS read with
Thank you very much,