-Tell Hadoop to store pairs of files at the same location(s) on HDFS
I have been wondering if there's a way (hack'ish would be okay too) to tell
Hadoop that two files shall be stored together at the same location(s). It
would benefit map-side join performance if it could be done somehow because
all map tasks would be able to read data from a local copy. Does anyone
know a way?