-When copying a file to HDFS, how to control what nodes that file will reside on?
jeremy p 2013-04-09, 20:49
I'm dealing with kind of a bizarre use case where I need to make sure that
File A is local to Machine A, File B is local to Machine B, etc. When
copying a file to HDFS, is there a way to control which machines that file
will reside on? I know that any given file will be replicated across three
machines, but I need to be able to say "File A will DEFINITELY exist on
Machine A". I don't really care about the other two machines -- they could
be any machines on my cluster.