This should be a simple question, I think. Disclosure, I am not a java
We're getting ready to build our Dev and Prod clusters. I'm pretty
comfortable with HDFS and how it sits atop several local file systems on
multiple servers. I'm fairly comfortable with the concept of Map/Reduce
and why it's cool and we want it.
Now for the question. Where should my developers, put and store their jar
files? Or asked another way, what's the best entry point for submitting
We have separate physical systems for NN, Checkpoint Node (formerly 2nn),
Job Tracker and Standby NN. Should I run from the JT node? Do I keep all
of my finished .jar's on the JT local file system?
Or should I expect that jobs will be run via Oozie? Do I put jars on the
local Oozie FS?
Thanks in advance.