I'm looking at Distributed Shell as an example for writing a YARN application.
My question is why are the script path and associated metadata saved as environment variables? Are there any other ways besides environment variables or command line arguments for passing data from the Client to the ApplicationMaster?
Command line arguments & env variables are the most direct options.
A more onerous option is to write some data to a file in HDFS, use LocalResource to ship it to the container on each node and get application code to read that file locally. (In MRv1 parlance that is "Distributed Cache").
On Jan 29, 2014, at 12:59 PM, Brian C. Huffman <[EMAIL PROTECTED]> wrote: Arun C. Murthy Hortonworks Inc. http://hortonworks.com/
CONFIDENTIALITY NOTICE NOTICE: This message is intended for the use of the individual or entity to which it is addressed and may contain information that is confidential, privileged and exempt from disclosure under applicable law. If the reader of this message is not the intended recipient, you are hereby notified that any printing, copying, dissemination, distribution, disclosure or forwarding of this communication is strictly prohibited. If you have received this communication in error, please contact the sender immediately and delete it from your system. Thank You.
while your at it, what about adding values to the Configuration() object, does that still work as a hack for information passing? On Wed, Jan 29, 2014 at 5:25 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote:
Apache Lucene, Apache Solr and all other Apache Software Foundation project and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by Sematext