| clear query|facets|time |
Search criteria: .
Results from 1 to 10 from
70 (0.132s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Shuffle design: optimization tradeoffs - MapReduce - [mail # user]
|
|
...On the academic side, you might be interested to read about *resilient distributed datasets (RDDs)* : http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pdf. Not exactly the same subje...
|
|
|
Author: Bertrand Dechoux,
2013-06-18, 22:04
|
|
|
Re: Assignment of data splits to mappers - MapReduce - [mail # user]
|
|
...1) The tradeoff is between reducing the overhead of distributed computing and reducing the cost of failure. Less tasks, less overhead but the cost of failure will be bigger, mainly because t...
|
|
|
Author: Bertrand Dechoux,
2013-06-18, 21:54
|
|
|
Re: Configuring SSH - is it required? for a psedo distriburted mode? - MapReduce - [mail # user]
|
|
...The scrits themselves will use ssh to connect to every machine (even localhost). It's up to you if you want to type the password everytime. For a pseudo-distributed system, I don't see the i...
|
|
|
Author: Bertrand Dechoux,
2013-05-17, 11:15
|
|
|
Re: Submitting a hadoop job in large clusters. - MapReduce - [mail # user]
|
|
...Which doesn't imply that you should log yourself to the physical machine where the JobTracker is hosted. It only implies that the hadoop client must be able to reach the JobTracker. It could...
|
|
|
Author: Bertrand Dechoux,
2013-05-12, 20:02
|
|
|
Re: why multiple checkpoint nodes? - MapReduce - [mail # user]
|
|
...For more information : https://issues.apache.org/jira/browse/HADOOP-7297 It has been corrected but the stable documentation is still the 1.0.4 (previous to correction). See * htt...
|
|
|
Author: Bertrand Dechoux,
2013-04-18, 20:05
|
|
|
Re: why multiple checkpoint nodes? - MapReduce - [mail # user]
|
|
...It would be important to point the document (which I believe is http://hadoop.apache.org/docs/stable/hdfs_user_guide.html) and the version of Hadoop you are interested in. At one time, the d...
|
|
|
Author: Bertrand Dechoux,
2013-04-18, 19:45
|
|
|
Re: Using Hadoop for codec functionality - MapReduce - [mail # user]
|
|
...Your question could be interpreted in another way : should I use Hadoop in order to perform massive compression/decompression using my own (eventually, proprietary) utility? So yes, Ha...
|
|
|
Author: Bertrand Dechoux,
2013-03-31, 09:38
|
|
|
Re: Which hadoop installation should I use on ubuntu server? - MapReduce - [mail # user]
|
|
...For information, the 50 node limit on CDH is a past limitation. It is no longer the case. *Support for unlimited nodes*. Previous versions of Cloudera Manager Free https://ccp.cl...
|
|
|
Author: Bertrand Dechoux,
2013-03-29, 10:10
|
|
|
Re: disk used percentage is not symmetric on datanodes (balancer) - MapReduce - [mail # user]
|
|
...Hi, It is not explicitly said but did you use the balancer? http://hadoop.apache.org/docs/r1.0.4/commands_manual.html#balancer Regards Bertrand On Mon, Mar 18, 2013 a...
|
|
|
Author: Bertrand Dechoux,
2013-03-18, 21:43
|
|
|
Re: Replication factor - MapReduce - [mail # user]
|
|
...The best way would be to test it. The provided links indeed do not seem to help. I would say the default replication factor is the one found in the configuration and it can be overwrit...
|
|
|
Author: Bertrand Dechoux,
2013-03-17, 18:07
|
|
|
|