Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 48 (0.18s).
Loading phrases to help you
refine your search...
Re: How to sort in a WordCount - Hadoop - [mail # user]
...You need a second MapReduce job. Take your WordCount input, have the mapper swapping keys and values, i.e. map(word, count) => (count, word), then your reducer will get the records sorted...
   Author: Kai Voigt, 2014-08-17, 04:51
[HIVE-7713] hive.session.silent in .hiverc has no effect - Hive - [issue]
...When setting "set hive.session.silent=true;" in ~/.hiverc, it doesn't have the same effect as running "hive -S". The map reduce output is still shown....
http://issues.apache.org/jira/browse/HIVE-7713    Author: Kai Voigt, 2014-08-13, 15:09
Re: Started learning Hadoop. Which distribution is best for native install in pseudo distributed mode? - Hadoop - [mail # user]
...3. seems a biased and incomplete statement.Cloudera’s distribution CDH is fully open source. The proprietary „stuff" you refer to is most likely Cloudera Manager, an additional tool to make ...
   Author: Kai Voigt, 2014-08-12, 21:11
Re: about rack awareness - Hadoop - [mail # user]
...Rack Awareness actually should be called Switch Awareness, and that¡¯s what people typically do: Nodes in a rack are at the same switch, also you should have balanced capacity across racks/s...
   Author: Kai Voigt, 2014-07-04, 03:44
Re: Map  reduce Query - Hadoop - [mail # user]
...That’s exactly what MapReduce does. The input is processed by the mapper function, and its output will be automatically sent into the reducer function. Between mappers and reducers we have t...
   Author: Kai Voigt, 2014-06-19, 10:07
Re: Counters in MapReduce - Hadoop - [mail # user]
...Like you said, just wrap your 3 jobs into a while loop and check the built-in counters, like the number of reduce output records to check if the job output was empty.Unfortunately, oozie can...
   Author: Kai Voigt, 2014-06-09, 09:47
Re: Object in mapreduce - MapReduce - [mail # user]
...Check out the Distributed Cache feature: http://hadoop.apache.org/docs/stable/api/org/apache/hadoop/filecache/DistributedCache.html  Kai  Am 28.12.2013 um 10:27 schrieb unmesha sre...
   Author: Kai Voigt, 2013-12-28, 09:44
Re: Relationship between heap sizes and mapred.child.java.opt configuration - HDFS - [mail # user]
...mapred.child.java.opts are referring to the settings for the JVMs spawned by the TaskTracker. This JVMs will actually run the tasks (mappers and reducers)  The heap sizes for TaskTracke...
   Author: Kai Voigt, 2013-11-25, 15:02
Re: ALL HDFS Blocks on the Same Machine if Replication factor = 1 - MapReduce - [mail # user]
...Hello,  Am 10.06.2013 um 15:36 schrieb Razen Al Harbi :   Yes, this is normal behavior. When a HDFS client happens to run on a host that also is a DataNode (always the case when a ...
   Author: Kai Voigt, 2013-06-10, 13:47
Re: Not saving any output - HDFS - [mail # user]
...You can have your python streaming script simply not write any key/value pairs to stdout, so you'll get an empty job output.  Independently, your script could do anything external, such...
   Author: Kai Voigt, 2013-05-28, 20:43
Hadoop (29)
MapReduce (11)
HDFS (5)
Hive (1)
Pig (1)
Sqoop (1)
mail # user (43)
issue (3)
mail # dev (2)
last 7 days (0)
last 30 days (0)
last 90 days (4)
last 6 months (6)
last 9 months (48)
Ted Yu (1643)
Harsh J (1292)
Jun Rao (1028)
Todd Lipcon (1001)
Stack (974)
Jonathan Ellis (842)
Andrew Purtell (796)
Jean-Daniel Cryans (754)
jacques@... (738)
stack (716)
Yusaku Sako (708)
Jarek Jarcec Cecho (699)
Eric Newton (696)
Jonathan Hsieh (675)
Roman Shaposhnik (656)
Brock Noland (653)
Namit Jain (649)
Neha Narkhede (648)
Hitesh Shah (626)
Owen O'Malley (625)
Steve Loughran (616)
Siddharth Seth (614)
Josh Elser (562)
Eli Collins (545)
Arun C Murthy (543)
Kai Voigt