Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 27 (0.056s).
Loading phrases to help you
refine your search...
Re: about rack awareness - Hadoop - [mail # user]
...Rack Awareness actually should be called Switch Awareness, and that¡¯s what people typically do: Nodes in a rack are at the same switch, also you should have balanced capacity across racks/s...
   Author: Kai Voigt, 2014-07-04, 03:44
Re: Map  reduce Query - Hadoop - [mail # user]
...That’s exactly what MapReduce does. The input is processed by the mapper function, and its output will be automatically sent into the reducer function. Between mappers and reducers we have t...
   Author: Kai Voigt, 2014-06-19, 10:07
Re: Counters in MapReduce - Hadoop - [mail # user]
...Like you said, just wrap your 3 jobs into a while loop and check the built-in counters, like the number of reduce output records to check if the job output was empty.Unfortunately, oozie can...
   Author: Kai Voigt, 2014-06-09, 09:47
Re: - Hadoop - [mail # user]
...In my opinion, another 2782829 times, give or take a few.  Or try reading and understanding http://hadoop.apache.org/mailing_lists.html otherwise which tells you to send an email to [EM...
   Author: Kai Voigt, 2013-03-06, 13:04
Re: aggregation by time window - Hadoop - [mail # user]
...Hi again,  the idea is that you emit every event multiple times. So your map input record (event1, 10:07) will be emitted seven times during the map() call. Like I said, (10:04,event1),...
[+ more]    Author: Kai Voigt, 2013-01-28, 13:48
Re: Get the name of node where mapper is running - Hadoop - [mail # user]
...Hello,  the JobTracker has a built-in Web UI (http://hostname_of_jobtracker:50030/) where you can get details for all completed and running jobs. For the map phase, it will tell you on ...
   Author: Kai Voigt, 2012-11-21, 18:06
Re: Transfer large file >50Gb with DistCp from s3 to cluster - Hadoop - [mail # user]
...Hi,  my guess is that you run "hadoop distcp" on one of the datanodes... In  that case, the node will get the first replica of each block. But you  should also see copies on m...
   Author: Kai Voigt, 2012-09-04, 20:09
Re: Hadoop or HBase - Hadoop - [mail # user]
...Having a distributed filesystem doesn't save you from having backups. If  someone deletes a file in HDFS, it's gone.  What backend storage is supported by your CMS?  Kai  ...
[+ more]    Author: Kai Voigt, 2012-08-28, 10:18
Re: Simple hadoop processes/testing on windows machine - Hadoop - [mail # user]
...I suggest using a virtual machine with all required services installed  and configured.  Cloudera offers a distribution as a VM, at  https://ccp.cloudera.com/display/SUPPORT/C...
   Author: Kai Voigt, 2012-07-25, 22:15
Re: Counting records - Hadoop - [mail # user]
...Hi,  an additional idea is to use the counter API inside the framework.   http://diveintodata.org/2011/03/15/an-example-of-hadoop-mapreduce-counter/  has a good example.  ...
   Author: Kai Voigt, 2012-07-23, 14:32
Hadoop (27)
MapReduce (11)
HDFS (5)
Pig (1)
Sqoop (1)
mail # user (25)
mail # dev (2)
last 7 days (0)
last 30 days (1)
last 90 days (3)
last 6 months (3)
last 9 months (27)
Harsh J (554)
Owen O'Malley (394)
Steve Loughran (382)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (163)
Arun C Murthy (162)
Chris Nauroth (142)
Allen Wittenauer (128)
Tom White (120)
Ted Yu (118)
Nigel Daley (115)
Daryn Sharp (110)
Konstantin Shvachko (107)
Doug Cutting (95)
Aaron Kimball (94)
Edward Capriolo (87)
Colin Patrick McCabe (86)
Mark Kerzner (86)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (69)
Suresh Srinivas (65)
Kai Voigt