Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 41 (0.124s).
Loading phrases to help you
refine your search...
Re: java.io.IOException: Type mismatch in key from map: expected org.apache.hadoop.io.LongWritable, recieved org.apache.hadoop.io.Text - Hadoop - [mail # user]
...Hi Harit  You need to set the Key Type as well. If you are using different Data Type for Key and Values in your map output with respect to reduce output then you need to specify both. &...
   Author: Bejoy Ks, 2012-08-02, 23:07
Re: No Support for setting MapFileOutputFormat in newer API - Hadoop - [mail # user]
...HI Abhinav  MapFileOutputFormat is currently not available for the new mapreduce API in hadoop 1.x . However a jira is in place to accommodate it in the future releases.  https://i...
   Author: Bejoy Ks, 2012-07-27, 12:29
Re: Hadoop 1.0.3 start-daemon.sh doesn't start all the expected daemons - Hadoop - [mail # user]
...Hi Dinesh  Try using $HADOOP_HOME/bin/start-all.sh . It starts all the hadoop daemons including TT and DN.   Regards Bejoy KS...
   Author: Bejoy Ks, 2012-07-27, 12:16
Re: Unexpected end of input stream (GZ) - Hadoop - [mail # user]
...Hi Oleg  which was the file split processed by that task. The split information is available under the status column for each task.  The file split information is not available on ...
   Author: Bejoy Ks, 2012-07-24, 10:10
Re: Error using MultipleInputs - Hadoop - [mail # user]
...Hi Sanchita  Try your code after commenting the following Line of code,  //conf.setInputFormat(TextInputFormat.class);  AFAIK This explicitly sets the input format as TextInpu...
   Author: Bejoy Ks, 2012-07-05, 12:08
Re: how to fine tuning my map reduce job that is generating a lot of intermediate key-value pairs (a lot of I/O operations) - Hadoop - [mail # user]
...Jane,        From my first look, properties that can help you could be - Increase io sort factor to 100 - Increase io.sort.mb to 512Mb - increase map task heap size to 2G...
   Author: Bejoy Ks, 2012-04-03, 11:48
Re: 0 tasktrackers in jobtracker but all datanodes present - Hadoop - [mail # user]
...Gaurav        NN memory might have hit its upper bound. As a bench mark, for every 1 million files/blocks/directories 1GB of memory is required on the NN. The number of f...
   Author: Bejoy Ks, 2012-04-02, 08:25
Re: tasktracker/jobtracker.. expectation.. - Hadoop - [mail # user]
...Hi Patai      JobTracker automatically handles this situation by attempting the task on different nodes.Could you verify the number of attempts that these failed tasks made. W...
   Author: Bejoy Ks, 2012-03-22, 18:18
Re: setNumTasks - Hadoop - [mail # user]
...Hi Mohit       The number of map tasks is determined by your number of input splits and the Input Format used by your MR job. Setting this value won't help you control the sam...
   Author: Bejoy Ks, 2012-03-22, 15:01
[expand - 1 more] - Re: how to implements the 'diff' cmd in hadoop - Hadoop - [mail # user]
...Yes, if you are having more than 2 files to be compared against then, the file name/ id is required from mapper. If it is just two files  and you just want to know which lines are not u...
   Author: Bejoy Ks, 2012-03-20, 11:13
Hive (94)
MapReduce (51)
Hadoop (40)
HDFS (12)
Pig (1)
mail # user (41)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (41)
Harsh J (559)
Owen O'Malley (394)
Steve Loughran (390)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (125)
Tom White (120)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (92)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (69)
Suresh Srinivas (64)
Bejoy Ks