Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 61 to 70 from 76 (0.105s).
Loading phrases to help you
refine your search...
Re: question for understanding partitioning - MapReduce - [mail # user]
...1) you need not have 26 reducers but you want 26 partitions - you might send to     int reducer = character % Math.min(26,nreducers);  // this insures that all items with A go...
   Author: Steve Lewis, 2011-01-18, 20:33
Re: Mapper runs only on one machine - MapReduce - [mail # user]
...Are you sure your input file is splittable - many files (say gzip) are not and such files must be processed on a single machine  On Tue, Nov 16, 2010 at 9:24 AM,  wrote:   &nb...
   Author: Steve Lewis, 2010-11-16, 17:33
Re: Big split file to Partitioner - MapReduce - [mail # user]
...It is a good idea to ask what the meaning of ths split is. Typically a split is one per line but I have written splits which return the entire file for a small file - say an xml document &nb...
   Author: Steve Lewis, 2010-08-22, 17:47
[expand - 1 more] - Re: Fixing a failed reduce task - MapReduce - [mail # user]
...Yes - of course but the question is whether there is a way to do it while the job is running rather than restarting with different parameter  On Tue, Jul 13, 2010 at 4:51 PM, Ted Yu &nb...
   Author: Steve Lewis, 2010-07-14, 01:57
Restarting a cluster - MapReduce - [mail # user]
...We have a cluster with 4 Cloudera VMs -   hadoop fs -ls / says 10/07/12 05:42:22 INFO ipc.Client: Retrying connect to server: localhost/ Already tried 0 time(s). 10/07/1...
   Author: Steve Lewis, 2010-07-12, 18:51
How does performance scale with the size of the data? - MapReduce - [mail # user]
...Assume we have a medium size cluster - say 20 nodes and that the cluster is used for one job and cannot change in size. Assume we are sorting a large data set. As we increase the size of the...
   Author: Steve Lewis, 2010-07-01, 05:15
[expand - 1 more] - Re: Who creates job.jar? - MapReduce - [mail # user]
...I have attached code for creating a Hadoop Jar - All you need to do is run HadoopDeployer in the same environment that your hadoop job runs as a local process (You did test your job in this ...
   Author: Steve Lewis, 2010-06-25, 15:55
Custom File reader - MapReduce - [mail # user]
...I have a number of files which can be read and converted into a series of lines of lext - however the means of reading the file is not known to the standard Hadoop splitters. I understand th...
   Author: Steve Lewis, 2010-06-24, 19:44
Using a custom FileSplitter? - MapReduce - [mail # user]
...Assume I have one of the two situations (I have both) 1) I have a directory with several hundred files - of these some fraction need to be passed to the mapper (say the ones ending in ".foo"...
   Author: Steve Lewis, 2010-06-23, 17:21
Newbie - question - how do I use Hadoop to sort a very large file - MapReduce - [mail # user]
...Assume I have a large file called *BigData.unsorted*  ( say 500GB) consisting of lines of text. Assume that these lines are in random order - I understand how to assign a key to lines a...
   Author: Steve Lewis, 2010-06-23, 17:15
MapReduce (74)
Hadoop (37)
Spark (22)
HDFS (5)
mail # user (76)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (76)
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (223)
Amar Kamat (181)
Thomas Graves (166)
Jason Lowe (162)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Alejandro Abdelnur (103)
Devaraj K (103)
Ramya Sunil (103)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (80)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (71)
Mahadev konar (67)
Ravi Prakash (66)
Steve Lewis