Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 31 to 40 from 68 (0.089s).
Loading phrases to help you
refine your search...
RE: Yarn HDFS and Yarn Exceptions when processing "larger" datasets. - MapReduce - [mail # user]
...Blah blah, Can you build and run the DistributedShell example?  If it does not run correctly this would tend to implicate your configuration.  If it run correctly then your code is...
   Author: John Lilley, 2013-07-02, 18:35
RE: some idea about the Data Compression - MapReduce - [mail # user]
...Geelong,   1.       These files will probably be some standard format like .gz or .bz2 or .zip.  In that case, pick an appropriate InputFormat.  See e.g. http:/...
   Author: John Lilley, 2013-07-02, 16:18
typical JSON data sets - MapReduce - [mail # user]
...I would like to hear your experiences working with large JSON data sets, specifically:  1)      How large is each JSON document?  2)      Do they tend...
   Author: John Lilley, 2013-07-02, 16:04
[expand - 2 more] - RE: intermediate results files - MapReduce - [mail # user]
...Replication also has downstream effects: it puts pressure on the available network bandwidth and disk I/O bandwidth when the cluster is loaded. john  From: Mohammad Tariq [mailto:[EMAIL...
   Author: John Lilley, 2013-07-02, 15:39
[expand - 1 more] - RE: Assignment of data splits to mappers - MapReduce - [mail # user]
...Bertrand,  Ah yes, I can see the wisdom of smaller tasks in (1).  Given that, does MR attempt to assign multiple blocks per task when the #blocks >> #nodes?  Regarding (...
   Author: John Lilley, 2013-07-01, 23:07
Hadoop database ecosystem overview - MapReduce - [mail # user]
...I'd like to find a web site or some slides that clearly delineate the "databases" of the Hadoop ecosystem and what they are each good at.  If we look at HBase, Hive, and Cassandra (and ...
   Author: John Lilley, 2013-06-17, 14:22
HDFS file reader and buffering - MapReduce - [mail # user]
...Do the HDFS file-reader classes perform internal buffering? Thanks John  ...
   Author: John Lilley, 2013-06-16, 13:33
RE: Shuffle design: optimization tradeoffs - MapReduce - [mail # user]
...Albert, Thanks for the link.  This is indeed what I am talking about. The authors have taken the idea even further, avoiding disk writes on either the mapper or reducer side.  It's...
   Author: John Lilley, 2013-06-15, 13:39
RE: Why/When partitioner is used. - MapReduce - [mail # user]
...There are kind of two parts to this.  The semantics of MapReduce promise that all tuples sharing the same key value are sent to the same reducer, so that you can write useful MR applica...
   Author: John Lilley, 2013-06-07, 14:03
efficiency of LocalResources and archives - MapReduce - [mail # user]
...Suppose that I have a large archive in HDFS, say, containing 500 files and 4GB.  I want to make this available via YARN LocalResource.  The archive doesn't change very often (maybe...
   Author: John Lilley, 2013-06-06, 20:10
MapReduce (59)
Hadoop (26)
HDFS (17)
Avro (1)
HBase (1)
mail # user (67)
mail # dev (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (68)
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (223)
Amar Kamat (181)
Thomas Graves (166)
Jason Lowe (162)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Alejandro Abdelnur (103)
Devaraj K (103)
Ramya Sunil (103)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (80)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (71)
Mahadev konar (67)
Ravi Prakash (66)
John Lilley