Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 21 to 30 from 71 (0.064s).
Loading phrases to help you
refine your search...
Re: Join-package in new API? - MapReduce - [mail # user]
...It was ported in a later version:  https://issues.apache.org/jira/browse/MAPREDUCE-355  -C  On Wed, Oct 10, 2012 at 7:47 AM, Sigurd Spieckermann  wrote:...
   Author: Chris Douglas, 2012-10-18, 18:56
Re: How does map-merge work exactly? - MapReduce - [mail # user]
...On Thu, Sep 13, 2012 at 7:04 AM, Martin Dobmeier  wrote:  A segment in this context is a fraction of spill output for a particular reduce. Each spill contains a segment for every r...
   Author: Chris Douglas, 2012-09-17, 08:21
[MAPREDUCE-4585] Checkpoint shuffle aggregation as map output - MapReduce - [issue]
...Map output collected during the shuffle can be spilled and written as a composite of map outputs. Particularly if the job employs a combiner, this checkpoint can provide fault tolerance and ...
http://issues.apache.org/jira/browse/MAPREDUCE-4585    Author: Chris Douglas, 2012-08-27, 20:19
[MAPREDUCE-4592] Collect statistics on key distributions/samples in intermediate data - MapReduce - [issue]
...Some jobs would benefit from statistics about the key distribution of intermediate data, including sampling for jobs implementing a total-order in the job. These data can inform a policy for...
http://issues.apache.org/jira/browse/MAPREDUCE-4592    Author: Chris Douglas, 2012-08-25, 01:21
[MAPREDUCE-4591] Extend IFile format to include optional metadata - MapReduce - [issue]
...By including sections for per-segment and per-spill metadata, one can embed information useful to consumers (e.g., information on key ranges, sampling, etc.). These changes make IFiles ideal...
http://issues.apache.org/jira/browse/MAPREDUCE-4591    Author: Chris Douglas, 2012-08-25, 01:21
[MAPREDUCE-4590] ReduceTask preemption - MapReduce - [issue]
...With a facility for addressing subsequences of the reduce input keygroups (such as MAPREDUCE-4587), a ReduceTask can efficiently checkpoint and restart itself. In some cases, a memento (i.e....
http://issues.apache.org/jira/browse/MAPREDUCE-4590    Author: Chris Douglas, 2012-08-25, 01:21
[MAPREDUCE-4589] MapTask preemption - MapReduce - [issue]
...For many input types, it is possible to restore the state of a RecordReader by writing a new split for the remaining data (e.g., storing the inflater state with a file offset for gzip text)....
http://issues.apache.org/jira/browse/MAPREDUCE-4589    Author: Chris Douglas, 2012-08-25, 01:20
[MAPREDUCE-4588] Map local segments as on-disk segments - MapReduce - [issue]
...Local map segments should never be handled as though they were remote (i.e., copied through a servlet to local disk). This optimization is uniformally more efficient for the fetch, though it...
http://issues.apache.org/jira/browse/MAPREDUCE-4588    Author: Chris Douglas, 2012-08-25, 01:20
[MAPREDUCE-4587] Support fetch by key boundaries for memcmp types - MapReduce - [issue]
...Intermediate data addressable by key support not only restartable streams, but partitioning after the map output are written. With sampling of map output, a job can implement a total-order a...
http://issues.apache.org/jira/browse/MAPREDUCE-4587    Author: Chris Douglas, 2012-08-25, 01:20
Re: question about org.apache.hadoop.mapred.join - MapReduce - [mail # user]
...Your understanding is correct. The framework doesn't do anything to align input splits across datasets. In the situation you describe- where one can't seek among key groups in the input data...
   Author: Chris Douglas, 2012-04-10, 17:33
Hadoop (88)
MapReduce (69)
HDFS (7)
Kafka (7)
Chukwa (6)
YARN (4)
issue (42)
mail # dev (18)
mail # user (11)
last 7 days (0)
last 30 days (0)
last 90 days (6)
last 6 months (8)
last 9 months (71)
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (223)
Amar Kamat (181)
Thomas Graves (166)
Jason Lowe (162)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Alejandro Abdelnur (103)
Devaraj K (103)
Ramya Sunil (103)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (80)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (71)
Mahadev konar (67)
Ravi Prakash (66)
Chris Douglas