Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 276 (0.124s).
Loading phrases to help you
refine your search...
[HIVE-7328] Refactoring: make Hive reduce side data processing reusable - Hive - [issue]
...ExecReducer is Hive's reducer implementation for MapReduce. Table rows are shuffled by MR framework to ExecReducer and further processed by ExecReducer.reduce() method, which invokes Hive's ...
http://issues.apache.org/jira/browse/HIVE-7328    Author: Xuefu Zhang, 2014-07-30, 10:51
[HIVE-7327] Refactoring: make Hive map side data processing reusable - Hive - [issue]
...ExecMapper is Hive's mapper implementation for MapReduce. Table rows are read by MR framework and processed by ExecMapper.map() method, which invokes Hive's map-side operator tree starting f...
http://issues.apache.org/jira/browse/HIVE-7327    Author: Xuefu Zhang, 2014-07-30, 10:49
[HIVE-7541] Support union all on Spark - Hive - [issue]
...For union all operator, we will use Spark's union transformation. Refer to the design doc on wiki for more information....
http://issues.apache.org/jira/browse/HIVE-7541    Author: Xuefu Zhang, 2014-07-30, 07:01
[HIVE-7292] Hive on Spark - Hive - [issue]
...Spark as an open-source data analytics cluster computing framework has gained significant momentum recently. Many Hive users already have Spark installed as their computing backbone. To take...
http://issues.apache.org/jira/browse/HIVE-7292    Author: Xuefu Zhang, 2014-07-30, 06:55
[HIVE-7330] Create SparkTask - Hive - [issue]
...SparkTask handles the execution of SparkWork. It will execute a graph of map and reduce work using a SparkClient instance....
http://issues.apache.org/jira/browse/HIVE-7330    Author: Xuefu Zhang, 2014-07-30, 06:44
[HIVE-7334] Create SparkShuffler, shuffling data between map-side data processing and reduce-side processing - Hive - [issue]
...Please refer to the design spec....
http://issues.apache.org/jira/browse/HIVE-7334    Author: Xuefu Zhang, 2014-07-30, 06:36
[HIVE-7526] Research to use groupby transformation to replace Hive existing partitionByKey and SparkCollector combination - Hive - [issue]
...Currently SparkClient shuffles data by calling paritionByKey(). This transformation outputs <key, value> tuples. However, Hive's ExecMapper expects <key, iterator<value>> t...
http://issues.apache.org/jira/browse/HIVE-7526    Author: Xuefu Zhang, 2014-07-30, 05:32
[HIVE-7439] Spark job monitoring and error reporting - Hive - [issue]
...After Hive submits a job to Spark cluster, we need to report to user the job progress, such as the percentage done, to the user. This is especially important for long running queries. Moreov...
http://issues.apache.org/jira/browse/HIVE-7439    Author: Xuefu Zhang, 2014-07-30, 04:51
[HIVE-7493] Enhance HiveReduceFunction's row clustering - Hive - [issue]
...HiveReduceFunction is backed by Hive's ExecReducer, whose reduce function takes an input in the form of <key, value list>. However, HiveReduceFunction's input is an iterator over a set...
http://issues.apache.org/jira/browse/HIVE-7493    Author: Xuefu Zhang, 2014-07-30, 02:25
[HIVE-7492] Enhance SparkCollector - Hive - [issue]
...SparkCollector is used to collect the rows generated by HiveMapFunction or HiveReduceFunction. It currently is backed by a ArrayList, and thus has unbounded memory usage. Ideally, the collec...
http://issues.apache.org/jira/browse/HIVE-7492    Author: Xuefu Zhang, 2014-07-30, 02:23
Sort:
project
Hive (276)
Pig (21)
Spark (3)
Bigtop (1)
type
mail # dev (141)
issue (130)
mail # user (3)
wiki (2)
date
last 7 days (28)
last 30 days (51)
last 90 days (84)
last 6 months (143)
last 9 months (276)
author
Namit Jain (645)
Carl Steinbach (417)
Zheng Shao (382)
Brock Noland (313)
Ashutosh Chauhan (300)
Edward Capriolo (298)
Navis (278)
Gunther Hagleitner (222)
John Sichi (212)
Thejas M Nair (198)
Lefty Leverenz (190)
Xuefu Zhang (189)
Ning Zhang (171)
Kevin Wilfong (152)
He Yongqiang (139)
Sergey Shelukhin (136)
Harish Butani (121)
Eugene Koifman (119)
Thejas Nair (113)
Jason Dere (109)
Eric Hanson (108)
Nitin Pawar (108)
Szehon Ho (106)
Prasad Mujumdar (104)
Vaibhav Gumashta (103)