Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 81 to 90 from 245 (0.107s).
Loading phrases to help you
refine your search...
[HIVE-7528] Support cluster by and distributed by [Spark Branch] - Hive - [issue]
...clustered by = distributed by + sort by, so this is related to HIVE-7527. If sort by is in place, the assumption is that we don't need to do anything about distributed by or clustered by. St...
http://issues.apache.org/jira/browse/HIVE-7528    Author: Xuefu Zhang, 2014-08-19, 04:10
[HIVE-7527] Support order by and sort by on Spark [Spark Branch] - Hive - [issue]
...Currently Hive depends completely on MapReduce's sorting as part of shuffling to achieve order by (global sort, one reducer) and sort by (local sort).Spark has a sort by transformation in di...
http://issues.apache.org/jira/browse/HIVE-7527    Author: Xuefu Zhang, 2014-08-19, 02:42
[HIVE-7530] Go thru the common code to find references to HIVE_EXECUCTION_ENGINE to make sure conditions works with Spark [Spark Branch] - Hive - [issue]
...In common code, such as Utilities.java, I found a lot of references to this conf variable and special handling to a specific engine such as following:          ...
http://issues.apache.org/jira/browse/HIVE-7530    Author: Xuefu Zhang, 2014-08-19, 01:21
[HIVE-7516] Add capacity control over queries running on Spark cluster [Spark Branch] - Hive - [issue]
...Add a capacity control mechanism in Hive to limit the number queries running on Spark concurrently which might overwhelm the cluster. Idea can be borrowed from Tez....
http://issues.apache.org/jira/browse/HIVE-7516    Author: Xuefu Zhang, 2014-08-18, 21:27
[HIVE-7541] Support union all on Spark [Spark Branch] - Hive - [issue]
...For union all operator, we will use Spark's union transformation. Refer to the design doc on wiki for more information....
http://issues.apache.org/jira/browse/HIVE-7541    Author: Xuefu Zhang, 2014-08-18, 18:36
[HIVE-7328] Refactoring: make Hive reduce side data processing reusable [Spark Branch] - Hive - [issue]
...ExecReducer is Hive's reducer implementation for MapReduce. Table rows are shuffled by MR framework to ExecReducer and further processed by ExecReducer.reduce() method, which invokes Hive's ...
http://issues.apache.org/jira/browse/HIVE-7328    Author: Xuefu Zhang, 2014-08-18, 17:55
[HIVE-7332] Create SparkClient, interface to Spark cluster [Spark Branch] - Hive - [issue]
...SparkClient is responsible for Spark job submission, monitoring, progress and error reporting, etc....
http://issues.apache.org/jira/browse/HIVE-7332    Author: Xuefu Zhang, 2014-08-18, 17:54
[HIVE-7335] Create SparkPlan, DAG representation of a Spark job [Spark Branch] - Hive - [issue]
...Encapsulate RDD, MapFunction, ReduceFunction, and SparkShuffler in a graph representation....
http://issues.apache.org/jira/browse/HIVE-7335    Author: Xuefu Zhang, 2014-08-18, 17:54
[HIVE-7525] Research to find out if it's possible to submit Spark jobs concurrently using shared SparkContext [Spark Branch] - Hive - [issue]
...Refer to HIVE-7503 and SPARK-2688. Find out if it's possible to submit multiple spark jobs concurrently using a shared SparkContext. SparkClient's code can be manipulated for this test. Here...
http://issues.apache.org/jira/browse/HIVE-7525    Author: Xuefu Zhang, 2014-08-18, 17:49
[HIVE-7597] Support analyze table - Hive - [issue]
...Both MR and Tez has a visitor processing "analyze table ..." command. We cloned the code from Tez, but may need to make it fit for Spark, verify, and test....
http://issues.apache.org/jira/browse/HIVE-7597    Author: Xuefu Zhang, 2014-08-14, 09:49
Sort:
project
Hive (245)
Pig (19)
Spark (5)
Bigtop (1)
type
issue (169)
mail # dev (65)
mail # user (10)
wiki (1)
date
last 7 days (9)
last 30 days (42)
last 90 days (77)
last 6 months (125)
last 9 months (245)
author
Namit Jain (645)
Carl Steinbach (409)
Brock Noland (400)
Zheng Shao (382)
Ashutosh Chauhan (300)
Navis (300)
Edward Capriolo (299)
Gunther Hagleitner (242)
Thejas M Nair (235)
Lefty Leverenz (223)
John Sichi (212)
Xuefu Zhang (194)
Ning Zhang (171)
Sergey Shelukhin (162)
Kevin Wilfong (152)
He Yongqiang (139)
Eugene Koifman (130)
Alan Gates (123)
Jason Dere (120)
Nitin Pawar (113)
Vaibhav Gumashta (113)
Harish Butani (111)
Prasanth J (111)
Szehon Ho (96)
Joydeep Sen Sarma (95)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB