Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 161 to 170 from 316 (0.113s).
Loading phrases to help you
refine your search...
[HIVE-7328] Refactoring: make Hive reduce side data processing reusable [Spark Branch] - Hive - [issue]
...ExecReducer is Hive's reducer implementation for MapReduce. Table rows are shuffled by MR framework to ExecReducer and further processed by ExecReducer.reduce() method, which invokes Hive's ...
http://issues.apache.org/jira/browse/HIVE-7328    Author: Xuefu Zhang, 2014-08-18, 17:55
[HIVE-7332] Create SparkClient, interface to Spark cluster [Spark Branch] - Hive - [issue]
...SparkClient is responsible for Spark job submission, monitoring, progress and error reporting, etc....
http://issues.apache.org/jira/browse/HIVE-7332    Author: Xuefu Zhang, 2014-08-18, 17:54
[HIVE-7335] Create SparkPlan, DAG representation of a Spark job [Spark Branch] - Hive - [issue]
...Encapsulate RDD, MapFunction, ReduceFunction, and SparkShuffler in a graph representation....
http://issues.apache.org/jira/browse/HIVE-7335    Author: Xuefu Zhang, 2014-08-18, 17:54
[HIVE-7525] Research to find out if it's possible to submit Spark jobs concurrently using shared SparkContext [Spark Branch] - Hive - [issue]
...Refer to HIVE-7503 and SPARK-2688. Find out if it's possible to submit multiple spark jobs concurrently using a shared SparkContext. SparkClient's code can be manipulated for this test. Here...
http://issues.apache.org/jira/browse/HIVE-7525    Author: Xuefu Zhang, 2014-08-18, 17:49
[HIVE-7597] Support analyze table - Hive - [issue]
...Both MR and Tez has a visitor processing "analyze table ..." command. We cloned the code from Tez, but may need to make it fit for Spark, verify, and test....
http://issues.apache.org/jira/browse/HIVE-7597    Author: Xuefu Zhang, 2014-08-14, 09:49
[HIVE-7493] Enhance HiveReduceFunction's row clustering - Hive - [issue]
...HiveReduceFunction is backed by Hive's ExecReducer, whose reduce function takes an input in the form of <key, value list>. However, HiveReduceFunction's input is an iterator over a set...
http://issues.apache.org/jira/browse/HIVE-7493    Author: Xuefu Zhang, 2014-08-12, 20:26
[HIVE-7569] Make sure multi-MR queries work - Hive - [issue]
...With the latest dev effort, queries that would involve multiple MR jobs should be supported by spark now, except for sorting, multi-insert, union, and join (map join and smb might just work)...
http://issues.apache.org/jira/browse/HIVE-7569    Author: Xuefu Zhang, 2014-08-12, 20:26
[HIVE-7492] Enhance SparkCollector - Hive - [issue]
...SparkCollector is used to collect the rows generated by HiveMapFunction or HiveReduceFunction. It currently is backed by a ArrayList, and thus has unbounded memory usage. Ideally, the collec...
http://issues.apache.org/jira/browse/HIVE-7492    Author: Xuefu Zhang, 2014-08-07, 22:57
Re: [jira] [Commented] (HIVE-7624) Reduce operator initialization failed when running multiple MR query on spark - Hive - [mail # dev]
...Another thing to watch is HiveConf's thread safety. I see it uses manystatic variables, but not sure if this the cause.On Tue, Aug 5, 2014 at 10:39 PM, Chao (JIRA)  wrote: ...
   Author: Xuefu Zhang, 2014-08-07, 00:51
[HIVE-7526] Research to use groupby transformation to replace Hive existing partitionByKey and SparkCollector combination - Hive - [issue]
...Currently SparkClient shuffles data by calling paritionByKey(). This transformation outputs <key, value> tuples. However, Hive's ExecMapper expects <key, iterator<value>> t...
http://issues.apache.org/jira/browse/HIVE-7526    Author: Xuefu Zhang, 2014-08-04, 22:33
Sort:
project
Hive (316)
Pig (21)
Spark (9)
Bigtop (1)
type
issue (206)
mail # dev (83)
mail # user (27)
date
last 7 days (9)
last 30 days (19)
last 90 days (47)
last 6 months (124)
last 9 months (316)
author
Namit Jain (644)
Brock Noland (517)
Carl Steinbach (409)
Zheng Shao (382)
Ashutosh Chauhan (343)
Navis (316)
Edward Capriolo (306)
Lefty Leverenz (276)
Gunther Hagleitner (273)
Sergey Shelukhin (250)
Thejas M Nair (250)
Xuefu Zhang (244)
John Sichi (211)
Alan Gates (178)
Ning Zhang (171)
Eugene Koifman (167)
Jason Dere (156)
Gopal V (153)
Kevin Wilfong (152)
Szehon Ho (142)
He Yongqiang (139)
Vaibhav Gumashta (129)
Nitin Pawar (114)
Harish Butani (111)
Prasanth J (109)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB