Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 274 (0.059s).
Loading phrases to help you
refine your search...
[HIVE-7527] Support order by and sort by on Spark - Hive - [issue]
...Currently Hive depends completely on MapReduce's sorting as part of shuffling to achieve order by (global sort, one reducer) and sort by (local sort).Spark has a sort by transformation in di...
http://issues.apache.org/jira/browse/HIVE-7527    Author: Xuefu Zhang, 2014-07-27, 23:40
[HIVE-7528] Support cluster by and distributed by - Hive - [issue]
...clustered by = distributed by + sort by, so this is related to HIVE-7527. If sort by is in place, the assumption is that we don't need to do anything about distributed by or clustered by. St...
http://issues.apache.org/jira/browse/HIVE-7528    Author: Xuefu Zhang, 2014-07-27, 23:38
[HIVE-7526] Research to use groupby transformation to replace Hive existing partitionByKey and SparkCollector combination - Hive - [issue]
...Currently SparkClient shuffles data by calling paritionByKey(). This transformation outputs <key, value> tuples. However, Hive's ExecMapper expects <key, iterator<value>> t...
http://issues.apache.org/jira/browse/HIVE-7526    Author: Xuefu Zhang, 2014-07-27, 22:55
[HIVE-7493] Enhance HiveReduceFunction's row clustering - Hive - [issue]
...HiveReduceFunction is backed by Hive's ExecReducer, whose reduce function takes an input in the form of <key, value list>. However, HiveReduceFunction's input is an iterator over a set...
http://issues.apache.org/jira/browse/HIVE-7493    Author: Xuefu Zhang, 2014-07-27, 21:18
[HIVE-7292] Hive on Spark - Hive - [issue]
...Spark as an open-source data analytics cluster computing framework has gained significant momentum recently. Many Hive users already have Spark installed as their computing backbone. To take...
http://issues.apache.org/jira/browse/HIVE-7292    Author: Xuefu Zhang, 2014-07-27, 21:18
[HIVE-7525] Research to find out if it's possible to submit Spark jobs concurrently using shared SparkContext - Hive - [issue]
...Refer to HIVE-7503 and SPARK-2688. Find out if it's possible to submit multiple spark jobs concurrently using a shared SparkContext. SparkClient's code can be manipulated for this test. Here...
http://issues.apache.org/jira/browse/HIVE-7525    Author: Xuefu Zhang, 2014-07-27, 21:04
[HIVE-7503] Support Hive's multi-table insert query with Spark - Hive - [issue]
...For Hive's multi insert query (https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DML), there may be an MR job for each insert.  When we achieve this with Spark, it would b...
http://issues.apache.org/jira/browse/HIVE-7503    Author: Xuefu Zhang, 2014-07-27, 21:03
[HIVE-7384] Research into reduce-side join - Hive - [issue]
...Hive's join operator is very sophisticated, especially for reduce-side join. While we expect that other types of join, such as map-side join and SMB map-side join, will work out of the box w...
http://issues.apache.org/jira/browse/HIVE-7384    Author: Xuefu Zhang, 2014-07-26, 07:00
[HIVE-7373] Hive should not remove trailing zeros for decimal numbers - Hive - [issue]
...Currently Hive blindly removes trailing zeros of a decimal input number as sort of standardization. This is questionable in theory and problematic in practice.1. In decimal context,  nu...
http://issues.apache.org/jira/browse/HIVE-7373    Author: Xuefu Zhang, 2014-07-25, 21:59
Re: Hive User Group Meeting - Hive - [mail # dev]
...Dear Hive users and developers,As an update, the hive user group meeting during Hadoop World will be heldon Oct. 15th, from 6:30pm to 9:00pm at about.com's office at 1500 Broadway,6th floor,...
   Author: Xuefu Zhang, 2014-07-25, 21:37
Sort:
project
Hive (274)
Pig (21)
Spark (3)
Bigtop (1)
type
mail # dev (141)
issue (128)
mail # user (3)
wiki (2)
date
last 7 days (19)
last 30 days (48)
last 90 days (83)
last 6 months (142)
last 9 months (274)
author
Namit Jain (645)
Carl Steinbach (417)
Zheng Shao (382)
Brock Noland (312)
Ashutosh Chauhan (301)
Edward Capriolo (298)
Navis (277)
Gunther Hagleitner (221)
John Sichi (212)
Thejas M Nair (197)
Lefty Leverenz (190)
Xuefu Zhang (187)
Ning Zhang (171)
Kevin Wilfong (152)
He Yongqiang (139)
Sergey Shelukhin (135)
Harish Butani (121)
Eugene Koifman (119)
Thejas Nair (113)
Eric Hanson (108)
Jason Dere (108)
Nitin Pawar (107)
Prasad Mujumdar (104)
Szehon Ho (103)
Vaibhav Gumashta (103)