Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 25 (0.089s).
Loading phrases to help you
refine your search...
Multi-group-by with transform leads to incorrect optimization - Hive - [mail # user]
...Hello,I've encountered an issue with hive's predicate push down optimization whenmulti-group-by is used together with transform. Here is a simple testcaseto illustrate my point:CREATE TABLE ...
   Author: Jan Dolinár, 2014-01-23, 09:28
Bug when adding multiple partitions - Hive - [mail # user]
...Hi everyone,  Consider following DDL:      CREATE TABLE partition_test       (a INT)     PARTITIONED BY (b INT);      ALTER TABLE ...
   Author: Jan Dolinár, 2013-08-15, 05:50
Re: Calling same UDF multiple times in a SELECT query - Hive - [mail # user]
...Hi,  If you use annotation, Hive should be able to optimize it to single call:   @UDFType(deterministic = true)  The obvious condition is that it must always return the identi...
   Author: Jan Dolinár, 2013-07-23, 19:35
Re: Table Wrapper - Hive - [mail # user]
...Slightly less "hackish" way to do this without joins is to write custom UDF that will take data.BLOCK__OFFSET__INSIDE__FILE as input parameter and return the corresponding data from the smal...
   Author: Jan Dolinár, 2013-06-27, 09:59
Re: Run queries from external files as subqueries - Hive - [mail # user]
...Quick and dirty way to do such thing would be to use some kind of preprocessor. To avoid writing one, you could use e.g. the one from GCC, with just a little help from sed:     &nb...
   Author: Jan Dolinár, 2013-06-20, 20:54
Re: How to convert Array to String in Hive0.7 - Hive - [mail # user]
...Hi,  I was facing the same problem few weeks ago. The final solution was quick and dirty - I grabbed the code for this UDF from http://svn.apache.org/viewvc/hive/trunk/ql/src/java/org/a...
   Author: Jan Dolinár, 2013-06-14, 10:07
Re: Loopup objects in distributed cache - Hive - [mail # user]
...Hello Vivek,  GenericUDTF has method initialize() which is only called once per task. So if you read your files in this method and store the structures in memory then the overhead is re...
   Author: Jan Dolinár, 2013-04-04, 07:11
Re: how to make data statistics efficiency in hive? - Hive - [mail # user]
...Hi Andy,  I'm not sure if I entirely understood your question, but I think you're looking for something like this:  select     concat(date,':',uid),     sum(1) ...
   Author: Jan Dolinár, 2013-03-27, 06:12
Re: Hive NR map progress inconsistent and regurlarly restart from 0% - Hive - [mail # user]
...This usually happens when some task fail, their progress is then not counted, hence the 'restart'. Check your task logs for failures.  Jan   On Wed, Nov 7, 2012 at 12:30 PM, Alexan...
   Author: Jan Dolinár, 2012-11-07, 11:36
Re: Regexp character classes clarification - Hive - [mail # user]
...Hi Neil,  Have you tried to test your regexes in Java? I was using one of the applets available on the web (e.g. http://www.cis.upenn.edu/~matuszek/General/RegexTester/regex-tester.html...
   Author: Jan Dolinár, 2012-11-01, 14:32
Sort:
project
Hive (25)
type
mail # user (25)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (25)
author
Namit Jain (645)
Carl Steinbach (416)
Brock Noland (390)
Zheng Shao (382)
Ashutosh Chauhan (338)
Edward Capriolo (297)
Navis (288)
Gunther Hagleitner (231)
Lefty Leverenz (217)
Thejas M Nair (217)
John Sichi (212)
Xuefu Zhang (208)
Ning Zhang (171)
Kevin Wilfong (152)
Sergey Shelukhin (148)
Harish Butani (144)
He Yongqiang (139)
Thejas Nair (135)
Jason Dere (132)
Eugene Koifman (123)
Szehon Ho (120)
Nitin Pawar (113)
Vaibhav Gumashta (113)
Eric Hanson (108)
Prasad Mujumdar (106)
Jan Dolinár