Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 21 to 25 from 25 (0.125s).
Loading phrases to help you
refine your search...
Re: count of distinct FROM multiple columns - Hive - [mail # user]
...Hi  A quick solution that comes first to my mind is to join the columns you want to combine into an array and then use the explode UDTF:  SELECT col1, COUNT(distinct combined) FROM...
   Author: Jan Dolinár, 2012-06-22, 12:52
[expand - 1 more] - Re: UDTF fails when used in LATERAL VIEW - Hive - [mail # user]
...Hi Mark,  Thanks for suggestion, it is not that naïve :) I tried a lot of things and combinations, including Text and even LazyString (as I was getting exceptions about converting Strin...
   Author: Jan Dolinár, 2012-06-22, 05:59
[expand - 1 more] - Re: Quering RDBMS table in a Hive query - Hive - [mail # user]
...On 6/15/12, Ruslan Al-Fakikh  wrote:  Both is possible, InputFormat and/or UD(T)F. It all depends on what you need. I actually use both - in Input format I load lists of allowed va...
   Author: Jan Dolinár, 2012-06-15, 12:35
[expand - 5 more] - Re: Multi-group-by select always scans entire table - Hive - [mail # user]
...Thank you very much Mark for your investigation and explanations.  I'm well aware of the fact that hadoop 0.7.1 is quite an old code and that newer version might perform better - that i...
   Author: Jan Dolinár, 2012-06-08, 05:42
[expand - 2 more] - Re: Multi-GroupBy-Insert optimization - Hive - [mail # user]
...Hi Shan,  If you happen to have a lot of repeated data (in the most general grouping), you might get some speedup by little pre-aggregation. The following code should produce the same r...
   Author: Jan Dolinár, 2012-06-05, 12:47
Sort:
project
Hive (25)
type
mail # user (25)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (25)
author
Namit Jain (645)
Carl Steinbach (409)
Brock Noland (397)
Zheng Shao (382)
Navis (300)
Edward Capriolo (299)
Ashutosh Chauhan (298)
Gunther Hagleitner (242)
Thejas M Nair (235)
Lefty Leverenz (223)
John Sichi (212)
Xuefu Zhang (193)
Ning Zhang (171)
Sergey Shelukhin (162)
Kevin Wilfong (152)
He Yongqiang (139)
Eugene Koifman (130)
Alan Gates (122)
Jason Dere (120)
Nitin Pawar (113)
Vaibhav Gumashta (113)
Harish Butani (111)
Prasanth J (111)
Joydeep Sen Sarma (95)
Szehon Ho (94)
Jan Dolinár
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB