Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 31 to 40 from 40 (0.114s).
Loading phrases to help you
refine your search...
Re: Hive in EC2 - Hive - [mail # user]
...The only caveat is that you are at Amazon's mercy in terms of the latest version of Hive. Also, they have their own versioning so EMR Hive's latest version 0.7.1 could be Apache Hive's 0.6.5...
   Author: Igor Tatarinov, 2011-08-31, 03:26
how to disable mapred.reduce.tasks - Hive - [mail # user]
...I set mapred.reduce.tasks manually to have a single wave of reducers (does that make sense, by the way?)  When I save the data, I often end up with a bunch of small files because we use...
   Author: Igor Tatarinov, 2011-06-29, 23:16
Re: loading datafiles in s3 - Hive - [mail # user]
...I think the answer to 1 is No but you can confirm on the AWS EMR forum.  The problem I've been having is that if you have x=foo in the prefix of y our S3 path, EMR will try to use it as...
   Author: Igor Tatarinov, 2011-06-28, 19:18
[expand - 1 more] - Re: Hive running out of memory - Hive - [mail # user]
...Yes, that's probably it. I found a related JIRA: https://issues.apache.org/jira/browse/HIVE-1316  doesn't look like the EMR installation has this fix. I am going to increase the heap si...
   Author: Igor Tatarinov, 2011-06-21, 21:31
Re: left outer join on same table - Hive - [mail # user]
...The condition T2.field6='yyyyyyy;' is tested after the outer join. As a result you won't see any non-matching results. You'll need a subquery to enforce that condition. Alternatively, adding...
   Author: Igor Tatarinov, 2011-06-11, 04:30
Re: Skew Join Optimization in hive - Hive - [mail # user]
...Have you tried splitting the query into 2 or 3 steps and/or enabling map jons (SET hive.auto.convert.join = true;) if some of the tables are smallish?   On Tue, Jun 7, 2011 at 12:31 PM,...
   Author: Igor Tatarinov, 2011-06-07, 19:58
Re: question about number of map tasks for small file - Hive - [mail # user]
...Can you pre-aggregate your historical data to reduce the number of files?  We used to partition our data by date but that created too many output files so now we partition by month. &nb...
   Author: Igor Tatarinov, 2011-06-01, 17:12
[expand - 1 more] - Re: Hive assert()? - Hive - [mail # user]
...Here is one example. I want to make sure I don't have negative prices in my data. I would like to write something like:  assert_empty(select * from Prices where price  wrote:  ...
   Author: Igor Tatarinov, 2011-05-26, 21:46
Re: HiveQL for 'rank() over (partition by ... order by ...)'? - Hive - [mail # user]
...Yes, we have UDF functions that compute cumulative (such a rank) and moving aggregates. In each case, the first parameter is the partitioning key so that the function knows when to 'reset' a...
   Author: Igor Tatarinov, 2011-05-25, 14:43
Re: Can Hive 0.7 Rebuild partitions ? - Hive - [mail # user]
...That's Amazon's extension to Hive and it's really handy.  On Thu, May 19, 2011 at 2:01 PM, Tim Spence wrote:  ...
   Author: Igor Tatarinov, 2011-05-19, 22:23
Sort:
project
Hive (40)
type
mail # user (40)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (40)
author
Namit Jain (645)
Carl Steinbach (416)
Brock Noland (390)
Zheng Shao (382)
Ashutosh Chauhan (338)
Edward Capriolo (297)
Navis (288)
Gunther Hagleitner (231)
Thejas M Nair (217)
Lefty Leverenz (216)
John Sichi (212)
Xuefu Zhang (208)
Ning Zhang (171)
Kevin Wilfong (152)
Sergey Shelukhin (148)
Harish Butani (144)
He Yongqiang (139)
Thejas Nair (135)
Jason Dere (132)
Eugene Koifman (123)
Szehon Ho (120)
Nitin Pawar (113)
Vaibhav Gumashta (112)
Eric Hanson (108)
Prasad Mujumdar (106)
Igor Tatarinov