| clear query|facets|time |
Search criteria: .
Results from 31 to 40 from
44 (0.154s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Confusion about IN clause - Hive - [mail # user]
|
|
...I think the doc refers to an IN subquery WHERE x IN (SELECT blah FROM ...) the simple WHERE col IN ('x', 'y', 'z') works fine. I imagine none of these work: http://www.dba-...
|
|
|
Author: Igor Tatarinov,
2011-12-13, 06:19
|
|
|
Re: CombineHiveInputFormat and Merge files not working for compressed text files - Hive - [mail # user]
|
|
...I might be wrong but I think EMR inserts a reduce job when writing data into S3. At least in my case, I am able to create a single output file by SET mapred.reduce.tasks = 1; INSERT OV...
|
|
|
Author: Igor Tatarinov,
2011-11-30, 08:07
|
|
|
Re: Hive in EC2 - Hive - [mail # user]
|
|
...The only caveat is that you are at Amazon's mercy in terms of the latest version of Hive. Also, they have their own versioning so EMR Hive's latest version 0.7.1 could be Apache Hive's 0.6.5...
|
|
|
Author: Igor Tatarinov,
2011-08-31, 03:26
|
|
|
how to disable mapred.reduce.tasks - Hive - [mail # user]
|
|
...I set mapred.reduce.tasks manually to have a single wave of reducers (does that make sense, by the way?) When I save the data, I often end up with a bunch of small files because we use...
|
|
|
Author: Igor Tatarinov,
2011-06-29, 23:16
|
|
|
Re: loading datafiles in s3 - Hive - [mail # user]
|
|
...I think the answer to 1 is No but you can confirm on the AWS EMR forum. The problem I've been having is that if you have x=foo in the prefix of y our S3 path, EMR will try to use it as...
|
|
|
Author: Igor Tatarinov,
2011-06-28, 19:18
|
|
|
Re: Hive running out of memory - Hive - [mail # user]
|
|
...Yes, that's probably it. I found a related JIRA: https://issues.apache.org/jira/browse/HIVE-1316 doesn't look like the EMR installation has this fix. I am going to increase the heap si...
|
|
|
Author: Igor Tatarinov,
2011-06-21, 21:31
|
|
|
Hive running out of memory - Hive - [mail # user]
|
|
...I have a table with 3 levels of partitioning and about 10,000 files (one file at every 'leaf'). I am using EMR and the table is stored in S3. For some reason, Hive can't even start running a...
|
|
|
Author: Igor Tatarinov,
2011-06-21, 07:19
|
|
|
Re: left outer join on same table - Hive - [mail # user]
|
|
...The condition T2.field6='yyyyyyy;' is tested after the outer join. As a result you won't see any non-matching results. You'll need a subquery to enforce that condition. Alternatively, adding...
|
|
|
Author: Igor Tatarinov,
2011-06-11, 04:30
|
|
|
Re: Skew Join Optimization in hive - Hive - [mail # user]
|
|
...Have you tried splitting the query into 2 or 3 steps and/or enabling map jons (SET hive.auto.convert.join = true;) if some of the tables are smallish? On Tue, Jun 7, 2011 at 12:31 PM,...
|
|
|
Author: Igor Tatarinov,
2011-06-07, 19:58
|
|
|
Re: question about number of map tasks for small file - Hive - [mail # user]
|
|
...Can you pre-aggregate your historical data to reduce the number of files? We used to partition our data by date but that created too many output files so now we partition by month. &nb...
|
|
|
Author: Igor Tatarinov,
2011-06-01, 17:12
|
|
|
|