Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 44 (1.039s).
Loading phrases to help you
refine your search...
string to int conversion (with leading zeros) - Hive - [mail # user]
...Hi all,  I need to convert a string attribute that contains ints with occasional leading zeroes. Unfortunately, I can't simply cast() as Hive will try to parse those numbers as octals. ...
   Author: Igor Tatarinov, 2013-04-11, 18:23
Re: Huge join performance issue - Hive - [mail # user]
...Did you verify that all your available mappers are running (and reducers too)? If you have a small number of partitions with huge files, you might me underutilizing mappers (check that the f...
   Author: Igor Tatarinov, 2013-04-08, 18:39
Re: Need rank() - Hive - [mail # user]
...You are getting the error because you are ORDERing BY rank but rank is not in the top SELECT  Also, DISTRIBUTE BY/SORT BY are done after SELECT so you have to use a subquery: SELECT ......
   Author: Igor Tatarinov, 2013-04-02, 17:56
Re: question about machine learning on Hive - Hive - [mail # user]
...Here is how Twitter does it with Pig: http://www.umiacs.umd.edu/~jimmylin/publications/Lin_Kolcz_SIGMOD2012.pdf  We use a similar approach and I think that Pig, being somewhat lower-lev...
   Author: Igor Tatarinov, 2013-01-17, 21:29
Re: Rolling MAU computation - Hive - [mail # user]
...You just need to put the join condition in the WHERE clause. That way Hive will do a cartesian product followed by a filter.  On Fri, Oct 12, 2012 at 1:02 PM, Tom Hubina  wrote: &n...
   Author: Igor Tatarinov, 2012-10-12, 20:08
Re: Rolling MAU computation - Hive - [mail # user]
...If you have a lot of data, you might have to write a custom reducer (in python) to keep track of the moving date window.  If you don't have that much data, you might want to use a temp ...
   Author: Igor Tatarinov, 2012-10-11, 06:05
Re: Long running Join Query - Reduce task fails due to failing to report status - Hive - [mail # user]
...Why don't you try splitting the big query into smaller ones?   On Fri, Aug 24, 2012 at 10:20 AM, Tim Havens  wrote:  te: e s 000 rows: used memory = 408582240 000 rows: used m...
   Author: Igor Tatarinov, 2012-08-24, 17:44
Re: Changing table types from managed to external - Hive - [mail # user]
...Try ALTER TABLE  SET TBLPROPERTIES('EXTERNAL'='TRUE');  It worked for me.  igor decide.com  On Mon, Aug 6, 2012 at 11:08 PM, Babe Ruth wrote:  ...
   Author: Igor Tatarinov, 2012-08-07, 06:26
Re: schema of hive database - Hive - [mail # user]
...Try describe formatted   igor decide.com  On Thu, Aug 2, 2012 at 12:04 PM, Anson Abraham wrote:  ...
   Author: Igor Tatarinov, 2012-08-02, 19:13
Nested SELECT DISTINCT runs out of memory - Hive - [mail # user]
...I have this query that consistently fails with out-of-memory errors. I know it can be re-written without a nested subquery (using count distinct) and then it runs fine.  Why does this q...
   Author: Igor Tatarinov, 2012-07-26, 18:40
Sort:
project
Hive (44)
type
mail # user (44)
date
last 7 days (0)
last 30 days (0)
last 90 days (3)
last 6 months (4)
last 9 months (44)
author
Namit Jain (802)
Edward Capriolo (631)
Zheng Shao (613)
Carl Steinbach (604)
John Sichi (297)
Mark Grover (269)
Ning Zhang (255)
Ashutosh Chauhan (250)
Nitin Pawar (220)
Ashish Thusoo (169)
Kevin Wilfong (161)
He Yongqiang (156)
Prasad Chakka (152)
Bejoy Ks (132)
Navis (128)
Igor Tatarinov