Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 40 (0.065s).
Loading phrases to help you
refine your search...
Re: single output file per partition? - Hive - [mail # user]
...Actually, using a temp table doesn't work either. Apparently, a single mapper can read from multiple partitions (and output multiple files). There is no way to force a single mapper per part...
   Author: Igor Tatarinov, 2013-08-21, 20:19
Re: only one mapper - Hive - [mail # user]
...LZO files are combinable so check your max split setting. http://mail-archives.apache.org/mod_mbox/hive-user/201107.mbox/%[EMAIL PROTECTED]%3E  igor decide.com    On Wed, Aug ...
   Author: Igor Tatarinov, 2013-08-21, 17:39
Re: UDAF terminatePartial structure - Hive - [mail # user]
...I found this Cloudera example helpful: http://grepcode.com/file/repository.cloudera.com/content/repositories/releases/org.apache.hadoop.hive/hive-contrib/0.7.0-cdh3u0/org/apache/hadoop/hive/...
   Author: Igor Tatarinov, 2013-07-29, 23:37
Re: Enhancing Query Join to speed up Query - Hive - [mail # user]
...I would expect no difference because of predicate pushdown.  igor decide.com   On Thu, Jun 13, 2013 at 11:31 AM, Naga Vijay  wrote:  ...
   Author: Igor Tatarinov, 2013-06-13, 22:29
string to int conversion (with leading zeros) - Hive - [mail # user]
...Hi all,  I need to convert a string attribute that contains ints with occasional leading zeroes. Unfortunately, I can't simply cast() as Hive will try to parse those numbers as octals. ...
   Author: Igor Tatarinov, 2013-04-11, 18:23
Re: Huge join performance issue - Hive - [mail # user]
...Did you verify that all your available mappers are running (and reducers too)? If you have a small number of partitions with huge files, you might me underutilizing mappers (check that the f...
   Author: Igor Tatarinov, 2013-04-08, 18:39
Re: Need rank() - Hive - [mail # user]
...You are getting the error because you are ORDERing BY rank but rank is not in the top SELECT  Also, DISTRIBUTE BY/SORT BY are done after SELECT so you have to use a subquery: SELECT ......
   Author: Igor Tatarinov, 2013-04-02, 17:56
Re: question about machine learning on Hive - Hive - [mail # user]
...Here is how Twitter does it with Pig: http://www.umiacs.umd.edu/~jimmylin/publications/Lin_Kolcz_SIGMOD2012.pdf  We use a similar approach and I think that Pig, being somewhat lower-lev...
   Author: Igor Tatarinov, 2013-01-17, 21:29
Re: Rolling MAU computation - Hive - [mail # user]
...You just need to put the join condition in the WHERE clause. That way Hive will do a cartesian product followed by a filter.  On Fri, Oct 12, 2012 at 1:02 PM, Tom Hubina  wrote: &n...
   Author: Igor Tatarinov, 2012-10-12, 20:08
Re: Long running Join Query - Reduce task fails due to failing to report status - Hive - [mail # user]
...Why don't you try splitting the big query into smaller ones?   On Fri, Aug 24, 2012 at 10:20 AM, Tim Havens  wrote:  te: e s 000 rows: used memory = 408582240 000 rows: used m...
   Author: Igor Tatarinov, 2012-08-24, 17:44
Hive (40)
mail # user (40)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (40)
Namit Jain (645)
Carl Steinbach (419)
Zheng Shao (382)
Brock Noland (294)
Edward Capriolo (291)
Ashutosh Chauhan (256)
Navis (248)
John Sichi (212)
Gunther Hagleitner (198)
Thejas M Nair (177)
Ning Zhang (170)
Lefty Leverenz (157)
Kevin Wilfong (152)
He Yongqiang (139)
Xuefu Zhang (132)
Igor Tatarinov