Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 40 (0.099s).
Loading phrases to help you
refine your search...
[expand - 2 more] - Re: single output file per partition? - Hive - [mail # user]
...Actually, using a temp table doesn't work either. Apparently, a single mapper can read from multiple partitions (and output multiple files). There is no way to force a single mapper per part...
   Author: Igor Tatarinov, 2013-08-21, 20:19
Re: only one mapper - Hive - [mail # user]
...LZO files are combinable so check your max split setting. http://mail-archives.apache.org/mod_mbox/hive-user/201107.mbox/%[EMAIL PROTECTED]%3E  igor decide.com    On Wed, Aug ...
   Author: Igor Tatarinov, 2013-08-21, 17:39
Re: UDAF terminatePartial structure - Hive - [mail # user]
...I found this Cloudera example helpful: http://grepcode.com/file/repository.cloudera.com/content/repositories/releases/org.apache.hadoop.hive/hive-contrib/0.7.0-cdh3u0/org/apache/hadoop/hive/...
   Author: Igor Tatarinov, 2013-07-29, 23:37
Re: Enhancing Query Join to speed up Query - Hive - [mail # user]
...I would expect no difference because of predicate pushdown.  igor decide.com   On Thu, Jun 13, 2013 at 11:31 AM, Naga Vijay  wrote:  ...
   Author: Igor Tatarinov, 2013-06-13, 22:29
string to int conversion (with leading zeros) - Hive - [mail # user]
...Hi all,  I need to convert a string attribute that contains ints with occasional leading zeroes. Unfortunately, I can't simply cast() as Hive will try to parse those numbers as octals. ...
   Author: Igor Tatarinov, 2013-04-11, 18:23
Re: Huge join performance issue - Hive - [mail # user]
...Did you verify that all your available mappers are running (and reducers too)? If you have a small number of partitions with huge files, you might me underutilizing mappers (check that the f...
   Author: Igor Tatarinov, 2013-04-08, 18:39
Re: Need rank() - Hive - [mail # user]
...You are getting the error because you are ORDERing BY rank but rank is not in the top SELECT  Also, DISTRIBUTE BY/SORT BY are done after SELECT so you have to use a subquery: SELECT ......
   Author: Igor Tatarinov, 2013-04-02, 17:56
Re: question about machine learning on Hive - Hive - [mail # user]
...Here is how Twitter does it with Pig: http://www.umiacs.umd.edu/~jimmylin/publications/Lin_Kolcz_SIGMOD2012.pdf  We use a similar approach and I think that Pig, being somewhat lower-lev...
   Author: Igor Tatarinov, 2013-01-17, 21:29
[expand - 1 more] - Re: Rolling MAU computation - Hive - [mail # user]
...You just need to put the join condition in the WHERE clause. That way Hive will do a cartesian product followed by a filter.  On Fri, Oct 12, 2012 at 1:02 PM, Tom Hubina  wrote: &n...
   Author: Igor Tatarinov, 2012-10-12, 20:08
Re: Long running Join Query - Reduce task fails due to failing to report status - Hive - [mail # user]
...Why don't you try splitting the big query into smaller ones?   On Fri, Aug 24, 2012 at 10:20 AM, Tim Havens  wrote:  te: e s 000 rows: used memory = 408582240 000 rows: used m...
   Author: Igor Tatarinov, 2012-08-24, 17:44
Hive (40)
mail # user (40)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (40)
Namit Jain (645)
Carl Steinbach (408)
Zheng Shao (382)
Brock Noland (349)
Edward Capriolo (297)
Navis (292)
Ashutosh Chauhan (284)
Gunther Hagleitner (227)
Thejas M Nair (223)
John Sichi (212)
Lefty Leverenz (208)
Ning Zhang (171)
Xuefu Zhang (167)
Kevin Wilfong (152)
He Yongqiang (139)
Sergey Shelukhin (136)
Eugene Koifman (124)
Jason Dere (114)
Nitin Pawar (113)
Harish Butani (110)
Alan Gates (107)
Prasanth J (101)
Vaibhav Gumashta (98)
Joydeep Sen Sarma (95)
Owen O'Malley (92)
Igor Tatarinov