Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 58 (0.091s).
Loading phrases to help you
refine your search...
Re: Union in Multi Insert - Hive - [mail # user]
...You are correct on the what I am hoping to do, basically emit two records for every row.  What was interesting was when I just did the union in the from, it didn't see to do a double ta...
   Author: John Omernik, 2013-02-12, 14:59
Union in Multi Insert - Hive - [mail # user]
...I am trying to do a union, group by, and multi insert all at once. I know this convoluted but I what I am trying to do is avoid having to scan through the original table more than once... if...
   Author: John Omernik, 2013-02-12, 03:34
Re: Combine multiple row values based upon a condition. - Hive - [mail # user]
...Yes, I agree with this. If you did a hive transform to say a python script that collected your offsets per doc id and used "distributed by" to ensure that the script you sent the data too ha...
   Author: John Omernik, 2013-02-03, 22:54
Re: Combine multiple row values based upon a condition. - Hive - [mail # user]
...Is there some think akin to a document I'd so we can assure all rows belonging to the same document can be sent to one mapper? On Feb 3, 2013 1:00 PM, "Martijn van Leeuwen"  wrote: &nbs...
   Author: John Omernik, 2013-02-03, 19:07
[HIVE-3979] Provide syntax for unescaped regex on rlike, and other regexp_* functions - Hive - [issue]
...To make hive more intuitive, can we provide a syntax for hive functions that when encompassing a regex, that regex doesn't have to the standard hive double escape? For examplecolumn rlike 'd...
http://issues.apache.org/jira/browse/HIVE-3979    Author: John Omernik, 2013-02-03, 12:15
Re: Combine multiple row values based upon a condition. - Hive - [mail # user]
...Well there are some methods that may work, but I'd have to understand your data and your constraints more. You want to be able to (As it sounds) sort by offset, and then look at the one row,...
   Author: John Omernik, 2013-02-03, 12:05
Re: The dreaded Heap Space Issue on a Transform - Hive - [mail # user]
...I am realizing one of my challenges is that I have quite a few cores and map tasks per node, but (I didn't set it up) I am only running 4 GB per physical core (12) with 18 map slots.  I...
   Author: John Omernik, 2013-01-30, 17:35
Re: The dreaded Heap Space Issue on a Transform - Hive - [mail # user]
...So it's filling up on the emitting stage, so I need to look at the task logs and or my script that's printing to stdout as the likely culprits I am guessing.    On Wed, Jan 30, 201...
   Author: John Omernik, 2013-01-30, 15:54
Re: The dreaded Heap Space Issue on a Transform - Hive - [mail # user]
...So just a follow-up. I am less looking for specific troubleshooting on how to fix my problem, and more looking for a general understanding of heap space usage with Hive.  When I get an ...
   Author: John Omernik, 2013-01-30, 14:28
The dreaded Heap Space Issue on a Transform - Hive - [mail # user]
...I am running a transform script that parses through a bunch of binary data. In 99% of the cases it runs, it runs fine, but on certain files I get a failure (as seen below).  Funny thing...
   Author: John Omernik, 2013-01-29, 13:43
Sort:
project
Hive (58)
type
mail # user (56)
issue (2)
date
last 7 days (2)
last 30 days (5)
last 90 days (8)
last 6 months (33)
last 9 months (58)
author
Namit Jain (802)
Edward Capriolo (631)
Zheng Shao (613)
Carl Steinbach (604)
John Sichi (297)
Mark Grover (269)
Ning Zhang (255)
Ashutosh Chauhan (250)
Nitin Pawar (220)
Ashish Thusoo (169)
Kevin Wilfong (161)
He Yongqiang (156)
Prasad Chakka (152)
Bejoy Ks (132)
Navis (128)
John Omernik