Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 78 (0.079s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: ClassCastException: org.apache.pig.data.DataByteArray cannot be cast to java.lang.Number - Pig - [mail # user]
...One possibility off the top of my head is that the delimiter might bewrong. Can you try specifying the correct delimiter to PigStorage.E.g. For CSV filesA = LOAD 'file_A' USING PigStorage(',...
   Author: Pradeep Gollakota, 2014-04-24, 21:32
Re: Number of map task - Pig - [mail # user]
...Pig is a little too smart when dealing with data. It has a feature calledsplit combination. If you set it to false, you should see more mappers.SET pig.noSplitCombination true;On Tue, Apr 22...
   Author: Pradeep Gollakota, 2014-04-22, 20:01
Re: Strange CROSS behavior - Pig - [mail # user]
...What is the storage func you're using? My guess is that there is someshared state in the Storage func. Take a look at this SO that is dealingwith shared state in Stores.http://stackoverflow....
   Author: Pradeep Gollakota, 2014-04-18, 21:29
Re: Pig script : Need help - Pig - [mail # user]
...That is because you're calling REPLACE on a bag of tuples and not a string.What you would want to do is write a UDF (suggested name JOIN_ON), thattakes as an argument a join char and will jo...
   Author: Pradeep Gollakota, 2014-04-07, 20:28
[expand - 2 more] - Re: 回复:Re: Any way to join two aliases without using CROSS - Pig - [mail # user]
...Unfortunately, the Enumerate UDF from DataFu would not work in this case.The UDF works on Bags and in this case, we want to enumerate a relation.Implementing RANK is a very tricky thing to d...
   Author: Pradeep Gollakota, 2014-03-26, 04:38
Re: Unable to add file paths when registering a UDF - Pig - [mail # user]
...According to the docs, It should work.http://pig.apache.org/docs/r0.12.0/basic.html#registerStupid question, but is the path correct? Is it on HDFS or local disk?On Tue, Mar 11, 2014 at 8:36...
   Author: Pradeep Gollakota, 2014-03-13, 03:43
[expand - 1 more] - Re: one MR job for group-bys and cube-bys - Pig - [mail # user]
...I forgot to mention that there are also other 3rd party libraries that makeexamining the physical plan easier. For example take a look atLipstickfrom Netflix.On Tue, Mar 11, 2014 at 11:41 AM...
   Author: Pradeep Gollakota, 2014-03-11, 18:49
[expand - 1 more] - Re: Nested foreach with order by - Pig - [mail # user]
...No... that wouldn't be related since you're not doing a GROUP ALL.The `FLATTEN(MY_UDF(t))` has me a little weary. Something is possibly goingwrong in your UDF. The output of your UDF is goin...
   Author: Pradeep Gollakota, 2014-02-28, 00:13
Re: how to control nested CROSS parallelism? - Pig - [mail # user]
...It's strange that it's being executed on the Map-side. The group is a reduce side operation (I'm assuming) and it seems that the nested foreach would happen on Reduce-side after grouping. Ha...
   Author: Pradeep Gollakota, 2014-01-20, 18:27
Re: Spilling issue - Optimize "GROUP BY" - Pig - [mail # user]
...Did you mean to say "timeout" instead of "spill"? Spills don't cause task failures (unless a spill fails). Default timeout for a task is 10 min. It would be very helpful to have a stack trac...
   Author: Pradeep Gollakota, 2014-01-10, 18:23
Pig (78)
HBase (16)
Kafka (8)
Hadoop (7)
MapReduce (6)
Ambari (2)
Avro (2)
HDFS (2)
Accumulo (1)
mail # user (73)
mail # dev (4)
issue (1)
last 7 days (2)
last 30 days (3)
last 90 days (4)
last 6 months (12)
last 9 months (78)
Daniel Dai (404)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (312)
Jonathan Coveney (237)
Rohini Palaniswamy (188)
Russell Jurney (177)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (108)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)