Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 14 (0.055s).
Loading phrases to help you
refine your search...
Re: Is pig maddening to work with because it's so slow? - Pig - [mail # user]
...Seconded for PigUnit.As for a faster debugging procedure, I've gone modular. First I JUnit testindividual UDFs against their functional requirements and use cases apriori.  Then I mocku...
   Author: Dan DeCapria, CivicScienc..., 2014-05-20, 19:40
Re: Reading json file. - Pig - [mail # user]
...I've had poor experiences getting the default json loaders to work as well.  I would highly recommend writing your own UDF JsonLoader extending LoadFunc over, say, importing twitter's e...
   Author: Dan DeCapria, CivicScienc..., 2013-08-30, 14:28
Re: Getting dimension values for Facts - Pig - [mail # user]
...It seems like your fact table and its corresponding dimension tables follow a traditional data warehousing star topology relational diagram.  I would have to ask what is the purpose/del...
   Author: Dan DeCapria, CivicScienc..., 2013-07-19, 14:33
Unique Self Cross Optimization - Pig - [mail # user]
...I have a data input of aliases and many identifying attributes per each alias. The order of aliases is ~1E8 and for all attributes is ~1E5.  I am attempting to generate a network of ali...
   Author: Dan DeCapria, CivicScienc..., 2013-06-17, 18:59
Re: Pig 0.11.1 on AWS EMR/S3 fails to cleanup failed task output file before retrying that task - Pig - [mail # user]
...Hi Alan,  I believe this is expected behavior wrt EMR and S3.  There cannot exist a duplicate file path in S3 prior to commit; in your case it looks like bucket: n2ygk, path: reduc...
   Author: Dan DeCapria, CivicScienc..., 2013-06-13, 13:57
Re: LEFT OUTER JOIN? - Pig - [mail # user]
...Have a solution which I personally don't like, but it seems to work for now:  T = LOAD 'T.dat' AS (a:chararray, b:chararray, c:chararray, d:chararray, x:chararray, y:chararray); U = LOA...
   Author: Dan DeCapria, CivicScienc..., 2013-04-19, 21:10
Re: Massive ILLUSTRATE - Pig - [mail # user]
...You wouldn't happen to know of a way to make some of those statements more verbose (remove the ellipsis)? In example:  | lyrics     | track_id:bytearray      | ...
   Author: Dan DeCapria, CivicScienc..., 2013-04-10, 21:49
Re: Pig Illustrate more verbose, remove Ellipsis - Pig - [mail # user]
...Hi Johnny,  Yes, with ILLUSTRATE I am expecting a small sample of the data to be returned for each LHS statement in my Pig script. ILLUSTRATE renders out the data in fields, regardless ...
   Author: Dan DeCapria, CivicScienc..., 2013-04-09, 18:09
UDF Complex Pig Object to JsonObject - Pig - [mail # user]
...pig Object to a Json Object. The converse operation is also desired.  Use Case 1: DataBag {(a,1.0)}  with Schema b1:bag{t1:tuple(t:chararray,s:double)} return JsonObject {[a,1.0]} ...
   Author: Dan DeCapria, CivicScienc..., 2013-04-02, 19:59
Utf8StorageConverter Not Handling Empty Tuples Properly - Pig - [mail # user]
...For Pig 0.10.1, I came across a use case for the caster * Utf8StorageConverter.consumeTuple()* method, whereby passing an empty tuple to the caster did not create a valid empty tuple output....
   Author: Dan DeCapria, CivicScienc..., 2013-03-21, 15:57
Sort:
project
Pig (14)
type
mail # user (14)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (14)
author
Daniel Dai (361)
Dmitriy Ryaboy (346)
Alan Gates (333)
Cheolsoo Park (292)
Jonathan Coveney (237)
Rohini Palaniswamy (175)
Russell Jurney (174)
Bill Graham (131)
Olga Natkovich (130)
Prashant Kommireddi (107)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (69)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (41)
pi song (41)
Gianmarco De Francisci Mo...(38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Koji Noguchi (34)
Pradeep Gollakota (33)
Jeff Zhang (32)
Santhosh Srinivasan (29)
Dan DeCapria, CivicScienc...
Dan DeCapria, CivicScienc...