Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 15 (0.175s).
Loading phrases to help you
refine your search...
[expand - 4 more] - Re: Using PIG with complex SQL Statement - Pig - [mail # user]
...Hi Vineet,Check the UI to see what resources you are using per each of those jobsfrom the full Pig script (ie, >lynx localhost:9100).  Sometimes the processmay not use the entire clu...
   Author: Dan DeCapria, CivicScienc..., 2014-11-05, 14:53
Re: Is pig maddening to work with because it's so slow? - Pig - [mail # user]
...Seconded for PigUnit.As for a faster debugging procedure, I've gone modular. First I JUnit testindividual UDFs against their functional requirements and use cases apriori.  Then I mocku...
   Author: Dan DeCapria, CivicScienc..., 2014-05-20, 19:40
Re: Reading json file. - Pig - [mail # user]
...I've had poor experiences getting the default json loaders to work as well.  I would highly recommend writing your own UDF JsonLoader extending LoadFunc over, say, importing twitter's e...
   Author: Dan DeCapria, CivicScienc..., 2013-08-30, 14:28
Re: Getting dimension values for Facts - Pig - [mail # user]
...It seems like your fact table and its corresponding dimension tables follow a traditional data warehousing star topology relational diagram.  I would have to ask what is the purpose/del...
   Author: Dan DeCapria, CivicScienc..., 2013-07-19, 14:33
Unique Self Cross Optimization - Pig - [mail # user]
...I have a data input of aliases and many identifying attributes per each alias. The order of aliases is ~1E8 and for all attributes is ~1E5.  I am attempting to generate a network of ali...
   Author: Dan DeCapria, CivicScienc..., 2013-06-17, 18:59
Re: Pig 0.11.1 on AWS EMR/S3 fails to cleanup failed task output file before retrying that task - Pig - [mail # user]
...Hi Alan,  I believe this is expected behavior wrt EMR and S3.  There cannot exist a duplicate file path in S3 prior to commit; in your case it looks like bucket: n2ygk, path: reduc...
   Author: Dan DeCapria, CivicScienc..., 2013-06-13, 13:57
[expand - 1 more] - Re: LEFT OUTER JOIN? - Pig - [mail # user]
...Have a solution which I personally don't like, but it seems to work for now:  T = LOAD 'T.dat' AS (a:chararray, b:chararray, c:chararray, d:chararray, x:chararray, y:chararray); U = LOA...
   Author: Dan DeCapria, CivicScienc..., 2013-04-19, 21:10
Re: Massive ILLUSTRATE - Pig - [mail # user]
...You wouldn't happen to know of a way to make some of those statements more verbose (remove the ellipsis)? In example:  | lyrics     | track_id:bytearray      | ...
   Author: Dan DeCapria, CivicScienc..., 2013-04-10, 21:49
[expand - 1 more] - Re: Pig Illustrate more verbose, remove Ellipsis - Pig - [mail # user]
...Hi Johnny,  Yes, with ILLUSTRATE I am expecting a small sample of the data to be returned for each LHS statement in my Pig script. ILLUSTRATE renders out the data in fields, regardless ...
   Author: Dan DeCapria, CivicScienc..., 2013-04-09, 18:09
UDF Complex Pig Object to JsonObject - Pig - [mail # user]
...pig Object to a Json Object. The converse operation is also desired.  Use Case 1: DataBag {(a,1.0)}  with Schema b1:bag{t1:tuple(t:chararray,s:double)} return JsonObject {[a,1.0]} ...
   Author: Dan DeCapria, CivicScienc..., 2013-04-02, 19:59
Sort:
project
Pig (15)
Storm (3)
type
mail # user (15)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (15)
author
Daniel Dai (409)
Dmitriy Ryaboy (345)
Alan Gates (333)
Cheolsoo Park (271)
Jonathan Coveney (230)
Rohini Palaniswamy (180)
Russell Jurney (174)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (64)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
liyunzhang_intel (40)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Dan DeCapria, CivicScienc...
Dan DeCapria, CivicScienc...
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB