Re: Best Practice: lookup table - Pig - [mail # user]
...Hi Markus,  I would start with a "replicated" join:  join InputTable by BrowserId, BrowserLookup by Id USING 'replicated';  The idea is to perform a map-side join by loading t...
   Author: Stan Rosenberg, 2012-03-27, 15:21
Re: Trying to store a bag of tuples using AvroStorage. - Pig - [mail # user]
...Hi Dan,  Could you attach your script and sample input files for both cases (with and without the schema).  In the case where no schema is provided, the stack trace shows that the ...
   Author: Stan Rosenberg, 2012-03-26, 15:11
Re: What should storefuncs do on parse errors while reading? - Pig - [mail # user]
...I typically increment a counter and have a bounded log of randomly sampled erroneous data.  stan On Mar 24, 2012 6:50 PM, "[EMAIL PROTECTED]"  wrote:  ...
   Author: Stan Rosenberg, 2012-03-25, 14:44
Re: Globbing several AVRO files with different (extended) schemes - Pig - [mail # user]
...There is a patch for AvroStorage which computes a union schema thereby allowing input avro files having different schemas, specifically (un-nested) records with different fields.  https...
   Author: Stan Rosenberg, 2012-03-22, 02:13
Re: config/reference data files for UDFS - Pig - [mail # user]
...Hi Alan,  I am also curious to see how the distributed cache is used in a UDF. However, the code you reference in the patch doesn't appear to contain such an example.  What is the ...
   Author: Stan Rosenberg, 2012-03-13, 20:55
Re: Understanding LoadFunc sequence - Pig - [mail # user]
...Hi Bill,  I've used the following in my UDFs:  public static boolean isBackend(JobContext ctx) { // HACK borrowed from HCatLoader: this property should only be set on the backend...
   Author: Stan Rosenberg, 2012-02-04, 04:04
Re: Passing schema inside Load functionc - Pig - [mail # user]
...Hi Praveenesh,  Maybe this will get you started.  Suppose we have the desired schema parsed and stored in 'map' of type LinkedHashMap.  The key is your field name, and the val...
   Author: Stan Rosenberg, 2012-02-04, 02:40
Re: Pig/Avro Question - Pig - [mail # user]
...Check the code in PigAvroInputFormat; it overrides 'listStatus' from FileInputFormat so that files not ending in .avro are filtered.  stan  On Fri, Feb 3, 2012 at 1:58 PM, Russell ...
   Author: Stan Rosenberg, 2012-02-03, 21:17
Re: explode operation - Pig - [mail # user]
...On Mon, Jan 30, 2012 at 2:25 AM, Aniket Mokashi  wrote:  Not quite. EXPLODE would take a record with n fields and generate n records....
   Author: Stan Rosenberg, 2012-01-30, 16:05
Re: Multiple files with AvroStorage and comma separated lists - Pig - [mail # user]
...Hi Guys,  Patch finally submitted: https://issues.apache.org/jira/browse/PIG-2492  Best,  stan  P.S. I classified it as an "improvement" rather than a "bug" since I don't...
   Author: Stan Rosenberg, 2012-01-25, 19:05
