Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Best Practice: LOAD returns null

Copy link to this message
Re: Best Practice: LOAD returns null
Typically, file pattern globing is very strict and LOADs fail if not all
glob variants are met. This makes sense when you think that someone might
pass a glob path with each of the 24 hours in a day. If one of those hours
doesn't exist you want the LOAD to fail.

On Tue, Apr 10, 2012 at 8:58 AM, Markus Resch <[EMAIL PROTECTED]>wrote:

> Hey everyone,
> I have a new question about how to handle a very common issue the best:
> We have a LOAD statement loading AVRO files using globbing by a given
> regex. By some wired reason this might return null as there is no file
> matching the regex.
> There are two thinkable cases where this can happen:
> On purpose: There is no data gathered in this e.g. time frame.
> On error: some nasty guy deleted a very important look up table for my
> join. Great hint the stuff with the replicated join, btw :).
> Do you have any suggestion about how to handle this?
> Thanks
> Markus
*Note that I'm no longer using my Yahoo! email address. Please email me at
[EMAIL PROTECTED] going forward.*