To add in bit more details: You basically load images as bytearray
type. Pig wont interpret it. Then you can write a UDF which takes
these bytearrays as input. You can choose to work directly on
bytearray or create image objects out of them, do your processing and
return bytearray back to Pig. You can further do other sql-y
operations on your images (like filter, group etc) using Pig
primitives which will let you leverage Hadoop's strength.
So, in short answer to your question is Yes.
On Mon, Jul 26, 2010 at 13:27, Alan Gates <[EMAIL PROTECTED]> wrote:
> Pig itself does not contain image processing primitives. But if you write
> your image processing in a UDF, then Pig can be a great framework for
> dealing with the parallelism, running it on Hadoop, etc.
> On Jul 26, 2010, at 11:56 AM, Ifeanyichukwu Osuji wrote:
>> Hi all,
>> I was wondering if it would be possible to process images on a
>> low level using PIG. I want to be able to write a pig script
>> that can differentiate between two images.