Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> DataFu library


Copy link to this message
-
Re: DataFu library
Hi Matt and the rest of the LinkedIn team:

Thanks for open-sourcing this project!

Norbert

On Tue, Sep 27, 2011 at 1:35 PM, Lakshminarayana Motamarri <
[EMAIL PROTECTED]> wrote:

> Thank you for sharing the UDF's...
>
> On Tue, Sep 27, 2011 at 10:19 AM, Matthew Hayes <[EMAIL PROTECTED]>
> wrote:
>
> > Hi Pig users,
> >
> > I wanted to share with you all that we recently open sourced a library we
> > have been developing at LinkedIn.  In it we have collected many of the
> > useful UDFs we have developed for products such as "People You May Know"
> and
> > "Skills".  There are UDFs for median, quantiles, set operations, bag
> > operations, pagerank, etc.  All the UDFs are pretty well documented and
> unit
> > tested (also tracking code coverage).  It would be great to get people's
> > feedback on it.  Also if anyone would like to contribute please let us
> know
> > :)
> >
> > Project page:
> > http://sna-projects.com/datafu/
> >
> > Github:
> > https://github.com/linkedin/datafu
> >
> > Thanks,
> > Matt Hayes
> >
> >
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB