Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Avro >> mail # dev >> HUG talk on PTD/Avro

Copy link to this message
HUG talk on PTD/Avro
Hi all,

Just wrote a blog post about the talk I gave on Wed night at the  
Hadoop Bay Area user group meetup:


Key points about Avro:

1. The Avro scheme for Cascading worked well for writing out fetch  
results, and we are using it in the example analysis code to read the  
same files for processing.

2. Sample Avro file (one of 613, from first loop) is available at S3 (/
bixolabs-ptd-demo/ptd-sample.avro), and we're working with Amazon to  
get this initial set into the Amazon public dataset.

3. It would be great to get feedback on both the Avro Cascading scheme  
(http://github.com/bixolabs/cascading.avro) and the content we're  
currently saving in the Avro file.


-- Ken

Ken Krugler
+1 530-210-6378
e l a s t i c   w e b   m i n i n g