The thing about OCR is that it is great for tables created from other
tables, (like the other columnar formats) but if you are logging directly
to HDFS, a columnar format is not easy (possible) to write directly.
Normally people store data in a very direct row oriented form and then
there first map reduce job buckets/partitions/columnar-izes it.
On Mon, Jan 27, 2014 at 2:44 PM, Thilina Gunarathne <[EMAIL PROTECTED]>wrote: