I recommend trying different values using the parquet-cli. That's an easy
way to see how different row group and page sizes perform. That's what I do
to tune all of our tables.


On Fri, Jan 12, 2018 at 10:43 AM, ALeX Wang <[EMAIL PROTECTED]> wrote:

Ryan Blue
Software Engineer
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB