Hi everyone,

Currently I am working on the implementation of the Parquet page index for
(design doc is here if you are interested:

During our discussions it came up that DataPageHeaderV2 states that page
boundaries are also record boundaries:


DataPageHeader(V1) doesn't have this statement, which means that in theory
it allows records to span through multiple pages. Is it really the case, or
is it something that is missing from the specification?

I ask this because filtering pages based on the page index is much more
simple if page boundaries are record boundaries as well.

NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB