Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Avro >> mail # user >> Synchronization Markers

Copy link to this message
Synchronization Markers
As I understand it, Avro container files contain synchronization markers
every so often to support splitting the file.  See:

(1) Why isn't the synchronization marker the same for every container
file?  (i.e. what is the point of generating it randomly every time)

(2) Is it possible, at least in theory, for naturally occurring data to
contain bytes that match the sync marker? If so, would this break