Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Creating Sequence File in C++

Copy link to this message
Creating Sequence File in C++

Let my Key-Value be something like BinaryWritables (my own class, but
something like this).  Is there a way to create the Sequence File
composed of several such key - values, without using Java?


I create objects using protocol buffers, my key and values are
serialized versions of these protocol buffer messages. These hadoop k-v
pairs that are exchanged in the mapreduce (and stored in both output and
input) are the serialized versions of these.

I would like to directly create sequence files using C++
and was curious if there is way to do this outside Java (and not have to
use JNI), as currently, its best to use a mapreduce job to convert my
textfiles to sequence files.

Thank you