There has been some work in the Tika  project recently on looking at NetCDF4  and HDF4/5  and extracting metadata/text content from them. Though this doesn't directly apply to your question below, it might be worth perhaps looking at how to marry Tika and Hadoop in that regard.
On 5/3/10 10:36 AM, "Andrew Nguyen" <[EMAIL PROTECTED]> wrote:
Does anyone know of any existing work integrating HDF5 (http://www.hdfgroup.org/HDF5/whatishdf5.html) with Hadoop?
I don't know much about HDF5 but it was recently brought to my attention as a way to store high-density scientific data. Since I've confirmed that having Hadoop dramatically speeds up our analysis, it seems like marrying the two might have some benefits.
I've done some searches on google and it doesn't turn up much.
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: [EMAIL PROTECTED]
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA