Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> typical JSON data sets


Copy link to this message
-
typical JSON data sets
I would like to hear your experiences working with large JSON data sets, specifically:

1)      How large is each JSON document?

2)      Do they tend to be a single JSON doc per file, or multiples per file?

3)      Do the JSON schemas change over time?

4)      Are there interesting public data sets you would recommend for experiment?
Thanks
John

NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB