Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
MapReduce >> mail # user >> typical JSON data sets

Copy link to this message
typical JSON data sets
I would like to hear your experiences working with large JSON data sets, specifically:

1)      How large is each JSON document?

2)      Do they tend to be a single JSON doc per file, or multiples per file?

3)      Do the JSON schemas change over time?

4)      Are there interesting public data sets you would recommend for experiment?