Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HDFS >> mail # user >> Re: Reading json format input


+
jamal sasha 2013-05-30, 18:43
+
Shahab Yunus 2013-05-30, 18:46
+
jamal sasha 2013-05-30, 20:57
Copy link to this message
-
Reading json format input
Hi,
   I am stuck again. :(
My input data is in hdfs. I am again trying to do wordcount but there is
slight difference.
The data is in json format.
So each line of data is:

{"author":"foo", "text": "hello"}
{"author":"foo123", "text": "hello world"}
{"author":"foo234", "text": "hello this world"}

So I want to do wordcount for text part.
I understand that in mapper, I just have to pass this data as json and
extract "text" and rest of the code is just the same but I am trying to
switch from python to java hadoop.
How do I do this.
Thanks
+
Russell Jurney 2013-05-29, 22:13
+
Michael Segel 2013-05-29, 23:30
+
jamal sasha 2013-05-29, 23:44
+
Rahul Bhattacharjee 2013-05-30, 03:12
+
Rishi Yadav 2013-05-29, 23:43
+
jamal sasha 2013-05-29, 23:45
+
Rishi Yadav 2013-05-30, 00:15