Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> Word count on cluster configuration


Copy link to this message
-
Re: Word count on cluster configuration
Thanks!
On Mon, Apr 1, 2013 at 12:23 PM, Wenming Ye <[EMAIL PROTECTED]> wrote:

>   because many of the “words” are unicode, check the next blog.
>
> http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
>
>  *From:* Varsha Raveendran <[EMAIL PROTECTED]>
> *Sent:* Sunday, March 31, 2013 11:43 PM
> *To:* [EMAIL PROTECTED]
> *Subject:* Word count on cluster configuration
>
>   Hello!
>
> I did the setup for a cluster configuration of Hadoop. After running the
> word count example the output shown in the part-r-00000 file is as shown :
>
> hduser@MT2012158:/usr/local/hadoop$ head
> /tmp/gutenberg-output/gutenberg-output
>     40
>     2
>     4
> ��� � � � �@��    2
> ��� � � � �@�@��    1
> ���� � � � �@�@��    1
> P�������� j l k m �������� g��������������������EXTH � j 2004-01-01d
> Leonardo    1
> P�������� � � � � �������� ���������������������EXTH � t    1
> �P�������� � � � ������������ � � � ���������EXTH � j 2004-01-01d
> Leonardo    1
> �P�������� � � � ������������ � � � � �����EXTH � t    1
>
>
>
> Can you please tell me why this is happening?
>
>
>
>
> --
> *-Varsha *
>

--
*-Varsha *
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB