Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
HDFS >> mail # user >> RE: puzzled at the output


Copy link to this message
-
RE: puzzled at the output
Bertrand/Mohamed,

You guys are awesome!!!!. Thanks a million… Commenting out the Combiner class in the driver solved the issue.

p.s. I have one more small dilemma.
I am trying to create xml from two files. The input for my 3rd MR job is the (Text,Text) output from two MapReds. I feed my inputto MR3 as KeyValueTextInputFormat and whenever try to simply output out.collection(key,key). The keys are there.
My keys  are single value Text()  but the values is multi-line xml Text().
Can you put me in the right direction on how I can properly bring in these values to combine into a larger XML. For now the output of the 3rd MR job is a mess of values with the keys trailering somewhere at the bottom.

Andy Kartashov
MPAC
Architecture R&D, Co-op
1340 Pickering Parkway, Pickering, L1V 0C4
• Phone : (905) 837 6269
• Mobile: (416) 722 1787
[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>

From: Mohamed Trad [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, October 02, 2012 1:35 PM
To: [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Subject: Re: puzzled at the output

I agree with Bertrand. Try disabling the combiner.

Envoyé de mon iPhone

Le 2 oct. 2012 à 19:02, Bertrand Dechoux <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> a écrit :
Combiner? And you are only using 'Text' as type?

Please do a real test with a specified input. We can only guess.

Bertrand
On Tue, Oct 2, 2012 at 6:52 PM, Chris Nauroth <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:
Is there also a Mapper?  Is there any chance that logic in the Mapper wrapped the values with the tags too, so that the records were already wrapped when they entered the reducer logic?

Thank you,
--Chris
On Tue, Oct 2, 2012 at 9:01 AM, Kartashov, Andy <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:
I want:

Key  <tag>
          Value1
          Value2
        </tag>

I get double tags:
Key  <tag><tag>
          Value1
          Value2
        </tag></tag>

Here is my last proposition that also failed in Reduce.

...
public void reduce (.....
....
StringBuilder sb = new StringBuilder();
while (values.hasNext()){
     sb.append(values.next().toString());
    }

output.collect(key, new Text("\n<tag>"+sb.toString()+"</tag>" ));

Kind regards,
-----Original Message-----
From: Harsh J [mailto:[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>]
Sent: Tuesday, October 02, 2012 10:03 AM
To: [EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>
Subject: Re: puzzled at the output

Hi,

Could you clarify your post to show what you expect your code to have actually printed and what it has printed?

On Tue, Oct 2, 2012 at 7:01 PM, Kartashov, Andy <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:
> Guys, have been stretching my head for the past couple of days.  Why
> are my tags duplicated while the content they wrap around i.e.my<http://i.e.my>
> StringBuilder sb is not?
>
>
>
>
>
> My Reduce code is:
>
> while (values.hasNext()){
>
>     sb.append(values.next().toString());
>
>    }
>
>     output.collect(key,new
> Text("\n<aaa>\n"+sb.toString()+"\n</bbbb>\n" ));
>
>
>
> The out put:
>
> 2459
>
> <aaa>
>
>
>
> <aaa>
>
>         <value....
>
>                 .....
>
>                 ......
>
> </bbbb>
>
>
>
> </bbbb>
>
>
>
> NOTICE: This e-mail message and any attachments are confidential,
> subject to copyright and may be privileged. Any unauthorized use,
> copying or disclosure is prohibited. If you are not the intended
> recipient, please delete and contact the sender immediately. Please
> consider the environment before printing this e-mail. AVIS : le
> présent courriel et toute pièce jointe qui l'accompagne sont
> confidentiels, protégés par le droit d'auteur et peuvent être couverts
> par le secret professionnel. Toute utilisation, copie ou divulgation
> non autorisée est interdite. Si vous n'êtes pas le destinataire prévu de ce courriel, supprimez-le et contactez immédiatement l'expéditeur.
> Veuillez penser à l'environnement avant d'imprimer le présent courriel

Harsh J
NOTICE: This e-mail message and any attachments are confidential, subject to copyright and may be privileged. Any unauthorized use, copying or disclosure is prohibited. If you are not the intended recipient, please delete and contact the sender immediately. Please consider the environment before printing this e-mail. AVIS : le présent courriel et toute pièce jointe qui l'accompagne sont confidentiels, protégés par le droit d'auteur et peuvent être couverts par le secret professionnel. Toute utilisation, copie ou divulgation non autorisée est interdite. Si vous n'êtes pas le destinataire prévu de ce courriel, supprimez-le et contactez immédiatement l'expéditeur. Veuillez penser à l'environnement avant d'imprimer le présent courriel
Bertrand Dechoux
NOTICE: This e-mail message and any attachments are confidential, subject to copyright and may be privileged. Any unauthorized use, copying or disclosure is prohibited. If you are not the intended recipient, please delete and contact the sender immediately. Please consider the environment before printing this e-mail. AVIS : le présent courriel et toute pièce jointe qui l'accompagne sont confidentiels, protégés par le droit d'auteur et peuvent être couverts par le secret professionnel. Toute utilisation, copie ou divulgation non autorisée est interdite. Si vous n'êtes pas le destinataire prévu de ce courriel, supprimez-le et contactez immédiatement l'expéditeur. Veuillez penser à l'environnement avant d'imprimer le présent courriel
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB