Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> question..


Copy link to this message
-
RE: question..
I think the query did finish properly. Can you recheck the data to see if you would really get a few rows of output?

Ashish

________________________________
From: Ronak Bhatt [mailto:[EMAIL PROTECTED]]
Sent: Friday, August 20, 2010 12:12 PM
To: [EMAIL PROTECTED]
Subject: question..

Hi - in my HIVE environment, I ran the following query and expecting to see the rows (the data is present)...however, after 2339 seconds, the o/p I get is as shown below in the email (I've pasted last 5~10 lines of screen output)...

Is there anything that I'm missing? Did the process finish correctly? should there be something that could point me how to debug?

----------------------------- Query ------------------------------

select substr(CB.EXEC_DATE,1,10), count(CB.ID<http://CB.ID>)
    from callbacks CB JOIN
      (select * from  pages p where
               p.page like '%google.com/search%<http://google.com/search%><http://google.com/search%>'
            or p.page like '%google.com/custom%<http://google.com/custom%><http://google.com/custom%>'
            or p.page like '%google.com/#hl%<http://google.com/#hl%><http://google.com/#hl%>'
            or p.page like '%google.com/cse%<http://google.com/cse%><http://google.com/cse%>'
            or p.page like '%search.yahoo.com/search%<http://search.yahoo.com/search%><http://search.yahoo.com/search%>'
            or p.page like '%bing.com/search%<http://bing.com/search%><http://bing.com/search%>'
            or p.page like '%google.com/product%<http://google.com/product%><http://google.com/product%>' ) s
ON s.id<http://s.id> = cb.id<http://cb.id>
    group by substr(CB.EXEC_DATE,1,10);
================= o/p on screen =================
10/08/20 11:13:38 INFO mapred.TaskRunner: Task 'attempt_local_0001_r_000000_0' done.
2010-08-20 11:13:39,526 map = 100%,  reduce =100%
10/08/20 11:13:39 INFO exec.ExecDriver: 2010-08-20 11:13:39,526 map = 100%,  reduce =100%
Ended Job = job_local_0001
10/08/20 11:13:39 INFO exec.ExecDriver: Ended Job = job_local_0001
10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001> to: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate>
10/08/20 11:13:39 INFO exec.FileSinkOperator: Moving tmp dir: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/_tmp.10001.intermediate> to: hdfs://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/10001<http://hdp01.billeo.com:54310/tmp/hive-hadoop/234840696/10001>

OK
Time taken: 2339.331 seconds
====================================================

thanks, ronak

408 504 4847
My Blog : http://ronakbaps.posterous.com

NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB