Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Hive >> mail # user >> Hive sample test


+
Kyle B 2013-03-05, 18:45
Copy link to this message
-
RE: Hive sample test
Using the Hive sampling feature would also help. This is exactly what that feature is designed for.

Chuck
From: Kyle B [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, March 05, 2013 1:45 PM
To: [EMAIL PROTECTED]
Subject: Hive sample test
Hello,

I was wondering if there is a way to quick-verify a Hive query before it is run against a big dataset? The tables I am querying against have millions of records, and I'd like to verify my Hive query before I run it against all records.

Is there a way to test the query against a small subset of the data, without going into full MapReduce? As silly as this sounds, is there a way to MapReduce without the overhead of MapReduce? That way I can check my query is doing what I want before I run it against all records.

Thanks,

-Kyle
+
Joey DAntoni 2013-03-05, 18:48
+
Dean Wampler 2013-03-05, 18:57
+
Mark Grover 2013-03-05, 19:26
+
Dean Wampler 2013-03-05, 19:44
+
Ramki Palle 2013-03-08, 11:30
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB