Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 111 to 118 from 118 (3.183s).
Loading phrases to help you
refine your search...
Re: How to make the query compiler not determine the number of reducers? - Hive - [mail # user]
...Hi Thomas        Hive automatically sets the number of reducers for you. But you can easily override them at CLI. Before executing your query  hive>SET mapred.red...
   Author: Bejoy KS, 2012-04-28, 05:43
Re: external table on flume log files in S3 - Hive - [mail # user]
...Hi Soren       If you can collect or order the log files into date based sub dirs in S3. Then you can partition the table based on date. With partitions you can query a subset...
   Author: Bejoy KS, 2012-04-24, 14:30
Re: When/how to use partitions and buckets usefully? - Hive - [mail # user]
...If data is in hdfs, then you can bucket it only after loading into a temp/staging table and then to the final bucketed table. Bucketing needs a Map reduce job.   Regards Bejoy KS  ...
   Author: Bejoy KS, 2012-04-23, 16:39
Re: When/how to use partitions and buckets usefully? - Hive - [mail # user]
...For Bucketed map join, both tables should be bucketed and the number of buckets of one should be multiple of other.  Regards Bejoy KS  Sent from handheld, please excuse typos. &nbs...
   Author: Bejoy KS, 2012-04-23, 16:05
Re: When/how to use partitions and buckets usefully? - Hive - [mail # user]
...Bucketed map join would be good I guess. What is the total size of the smaller table and what is its expected size in the next few years?  The size should be good enough to be put in Di...
   Author: Bejoy KS, 2012-04-23, 16:03
Re: When/how to use partitions and buckets usefully? - Hive - [mail # user]
...Partitions are good when you want to run your queries on a subset of whole data. So the partition column depends on your queries. But a good point to be taken care is that every partition ha...
   Author: Bejoy KS, 2012-04-23, 15:31
Re: Postgres JDBC + dboutput UDF to export from Hive to remote Postgres - Hive - [mail # user]
...Hi Abshiek        To transfer data between rdbms and hadoop Sqoop is the preferred and recommended option. Once you have the process done in hive the output data can be e...
   Author: Bejoy KS, 2012-03-29, 10:12
Re: Zero Byte file in HDFS - Hive - [mail # user]
...Hi Abshiek   Merging happens as a last stage of hive jobs. Say your hive query is translated to n MR jobs when you enable merge you can set a size that is needed to merge (usually block...
   Author: Bejoy KS, 2012-03-26, 21:51
Sort:
project
Hive (118)
MapReduce (69)
Hadoop (29)
HDFS (13)
Sqoop (10)
Pig (2)
type
mail # user (115)
issue (2)
mail # dev (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (3)
last 9 months (118)
author
Namit Jain (802)
Edward Capriolo (636)
Zheng Shao (613)
Carl Steinbach (604)
John Sichi (297)
Mark Grover (269)
Ning Zhang (255)
Ashutosh Chauhan (250)
Nitin Pawar (220)
Ashish Thusoo (169)
Kevin Wilfong (161)
He Yongqiang (156)
Prasad Chakka (152)
Bejoy Ks (132)
Navis (128)
Bejoy KS