Parag Sarda 2013-02-11, 22:59
Mark Grover 2013-02-12, 04:57
-Re: How to load hive metadata from conf dir
Parag Sarda 2013-02-12, 08:18
Thanks Mark for your reply.
My program is like a workflow management application and it runs on client machine and not on hadoop cluster. I use 'hadoop jar' so that my application has access to DFS and hadoop API. I would also like my application to have access to Hive metadata the same way it has access to DFS. Users can then write the rules for their workflow against hive metadata.
Since users for my application are already using Hive, I need to support hive metadata and I can not ask them to move to Hcatlog.
From: Mark Grover <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>>
Reply-To: "[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>" <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>>
Date: Tuesday, 12 February 2013 10:27 AM
To: "[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>" <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>>
Subject: Re: How to load hive metadata from conf dir
I think your question boils down to:
How does one access Hive metadata from MapReduce jobs?
In the past, when I've had to write MR jobs and needed Hive metadata, I ended up writing a wrapper Hive query that used a custom mapper and reducer by using hive's transform functionality to do the job.
However, if you want to stick to MR job, you seem to be along the right lines.
Also, it seems that HCatalog's (http://incubator.apache.org/hcatalog/docs/r0.4.0/) premise is to make metadata access among Hive, Pig and MR easier. Perhaps, you want to take a look at that and see if that fits your use case?
On Mon, Feb 11, 2013 at 2:59 PM, Parag Sarda <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:
Hello Hive Users,
I am writing a program in java which is bundled as JAR and executed using
hadoop jar command. I would like to access hive metadata (read partitions
informations) in this program. I can ask user to set HIVE_CONF_DIR
environment variable before calling my program or ask for any reasonable
parameters to be passed. I do not want to force user to run hive megastore
service if possible to increase reliability of program by avoiding
What is the recommended way to get partitions information? Here is my
1. Make sure my jar is bundled with hive-metastore library.
2. Use HiveMetastoreClient
Is this correct? If yes, how to read the hive configuration from
Thanks in advance,
Nitin Pawar 2013-02-12, 08:25
Parag Sarda 2013-02-12, 16:36
Dean Wampler 2013-02-12, 14:41
Edward Capriolo 2013-02-12, 16:53
Parag Sarda 2013-02-12, 17:31
Dean Wampler 2013-02-12, 16:55