Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> HIVE and S3 via EMR?

Copy link to this message
Re: HIVE and S3 via EMR?
Good catch, Pedro!

Russell: Not sure how you can be using Hive 0.9 on EMR since EMR only supports upto Hive 0.7.1.

Check this for details: http://aws.amazon.com/elasticmapreduce/faqs/#hive-9


----- Original Message -----
From: "Russell Jurney" <[EMAIL PROTECTED]>
Sent: Wednesday, May 30, 2012 3:52:27 PM
Subject: Re: HIVE and S3 via EMR?
You = Excellent
Russell Jurney

On May 29, 2012, at 11:06 PM, Pedro Figueiredo < [EMAIL PROTECTED] > wrote:
On 30 May 2012, at 02:17, Russell Jurney wrote:
I've made the bucket - which is derived from the enron emails - available at s3:///rjurney_public_web/from_to_date and a sample is available at http://s3.amazonaws.com/rjurney_public_web/from_to_date/part-m-00004

The problem is that your bucket name contains the '_' character. When Hive (or whatever, really) tries to resolve the hostname rjurney_public_web.s3.amazonaws.com it fails, because '_' is an illegal character in DNS. It's got nothing to do with Hive, or your table definition.
You can have a look at "Rules for bucket naming" in http://docs.amazonwebservices.com/AmazonS3/latest/dev/BucketRestrictions.html

Pedro Figueiredo
Skype: pfig.89clouds
http://89clouds.com/ - Big Data Consulting