Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> HIVE and S3 via EMR?


Copy link to this message
-
Re: HIVE and S3 via EMR?
Good catch, Pedro!

Russell: Not sure how you can be using Hive 0.9 on EMR since EMR only supports upto Hive 0.7.1.

Check this for details: http://aws.amazon.com/elasticmapreduce/faqs/#hive-9

Mark

----- Original Message -----
From: "Russell Jurney" <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Wednesday, May 30, 2012 3:52:27 PM
Subject: Re: HIVE and S3 via EMR?
You = Excellent
Russell Jurney
twitter.com/rjurney
[EMAIL PROTECTED]
datasyndrome.com

On May 29, 2012, at 11:06 PM, Pedro Figueiredo < [EMAIL PROTECTED] > wrote:
On 30 May 2012, at 02:17, Russell Jurney wrote:
I've made the bucket - which is derived from the enron emails - available at s3:///rjurney_public_web/from_to_date and a sample is available at http://s3.amazonaws.com/rjurney_public_web/from_to_date/part-m-00004

The problem is that your bucket name contains the '_' character. When Hive (or whatever, really) tries to resolve the hostname rjurney_public_web.s3.amazonaws.com it fails, because '_' is an illegal character in DNS. It's got nothing to do with Hive, or your table definition.
You can have a look at "Rules for bucket naming" in http://docs.amazonwebservices.com/AmazonS3/latest/dev/BucketRestrictions.html
Cheers,
Pedro

Pedro Figueiredo
Skype: pfig.89clouds
http://89clouds.com/ - Big Data Consulting
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB