Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 11 (0.141s).
Loading phrases to help you
refine your search...
Re: any suggestions on IIS log storage and analysis? - MapReduce - [mail # user]
...You can run a series of map-reduce jobs on your data, if some log line is related to another line, e.g. based on sessionId, you can emit the sessionId as the key of your mapper output with t...
   Author: Peyman Mohajerian, 2014-01-01, 01:40
Re: Migrating from Legacy to Hadoop. - MapReduce - [mail # user]
...I wonder if JDBC driver over Hive could help you. If you legacy ETL job can talk to a jdbc driver, it is a slow way of writing to HDFS and I don't have any experience doing it, e.g.: http://...
   Author: Peyman Mohajerian, 2013-10-09, 02:16
Re: File formats in Hadoop: Sequence files vs AVRO vs RC vs ORC - MapReduce - [mail # user]
...It is not recommended to keep the data at rest in sequences format, because it is Java specific and you cannot share it with other none-java systems easily, it is ideal for running map/reduc...
   Author: Peyman Mohajerian, 2013-09-30, 17:40
Re: Retrieve and compute input splits - MapReduce - [mail # user]
...For the JobClient to compute the input splits doesn't it need to contact Name Node. Only Name Node knows where the splits are, how can it compute it without that additional call?   On F...
   Author: Peyman Mohajerian, 2013-09-27, 23:02
Re: issue about invisible data in haoop file - MapReduce - [mail # user]
...In my experience with Flume and this issue, it occurs when the file is not properly closed. If it was then it would show you the correct size and Hive will read the content.   On Wed, S...
   Author: Peyman Mohajerian, 2013-09-25, 15:08
Re: Oozie dynamic action - MapReduce - [mail # user]
...If you want to see a simple example of what you are looking for: https://github.com/cloudera/cdh-twitter-example It is part of this article: http://blog.cloudera.com/blog/2012/09/analyzing-t...
   Author: Peyman Mohajerian, 2013-09-17, 21:43
Re: Hdfs questions - MapReduce - [mail # user]
...In Amazon the best approach and I think cheapest is to first copy to s3, there is a command in EMR to facilitate that, if you aren't using EMR you may still be able to install it.   On ...
   Author: Peyman Mohajerian, 2013-09-10, 16:19
Re: Hadoop Clients (Hive,Pig) and Hadoop Cluster - MapReduce - [mail # user]
...Regarding Sqoop, you can install it wherever you would have access to your database and HDFS cluster, you could e.g. install it on the namenode if you want it as long as it has access to the...
   Author: Peyman Mohajerian, 2013-08-29, 22:28
Re: Mapreduce jobs to download job input from across the internet - MapReduce - [mail # user]
...Apache Flume may help you for this use case. I read an article on Cloudera's site about using Flume to pull tweets and same idea may apply here.   On Tue, Apr 16, 2013 at 9:26 PM, David...
   Author: Peyman Mohajerian, 2013-04-17, 16:41
Re: Input path with no Output path - MapReduce - [mail # user]
...I think this does it: http://hadoop.apache.org/docs/r0.20.1/api/org/apache/hadoop/mapreduce/lib/output/NullOutputFormat.html  On Fri, Dec 7, 2012 at 10:06 AM, Oleg Zhurakousky  wro...
   Author: Peyman Mohajerian, 2012-12-07, 18:21
Hadoop (20)
Hive (14)
MapReduce (11)
HDFS (5)
Flume (1)
mail # user (11)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (11)
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (223)
Amar Kamat (181)
Thomas Graves (166)
Jason Lowe (162)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Alejandro Abdelnur (103)
Devaraj K (103)
Ramya Sunil (103)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (80)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (71)
Mahadev konar (67)
Ravi Prakash (66)
Peyman Mohajerian