Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 61 to 70 from 75 (0.057s).
Loading phrases to help you
refine your search...
Re: Loading data from a SQL database? - Pig - [mail # user]
...Hi,  In my project, we had to develop our own Loader for that purpose.  Thanks  On Fri, Aug 10, 2012 at 2:28 PM, Vincent Barat  wr ote: . op ).    Best Regards,...
   Author: Ruslan Al-Fakikh, 2012-08-10, 13:39
Re: how can I delete a file in pig only after checking if the file exists? - Pig - [mail # user]
...hi Sheng,  Try something like sh bash -c 'if hadoop fs -test -e $LOOKUP_HDFS_TEMP; then echo Deleting old local file lookup; hadoop fs -rm $LOOKUP_HDFS_TEMP; else echo Local file lookup...
   Author: Ruslan Al-Fakikh, 2012-07-23, 12:50
[expand - 3 more] - Re: Best Practice: store depending on data content - Pig - [mail # user]
...That is a very interesting offtopic:) I think I will reinvestigate HCatalog some day and come up with specific questions.  Thanks a lot for explaining  On Wed, Jul 4, 2012 at 4:37 ...
   Author: Ruslan Al-Fakikh, 2012-07-05, 15:01
Re: Using average function is really slow - Pig - [mail # user]
...Hi James,  AVG is Algebraic which means that it will use combiner when it can. It seems that your job is not using combiner. Can you give the full script? Also check the job config of t...
   Author: Ruslan Al-Fakikh, 2012-07-04, 21:05
Re: Does pig support in clause? - Pig - [mail # user]
...Hi Johannes,  Try this C = LOAD 'in.dat' AS (A1); A = LOAD 'in2.dat' AS (A1);  joined = JOIN A BY A1 LEFT OUTER, C BY A1;  DESCRIBE joined;  newEntries = FILTER joined BY...
   Author: Ruslan Al-Fakikh, 2012-07-04, 13:53
Re: What is the best way to do counting in pig? - Pig - [mail # user]
...Hi,  As it was said, COUNT is algebraic and should be fast, because it forces combiner. You should make sure that combiner is really used here. It can be disabled in some situations. I'...
   Author: Ruslan Al-Fakikh, 2012-07-03, 10:03
Re: suggestion - Pig - [mail # user]
...Hey Yang,  For debugging you may want the local mode, try pig -x local  Also there are some useful commands like, DESCRIBE, ILLUSTRATE  Ruslan  On Fri, Jun 29, 2012 at 7:...
   Author: Ruslan Al-Fakikh, 2012-06-29, 12:02
Re: Unable to open iterator for alias A - Pig - [mail # user]
...Hi,  It seems that you are using MapReduce 2.0. Why? As far as I know it is an alpha version. Also an extract from here http://hortonworks.com/blog/new-features-in-apache-pig-0-10/ &nbs...
   Author: Ruslan Al-Fakikh, 2012-06-28, 15:01
Re: Hive error when loading csv data. - Pig - [mail # user]
...Hi,  You may try Cloudera's pseudo-distributed mode https://ccp.cloudera.com/display/CDHDOC/CDH3+Deployment+in+Pseudo-Distribut ed+Mode You may also try Cloudera's demo VM https://ccp.c...
   Author: Ruslan Al-Fakikh, 2012-06-27, 15:38
Re: how can I distinct one field of a relation - Pig - [mail # user]
...Hey Haitao,  I didn't get exactly what your requirement was and your example seems to be incomplete. Here it is:  A is: 1,2,3 1,2,3 4,5,6  What I want is : 1,2,3 4,5,6  W...
   Author: Ruslan Al-Fakikh, 2012-06-27, 14:09
Pig (75)
Hive (17)
MapReduce (6)
Sqoop (5)
Avro (3)
Hadoop (3)
mail # user (75)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (75)
Daniel Dai (404)
Dmitriy Ryaboy (345)
Alan Gates (335)
Cheolsoo Park (311)
Jonathan Coveney (237)
Rohini Palaniswamy (191)
Russell Jurney (176)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (108)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (62)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)