Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 91 to 100 from 24985 (0.225s).
Loading phrases to help you
refine your search...
GROUP BY Issue - Pig - [mail # user]
...Hi,  On running the following query I am getting multiple records with same value of F1  SELECT F1, COUNT(*) FROM ( SELECT F1, F2, COUNT(*) FROM TABLE1 GROUP BY F1, F2 ) a GROUP BY...
   Author: Gourav Sengupta, 2013-06-10, 14:39
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...The hadoop wiki give a brief explanation : http://wiki.apache.org/hadoop/HowManyMapsAndReduces  The logic is indeed the same for Pig because, under the hood, Pig will generated and opti...
   Author: Bertrand Dechoux, 2013-06-10, 11:42
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...Hi Pedro,  Yes, Pig Latin is always compiled to MapReduce. Usually you don't have to specify the number of mappers (I am not sure whether you really can). If you have a file of 500MB an...
   Author: Ruslan Al-Fakikh, 2013-06-10, 11:34
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...Yes, I understand the previous answers now. The reason of my question is because I was trying to "split" a file with pig latin by loading the file and writing portions of the file again in H...
   Author: Pedro Sá da Costa, 2013-06-10, 09:36
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...I wasn't clear. Specifying the size of the files is not your real aim, I guess. But you think that's what is needed in order to solve your problem that we don't know about. 500MB is not a re...
   Author: Bertrand Dechoux, 2013-06-10, 09:21
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...Hi, Pedro: Basically how many splits of files depends on how many reducer you have in your Pig job. So if total result data size is 100MB, and you have 10 reducers, you will get 10 files and...
   Author: Johnny Zhang, 2013-06-10, 06:58
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...I don't understand why my purpose is not clear. The previous e-mails explain it very clearly.  I want to split a 500MB single txt in HDFS into multiple files using Pig latin. Is it poss...
   Author: Pedro Sá da Costa, 2013-06-10, 05:42
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...The purpose is not really clear. But if you are looking for how to specify multiple Reducer task, it is well explained in the documentation. http://pig.apache.org/docs/r0.11.1/perf.html#para...
   Author: Bertrand Dechoux, 2013-06-10, 05:29
Re: save several 64MB files in Pig Latin - Pig - [mail # user]
...I said 64MB, but it can be 128MB, or 5KB. It doesn't matter the number. I just want to extract data and put into several files with specific size. Basically, I am doing a cat to a big txt fi...
   Author: Pedro Sá da Costa, 2013-06-10, 04:53
[PIG-3250] Pig dryrun generates wrong output in .expanded file for 'SPLIT....OTHERWISE...' command - Pig - [issue]
...step to reproduce it:1. input files 'users'123452. pig script split.pigdefine group_and_count (A,key) returns B {    SPLIT $A INTO $B IF $key<7, Y IF $key==5, Z OTHERWISE;}...
http://issues.apache.org/jira/browse/PIG-3250    Author: Johnny Zhang, 2013-06-10, 01:31
Sort:
project
Hadoop (69216)
HBase (58649)
Hive (38927)
Pig (24933)
Accumulo (21639)
MapReduce (18316)
HDFS (9112)
Flume (8389)
Zookeeper (8249)
Kafka (8003)
Avro (7285)
Sqoop (6851)
Chukwa (5949)
Drill (1854)
Bigtop (1568)
type
mail # user (12122)
javadoc (7260)
mail # dev (3289)
source code (1132)
issue (1011)
web site (74)
Sematext # blog (52)
wiki (45)
date
last 7 days (64)
last 30 days (348)
last 90 days (1082)
last 6 months (2320)
last 9 months (16588)
author
Dmitriy Ryaboy (1351)
Alan Gates (959)
Jonathan Coveney (736)
Daniel Dai (545)
Russell Jurney (493)
Olga Natkovich (453)
Prashant Kommireddi (370)
Bill Graham (333)
Cheolsoo Park (263)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (156)
Jeff Zhang (146)
Santhosh Srinivasan (142)