Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> DATA not storing as comma-separted


Copy link to this message
-
Re: DATA not storing as comma-separted
Why are you trying 0.7, yogesh? It's ancient at this point.

" Unable to create input splits for: file:///hello/demotry.txt "
implies the file does not exist.

Can you show a whole session in which you load data, store it using
PigStorage(','), cat it, and it comes out wrong?
So far I've been unable to reproduce your results.

D

On Wed, Jul 25, 2012 at 7:09 AM, Mohammad Tariq <[EMAIL PROTECTED]> wrote:
> Hello Yogesh,
>
>        Also add these lines, export PIG_CLASSPATH=/HADOOP_HOME/conf &
> export HADOOP_CONF_DIR=/HADOOP_HOME/conf, and see if it works for you.
>
> Regards,
>     Mohammad Tariq
>
>
> On Wed, Jul 25, 2012 at 6:01 PM,  <[EMAIL PROTECTED]> wrote:
>> Hi mohammad,
>>
>> when I try the command
>>
>> Pig
>>
>> its shows error for 0.7.0 version
>>
>> mediaadmin$ pig
>> 12/07/25 17:54:15 INFO pig.Main: Logging error messages to: /users/mediaadmin/pig_1343219055229.log
>> 2012-07-25 17:54:15,451 [main] INFO  org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: file:///
>>
>> and this  .log file doesn't exist /users/mediaadmin/
>>
>> Wht is it so, I have set the thses properties in pig-0.70.0/bin/pig file.
>>
>> ---------------------------------------------------------------------
>>  The Pig command script
>> #
>> # Environment Variables
>> #
>>     export JAVA_HOME=/Library/Java/Home
>> #
>> #     PIG_CLASSPATH Extra Java CLASSPATH entries.
>> #
>>       export HADOOP_HOME=/HADOOP/hadoop-0.20.2
>>
>>         export HADOOP_CONF_DIR=/HADOOP/hadoop-0.20.2/conf
>>
>> #     PIG_HEAPSIZE    The maximum amount of heap to use, in MB.
>> #                                        Default is 1000.
>> #
>> #     PIG_OPTS            Extra Java runtime options.
>> #
>>      export PIG_CONF_DIR=/HADOOP/pig-0.7.0/conf
>> #
>> #     PIG_ROOT_LOGGER The root appender. Default is INFO,console
>> #
>> #     PIG_HADOOP_VERSION Version of hadoop to run with.    Default is 20 (0.20).
>>
>> ----------------------------------------------------------------
>>
>>
>>
>>
>> ________________________________________
>> From: Mohammad Tariq [[EMAIL PROTECTED]]
>> Sent: Wednesday, July 25, 2012 5:34 PM
>> To: [EMAIL PROTECTED]
>> Subject: Re: DATA not storing as comma-separted
>>
>> Also, it would be help to go to the MapReduce web UI and having a look
>> at the details of the job corresponding to this query.
>>
>> Regards,
>>     Mohammad Tariq
>>
>>
>> On Wed, Jul 25, 2012 at 5:31 PM, Mohammad Tariq <[EMAIL PROTECTED]> wrote:
>>> I have worked with pig-0.7.0 once and it was working fine. Try to see
>>> if there is anything interesting in the log files. Also, if possible,
>>> share 2-3 lines of your file..I'll give it a try on my machine.
>>>
>>> Regards,
>>>     Mohammad Tariq
>>>
>>>
>>> On Wed, Jul 25, 2012 at 5:20 PM,  <[EMAIL PROTECTED]> wrote:
>>>> Hi Mohammad,
>>>>
>>>> I have switched from pig 0.10.0 to 0.7.0 and its horrible experience.
>>>> I do perform
>>>>
>>>> grunt> A = load '/hello/demotry.txt'
>>>>>> as (name:chararray, roll:int, mssg:chararray);
>>>>
>>>> grunt> dump A;
>>>>
>>>> it shows this error:
>>>>
>>>> grunt> dump A;
>>>> 2012-07-25 17:20:34,081 [main] INFO  org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No column pruned for A
>>>> 2012-07-25 17:20:34,081 [main] INFO  org.apache.pig.impl.logicalLayer.optimizer.PruneColumns - No map keys pruned for A
>>>> 2012-07-25 17:20:34,102 [main] INFO  org.apache.hadoop.metrics.jvm.JvmMetrics - Initializing JVM Metrics with processName=JobTracker, sessionId>>>> 2012-07-25 17:20:34,169 [main] INFO  org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - (Name: Store(file:/tmp/temp61624047/tmp1087576502:org.apache.pig.builtin.BinStorage) - 1-18 Operator Key: 1-18)
>>>> 2012-07-25 17:20:34,195 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
>>>> 2012-07-25 17:20:34,195 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1