Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig, mail # user - Pig job result output and schema


Copy link to this message
-
Pig job result output and schema
Jeff Yuan 2013-03-05, 19:18
I have a couple of questions regarding job result and schema. The
context is that I'm trying to create a custom entry point for Pig that
takes a script, executes it, and always stores the last declared
alias/variable in a file. Would appreciate any insights to the 2
questions I have below or any advice in general.

1. I'm looking to automatically dump or store the last variable/alias
that the user has set. I know PigServer.getAliasKeySet or getAliases
will return a Set or Map of the alias. But they are unordered, is
there a way to get an ordered list of aliases?

2. I'm interested in getting the result schema and the raw result set.
Is the best way to do this just PigServer.dumpSchema(alias) to get the
result schema, and PigServer.openIterator(alias) to get the resulting
Tuples?

Thanks,
Jeff