Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> RowCount

I've been doing the following to count rows:

x = foreach (group foo all) generate COUNT($1);

Is that the current best practice?  If so, would there be interest in a patch that simply did: x = rowcount(foo) ?  I find myself doing sanity checks on scripts a fair amount and am starting to wish for something less verbose.


Grant Ingersoll