Panshul Whisper 2013-02-26, 23:19
AFAIK, SAMPLE operator internally uses reservoir sampling. So it reads entire data to randomly generate 10% data.
On Feb 26, 2013, at 6:19 PM, Panshul Whisper <[EMAIL PROTECTED]> wrote:
> Can somebody please explain me the difference between Limit and Sample
> Does it read the entire input file in case of Sample if the value is set to
> 0.1 or it reads randomly only till 10% of the data has been collected.
> Thanking You for any help.
> Ouch Whisper
Gianmarco De Francisci Mo... 2013-02-28, 10:01
Prasanth J 2013-02-28, 10:08
Panshul Whisper 2013-02-28, 10:41