AFAIK, SAMPLE operator internally uses reservoir sampling. So it reads entire data to randomly generate 10% data.
On Feb 26, 2013, at 6:19 PM, Panshul Whisper <[EMAIL PROTECTED]> wrote:
> Can somebody please explain me the difference between Limit and Sample
> Does it read the entire input file in case of Sample if the value is set to
> 0.1 or it reads randomly only till 10% of the data has been collected.
> Thanking You for any help.
> Ouch Whisper