|
Olga Natkovich
2011-10-20, 23:40
Santhosh Srinivasan
2011-10-20, 23:58
Thejas Nair
2011-10-21, 17:45
Santhosh Srinivasan
2011-10-21, 17:59
Thejas Nair
2011-10-21, 18:22
Santhosh Srinivasan
2011-10-21, 20:50
Milind.Bhandarkar@...
2011-10-21, 21:10
Gianmarco De Francisci Mo...
2011-10-22, 09:31
Daniel Dai
2011-10-24, 18:59
Dmitriy Ryaboy
2011-10-24, 19:43
Thejas Nair
2011-10-24, 19:55
Dmitriy Ryaboy
2011-10-24, 21:32
Thejas Nair
2011-10-25, 16:54
Dmitriy Ryaboy
2011-10-25, 00:38
Olga Natkovich
2011-10-25, 00:54
Thejas Nair
2011-10-25, 01:10
Dmitriy Ryaboy
2011-10-25, 01:45
Santhosh Srinivasan
2011-10-25, 06:53
Dmitriy Ryaboy
2011-10-25, 07:30
Santhosh Srinivasan
2011-10-25, 07:52
Thejas Nair
2011-10-25, 17:01
Olga Natkovich
2011-10-25, 17:37
Olga Natkovich
2011-10-24, 20:12
Scott Carey
2011-10-24, 23:05
Gianmarco De Francisci Mo...
2011-10-25, 09:22
Alan Gates
2011-10-21, 18:00
Olga Natkovich
2011-10-25, 00:51
|
-
Next Pig release proposalOlga Natkovich 2011-10-20, 23:40
Hi,
Here is what I propose we do for the next Pig release: (1) Branch early next week - we have major features and many bug fixes in and will be fixing remaining bugs on the branch (2) Publish the release by 11/15 - that will give us a couple of weeks to stabilize the branch and get last minute bug fixes in (3) Make this release a 1.0 release. Reasons to go for 1.0 and not 0.10 a. This release has minimal number of features and was focused on code stabilization and bug fixes. We believe it will be a stable release b. It has no major interface changes c. Going from 0.10 to 1.0 would be very confusing Comments? Olga +
Olga Natkovich 2011-10-20, 23:40
-
RE: Next Pig release proposalSanthosh Srinivasan 2011-10-20, 23:58
Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0)
How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). My concerns were around Hadoop API stability. I have heard that the APIs will not be stable for at least 1 year. This is taking me away from the Hadoop API stability factor (They passed healthcare in that duration. Really!) Do we want compatibility with 0.23 as a gating factor - not sure if this is anywhere close to getting done in the near future. Will we support append (0.20.205)? Btw, Hbase has been doing 0.90.1, 0.90.2, etc. So we can take a look at this option too. Santhosh -----Original Message----- From: Olga Natkovich [mailto:[EMAIL PROTECTED]] Sent: Thursday, October 20, 2011 4:40 PM To: [EMAIL PROTECTED] Subject: Next Pig release proposal Hi, Here is what I propose we do for the next Pig release: (1) Branch early next week - we have major features and many bug fixes in and will be fixing remaining bugs on the branch (2) Publish the release by 11/15 - that will give us a couple of weeks to stabilize the branch and get last minute bug fixes in (3) Make this release a 1.0 release. Reasons to go for 1.0 and not 0.10 a. This release has minimal number of features and was focused on code stabilization and bug fixes. We believe it will be a stable release b. It has no major interface changes c. Going from 0.10 to 1.0 would be very confusing Comments? Olga +
Santhosh Srinivasan 2011-10-20, 23:58
-
Re: Next Pig release proposalThejas Nair 2011-10-21, 17:45
On 10/20/11 4:58 PM, Santhosh Srinivasan wrote:
> Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) > > How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). > > My concerns were around Hadoop API stability. Over the next year or so, there are going to be two API versions of hadoop to be supported - 0.20.x api's and 0.23 apis, as we will have userbase on both. I think it is just a matter of releasing pig 1.0 for 0.20.x api's and 1.0 for 0.23.x api's. We will have to come up with a numbering scheme that reflects 'for hadoop version X' in our pig releases, regardless of it being 0.10 or 1.0. As there will be support for different api's of hadoop in pig releases, I don't see a reason why the hadoop api stability should stop pig from going 1.0 . -Thejas +
Thejas Nair 2011-10-21, 17:45
-
RE: Next Pig release proposalSanthosh Srinivasan 2011-10-21, 17:59
Thejas,
I guess you did not read my email completely. You are referring to the premise without examining the conclusion. I am repasting my entire email to avoid confusion (I hate truncated references). If you could respond again, it will bring us onto the same page. <email> Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). My concerns were around Hadoop API stability. I have heard that the APIs will not be stable for at least 1 year. This is taking me away from the Hadoop API stability factor (They passed healthcare in that duration. Really!) Do we want compatibility with 0.23 as a gating factor - not sure if this is anywhere close to getting done in the near future. Will we support append (0.20.205)? Btw, Hbase has been doing 0.90.1, 0.90.2, etc. So we can take a look at this option too. Santhosh -----Original Message----- From: Olga Natkovich [mailto:[EMAIL PROTECTED]] Sent: Thursday, October 20, 2011 4:40 PM To: [EMAIL PROTECTED] Subject: Next Pig release proposal Hi, Here is what I propose we do for the next Pig release: (1) Branch early next week - we have major features and many bug fixes in and will be fixing remaining bugs on the branch (2) Publish the release by 11/15 - that will give us a couple of weeks to stabilize the branch and get last minute bug fixes in (3) Make this release a 1.0 release. Reasons to go for 1.0 and not 0.10 a. This release has minimal number of features and was focused on code stabilization and bug fixes. We believe it will be a stable release <email/> Thanks, Santhosh -----Original Message----- From: Thejas Nair [mailto:[EMAIL PROTECTED]] Sent: Friday, October 21, 2011 10:45 AM To: [EMAIL PROTECTED] Subject: Re: Next Pig release proposal On 10/20/11 4:58 PM, Santhosh Srinivasan wrote: > Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) > > How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). > > My concerns were around Hadoop API stability. Over the next year or so, there are going to be two API versions of hadoop to be supported - 0.20.x api's and 0.23 apis, as we will have userbase on both. I think it is just a matter of releasing pig 1.0 for 0.20.x api's and 1.0 for 0.23.x api's. We will have to come up with a numbering scheme that reflects 'for hadoop version X' in our pig releases, regardless of it being 0.10 or 1.0. As there will be support for different api's of hadoop in pig releases, I don't see a reason why the hadoop api stability should stop pig from going 1.0 . -Thejas +
Santhosh Srinivasan 2011-10-21, 17:59
-
Re: Next Pig release proposalThejas Nair 2011-10-21, 18:22
Santosh, I thought you meant API stability for hadoop across major versions, but I guess you are referring to stability within 0.23 versions. But argument applies to that as well, if 0.23.1 is not compatible with 0.23.0, we need to call the release for 0.23.1 as 'pig 1.x for 0.23.1 api' . We just need to communicate to the users that the InputFormat/OutputFormat api's (and any anything else we expose from hadoop) depends on the hadoop version they are using. I think it is just like different JNI libraries that you would write for different OS. But the java version remains the same across OSs. -Thejas On 10/21/11 10:59 AM, Santhosh Srinivasan wrote: > Thejas, > > I guess you did not read my email completely. You are referring to the premise without examining the conclusion. I am repasting my entire email to avoid confusion (I hate truncated references). If you could respond again, it will bring us onto the same page. > > <email> > > Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) > > How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). > > My concerns were around Hadoop API stability. I have heard that the APIs will not be stable for at least 1 year. This is taking me away from the Hadoop API stability factor (They passed healthcare in that duration. Really!) Do we want compatibility with 0.23 as a gating factor - not sure if this is anywhere close to getting done in the near future. Will we support append (0.20.205)? > > Btw, Hbase has been doing 0.90.1, 0.90.2, etc. So we can take a look at this option too. > > Santhosh > > > > -----Original Message----- > From: Olga Natkovich [mailto:[EMAIL PROTECTED]] > Sent: Thursday, October 20, 2011 4:40 PM > To: [EMAIL PROTECTED] > Subject: Next Pig release proposal > > Hi, > > Here is what I propose we do for the next Pig release: > > > (1) Branch early next week - we have major features and many bug fixes in and will be fixing remaining bugs on the branch > > (2) Publish the release by 11/15 - that will give us a couple of weeks to stabilize the branch and get last minute bug fixes in > > (3) Make this release a 1.0 release. Reasons to go for 1.0 and not 0.10 > > a. This release has minimal number of features and was focused on code stabilization and bug fixes. We believe it will be a stable release > > <email/> > > Thanks, > Santhosh > > -----Original Message----- > From: Thejas Nair [mailto:[EMAIL PROTECTED]] > Sent: Friday, October 21, 2011 10:45 AM > To: [EMAIL PROTECTED] > Subject: Re: Next Pig release proposal > > On 10/20/11 4:58 PM, Santhosh Srinivasan wrote: >> Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) >> >> How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). >> >> My concerns were around Hadoop API stability. > > Over the next year or so, there are going to be two API versions of hadoop to be supported - 0.20.x api's and 0.23 apis, as we will have userbase on both. > > I think it is just a matter of releasing pig 1.0 for 0.20.x api's and 1.0 for 0.23.x api's. We will have to come up with a numbering scheme that reflects 'for hadoop version X' in our pig releases, regardless of it being 0.10 or 1.0. > > As there will be support for different api's of hadoop in pig releases, I don't see a reason why the hadoop api stability should stop pig from going 1.0 . > > -Thejas +
Thejas Nair 2011-10-21, 18:22
-
RE: Next Pig release proposalSanthosh Srinivasan 2011-10-21, 20:50
If I was not clear in my earlier email, I apologize for the lack of clarity. I am no longer in favour of waiting for Hadoop API stability across Hadoop versions. It's a pipe dream.
When we had PigInputFormat and PigOutputFormat, your reasoning would be spot on. I am concerned about the following. Our tight integration with Hadoop due to the use of Input and Output format might lead to a break in backward compatibility. I am not sure if the comparison with that of Java is valid. Probably a majority of the users don't use JNI. Its very hard to use Pig without writing custom load and store functions. The default load and store don't suffice for a majority of use cases that I have observed. I am trying to get all factors that might influence this decision. From the few emails that have been exchanged since yesterday, we have the following factors: 1. Hadoop 0.20.205 (support for Append) 2. Hadoop 0.22 3. Hadoop 0.23 4. Maturity of the new parser 5. Stability of the new logical plan 6. Other components in the eco system. - Avro (1.5.4, 1.4.1, ...) - Cassandra (1.0.0, 0.8.7, ...) - Chukwa (0.4.0, 0.3.0, ...) - Hama (0.3.0, 0.2.0, ...) - Hbase (0.90.4, 0.90.3, 0.90.2, 0.90.1, ...) - Hive (Releases - 0.7.1, 0.7.0, 0.6.0, ...) - Zookeeper (3.3.3, 3.3.2, 3.2.2, 3.1.2, ...) Santhosh -----Original Message----- From: Thejas Nair [mailto:[EMAIL PROTECTED]] Sent: Friday, October 21, 2011 11:22 AM To: [EMAIL PROTECTED] Subject: Re: Next Pig release proposal Santosh, I thought you meant API stability for hadoop across major versions, but I guess you are referring to stability within 0.23 versions. But argument applies to that as well, if 0.23.1 is not compatible with 0.23.0, we need to call the release for 0.23.1 as 'pig 1.x for 0.23.1 api' . We just need to communicate to the users that the InputFormat/OutputFormat api's (and any anything else we expose from hadoop) depends on the hadoop version they are using. I think it is just like different JNI libraries that you would write for different OS. But the java version remains the same across OSs. -Thejas On 10/21/11 10:59 AM, Santhosh Srinivasan wrote: > Thejas, > > I guess you did not read my email completely. You are referring to the premise without examining the conclusion. I am repasting my entire email to avoid confusion (I hate truncated references). If you could respond again, it will bring us onto the same page. > > <email> > > Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) > > How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). > > My concerns were around Hadoop API stability. I have heard that the APIs will not be stable for at least 1 year. This is taking me away from the Hadoop API stability factor (They passed healthcare in that duration. Really!) Do we want compatibility with 0.23 as a gating factor - not sure if this is anywhere close to getting done in the near future. Will we support append (0.20.205)? > > Btw, Hbase has been doing 0.90.1, 0.90.2, etc. So we can take a look at this option too. > > Santhosh > > > > -----Original Message----- > From: Olga Natkovich [mailto:[EMAIL PROTECTED]] > Sent: Thursday, October 20, 2011 4:40 PM > To: [EMAIL PROTECTED] > Subject: Next Pig release proposal > > Hi, > > Here is what I propose we do for the next Pig release: > > > (1) Branch early next week - we have major features and many bug fixes in and will be fixing remaining bugs on the branch > > (2) Publish the release by 11/15 - that will give us a couple of weeks to stabilize the branch and get last minute bug fixes in > > (3) Make this release a 1.0 release. Reasons to go for 1.0 and not 0.10 > > a. This release has minimal number of features and was focused on code stabilization and bug fixes. We believe it will be a stable release +
Santhosh Srinivasan 2011-10-21, 20:50
-
Re: Next Pig release proposalMilind.Bhandarkar@... 2011-10-21, 21:10
If one were to rewrite input and output formats to use the webhdfs://
APIs, this would not be an issue, right ? - milind On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: >If I was not clear in my earlier email, I apologize for the lack of >clarity. I am no longer in favour of waiting for Hadoop API stability >across Hadoop versions. It's a pipe dream. > >When we had PigInputFormat and PigOutputFormat, your reasoning would be >spot on. I am concerned about the following. Our tight integration with >Hadoop due to the use of Input and Output format might lead to a break in >backward compatibility. I am not sure if the comparison with that of Java >is valid. Probably a majority of the users don't use JNI. Its very hard >to use Pig without writing custom load and store functions. The default >load and store don't suffice for a majority of use cases that I have >observed. > >I am trying to get all factors that might influence this decision. From >the few emails that have been exchanged since yesterday, we have the >following factors: > >1. Hadoop 0.20.205 (support for Append) >2. Hadoop 0.22 >3. Hadoop 0.23 >4. Maturity of the new parser >5. Stability of the new logical plan >6. Other components in the eco system. > - Avro (1.5.4, 1.4.1, ...) > - Cassandra (1.0.0, 0.8.7, ...) > - Chukwa (0.4.0, 0.3.0, ...) > - Hama (0.3.0, 0.2.0, ...) > - Hbase (0.90.4, 0.90.3, 0.90.2, 0.90.1, ...) > - Hive (Releases - 0.7.1, 0.7.0, 0.6.0, ...) > - Zookeeper (3.3.3, 3.3.2, 3.2.2, 3.1.2, ...) > >Santhosh > > >-----Original Message----- >From: Thejas Nair [mailto:[EMAIL PROTECTED]] >Sent: Friday, October 21, 2011 11:22 AM >To: [EMAIL PROTECTED] >Subject: Re: Next Pig release proposal > > >Santosh, >I thought you meant API stability for hadoop across major versions, but I >guess you are referring to stability within 0.23 versions. But argument >applies to that as well, if 0.23.1 is not compatible with 0.23.0, we need >to call the release for 0.23.1 as 'pig 1.x for 0.23.1 api' . > >We just need to communicate to the users that the >InputFormat/OutputFormat api's (and any anything else we expose from >hadoop) depends on the hadoop version they are using. > >I think it is just like different JNI libraries that you would write for >different OS. But the java version remains the same across OSs. > >-Thejas > > >On 10/21/11 10:59 AM, Santhosh Srinivasan wrote: >> Thejas, >> >> I guess you did not read my email completely. You are referring to the >>premise without examining the conclusion. I am repasting my entire email >>to avoid confusion (I hate truncated references). If you could respond >>again, it will bring us onto the same page. >> >> <email> >> >> Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) >> >> How far have we progressed from our last discussion in March. There was >>no consensus on the 1.0 release. Opinions ranged from having more >>releases to bake in the maturity of the new parser and logical plan >>changes to compatibility with Hadoop API (was compared to Social >>Security - a very hot topic these days). >> >> My concerns were around Hadoop API stability. I have heard that the >>APIs will not be stable for at least 1 year. This is taking me away from >>the Hadoop API stability factor (They passed healthcare in that >>duration. Really!) Do we want compatibility with 0.23 as a gating factor >>- not sure if this is anywhere close to getting done in the near future. >>Will we support append (0.20.205)? >> >> Btw, Hbase has been doing 0.90.1, 0.90.2, etc. So we can take a look at >>this option too. >> >> Santhosh >> >> >> >> -----Original Message----- >> From: Olga Natkovich [mailto:[EMAIL PROTECTED]] >> Sent: Thursday, October 20, 2011 4:40 PM >> To: [EMAIL PROTECTED] >> Subject: Next Pig release proposal >> >> Hi, >> >> Here is what I propose we do for the next Pig release: >> >> >> (1) Branch early next week - we have major features and many bug >>fixes in and will be fixing remaining bugs on the branch +
Milind.Bhandarkar@... 2011-10-21, 21:10
-
Re: Next Pig release proposalGianmarco De Francisci Mo... 2011-10-22, 09:31
Hi,
just my 2 cents. I think the issue here is not 1.0 vs 0.10, but what's the versioning scheme we want to use for Pig. Up to now it has been just an increasing number after a '0.' prefix, changed when the community felt it was time. I think this works well for a small project, but it is somewhat fuzzy. I like the idea of having <major>.<minor>.<patch> versions like many other projects. It's a very clear and almost standard way of versioning a piece of software. It has clear rules on when to change each of the numbers, and lets the user get an idea of backward compatibility at a glance. So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we decide a clear versioning policy (whichever it is). So that the 1.0 milestone would mark the beginning of our new policy. Cheers, -- Gianmarco On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: > If one were to rewrite input and output formats to use the webhdfs:// > APIs, this would not be an issue, right ? > > - milind > > > On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: > > >If I was not clear in my earlier email, I apologize for the lack of > >clarity. I am no longer in favour of waiting for Hadoop API stability > >across Hadoop versions. It's a pipe dream. > > > >When we had PigInputFormat and PigOutputFormat, your reasoning would be > >spot on. I am concerned about the following. Our tight integration with > >Hadoop due to the use of Input and Output format might lead to a break in > >backward compatibility. I am not sure if the comparison with that of Java > >is valid. Probably a majority of the users don't use JNI. Its very hard > >to use Pig without writing custom load and store functions. The default > >load and store don't suffice for a majority of use cases that I have > >observed. > > > >I am trying to get all factors that might influence this decision. From > >the few emails that have been exchanged since yesterday, we have the > >following factors: > > > >1. Hadoop 0.20.205 (support for Append) > >2. Hadoop 0.22 > >3. Hadoop 0.23 > >4. Maturity of the new parser > >5. Stability of the new logical plan > >6. Other components in the eco system. > > - Avro (1.5.4, 1.4.1, ...) > > - Cassandra (1.0.0, 0.8.7, ...) > > - Chukwa (0.4.0, 0.3.0, ...) > > - Hama (0.3.0, 0.2.0, ...) > > - Hbase (0.90.4, 0.90.3, 0.90.2, 0.90.1, ...) > > - Hive (Releases - 0.7.1, 0.7.0, 0.6.0, ...) > > - Zookeeper (3.3.3, 3.3.2, 3.2.2, 3.1.2, ...) > > > >Santhosh > > > > > >-----Original Message----- > >From: Thejas Nair [mailto:[EMAIL PROTECTED]] > >Sent: Friday, October 21, 2011 11:22 AM > >To: [EMAIL PROTECTED] > >Subject: Re: Next Pig release proposal > > > > > >Santosh, > >I thought you meant API stability for hadoop across major versions, but I > >guess you are referring to stability within 0.23 versions. But argument > >applies to that as well, if 0.23.1 is not compatible with 0.23.0, we need > >to call the release for 0.23.1 as 'pig 1.x for 0.23.1 api' . > > > >We just need to communicate to the users that the > >InputFormat/OutputFormat api's (and any anything else we expose from > >hadoop) depends on the hadoop version they are using. > > > >I think it is just like different JNI libraries that you would write for > >different OS. But the java version remains the same across OSs. > > > >-Thejas > > > > > >On 10/21/11 10:59 AM, Santhosh Srinivasan wrote: > >> Thejas, > >> > >> I guess you did not read my email completely. You are referring to the > >>premise without examining the conclusion. I am repasting my entire email > >>to avoid confusion (I hate truncated references). If you could respond > >>again, it will bring us onto the same page. > >> > >> <email> > >> > >> Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) > >> > >> How far have we progressed from our last discussion in March. There was > >>no consensus on the 1.0 release. Opinions ranged from having more > >>releases to bake in the maturity of the new parser and logical plan +
Gianmarco De Francisci Mo... 2011-10-22, 09:31
-
Re: Next Pig release proposalDaniel Dai 2011-10-24, 18:59
Yes, we need a versioning scheme. There are two versioning scheme I can
think of: Scheme 1: <major>.<patch> <major> will be the feature rich release every 3 month <patch> will be the bug fix release when necessary Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 etc for bug fixes. Scheme 2: <major>.<minor>.<patch> Most of our 3 month release will be counted as <minor> release unless there are major user facing/disruptive changes. Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, 1.1.1 etc for bug fixes. I personally prefer scheme 2, increasing major version too frequently might be confusing to users. How's other folks feel? Daniel On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales < [EMAIL PROTECTED]> wrote: > Hi, > > just my 2 cents. > > I think the issue here is not 1.0 vs 0.10, but what's the versioning scheme > we want to use for Pig. > Up to now it has been just an increasing number after a '0.' prefix, > changed > when the community felt it was time. I think this works well for a small > project, but it is somewhat fuzzy. > > I like the idea of having <major>.<minor>.<patch> versions like many other > projects. It's a very clear and almost standard way of versioning a piece > of > software. It has clear rules on when to change each of the numbers, and > lets > the user get an idea of backward compatibility at a glance. > > So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we decide > a clear versioning policy (whichever it is). > So that the 1.0 milestone would mark the beginning of our new policy. > > Cheers, > -- > Gianmarco > > > > On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: > > > If one were to rewrite input and output formats to use the webhdfs:// > > APIs, this would not be an issue, right ? > > > > - milind > > > > > > On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: > > > > >If I was not clear in my earlier email, I apologize for the lack of > > >clarity. I am no longer in favour of waiting for Hadoop API stability > > >across Hadoop versions. It's a pipe dream. > > > > > >When we had PigInputFormat and PigOutputFormat, your reasoning would be > > >spot on. I am concerned about the following. Our tight integration with > > >Hadoop due to the use of Input and Output format might lead to a break > in > > >backward compatibility. I am not sure if the comparison with that of > Java > > >is valid. Probably a majority of the users don't use JNI. Its very hard > > >to use Pig without writing custom load and store functions. The default > > >load and store don't suffice for a majority of use cases that I have > > >observed. > > > > > >I am trying to get all factors that might influence this decision. From > > >the few emails that have been exchanged since yesterday, we have the > > >following factors: > > > > > >1. Hadoop 0.20.205 (support for Append) > > >2. Hadoop 0.22 > > >3. Hadoop 0.23 > > >4. Maturity of the new parser > > >5. Stability of the new logical plan > > >6. Other components in the eco system. > > > - Avro (1.5.4, 1.4.1, ...) > > > - Cassandra (1.0.0, 0.8.7, ...) > > > - Chukwa (0.4.0, 0.3.0, ...) > > > - Hama (0.3.0, 0.2.0, ...) > > > - Hbase (0.90.4, 0.90.3, 0.90.2, 0.90.1, ...) > > > - Hive (Releases - 0.7.1, 0.7.0, 0.6.0, ...) > > > - Zookeeper (3.3.3, 3.3.2, 3.2.2, 3.1.2, ...) > > > > > >Santhosh > > > > > > > > >-----Original Message----- > > >From: Thejas Nair [mailto:[EMAIL PROTECTED]] > > >Sent: Friday, October 21, 2011 11:22 AM > > >To: [EMAIL PROTECTED] > > >Subject: Re: Next Pig release proposal > > > > > > > > >Santosh, > > >I thought you meant API stability for hadoop across major versions, but > I > > >guess you are referring to stability within 0.23 versions. But argument > > >applies to that as well, if 0.23.1 is not compatible with 0.23.0, we > need > > >to call the release for 0.23.1 as 'pig 1.x for 0.23.1 api' . +
Daniel Dai 2011-10-24, 18:59
-
Re: Next Pig release proposalDmitriy Ryaboy 2011-10-24, 19:43
I am good with Scheme 2.
We are finding a fair number of issues trying to move from Pig 0.8.1 to 0.9, and I don't think those issues are fixed in 10, either.. not sure that this "stabilization" process has happened yet. D On Mon, Oct 24, 2011 at 11:59 AM, Daniel Dai <[EMAIL PROTECTED]> wrote: > Yes, we need a versioning scheme. There are two versioning scheme I can > think of: > > Scheme 1: > <major>.<patch> > <major> will be the feature rich release every 3 month > <patch> will be the bug fix release when necessary > > Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 > etc > for bug fixes. > > Scheme 2: > <major>.<minor>.<patch> > Most of our 3 month release will be counted as <minor> release unless there > are major user facing/disruptive changes. > > Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, > 1.1.1 etc for bug fixes. > > I personally prefer scheme 2, increasing major version too frequently might > be confusing to users. How's other folks feel? > > Daniel > > > On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales < > [EMAIL PROTECTED]> wrote: > > > Hi, > > > > just my 2 cents. > > > > I think the issue here is not 1.0 vs 0.10, but what's the versioning > scheme > > we want to use for Pig. > > Up to now it has been just an increasing number after a '0.' prefix, > > changed > > when the community felt it was time. I think this works well for a small > > project, but it is somewhat fuzzy. > > > > I like the idea of having <major>.<minor>.<patch> versions like many > other > > projects. It's a very clear and almost standard way of versioning a piece > > of > > software. It has clear rules on when to change each of the numbers, and > > lets > > the user get an idea of backward compatibility at a glance. > > > > So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we > decide > > a clear versioning policy (whichever it is). > > So that the 1.0 milestone would mark the beginning of our new policy. > > > > Cheers, > > -- > > Gianmarco > > > > > > > > On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: > > > > > If one were to rewrite input and output formats to use the webhdfs:// > > > APIs, this would not be an issue, right ? > > > > > > - milind > > > > > > > > > On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: > > > > > > >If I was not clear in my earlier email, I apologize for the lack of > > > >clarity. I am no longer in favour of waiting for Hadoop API stability > > > >across Hadoop versions. It's a pipe dream. > > > > > > > >When we had PigInputFormat and PigOutputFormat, your reasoning would > be > > > >spot on. I am concerned about the following. Our tight integration > with > > > >Hadoop due to the use of Input and Output format might lead to a break > > in > > > >backward compatibility. I am not sure if the comparison with that of > > Java > > > >is valid. Probably a majority of the users don't use JNI. Its very > hard > > > >to use Pig without writing custom load and store functions. The > default > > > >load and store don't suffice for a majority of use cases that I have > > > >observed. > > > > > > > >I am trying to get all factors that might influence this decision. > From > > > >the few emails that have been exchanged since yesterday, we have the > > > >following factors: > > > > > > > >1. Hadoop 0.20.205 (support for Append) > > > >2. Hadoop 0.22 > > > >3. Hadoop 0.23 > > > >4. Maturity of the new parser > > > >5. Stability of the new logical plan > > > >6. Other components in the eco system. > > > > - Avro (1.5.4, 1.4.1, ...) > > > > - Cassandra (1.0.0, 0.8.7, ...) > > > > - Chukwa (0.4.0, 0.3.0, ...) > > > > - Hama (0.3.0, 0.2.0, ...) > > > > - Hbase (0.90.4, 0.90.3, 0.90.2, 0.90.1, ...) > > > > - Hive (Releases - 0.7.1, 0.7.0, 0.6.0, ...) > > > > - Zookeeper (3.3.3, 3.3.2, 3.2.2, 3.1.2, ...) > > > > > > > >Santhosh > > > > > > > > > > > >-----Original Message----- +
Dmitriy Ryaboy 2011-10-24, 19:43
-
Re: Next Pig release proposalThejas Nair 2011-10-24, 19:55
On 10/24/11 12:43 PM, Dmitriy Ryaboy wrote:
> We are finding a fair number of issues trying to move from Pig 0.8.1 to 0.9, > and I don't think those issues are fixed in 10, either.. not sure that this > "stabilization" process has happened yet. > > D > What kind of issues are these ? Are they related to major changes in 0.8 (logical plan) or 0.9 (antlr parser, or semantic cleanup (in terms of backward compat) ) ? -Thejas +
Thejas Nair 2011-10-24, 19:55
-
Re: Next Pig release proposalDmitriy Ryaboy 2011-10-24, 21:32
search for emails from Jonathan Coveney to pig-user that are not about jruby
:) On Mon, Oct 24, 2011 at 12:55 PM, Thejas Nair <[EMAIL PROTECTED]>wrote: > On 10/24/11 12:43 PM, Dmitriy Ryaboy wrote: > >> We are finding a fair number of issues trying to move from Pig 0.8.1 to >> 0.9, >> and I don't think those issues are fixed in 10, either.. not sure that >> this >> "stabilization" process has happened yet. >> >> D >> >> > What kind of issues are these ? Are they related to major changes in 0.8 > (logical plan) or 0.9 (antlr parser, or semantic cleanup (in terms of > backward compat) ) ? > > -Thejas > +
Dmitriy Ryaboy 2011-10-24, 21:32
-
Re: Next Pig release proposalThejas Nair 2011-10-25, 16:54
I can find two issues reported by Jonathan, both related to the parser.
One having to do with nested statement syntax, and other to do with speed of parsing. Both of these should get looked into before next release. But I don't see a whole lot issues with the parser changes or 0.9 in general, it seems to be fairly stable. -Thejas On 10/24/11 2:32 PM, Dmitriy Ryaboy wrote: > search for emails from Jonathan Coveney to pig-user that are not about jruby > :) > > > On Mon, Oct 24, 2011 at 12:55 PM, Thejas Nair<[EMAIL PROTECTED]>wrote: > >> On 10/24/11 12:43 PM, Dmitriy Ryaboy wrote: >> >>> We are finding a fair number of issues trying to move from Pig 0.8.1 to >>> 0.9, >>> and I don't think those issues are fixed in 10, either.. not sure that >>> this >>> "stabilization" process has happened yet. >>> >>> D >>> >>> >> What kind of issues are these ? Are they related to major changes in 0.8 >> (logical plan) or 0.9 (antlr parser, or semantic cleanup (in terms of >> backward compat) ) ? >> >> -Thejas >> > +
Thejas Nair 2011-10-25, 16:54
-
Re: Next Pig release proposalDmitriy Ryaboy 2011-10-25, 00:38
To be a little more concrete about what I am saying here -- I don't think we
should put a "1.0" label on any *.0 release. 0.8.1 is pretty solid; 0.9.0 has some holes, 0.9.1 is better. If we put 1.0 on what is currently being thought of as 0.10, it will have some stability / usability issues (things tend to show up after we make a release and people in the wild start trying it), and those issues will make a poor impression on those who expect 1.0 to be shiny and polished after so much time. I'm in favor of waiting a couple of dot releases, promoting a stabilized release into 1.0, and going from there. So, pictorially: -- trunk --- 0.11-dev ----------0.12-dev------------------| 1.2-dev! \ \ \ \ ---------------- 0.11.0 --------------------| 1.1.0! \ \------- 0.10.0 ------- 0.10.1 ------- 0.10.2 --------| 1.0.0 !! On Mon, Oct 24, 2011 at 12:43 PM, Dmitriy Ryaboy <[EMAIL PROTECTED]> wrote: > I am good with Scheme 2. > > We are finding a fair number of issues trying to move from Pig 0.8.1 to > 0.9, and I don't think those issues are fixed in 10, either.. not sure that > this "stabilization" process has happened yet. > > D > > > On Mon, Oct 24, 2011 at 11:59 AM, Daniel Dai <[EMAIL PROTECTED]>wrote: > >> Yes, we need a versioning scheme. There are two versioning scheme I can >> think of: >> >> Scheme 1: >> <major>.<patch> >> <major> will be the feature rich release every 3 month >> <patch> will be the bug fix release when necessary >> >> Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 >> etc >> for bug fixes. >> >> Scheme 2: >> <major>.<minor>.<patch> >> Most of our 3 month release will be counted as <minor> release unless >> there >> are major user facing/disruptive changes. >> >> Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, >> 1.1.1 etc for bug fixes. >> >> I personally prefer scheme 2, increasing major version too frequently >> might >> be confusing to users. How's other folks feel? >> >> Daniel >> >> >> On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales < >> [EMAIL PROTECTED]> wrote: >> >> > Hi, >> > >> > just my 2 cents. >> > >> > I think the issue here is not 1.0 vs 0.10, but what's the versioning >> scheme >> > we want to use for Pig. >> > Up to now it has been just an increasing number after a '0.' prefix, >> > changed >> > when the community felt it was time. I think this works well for a small >> > project, but it is somewhat fuzzy. >> > >> > I like the idea of having <major>.<minor>.<patch> versions like many >> other >> > projects. It's a very clear and almost standard way of versioning a >> piece >> > of >> > software. It has clear rules on when to change each of the numbers, and >> > lets >> > the user get an idea of backward compatibility at a glance. >> > >> > So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we >> decide >> > a clear versioning policy (whichever it is). >> > So that the 1.0 milestone would mark the beginning of our new policy. >> > >> > Cheers, >> > -- >> > Gianmarco >> > >> > >> > >> > On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: >> > >> > > If one were to rewrite input and output formats to use the webhdfs:// >> > > APIs, this would not be an issue, right ? >> > > >> > > - milind >> > > >> > > >> > > On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: >> > > >> > > >If I was not clear in my earlier email, I apologize for the lack of >> > > >clarity. I am no longer in favour of waiting for Hadoop API stability >> > > >across Hadoop versions. It's a pipe dream. >> > > > >> > > >When we had PigInputFormat and PigOutputFormat, your reasoning would >> be >> > > >spot on. I am concerned about the following. Our tight integration >> with >> > > >Hadoop due to the use of Input and Output format might lead to a >> break >> > in >> > > >backward compatibility. I am not sure if the comparison with that of >> > Java >> > > >is valid. Probably a majority of the users don't use JNI. Its very +
Dmitriy Ryaboy 2011-10-25, 00:38
-
RE: Next Pig release proposalOlga Natkovich 2011-10-25, 00:54
I actually find this to be a bit confusing scheme. I have not seen any project or product doing something similar.
Olga -----Original Message----- From: Dmitriy Ryaboy [mailto:[EMAIL PROTECTED]] Sent: Monday, October 24, 2011 5:38 PM To: [EMAIL PROTECTED] Subject: Re: Next Pig release proposal To be a little more concrete about what I am saying here -- I don't think we should put a "1.0" label on any *.0 release. 0.8.1 is pretty solid; 0.9.0 has some holes, 0.9.1 is better. If we put 1.0 on what is currently being thought of as 0.10, it will have some stability / usability issues (things tend to show up after we make a release and people in the wild start trying it), and those issues will make a poor impression on those who expect 1.0 to be shiny and polished after so much time. I'm in favor of waiting a couple of dot releases, promoting a stabilized release into 1.0, and going from there. So, pictorially: -- trunk --- 0.11-dev ----------0.12-dev------------------| 1.2-dev! \ \ \ \ ---------------- 0.11.0 --------------------| 1.1.0! \ \------- 0.10.0 ------- 0.10.1 ------- 0.10.2 --------| 1.0.0 !! On Mon, Oct 24, 2011 at 12:43 PM, Dmitriy Ryaboy <[EMAIL PROTECTED]> wrote: > I am good with Scheme 2. > > We are finding a fair number of issues trying to move from Pig 0.8.1 to > 0.9, and I don't think those issues are fixed in 10, either.. not sure that > this "stabilization" process has happened yet. > > D > > > On Mon, Oct 24, 2011 at 11:59 AM, Daniel Dai <[EMAIL PROTECTED]>wrote: > >> Yes, we need a versioning scheme. There are two versioning scheme I can >> think of: >> >> Scheme 1: >> <major>.<patch> >> <major> will be the feature rich release every 3 month >> <patch> will be the bug fix release when necessary >> >> Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 >> etc >> for bug fixes. >> >> Scheme 2: >> <major>.<minor>.<patch> >> Most of our 3 month release will be counted as <minor> release unless >> there >> are major user facing/disruptive changes. >> >> Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, >> 1.1.1 etc for bug fixes. >> >> I personally prefer scheme 2, increasing major version too frequently >> might >> be confusing to users. How's other folks feel? >> >> Daniel >> >> >> On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales < >> [EMAIL PROTECTED]> wrote: >> >> > Hi, >> > >> > just my 2 cents. >> > >> > I think the issue here is not 1.0 vs 0.10, but what's the versioning >> scheme >> > we want to use for Pig. >> > Up to now it has been just an increasing number after a '0.' prefix, >> > changed >> > when the community felt it was time. I think this works well for a small >> > project, but it is somewhat fuzzy. >> > >> > I like the idea of having <major>.<minor>.<patch> versions like many >> other >> > projects. It's a very clear and almost standard way of versioning a >> piece >> > of >> > software. It has clear rules on when to change each of the numbers, and >> > lets >> > the user get an idea of backward compatibility at a glance. >> > >> > So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we >> decide >> > a clear versioning policy (whichever it is). >> > So that the 1.0 milestone would mark the beginning of our new policy. >> > >> > Cheers, >> > -- >> > Gianmarco >> > >> > >> > >> > On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: >> > >> > > If one were to rewrite input and output formats to use the webhdfs:// >> > > APIs, this would not be an issue, right ? >> > > >> > > - milind >> > > >> > > >> > > On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: >> > > >> > > >If I was not clear in my earlier email, I apologize for the lack of >> > > >clarity. I am no longer in favour of waiting for Hadoop API stability >> > > >across Hadoop versions. It's a pipe dream. >> > > > >> > > >When we had PigInputFormat and PigOutputFormat, your reasoning would +
Olga Natkovich 2011-10-25, 00:54
-
Re: Next Pig release proposalThejas Nair 2011-10-25, 01:10
Dmitriy,
I think what you are saying is something similar to alpha/beta releases. (maybe beta1, beta2 .. is better). So the first release could be 1.0.0_beta1. I scheme will be easier for users to understand. But I am not sure what the criteria for promoting a release from betaX to general release should be. Thanks, Thejas On 10/24/11 5:38 PM, Dmitriy Ryaboy wrote: > To be a little more concrete about what I am saying here -- I don't think we > should put a "1.0" label on any *.0 release. 0.8.1 is pretty solid; 0.9.0 > has some holes, 0.9.1 is better. If we put 1.0 on what is currently being > thought of as 0.10, it will have some stability / usability issues (things > tend to show up after we make a release and people in the wild start trying > it), and those issues will make a poor impression on those who expect 1.0 to > be shiny and polished after so much time. I'm in favor of waiting a couple > of dot releases, promoting a stabilized release into 1.0, and going from > there. So, pictorially: > > -- trunk --- 0.11-dev ----------0.12-dev------------------| 1.2-dev! > \ \ > \ \ ---------------- 0.11.0 --------------------| 1.1.0! > \ > \------- 0.10.0 ------- 0.10.1 ------- 0.10.2 --------| 1.0.0 !! > > On Mon, Oct 24, 2011 at 12:43 PM, Dmitriy Ryaboy<[EMAIL PROTECTED]> wrote: > >> I am good with Scheme 2. >> >> We are finding a fair number of issues trying to move from Pig 0.8.1 to >> 0.9, and I don't think those issues are fixed in 10, either.. not sure that >> this "stabilization" process has happened yet. >> >> D >> >> >> On Mon, Oct 24, 2011 at 11:59 AM, Daniel Dai<[EMAIL PROTECTED]>wrote: >> >>> Yes, we need a versioning scheme. There are two versioning scheme I can >>> think of: >>> >>> Scheme 1: >>> <major>.<patch> >>> <major> will be the feature rich release every 3 month >>> <patch> will be the bug fix release when necessary >>> >>> Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 >>> etc >>> for bug fixes. >>> >>> Scheme 2: >>> <major>.<minor>.<patch> >>> Most of our 3 month release will be counted as<minor> release unless >>> there >>> are major user facing/disruptive changes. >>> >>> Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, >>> 1.1.1 etc for bug fixes. >>> >>> I personally prefer scheme 2, increasing major version too frequently >>> might >>> be confusing to users. How's other folks feel? >>> >>> Daniel >>> >>> >>> On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales< >>> [EMAIL PROTECTED]> wrote: >>> >>>> Hi, >>>> >>>> just my 2 cents. >>>> >>>> I think the issue here is not 1.0 vs 0.10, but what's the versioning >>> scheme >>>> we want to use for Pig. >>>> Up to now it has been just an increasing number after a '0.' prefix, >>>> changed >>>> when the community felt it was time. I think this works well for a small >>>> project, but it is somewhat fuzzy. >>>> >>>> I like the idea of having<major>.<minor>.<patch> versions like many >>> other >>>> projects. It's a very clear and almost standard way of versioning a >>> piece >>>> of >>>> software. It has clear rules on when to change each of the numbers, and >>>> lets >>>> the user get an idea of backward compatibility at a glance. >>>> >>>> So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we >>> decide >>>> a clear versioning policy (whichever it is). >>>> So that the 1.0 milestone would mark the beginning of our new policy. >>>> >>>> Cheers, >>>> -- >>>> Gianmarco >>>> >>>> >>>> >>>> On Fri, Oct 21, 2011 at 23:10,<[EMAIL PROTECTED]> wrote: >>>> >>>>> If one were to rewrite input and output formats to use the webhdfs:// >>>>> APIs, this would not be an issue, right ? >>>>> >>>>> - milind >>>>> >>>>> >>>>> On 10/21/11 1:50 PM, "Santhosh Srinivasan"<[EMAIL PROTECTED]> wrote: >>>>> >>>>>> If I was not clear in my earlier email, I apologize for the lack of >>>>>> clarity. I am no longer in favour of waiting for Hadoop API stability +
Thejas Nair 2011-10-25, 01:10
-
Re: Next Pig release proposalDmitriy Ryaboy 2011-10-25, 01:45
I am just saying based on what's in trunk, 10.0 should not be 1.0. I am not
sure alphas and betas will work cause people just won't install them... Olga -- my diagramming skills leave something to be desired :). I was just saying we let 0.10 stabilize (via a few dot releases), then move all versions up in one fell swoop -- so 0.10 line becomes 1.0, 0.11 becomes 1.1, and if at that point 0.12 also exists, it becomes 1.2. D On Mon, Oct 24, 2011 at 6:10 PM, Thejas Nair <[EMAIL PROTECTED]> wrote: > Dmitriy, > I think what you are saying is something similar to alpha/beta releases. > (maybe beta1, beta2 .. is better). > So the first release could be 1.0.0_beta1. I scheme will be easier for > users to understand. > But I am not sure what the criteria for promoting a release from betaX to > general release should be. > > > Thanks, > Thejas > > > > On 10/24/11 5:38 PM, Dmitriy Ryaboy wrote: > >> To be a little more concrete about what I am saying here -- I don't think >> we >> should put a "1.0" label on any *.0 release. 0.8.1 is pretty solid; 0.9.0 >> has some holes, 0.9.1 is better. If we put 1.0 on what is currently being >> thought of as 0.10, it will have some stability / usability issues (things >> tend to show up after we make a release and people in the wild start >> trying >> it), and those issues will make a poor impression on those who expect 1.0 >> to >> be shiny and polished after so much time. I'm in favor of waiting a couple >> of dot releases, promoting a stabilized release into 1.0, and going from >> there. So, pictorially: >> >> -- trunk --- 0.11-dev ----------0.12-dev------------**------| 1.2-dev! >> \ \ >> \ \ ---------------- 0.11.0 --------------------| >> 1.1.0! >> \ >> \------- 0.10.0 ------- 0.10.1 ------- 0.10.2 --------| 1.0.0 !! >> >> On Mon, Oct 24, 2011 at 12:43 PM, Dmitriy Ryaboy<[EMAIL PROTECTED]> >> wrote: >> >> I am good with Scheme 2. >>> >>> We are finding a fair number of issues trying to move from Pig 0.8.1 to >>> 0.9, and I don't think those issues are fixed in 10, either.. not sure >>> that >>> this "stabilization" process has happened yet. >>> >>> D >>> >>> >>> On Mon, Oct 24, 2011 at 11:59 AM, Daniel Dai<[EMAIL PROTECTED]>** >>> wrote: >>> >>> Yes, we need a versioning scheme. There are two versioning scheme I can >>>> think of: >>>> >>>> Scheme 1: >>>> <major>.<patch> >>>> <major> will be the feature rich release every 3 month >>>> <patch> will be the bug fix release when necessary >>>> >>>> Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 >>>> etc >>>> for bug fixes. >>>> >>>> Scheme 2: >>>> <major>.<minor>.<patch> >>>> Most of our 3 month release will be counted as<minor> release unless >>>> there >>>> are major user facing/disruptive changes. >>>> >>>> Nov release will be 1.0.0, Feb release will be 1.1.0. There will be >>>> 1.0.1, >>>> 1.1.1 etc for bug fixes. >>>> >>>> I personally prefer scheme 2, increasing major version too frequently >>>> might >>>> be confusing to users. How's other folks feel? >>>> >>>> Daniel >>>> >>>> >>>> On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales< >>>> [EMAIL PROTECTED]> wrote: >>>> >>>> Hi, >>>>> >>>>> just my 2 cents. >>>>> >>>>> I think the issue here is not 1.0 vs 0.10, but what's the versioning >>>>> >>>> scheme >>>> >>>>> we want to use for Pig. >>>>> Up to now it has been just an increasing number after a '0.' prefix, >>>>> changed >>>>> when the community felt it was time. I think this works well for a >>>>> small >>>>> project, but it is somewhat fuzzy. >>>>> >>>>> I like the idea of having<major>.<minor>.<patch> versions like many >>>>> >>>> other >>>> >>>>> projects. It's a very clear and almost standard way of versioning a >>>>> >>>> piece >>>> >>>>> of >>>>> software. It has clear rules on when to change each of the numbers, and >>>>> lets >>>>> the user get an idea of backward compatibility at a glance. >>>>> >>>>> So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we +
Dmitriy Ryaboy 2011-10-25, 01:45
-
RE: Next Pig release proposalSanthosh Srinivasan 2011-10-25, 06:53
My understanding of Dmitriy's proposal is as follows:
1. We need to establish (more) stability before we transition to 1.0 2. Assuming that we have satisfied the constraints in point #1 and that we are calling the next release 0.10.0, we could have the following situation: i. Trunk ii. 0.11.0 branch iii. 0.10.0 branch iv. 0.9.X branch 3. The next release on trunk will be 1.2 4. The next dot release on the 0.11.0 branch will be 1.1, the next dot release on 0.10.0 will be 1.0, etc. I am with Dmitriy on the first two points. For the latter two points, I would approach it as follows: 1. The next release on trunk will 1.0 2. The 0.11.0 dot releases will continue with 0.11.1, 0.11.2, etc., 0.10.0 dot releases will continue with 0.10.1, 0.10.2, etc. 3. All subsequent releases based off of trunk and the 1.0 branch will bear the 1.X.Y signature till we hit the next major release Santhosh -----Original Message----- From: Dmitriy Ryaboy [mailto:[EMAIL PROTECTED]] Sent: Monday, October 24, 2011 6:46 PM To: [EMAIL PROTECTED] Subject: Re: Next Pig release proposal I am just saying based on what's in trunk, 10.0 should not be 1.0. I am not sure alphas and betas will work cause people just won't install them... Olga -- my diagramming skills leave something to be desired :). I was just saying we let 0.10 stabilize (via a few dot releases), then move all versions up in one fell swoop -- so 0.10 line becomes 1.0, 0.11 becomes 1.1, and if at that point 0.12 also exists, it becomes 1.2. D On Mon, Oct 24, 2011 at 6:10 PM, Thejas Nair <[EMAIL PROTECTED]> wrote: > Dmitriy, > I think what you are saying is something similar to alpha/beta releases. > (maybe beta1, beta2 .. is better). > So the first release could be 1.0.0_beta1. I scheme will be easier for > users to understand. > But I am not sure what the criteria for promoting a release from betaX > to general release should be. > > > Thanks, > Thejas > > > > On 10/24/11 5:38 PM, Dmitriy Ryaboy wrote: > >> To be a little more concrete about what I am saying here -- I don't >> think we should put a "1.0" label on any *.0 release. 0.8.1 is pretty >> solid; 0.9.0 has some holes, 0.9.1 is better. If we put 1.0 on what >> is currently being thought of as 0.10, it will have some stability / >> usability issues (things tend to show up after we make a release and >> people in the wild start trying it), and those issues will make a >> poor impression on those who expect 1.0 to be shiny and polished >> after so much time. I'm in favor of waiting a couple of dot releases, >> promoting a stabilized release into 1.0, and going from there. So, >> pictorially: >> >> -- trunk --- 0.11-dev ----------0.12-dev------------**------| 1.2-dev! >> \ \ >> \ \ ---------------- 0.11.0 --------------------| >> 1.1.0! >> \ >> \------- 0.10.0 ------- 0.10.1 ------- 0.10.2 --------| 1.0.0 !! >> >> On Mon, Oct 24, 2011 at 12:43 PM, Dmitriy Ryaboy<[EMAIL PROTECTED]> >> wrote: >> >> I am good with Scheme 2. >>> >>> We are finding a fair number of issues trying to move from Pig 0.8.1 >>> to 0.9, and I don't think those issues are fixed in 10, either.. not >>> sure that this "stabilization" process has happened yet. >>> >>> D >>> >>> >>> On Mon, Oct 24, 2011 at 11:59 AM, Daniel >>> Dai<[EMAIL PROTECTED]>** >>> wrote: >>> >>> Yes, we need a versioning scheme. There are two versioning scheme I >>> can >>>> think of: >>>> >>>> Scheme 1: >>>> <major>.<patch> >>>> <major> will be the feature rich release every 3 month <patch> >>>> will be the bug fix release when necessary >>>> >>>> Nov release will be 1.0, Feb release will be 2.0. There will be >>>> 1.1, 2.1 etc for bug fixes. >>>> >>>> Scheme 2: >>>> <major>.<minor>.<patch> >>>> Most of our 3 month release will be counted as<minor> release >>>> unless there are major user facing/disruptive changes. >>>> >>>> Nov release will be 1.0.0, Feb release will be 1.1.0. There will be >>>> 1.0.1, +
Santhosh Srinivasan 2011-10-25, 06:53
-
Re: Next Pig release proposalDmitriy Ryaboy 2011-10-25, 07:30
Thanks Santhosh, you understood my meaning precisely.
I believe that unlike other releases, 1.0 is "special" in people's minds, it's a "we are ready" label. I don't think we should promote trunk, even in gloriously stable future, to 1.0, for the same reason I don't think we should promote the current trunk to 1.0 (but would be ok with, say, promoting 9.2...) -- our 1.0 release should be stable, and trunk is not by nature, even when we make a release off it. I would prefer we not tie our hands in search of stability and avoid adding new features / refactors / etc in trunk because it's got the 1.0 label hanging over it. We already have a history of policing what goes into dot-dot releases (8.1, 9.1), and I think those have been working well for creating more stable releases while we can do more radical stuff on trunk. On Mon, Oct 24, 2011 at 11:53 PM, Santhosh Srinivasan <[EMAIL PROTECTED]> wrote: > My understanding of Dmitriy's proposal is as follows: > > 1. We need to establish (more) stability before we transition to 1.0 > 2. Assuming that we have satisfied the constraints in point #1 and that we are calling the next release 0.10.0, we could have the following situation: > i. Trunk > ii. 0.11.0 branch > iii. 0.10.0 branch > iv. 0.9.X branch > 3. The next release on trunk will be 1.2 > 4. The next dot release on the 0.11.0 branch will be 1.1, the next dot release on 0.10.0 will be 1.0, etc. > > I am with Dmitriy on the first two points. For the latter two points, I would approach it as follows: > > 1. The next release on trunk will 1.0 > 2. The 0.11.0 dot releases will continue with 0.11.1, 0.11.2, etc., 0.10.0 dot releases will continue with 0.10.1, 0.10.2, etc. > 3. All subsequent releases based off of trunk and the 1.0 branch will bear the 1.X.Y signature till we hit the next major release > > Santhosh > > -----Original Message----- > From: Dmitriy Ryaboy [mailto:[EMAIL PROTECTED]] > Sent: Monday, October 24, 2011 6:46 PM > To: [EMAIL PROTECTED] > Subject: Re: Next Pig release proposal > > I am just saying based on what's in trunk, 10.0 should not be 1.0. I am not sure alphas and betas will work cause people just won't install them... > > Olga -- my diagramming skills leave something to be desired :). I was just saying we let 0.10 stabilize (via a few dot releases), then move all versions up in one fell swoop -- so 0.10 line becomes 1.0, 0.11 becomes 1.1, and if at that point 0.12 also exists, it becomes 1.2. > > D > > On Mon, Oct 24, 2011 at 6:10 PM, Thejas Nair <[EMAIL PROTECTED]> wrote: > >> Dmitriy, >> I think what you are saying is something similar to alpha/beta releases. >> (maybe beta1, beta2 .. is better). >> So the first release could be 1.0.0_beta1. I scheme will be easier for >> users to understand. >> But I am not sure what the criteria for promoting a release from betaX >> to general release should be. >> >> >> Thanks, >> Thejas >> >> >> >> On 10/24/11 5:38 PM, Dmitriy Ryaboy wrote: >> >>> To be a little more concrete about what I am saying here -- I don't >>> think we should put a "1.0" label on any *.0 release. 0.8.1 is pretty >>> solid; 0.9.0 has some holes, 0.9.1 is better. If we put 1.0 on what >>> is currently being thought of as 0.10, it will have some stability / >>> usability issues (things tend to show up after we make a release and >>> people in the wild start trying it), and those issues will make a >>> poor impression on those who expect 1.0 to be shiny and polished >>> after so much time. I'm in favor of waiting a couple of dot releases, >>> promoting a stabilized release into 1.0, and going from there. So, >>> pictorially: >>> >>> -- trunk --- 0.11-dev ----------0.12-dev------------**------| 1.2-dev! >>> \ \ >>> \ \ ---------------- 0.11.0 --------------------| >>> 1.1.0! >>> \ >>> \------- 0.10.0 ------- 0.10.1 ------- 0.10.2 --------| 1.0.0 !! >>> >>> On Mon, Oct 24, 2011 at 12:43 PM, Dmitriy Ryaboy<[EMAIL PROTECTED]> +
Dmitriy Ryaboy 2011-10-25, 07:30
-
RE: Next Pig release proposalSanthosh Srinivasan 2011-10-25, 07:52
Fair enough. Now its boiling down to ...
Which release should be promoted to 1.0? What are the stability related concerns/bugs that need to be addressed before we promote the last release to 1.0? Santhosh -----Original Message----- From: Dmitriy Ryaboy [mailto:[EMAIL PROTECTED]] Sent: Tuesday, October 25, 2011 12:31 AM To: [EMAIL PROTECTED] Subject: Re: Next Pig release proposal Thanks Santhosh, you understood my meaning precisely. I believe that unlike other releases, 1.0 is "special" in people's minds, it's a "we are ready" label. I don't think we should promote trunk, even in gloriously stable future, to 1.0, for the same reason I don't think we should promote the current trunk to 1.0 (but would be ok with, say, promoting 9.2...) -- our 1.0 release should be stable, and trunk is not by nature, even when we make a release off it. I would prefer we not tie our hands in search of stability and avoid adding new features / refactors / etc in trunk because it's got the 1.0 label hanging over it. We already have a history of policing what goes into dot-dot releases (8.1, 9.1), and I think those have been working well for creating more stable releases while we can do more radical stuff on trunk. On Mon, Oct 24, 2011 at 11:53 PM, Santhosh Srinivasan <[EMAIL PROTECTED]> wrote: > My understanding of Dmitriy's proposal is as follows: > > 1. We need to establish (more) stability before we transition to 1.0 > 2. Assuming that we have satisfied the constraints in point #1 and that we are calling the next release 0.10.0, we could have the following situation: > i. Trunk > ii. 0.11.0 branch > iii. 0.10.0 branch > iv. 0.9.X branch > 3. The next release on trunk will be 1.2 > 4. The next dot release on the 0.11.0 branch will be 1.1, the next dot release on 0.10.0 will be 1.0, etc. > > I am with Dmitriy on the first two points. For the latter two points, I would approach it as follows: > > 1. The next release on trunk will 1.0 > 2. The 0.11.0 dot releases will continue with 0.11.1, 0.11.2, etc., 0.10.0 dot releases will continue with 0.10.1, 0.10.2, etc. > 3. All subsequent releases based off of trunk and the 1.0 branch will bear the 1.X.Y signature till we hit the next major release > > Santhosh > > -----Original Message----- > From: Dmitriy Ryaboy [mailto:[EMAIL PROTECTED]] > Sent: Monday, October 24, 2011 6:46 PM > To: [EMAIL PROTECTED] > Subject: Re: Next Pig release proposal > > I am just saying based on what's in trunk, 10.0 should not be 1.0. I am not sure alphas and betas will work cause people just won't install them... > > Olga -- my diagramming skills leave something to be desired :). I was just saying we let 0.10 stabilize (via a few dot releases), then move all versions up in one fell swoop -- so 0.10 line becomes 1.0, 0.11 becomes 1.1, and if at that point 0.12 also exists, it becomes 1.2. > > D > > On Mon, Oct 24, 2011 at 6:10 PM, Thejas Nair <[EMAIL PROTECTED]> wrote: > >> Dmitriy, >> I think what you are saying is something similar to alpha/beta releases. >> (maybe beta1, beta2 .. is better). >> So the first release could be 1.0.0_beta1. I scheme will be easier for >> users to understand. >> But I am not sure what the criteria for promoting a release from betaX >> to general release should be. >> >> >> Thanks, >> Thejas >> >> >> >> On 10/24/11 5:38 PM, Dmitriy Ryaboy wrote: >> >>> To be a little more concrete about what I am saying here -- I don't >>> think we should put a "1.0" label on any *.0 release. 0.8.1 is pretty >>> solid; 0.9.0 has some holes, 0.9.1 is better. If we put 1.0 on what >>> is currently being thought of as 0.10, it will have some stability / >>> usability issues (things tend to show up after we make a release and >>> people in the wild start trying it), and those issues will make a >>> poor impression on those who expect 1.0 to be shiny and polished >>> after so much time. I'm in favor of waiting a couple of dot releases, >>> promoting a stabilized release into 1.0, and going from there. So, +
Santhosh Srinivasan 2011-10-25, 07:52
-
Re: Next Pig release proposalThejas Nair 2011-10-25, 17:01
Dmitriy,
I haven't understood how you propose the code in future trunk would get into 1.x releases, once the 1.0 is out. Will it be possible to meet the stability criteria that you are applying to 1.0 for all 1.x releases ? Are you suggesting that all future releases go into a 0.x release before going into 1.x release ? -Thejas On 10/25/11 12:30 AM, Dmitriy Ryaboy wrote: > Thanks Santhosh, you understood my meaning precisely. > > I believe that unlike other releases, 1.0 is "special" in people's > minds, it's a "we are ready" label. I don't think we should promote > trunk, even in gloriously stable future, to 1.0, for the same reason I > don't think we should promote the current trunk to 1.0 (but would be > ok with, say, promoting 9.2...) -- our 1.0 release should be stable, > and trunk is not by nature, even when we make a release off it. I > would prefer we not tie our hands in search of stability and avoid > adding new features / refactors / etc in trunk because it's got the > 1.0 label hanging over it. We already have a history of policing what > goes into dot-dot releases (8.1, 9.1), and I think those have been > working well for creating more stable releases while we can do more > radical stuff on trunk. > > On Mon, Oct 24, 2011 at 11:53 PM, Santhosh Srinivasan<[EMAIL PROTECTED]> wrote: >> My understanding of Dmitriy's proposal is as follows: >> >> 1. We need to establish (more) stability before we transition to 1.0 >> 2. Assuming that we have satisfied the constraints in point #1 and that we are calling the next release 0.10.0, we could have the following situation: >> i. Trunk >> ii. 0.11.0 branch >> iii. 0.10.0 branch >> iv. 0.9.X branch >> 3. The next release on trunk will be 1.2 >> 4. The next dot release on the 0.11.0 branch will be 1.1, the next dot release on 0.10.0 will be 1.0, etc. >> >> I am with Dmitriy on the first two points. For the latter two points, I would approach it as follows: >> >> 1. The next release on trunk will 1.0 >> 2. The 0.11.0 dot releases will continue with 0.11.1, 0.11.2, etc., 0.10.0 dot releases will continue with 0.10.1, 0.10.2, etc. >> 3. All subsequent releases based off of trunk and the 1.0 branch will bear the 1.X.Y signature till we hit the next major release >> >> Santhosh >> >> -----Original Message----- >> From: Dmitriy Ryaboy [mailto:[EMAIL PROTECTED]] >> Sent: Monday, October 24, 2011 6:46 PM >> To: [EMAIL PROTECTED] >> Subject: Re: Next Pig release proposal >> >> I am just saying based on what's in trunk, 10.0 should not be 1.0. I am not sure alphas and betas will work cause people just won't install them... >> >> Olga -- my diagramming skills leave something to be desired :). I was just saying we let 0.10 stabilize (via a few dot releases), then move all versions up in one fell swoop -- so 0.10 line becomes 1.0, 0.11 becomes 1.1, and if at that point 0.12 also exists, it becomes 1.2. >> >> D >> >> On Mon, Oct 24, 2011 at 6:10 PM, Thejas Nair<[EMAIL PROTECTED]> wrote: >> >>> Dmitriy, >>> I think what you are saying is something similar to alpha/beta releases. >>> (maybe beta1, beta2 .. is better). >>> So the first release could be 1.0.0_beta1. I scheme will be easier for >>> users to understand. >>> But I am not sure what the criteria for promoting a release from betaX >>> to general release should be. >>> >>> >>> Thanks, >>> Thejas >>> >>> >>> >>> On 10/24/11 5:38 PM, Dmitriy Ryaboy wrote: >>> >>>> To be a little more concrete about what I am saying here -- I don't >>>> think we should put a "1.0" label on any *.0 release. 0.8.1 is pretty >>>> solid; 0.9.0 has some holes, 0.9.1 is better. If we put 1.0 on what >>>> is currently being thought of as 0.10, it will have some stability / >>>> usability issues (things tend to show up after we make a release and >>>> people in the wild start trying it), and those issues will make a >>>> poor impression on those who expect 1.0 to be shiny and polished >>>> after so much time. I'm in favor of waiting a couple of dot releases, +
Thejas Nair 2011-10-25, 17:01
-
RE: Next Pig release proposalOlga Natkovich 2011-10-25, 17:37
I don't see this discussion really converging any time soon and I don't want to see the release delayed by weeks because we can't agree on the label.
I am going to branch as 0.10 tomorrow and keep the proposed schedule unless I hear otherwise. Olga -----Original Message----- From: Thejas Nair [mailto:[EMAIL PROTECTED]] Sent: Tuesday, October 25, 2011 10:01 AM To: [EMAIL PROTECTED] Subject: Re: Next Pig release proposal Dmitriy, I haven't understood how you propose the code in future trunk would get into 1.x releases, once the 1.0 is out. Will it be possible to meet the stability criteria that you are applying to 1.0 for all 1.x releases ? Are you suggesting that all future releases go into a 0.x release before going into 1.x release ? -Thejas On 10/25/11 12:30 AM, Dmitriy Ryaboy wrote: > Thanks Santhosh, you understood my meaning precisely. > > I believe that unlike other releases, 1.0 is "special" in people's > minds, it's a "we are ready" label. I don't think we should promote > trunk, even in gloriously stable future, to 1.0, for the same reason I > don't think we should promote the current trunk to 1.0 (but would be > ok with, say, promoting 9.2...) -- our 1.0 release should be stable, > and trunk is not by nature, even when we make a release off it. I > would prefer we not tie our hands in search of stability and avoid > adding new features / refactors / etc in trunk because it's got the > 1.0 label hanging over it. We already have a history of policing what > goes into dot-dot releases (8.1, 9.1), and I think those have been > working well for creating more stable releases while we can do more > radical stuff on trunk. > > On Mon, Oct 24, 2011 at 11:53 PM, Santhosh Srinivasan<[EMAIL PROTECTED]> wrote: >> My understanding of Dmitriy's proposal is as follows: >> >> 1. We need to establish (more) stability before we transition to 1.0 >> 2. Assuming that we have satisfied the constraints in point #1 and that we are calling the next release 0.10.0, we could have the following situation: >> i. Trunk >> ii. 0.11.0 branch >> iii. 0.10.0 branch >> iv. 0.9.X branch >> 3. The next release on trunk will be 1.2 >> 4. The next dot release on the 0.11.0 branch will be 1.1, the next dot release on 0.10.0 will be 1.0, etc. >> >> I am with Dmitriy on the first two points. For the latter two points, I would approach it as follows: >> >> 1. The next release on trunk will 1.0 >> 2. The 0.11.0 dot releases will continue with 0.11.1, 0.11.2, etc., 0.10.0 dot releases will continue with 0.10.1, 0.10.2, etc. >> 3. All subsequent releases based off of trunk and the 1.0 branch will bear the 1.X.Y signature till we hit the next major release >> >> Santhosh >> >> -----Original Message----- >> From: Dmitriy Ryaboy [mailto:[EMAIL PROTECTED]] >> Sent: Monday, October 24, 2011 6:46 PM >> To: [EMAIL PROTECTED] >> Subject: Re: Next Pig release proposal >> >> I am just saying based on what's in trunk, 10.0 should not be 1.0. I am not sure alphas and betas will work cause people just won't install them... >> >> Olga -- my diagramming skills leave something to be desired :). I was just saying we let 0.10 stabilize (via a few dot releases), then move all versions up in one fell swoop -- so 0.10 line becomes 1.0, 0.11 becomes 1.1, and if at that point 0.12 also exists, it becomes 1.2. >> >> D >> >> On Mon, Oct 24, 2011 at 6:10 PM, Thejas Nair<[EMAIL PROTECTED]> wrote: >> >>> Dmitriy, >>> I think what you are saying is something similar to alpha/beta releases. >>> (maybe beta1, beta2 .. is better). >>> So the first release could be 1.0.0_beta1. I scheme will be easier for >>> users to understand. >>> But I am not sure what the criteria for promoting a release from betaX >>> to general release should be. >>> >>> >>> Thanks, >>> Thejas >>> >>> >>> >>> On 10/24/11 5:38 PM, Dmitriy Ryaboy wrote: >>> >>>> To be a little more concrete about what I am saying here -- I don't >>>> think we should put a "1.0" label on any *.0 release. 0.8.1 is pretty +
Olga Natkovich 2011-10-25, 17:37
-
RE: Next Pig release proposalOlga Natkovich 2011-10-24, 20:12
I am also is in favor of #2 and assumed that's the way we go:
(1) Major version change: major disruptive or non-backward compatible changes (2) Minor version changes: small features + bug fixes; *no breakage of backward compatibility* (3) Patch version changes: P1 bug fixes; no new features, no compatibility breakage. Olga -----Original Message----- From: Daniel Dai [mailto:[EMAIL PROTECTED]] Sent: Monday, October 24, 2011 12:00 PM To: [EMAIL PROTECTED] Subject: Re: Next Pig release proposal Yes, we need a versioning scheme. There are two versioning scheme I can think of: Scheme 1: <major>.<patch> <major> will be the feature rich release every 3 month <patch> will be the bug fix release when necessary Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 etc for bug fixes. Scheme 2: <major>.<minor>.<patch> Most of our 3 month release will be counted as <minor> release unless there are major user facing/disruptive changes. Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, 1.1.1 etc for bug fixes. I personally prefer scheme 2, increasing major version too frequently might be confusing to users. How's other folks feel? Daniel On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales < [EMAIL PROTECTED]> wrote: > Hi, > > just my 2 cents. > > I think the issue here is not 1.0 vs 0.10, but what's the versioning scheme > we want to use for Pig. > Up to now it has been just an increasing number after a '0.' prefix, > changed > when the community felt it was time. I think this works well for a small > project, but it is somewhat fuzzy. > > I like the idea of having <major>.<minor>.<patch> versions like many other > projects. It's a very clear and almost standard way of versioning a piece > of > software. It has clear rules on when to change each of the numbers, and > lets > the user get an idea of backward compatibility at a glance. > > So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we decide > a clear versioning policy (whichever it is). > So that the 1.0 milestone would mark the beginning of our new policy. > > Cheers, > -- > Gianmarco > > > > On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: > > > If one were to rewrite input and output formats to use the webhdfs:// > > APIs, this would not be an issue, right ? > > > > - milind > > > > > > On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: > > > > >If I was not clear in my earlier email, I apologize for the lack of > > >clarity. I am no longer in favour of waiting for Hadoop API stability > > >across Hadoop versions. It's a pipe dream. > > > > > >When we had PigInputFormat and PigOutputFormat, your reasoning would be > > >spot on. I am concerned about the following. Our tight integration with > > >Hadoop due to the use of Input and Output format might lead to a break > in > > >backward compatibility. I am not sure if the comparison with that of > Java > > >is valid. Probably a majority of the users don't use JNI. Its very hard > > >to use Pig without writing custom load and store functions. The default > > >load and store don't suffice for a majority of use cases that I have > > >observed. > > > > > >I am trying to get all factors that might influence this decision. From > > >the few emails that have been exchanged since yesterday, we have the > > >following factors: > > > > > >1. Hadoop 0.20.205 (support for Append) > > >2. Hadoop 0.22 > > >3. Hadoop 0.23 > > >4. Maturity of the new parser > > >5. Stability of the new logical plan > > >6. Other components in the eco system. > > > - Avro (1.5.4, 1.4.1, ...) > > > - Cassandra (1.0.0, 0.8.7, ...) > > > - Chukwa (0.4.0, 0.3.0, ...) > > > - Hama (0.3.0, 0.2.0, ...) > > > - Hbase (0.90.4, 0.90.3, 0.90.2, 0.90.1, ...) > > > - Hive (Releases - 0.7.1, 0.7.0, 0.6.0, ...) > > > - Zookeeper (3.3.3, 3.3.2, 3.2.2, 3.1.2, ...) > > > > > >Santhosh > > > > > > > > >-----Original Message-- +
Olga Natkovich 2011-10-24, 20:12
-
Re: Next Pig release proposalScott Carey 2011-10-24, 23:05
On 10/24/11 1:12 PM, "Olga Natkovich" <[EMAIL PROTECTED]> wrote: >I am also is in favor of #2 and assumed that's the way we go: This is exactly what is needed IMO. Define what is acceptable for a major, minor, and patch change, but leave many of the minor details flexible for the project. "1.0" is just a number, its not a promise that there won't be API changes, that the project is 'production ready', or that there won't be a 2.0 any time soon. > >(1) Major version change: major disruptive or non-backward compatible >changes >(2) Minor version changes: small features + bug fixes; *no breakage of >backward compatibility* >(3) Patch version changes: P1 bug fixes; no new features, no >compatibility breakage. Minor version: "small features" is not well defined. I'd suggest leaving that out. A feature that does not affect backwards compatibility at all may be acceptable regardless of its subjective 'size'. I suggest the minor version definition should be broad: A minor release is a release that is neither a bug fix release nor major version release. It may include big new features, but is not a disruptive upgrade -- Users are not expected to have to modify any Pig scripts or UDFs. This allows major work to progress across many minor releases as long as the old APIs are still available. At a major release, users may have to change their Pig scrips, UDF classes, or APIs may be removed. I also suggest leaving out "no new features" from the Patch version definition -- a 'new feature' is often poorly defined and people will debate its meaning. Is an additional command line parameter that helps out some users in a backwards-compatible fashion a 'new feature' or a 'bug fix'? Its subjective. What is important is that what is in the patch version is very low risk and does not break compatibility in any way -- compatibility is objective, and 'low risk' although subjective, is more reasonable for committers to discuss than whether it is in the 'bug fix' or 'feature' category. -Scott > >Olga > >-----Original Message----- >From: Daniel Dai [mailto:[EMAIL PROTECTED]] >Sent: Monday, October 24, 2011 12:00 PM >To: [EMAIL PROTECTED] >Subject: Re: Next Pig release proposal > >Yes, we need a versioning scheme. There are two versioning scheme I can >think of: > >Scheme 1: ><major>.<patch> ><major> will be the feature rich release every 3 month ><patch> will be the bug fix release when necessary > >Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 >etc >for bug fixes. > >Scheme 2: ><major>.<minor>.<patch> >Most of our 3 month release will be counted as <minor> release unless >there >are major user facing/disruptive changes. > >Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, >1.1.1 etc for bug fixes. > >I personally prefer scheme 2, increasing major version too frequently >might >be confusing to users. How's other folks feel? > >Daniel > > >On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales < >[EMAIL PROTECTED]> wrote: > >> Hi, >> >> just my 2 cents. >> >> I think the issue here is not 1.0 vs 0.10, but what's the versioning >>scheme >> we want to use for Pig. >> Up to now it has been just an increasing number after a '0.' prefix, >> changed >> when the community felt it was time. I think this works well for a small >> project, but it is somewhat fuzzy. >> >> I like the idea of having <major>.<minor>.<patch> versions like many >>other >> projects. It's a very clear and almost standard way of versioning a >>piece >> of >> software. It has clear rules on when to change each of the numbers, and >> lets >> the user get an idea of backward compatibility at a glance. >> >> So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we >>decide >> a clear versioning policy (whichever it is). >> So that the 1.0 milestone would mark the beginning of our new policy. >> >> Cheers, >> -- >> Gianmarco >> >> >> >> On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: +
Scott Carey 2011-10-24, 23:05
-
Re: Next Pig release proposalGianmarco De Francisci Mo... 2011-10-25, 09:22
Given that the majority agrees on option #2, I would suggest to formalize it
and put it in an official document. Are the bylaws the right place? This doesn't solve the 1.0(.0) issue but at least we have a clear path ahead. Cheers, -- Gianmarco De Francisci Morales On Mon, Oct 24, 2011 at 22:12, Olga Natkovich <[EMAIL PROTECTED]> wrote: > I am also is in favor of #2 and assumed that's the way we go: > > (1) Major version change: major disruptive or non-backward compatible > changes > (2) Minor version changes: small features + bug fixes; *no breakage of > backward compatibility* > (3) Patch version changes: P1 bug fixes; no new features, no compatibility > breakage. > > Olga > > -----Original Message----- > From: Daniel Dai [mailto:[EMAIL PROTECTED]] > Sent: Monday, October 24, 2011 12:00 PM > To: [EMAIL PROTECTED] > Subject: Re: Next Pig release proposal > > Yes, we need a versioning scheme. There are two versioning scheme I can > think of: > > Scheme 1: > <major>.<patch> > <major> will be the feature rich release every 3 month > <patch> will be the bug fix release when necessary > > Nov release will be 1.0, Feb release will be 2.0. There will be 1.1, 2.1 > etc > for bug fixes. > > Scheme 2: > <major>.<minor>.<patch> > Most of our 3 month release will be counted as <minor> release unless there > are major user facing/disruptive changes. > > Nov release will be 1.0.0, Feb release will be 1.1.0. There will be 1.0.1, > 1.1.1 etc for bug fixes. > > I personally prefer scheme 2, increasing major version too frequently might > be confusing to users. How's other folks feel? > > Daniel > > > On Sat, Oct 22, 2011 at 2:31 AM, Gianmarco De Francisci Morales < > [EMAIL PROTECTED]> wrote: > > > Hi, > > > > just my 2 cents. > > > > I think the issue here is not 1.0 vs 0.10, but what's the versioning > scheme > > we want to use for Pig. > > Up to now it has been just an increasing number after a '0.' prefix, > > changed > > when the community felt it was time. I think this works well for a small > > project, but it is somewhat fuzzy. > > > > I like the idea of having <major>.<minor>.<patch> versions like many > other > > projects. It's a very clear and almost standard way of versioning a piece > > of > > software. It has clear rules on when to change each of the numbers, and > > lets > > the user get an idea of backward compatibility at a glance. > > > > So, to conclude, I am in favor of going 1.0 (or 1.0.0) as long as we > decide > > a clear versioning policy (whichever it is). > > So that the 1.0 milestone would mark the beginning of our new policy. > > > > Cheers, > > -- > > Gianmarco > > > > > > > > On Fri, Oct 21, 2011 at 23:10, <[EMAIL PROTECTED]> wrote: > > > > > If one were to rewrite input and output formats to use the webhdfs:// > > > APIs, this would not be an issue, right ? > > > > > > - milind > > > > > > > > > On 10/21/11 1:50 PM, "Santhosh Srinivasan" <[EMAIL PROTECTED]> wrote: > > > > > > >If I was not clear in my earlier email, I apologize for the lack of > > > >clarity. I am no longer in favour of waiting for Hadoop API stability > > > >across Hadoop versions. It's a pipe dream. > > > > > > > >When we had PigInputFormat and PigOutputFormat, your reasoning would > be > > > >spot on. I am concerned about the following. Our tight integration > with > > > >Hadoop due to the use of Input and Output format might lead to a break > > in > > > >backward compatibility. I am not sure if the comparison with that of > > Java > > > >is valid. Probably a majority of the users don't use JNI. Its very > hard > > > >to use Pig without writing custom load and store functions. The > default > > > >load and store don't suffice for a majority of use cases that I have > > > >observed. > > > > > > > >I am trying to get all factors that might influence this decision. > From > > > >the few emails that have been exchanged since yesterday, we have the > > > >following factors: > > > > > > > >1. Hadoop 0.20.205 (support for Append) > > > >2. Hadoop 0.22 +
Gianmarco De Francisci Mo... 2011-10-25, 09:22
-
Re: Next Pig release proposalAlan Gates 2011-10-21, 18:00
On Oct 20, 2011, at 4:58 PM, Santhosh Srinivasan wrote: > Ref: http://tinyurl.com/4ng8upa (last discussion on 1.0) > > How far have we progressed from our last discussion in March. There was no consensus on the 1.0 release. Opinions ranged from having more releases to bake in the maturity of the new parser and logical plan changes to compatibility with Hadoop API (was compared to Social Security - a very hot topic these days). > > My concerns were around Hadoop API stability. I have heard that the APIs will not be stable for at least 1 year. This is taking me away from the Hadoop API stability factor (They passed healthcare in that duration. Really!) Do we want compatibility with 0.23 as a gating factor - not sure if this is anywhere close to getting done in the near future. Will we support append (0.20.205)? As Thejas points out, we'll be supporting both for some time. Given the size of the Hadoop user base, even assuming 0.23 is an instant success, the 0.20.x line will be around for two years or more. Pig releases will have to work with both 20 and 23 in this time. So I agree that support for 0.23 is not related to what we call our release. Also, some contributors seem interested in supporting Pig for Hadoop 0.22 (see https://issues.apache.org/jira/browse/PIG-2277), so we need to figure out how we make releases tailored to specific version of Hadoop. I agree we have not met Santhosh's criteria of Hadoop having a stable API, and I don't think we will for a couple more years. I believe we have met Dmitriy's criteria of letting the 0.9 changes bake for 6 months and not doing any more potentially destabilizing rewrites. So, here's my +1 for the branch and for going 1.0. Alan. > > Btw, Hbase has been doing 0.90.1, 0.90.2, etc. So we can take a look at this option too. > > Santhosh > > > > -----Original Message----- > From: Olga Natkovich [mailto:[EMAIL PROTECTED]] > Sent: Thursday, October 20, 2011 4:40 PM > To: [EMAIL PROTECTED] > Subject: Next Pig release proposal > > Hi, > > Here is what I propose we do for the next Pig release: > > > (1) Branch early next week - we have major features and many bug fixes in and will be fixing remaining bugs on the branch > > (2) Publish the release by 11/15 - that will give us a couple of weeks to stabilize the branch and get last minute bug fixes in > > (3) Make this release a 1.0 release. Reasons to go for 1.0 and not 0.10 > > a. This release has minimal number of features and was focused on code stabilization and bug fixes. We believe it will be a stable release > > b. It has no major interface changes > > c. Going from 0.10 to 1.0 would be very confusing > > Comments? > > Olga +
Alan Gates 2011-10-21, 18:00
-
RE: Next Pig release proposalOlga Natkovich 2011-10-25, 00:51
Based on the discussion - there does not seem to be a disagreement on the first 2 points but we still don't have a consensus on the version number. The problem is that we can't branch till we do.
I am going to send a separate email on the vote for the version number. According to our bylaws, we need Lazy Majority from Committers and the vote lasts 3 days. Olga -----Original Message----- From: Olga Natkovich [mailto:[EMAIL PROTECTED]] Sent: Thursday, October 20, 2011 4:40 PM To: [EMAIL PROTECTED] Subject: Next Pig release proposal Hi, Here is what I propose we do for the next Pig release: (1) Branch early next week - we have major features and many bug fixes in and will be fixing remaining bugs on the branch (2) Publish the release by 11/15 - that will give us a couple of weeks to stabilize the branch and get last minute bug fixes in (3) Make this release a 1.0 release. Reasons to go for 1.0 and not 0.10 a. This release has minimal number of features and was focused on code stabilization and bug fixes. We believe it will be a stable release b. It has no major interface changes c. Going from 0.10 to 1.0 would be very confusing Comments? Olga +
Olga Natkovich 2011-10-25, 00:51
|