|
Arun C Murthy
2011-05-02, 20:07
Doug Cutting
2011-05-02, 20:40
Arun C Murthy
2011-05-02, 21:05
Doug Cutting
2011-05-02, 21:21
Arun C Murthy
2011-05-02, 21:33
Doug Cutting
2011-05-02, 21:48
Ian Holsman
2011-05-02, 21:49
Doug Cutting
2011-05-02, 21:55
Andrew Purtell
2011-05-02, 22:05
Andrew Purtell
2011-05-02, 22:14
Arun C Murthy
2011-05-02, 22:15
Arun C Murthy
2011-05-02, 22:19
Eli Collins
2011-05-02, 22:19
Doug Cutting
2011-05-02, 22:25
Jake Cornelius
2011-05-02, 22:31
Konstantin Shvachko
2011-05-03, 08:39
Eli Collins
2011-05-03, 17:02
Todd Lipcon
2011-05-03, 23:58
Doug Cutting
2011-05-04, 00:16
Arun C Murthy
2011-05-04, 01:01
Owen O'Malley
2011-05-04, 17:31
Eli Collins
2011-05-04, 19:17
Allen Wittenauer
2011-05-04, 20:17
Eric Baldeschwieler
2011-05-04, 20:26
Eli Collins
2011-05-04, 22:03
Arun C Murthy
2011-05-04, 22:05
Suresh Srinivas
2011-05-04, 22:06
Doug Cutting
2011-05-04, 22:17
Todd Lipcon
2011-05-04, 22:23
Eli Collins
2011-05-04, 22:24
Jakob Homan
2011-05-04, 22:29
Todd Lipcon
2011-05-04, 22:30
Eli Collins
2011-05-04, 22:36
Suresh Srinivas
2011-05-04, 22:41
Eli Collins
2011-05-04, 22:51
Tsz Wo \
2011-05-04, 23:09
Eli Collins
2011-05-04, 23:11
Arun C Murthy
2011-05-04, 23:11
Owen O'Malley
2011-05-04, 23:19
Mahadev Konar
2011-05-04, 23:29
Konstantin Boudnik
2011-05-04, 23:32
Todd Lipcon
2011-05-04, 23:44
Doug Cutting
2011-05-04, 23:46
Arun C Murthy
2011-05-04, 23:55
Eric Baldeschwieler
2011-05-05, 00:05
Eli Collins
2011-05-05, 00:22
Mahadev Konar
2011-05-05, 00:30
Eli Collins
2011-05-05, 00:39
Konstantin Boudnik
2011-05-05, 00:43
Devaraj Das
2011-05-05, 01:10
Chris Douglas
2011-05-05, 01:12
Eric Baldeschwieler
2011-05-05, 01:18
Eli Collins
2011-05-05, 01:19
Eli Collins
2011-05-05, 01:24
Jean-Daniel Cryans
2011-05-05, 01:24
Milind Bhandarkar
2011-05-05, 01:30
Roy T. Fielding
2011-05-05, 01:56
Andrew Purtell
2011-05-05, 02:12
Dhruba Borthakur
2011-05-05, 02:20
Roy T. Fielding
2011-05-05, 02:22
Ian Holsman
2011-05-05, 03:34
Nigel Daley
2011-05-05, 04:17
Jeff Hammerbacher
2011-05-05, 04:54
Eric Sammer
2011-05-05, 05:47
Matei Zaharia
2011-05-05, 06:21
Sanjay Radia
2011-05-05, 06:32
Jane Chen
2011-05-05, 06:49
Jean-Daniel Cryans
2011-05-05, 06:52
Andrew Purtell
2011-05-05, 16:18
Stack
2011-05-05, 18:33
Suresh Srinivas
2011-05-05, 19:02
Jakob Homan
2011-05-05, 20:56
Roy T. Fielding
2011-05-05, 21:10
Tsz Wo \
2011-05-06, 00:42
Lars Francke
2011-05-06, 10:07
Steve Loughran
2011-05-06, 11:52
Andrew Purtell
2011-05-06, 18:30
Allen Wittenauer
2011-05-07, 00:49
Todd Papaioannou
2011-05-07, 01:43
Allen Wittenauer
2011-05-07, 02:31
Milind Bhandarkar
2011-05-07, 06:18
Allen Wittenauer
2011-05-07, 19:06
Konstantin Shvachko
2011-05-08, 05:41
Owen O'Malley
2011-05-11, 17:36
Ian Holsman
2011-05-11, 17:38
Konstantin Shvachko
2011-05-11, 18:40
Raghu Angadi
2011-05-11, 19:09
Owen O'Malley
2011-05-11, 22:10
|
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Arun C Murthy 2011-05-02, 20:07
Doug,
On May 2, 2011, at 10:58 AM, Doug Cutting wrote: > The patch selection process for this branch did not appear to be a > community process. A massive patch set was committed en-masse with no > public discussion before or after about its specific composition. Lets review: # You proposed to release off the Yahoo security patchset first in April, 2010: http://s.apache.org/5Gv # I started this discussion again in Jan, 2011: http://s.apache.org/uf # We went through several iterations: - I first committed a jumbo patch upon which some reservations were expressed. - Owen went ahead and broke them up to commit individual patches to incorporate the provided feedback. # Roy clearly clarified the way forward: http://s.apache.org/tD4 (which Owen has since incorporatedk by breaking into individual patches). Your current stance given the history, is surprising, to say the least... we have already discussed this. It is clear that the community (including downstream Apache projects like Pig, Hive and HCatalog) will substantially benefit from an Apache release of this improved codebase. thanks, Arun
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Doug Cutting 2011-05-02, 20:40
On 05/02/2011 01:07 PM, Arun C Murthy wrote:
> # Roy clearly clarified the way forward: http://s.apache.org/tD4 (which > Owen has since incorporated by breaking into individual patches). Roy suggested a three ways forward and possible outcomes: Roy Fielding wrote: > a) break the changes down into a sequence of patches, create jira > issues for each one (or append to the existing issue), and then > provide the group with a list of the issue links so that people > can quickly +1 each one. When it seems worthwhile to you, create > a branch off of some prior Apache release point in svn and commit > each patch to it until the branch is identical to (or, in your own > opinion, better than) the source code that you have tested locally. > Then RM a tarball and start a release vote. Since all of this is > being done in jira and svn, others can help you do all but the > first part (breaking down the big patch). > > or > > b) create a branch off of some prior Apache release point in svn > and replay the internal Y! commits on that branch until the branch > source code is identical to what you have tested locally. Then > RM a tarball based on that branch and start a release vote. > Since the history is now in svn, others could do the RM bit if > you don't have time. > > or > > c) create a branch off of some prior Apache release point in svn > and apply one big ugly patch to it. Then RM a tarball based > on that branch and ask for a release vote. > > You will note that none of the above requires a discussion on this > list prior to the release vote, though (a) would likely result in > more +1s than (b), and (b) would likely receive more +1s than (c). > Regardless, the release vote is a lazy majority decision. > > [ ... ] > > When the release vote happens, encourage folks to test and +1 > the release. If it passes, woohoo! If not, then listen to the > reasons given by the other PMC members and see if you can make > enough changes to the release to get those extra +1s. I believe that Owen chose (b). We're now at the release vote and I am a PMC member giving reasons for my vote. Also note that, on the common-dev thread, Eli & Tom have both noted a number of inconsistencies between this set of patches and trunk, 0.22 and even prior 0.20 branches and releases. In addition to the lack of community involvement in patch selection, these issues concern me. I cannot in good conscience vote for this release as a community product. Doug
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Arun C Murthy 2011-05-02, 21:05
Doug,
On May 2, 2011, at 1:40 PM, Doug Cutting wrote: > > Also note that, on the common-dev thread, Eli & Tom have both noted a > number of inconsistencies between this set of patches and trunk, 0.22 > and even prior 0.20 branches and releases. In addition to the lack of > community involvement in patch selection, these issues concern me. > > I cannot in good conscience vote for this release as a community > product. As I noted before you were the first one to propose this release off Yahoo security patch-set in April, 2010: http://s.apache.org/5Gv What has changed since? Clearly, the same situation exists today. Also, please note that of the ~450 commits in the branch, only 30 odd jiras are yet to be committed to trunk: http://s.apache.org/7Pe. So it's incorrect to state 'lack of community involvement'. Assuming the technical inconsistencies are sorted out, are you willing to withdraw you objection? thanks, Arun
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Doug Cutting 2011-05-02, 21:21
On 05/02/2011 02:05 PM, Arun C Murthy wrote:
> As I noted before you were the first one to propose this release off > Yahoo security patch-set in April, 2010: > http://s.apache.org/5Gv > > What has changed since? Clearly, the same situation exists today. I have absolutely no objection in principle to an Apache 0.20 release including security. I object to the fact that this patchset started from an arbitrary point and unilaterally applied a large set of patches that are not well correlated with Jira, trunk or other 0.20 branches. > Also, please note that of the ~450 commits in the branch, only 30 odd > jiras are yet to be committed to trunk: > http://s.apache.org/7Pe. So it's incorrect to state 'lack of community > involvement'. This should be easily discoverable from Jira: issues should use the "fix-for" field to indicate which branches they've been merged to. This standard practice has not been observed for over 400 patches included in this release candidate. > Assuming the technical inconsistencies are sorted out, are you willing > to withdraw you objection? These are not just technical concerns. How I vote on any future release candidate will in part depend on how the community is involved in its production. Thanks, Doug
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Arun C Murthy 2011-05-02, 21:33
On May 2, 2011, at 2:21 PM, Doug Cutting wrote: > On 05/02/2011 02:05 PM, Arun C Murthy wrote: >> As I noted before you were the first one to propose this release off >> Yahoo security patch-set in April, 2010: >> http://s.apache.org/5Gv >> >> What has changed since? Clearly, the same situation exists today. > > I have absolutely no objection in principle to an Apache 0.20 release > including security. I object to the fact that this patchset started > from an arbitrary point and unilaterally applied a large set of > patches > that are not well correlated with Jira, trunk or other 0.20 branches. Completely untrue. This patchset started from 0.20.1 has is complete superset of 0.20.1. We will work towards ensuring it is a complete superset of the last stable release: 0.20.2. > >> Also, please note that of the ~450 commits in the branch, only 30 odd >> jiras are yet to be committed to trunk: >> http://s.apache.org/7Pe. So it's incorrect to state 'lack of >> community >> involvement'. > > This should be easily discoverable from Jira: issues should use the > "fix-for" field to indicate which branches they've been merged to. > This > standard practice has not been observed for over 400 patches > included in > this release candidate. > This seems like parliamentary stalling procedures... sure they don't have 'fix-for' fields but they've been verified to be true from external committers: http://s.apache.org/yX Are you simply asking for someone to go through the 450 odd jiras and set 'fix-for' fields? >> Assuming the technical inconsistencies are sorted out, are you >> willing >> to withdraw you objection? > > These are not just technical concerns. How I vote on any future > release > candidate will in part depend on how the community is involved in its > production. > I understand they aren't technical concerns. I asked if you were willing to withdraw your objection if the technical concerns are satisfied. I think you answered my question - you will not withdraw your objection even if it's a technical issue. thanks, Arun
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Doug Cutting 2011-05-02, 21:48
On 05/02/2011 02:33 PM, Arun C Murthy wrote:
> On May 2, 2011, at 2:21 PM, Doug Cutting wrote: >> I have absolutely no objection in principle to an Apache 0.20 release >> including security. I object to the fact that this patchset started >> from an arbitrary point and unilaterally applied a large set of patches >> that are not well correlated with Jira, trunk or other 0.20 branches. > > Completely untrue. 'Completely'? Really? Not a true bit in there? Wow! > This patchset started from 0.20.1 has is complete superset of 0.20.1. 0.20.1 isn't a branch, it's a tag. The 0.20 branch includes many post-0.20.1 patches that are not in this candidate. Releases in a series normally share a branch. > I asked if you were willing to withdraw your objection if the technical > concerns are satisfied. I think you answered my question - you will not > withdraw your objection even if it's a technical issue. That is not what I said. If this release does not get enough votes then perhaps another 0.20.203 release candidate will be proposed. Its process and contents will be different and I will judge it on the basis of those when I vote. Doug
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Ian Holsman 2011-05-02, 21:49
On May 3, 2011, at 7:33 AM, Arun C Murthy wrote: > > This patchset started from 0.20.1 has is complete superset of 0.20.1. > > We will work towards ensuring it is a complete superset of the last stable release: 0.20.2. so are you intending to make it a superset for 203? or for a future release?
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Doug Cutting 2011-05-02, 21:55
On 05/02/2011 02:33 PM, Arun C Murthy wrote:
> We will work towards ensuring it is a complete superset of the last > stable release: 0.20.2. Great! Who's 'we'? Do you want any help with this? Doug
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Andrew Purtell 2011-05-02, 22:05
Most points in this thread are valid, having to do with the process of how the contribution was assembled; and specific technical aspects of it, e.g. JIRAs missing from branch 0.20.203 relative to branch 0.20. However,
> From: Doug Cutting <[EMAIL PROTECTED]> > > Assuming the technical inconsistencies are sorted out, > > are you willing to withdraw you objection? > > These are not just technical concerns. How I vote on any future > release candidate will in part depend on how the community is > involved in its production. What strikes me, as an observer to this discussion, is that here "community" does not seem equated with Yahoo by implication. Perhaps I misread. Nevertheless, Yahoo retains a good percentage of active Core developers with standing as both committers and high scale users, and these people produced the contribution that is branch 0.20.203, and therefore by definition "the community" was entirely involved in its production. Yahoo should be commended for advancing the state of branch 0.20 with an obvious commitment to donating the results to Apache. As a community we are lucky to have a strong contributor. Their security enhancements allow us and many others the option of strong authentication and user isolation for multitenant deployments. A commercial vendor's product already incorporates Yahoo's donated security enhancements. It would be regrettable if nontechnical factors ultimately prevents Apache from incorporating the value of these contributions into an official release. Some technical concerns seem reasonable. Regarding that: > From: Stack <[EMAIL PROTECTED]> > How hard would it be to get the patches Tom lists below into > branch-0.20-security-203? I'd think it'd be an easier > sell if it were a superset of all in 0.20, especially since it > bears its name. This suggestion makes a lot of sense. In addition, filing JIRAs for and posting the diffs of the remaining differences could help the process as well, and would be good faith actions of an active contributor. Best regards, - Andy Problems worthy of attack prove their worth by hitting back. - Piet Hein (via Tom White)
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Andrew Purtell 2011-05-02, 22:14
I would like to make one minor but important clarification:
From: > It would be regrettable if > nontechnical factors ultimately prevents Apache from > incorporating the value of these contributions into an > official release. To: It would be regrettable if nontechnical factors ultimately prevents Apache from incorporating the value of these contributions into an official release OF 0.20. There are some not yet ready to take the leap to 0.22; who do not consider it proven. So in this regard I do not wish to minimize concerns about distracting from the success of 0.22 or later releases. > From: Andrew Purtell <[EMAIL PROTECTED]> > Subject: Re: [VOTE] Release candidate 0.20.203.0-rc0 > To: [EMAIL PROTECTED] > Date: Monday, May 2, 2011, 3:05 PM > Most points in this thread are valid, > having to do with the process of how the contribution was > assembled; and specific technical aspects of it, e.g. JIRAs > missing from branch 0.20.203 relative to branch 0.20. > However, > > > > From: Doug Cutting <[EMAIL PROTECTED]> > > > Assuming the technical inconsistencies are sorted > > > out, are you willing to withdraw you objection? > > > > These are not just technical concerns. How I vote on > > any future release candidate will in part depend on how > > the community is involved in its production. > > What strikes me, as an observer to this discussion, is that > here "community" does not seem equated with Yahoo by > implication. Perhaps I misread. Nevertheless, Yahoo retains > a good percentage of active Core developers with standing as > both committers and high scale users, and these people > produced the contribution that is branch 0.20.203, and > therefore by definition "the community" was entirely > involved in its production. > > Yahoo should be commended for advancing the state of branch > 0.20 with an obvious commitment to donating the results to > Apache. As a community we are lucky to have a strong > contributor. Their security enhancements allow us and many > others the option of strong authentication and user > isolation for multitenant deployments. > > A commercial vendor's product already incorporates Yahoo's > donated security enhancements. It would be regrettable if > nontechnical factors ultimately prevents Apache from > incorporating the value of these contributions into an > official release. > > Some technical concerns seem reasonable. Regarding that: > > > From: Stack <[EMAIL PROTECTED]> > > How hard would it be to get the patches Tom lists > below into > > branch-0.20-security-203? I'd think it'd be an > easier > > sell if it were a superset of all in 0.20, especially > since it > > bears its name. > > This suggestion makes a lot of sense. In addition, filing > JIRAs for and posting the diffs of the remaining differences > could help the process as well, and would be good faith > actions of an active contributor. > > Best regards, > > - Andy > > Problems worthy of attack prove their worth by hitting > back. - Piet Hein (via Tom White) > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Arun C Murthy 2011-05-02, 22:15
On May 2, 2011, at 3:05 PM, Andrew Purtell wrote: > Some technical concerns seem reasonable. Regarding that: > >> From: Stack <[EMAIL PROTECTED]> >> How hard would it be to get the patches Tom lists below into >> branch-0.20-security-203? I'd think it'd be an easier >> sell if it were a superset of all in 0.20, especially since it >> bears its name. > > This suggestion makes a lot of sense. In addition, filing JIRAs for > and posting the diffs of the remaining differences could help the > process as well, and would be good faith actions of an active > contributor. > Agreed, I'm starting the effort to ensure the differences from 0.20.2 are resolved. From my msg on this thread to common-dev@: > # Remaining for 0.20.203 > * HADOOP-5611 > * HADOOP-5612 > * HADOOP-5623 > * HDFS-596 > * HDFS-723 > * HDFS-732 > * HDFS-579 > * MAPREDUCE-1070 > * HADOOP-6315 > * MAPREDUCE-1163 Suresh has kindly agreed to help me, appreciate help from others - particularly on the 0.20.3 changes. thanks, Arun
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Arun C Murthy 2011-05-02, 22:19
On May 2, 2011, at 2:49 PM, Ian Holsman wrote: > > On May 3, 2011, at 7:33 AM, Arun C Murthy wrote: > >> >> This patchset started from 0.20.1 has is complete superset of 0.20.1. >> >> We will work towards ensuring it is a complete superset of the last >> stable release: 0.20.2. > > so are you intending to make it a superset for 203? or for a future > release? For 0.20.203, it behooves us to incorporate feedback for that. Arun
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Eli Collins 2011-05-02, 22:19
On Mon, May 2, 2011 at 3:15 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote:
> > On May 2, 2011, at 3:05 PM, Andrew Purtell wrote: > > >> Some technical concerns seem reasonable. Regarding that: >> >>> From: Stack <[EMAIL PROTECTED]> >>> How hard would it be to get the patches Tom lists below into >>> branch-0.20-security-203? I'd think it'd be an easier >>> sell if it were a superset of all in 0.20, especially since it >>> bears its name. >> >> This suggestion makes a lot of sense. In addition, filing JIRAs for and >> posting the diffs of the remaining differences could help the process as >> well, and would be good faith actions of an active contributor. >> > > Agreed, I'm starting the effort to ensure the differences from 0.20.2 are > resolved. > > From my msg on this thread to common-dev@: > >> # Remaining for 0.20.203 >> * HADOOP-5611 >> * HADOOP-5612 >> * HADOOP-5623 >> * HDFS-596 >> * HDFS-723 >> * HDFS-732 >> * HDFS-579 >> * MAPREDUCE-1070 >> * HADOOP-6315 >> * MAPREDUCE-1163 > > Suresh has kindly agreed to help me, appreciate help from others - > particularly on the 0.20.3 changes. > I'd like to help. Perhaps let's coordinate in a separate thread? Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Doug Cutting 2011-05-02, 22:25
On 05/02/2011 03:05 PM, Andrew Purtell wrote:
> What strikes me, as an observer to this discussion, is that here > "community" does not seem equated with Yahoo by implication. Perhaps > I misread. Nevertheless, Yahoo retains a good percentage of active > Core developers with standing as both committers and high scale > users, and these people produced the contribution that is branch > 0.20.203, and therefore by definition "the community" was entirely > involved in its production. Whether or not a subset of contributors acts as the community depends on whether others outside that subset have a reasonable opportunity to become involved. Until this release vote was called it wasn't entirely clear to all what was happening with these branches. Wider community involvement is now starting as folks work to rationalize this 450-issue patch with respect to past and future releases, Jira, etc. > Yahoo should be commended for advancing the state of branch 0.20 with > an obvious commitment to donating the results to Apache. As a > community we are lucky to have a strong contributor. Their security > enhancements allow us and many others the option of strong > authentication and user isolation for multitenant deployments. +1 > A commercial vendor's product already incorporates Yahoo's donated > security enhancements. It would be regrettable if nontechnical > factors ultimately prevents Apache from incorporating the value of > these contributions into an official release. +1 Cheers, Doug
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Jake Cornelius 2011-05-02, 22:31
Doug Cutting <[EMAIL PROTECTED]> wrote: On 05/02/2011 03:05 PM, Andrew Purtell wrote: > What strikes me, as an observer to this discussion, is that here > "community" does not seem equated with Yahoo by implication. Perhaps > I misread. Nevertheless, Yahoo retains a good percentage of active > Core developers with standing as both committers and high scale > users, and these people produced the contribution that is branch > 0.20.203, and therefore by definition "the community" was entirely > involved in its production. Whether or not a subset of contributors acts as the community depends on whether others outside that subset have a reasonable opportunity to become involved. Until this release vote was called it wasn't entirely clear to all what was happening with these branches. Wider community involvement is now starting as folks work to rationalize this 450-issue patch with respect to past and future releases, Jira, etc. > Yahoo should be commended for advancing the state of branch 0.20 with > an obvious commitment to donating the results to Apache. As a > community we are lucky to have a strong contributor. Their security > enhancements allow us and many others the option of strong > authentication and user isolation for multitenant deployments. +1 > A commercial vendor's product already incorporates Yahoo's donated > security enhancements. It would be regrettable if nontechnical > factors ultimately prevents Apache from incorporating the value of > these contributions into an official release. +1 Cheers, Doug
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Konstantin Shvachko 2011-05-03, 08:39
I think its a good idea to release hadoop-0.20.203. It moves Apache Hadoop a
step forward. Looks like the technical difficulties are resolved now with latest Arun's commits. Being a superset of hadoop-0.20.2 it can be considered based on one of the official Apache releases. I don't think there was a lack of discussions on the lists about the issues included in the release candidate. Todd did a thorough review of the entire security branch. Many developers participated in discussions. Agreeing with Stack I wish HBase was considered a primary target for Hadoop support. But it is not realistic to have it in hadoop-0.20.203. I have some experience running a version of this release candidate on a large cluster. It works. I would add a couple of patches, which make it run on Windows for me like HADOOP-7110, HADOOP-7126. But those are not blockers. Thanks, --Konstantin On Mon, May 2, 2011 at 5:12 PM, Ian Holsman <[EMAIL PROTECTED]> wrote: > > On May 3, 2011, at 9:58 AM, Arun C Murthy wrote: > > >> > >> Owen, Suresh and I have committed everything on this list except > >> HADOOP-6386 and HADOOP-6428. Not sure which of the two are relevant/ > >> necessary, I'll check with Cos. Other than that hadoop-0.20.203 now a > >> superset of hadoop-0.20.2. > >> > > > > Missed adding HADOOP-5759 to that list, I'll check with Amareshwari > before committing. > > > > Arun > > Thanks for doing this so fast Arun. > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Eli Collins 2011-05-03, 17:02
I think we still need to incorporate the patches currently checked
into branch 0.20. For example, Owen identified a major bug (BooleanWritable's comparator is broken) and filed a jira (HADOOP-6928) to put it in branch-0.20, where I reviewed it and checked it in, so this bug would be fixed in the next stable release. However this change is not in branch-0.20-security-203. Unless we put the delta from branch-0.20 into this release, it is missing important bug fixes that will cause it to regress against 20.3 (if it ever is released). I am also nervous about changes like the one identified by HADOOP-7255. It looks like this change caused a significant regression in TestDFSIO throughput. It changes the core Task class, the commit log is a single line, and as far as I can tell it was not discussed or reviewed by anyone in the community. Don't changes like this at least deserve a jira before we release them? Thanks, Eli On Tue, May 3, 2011 at 1:39 AM, Konstantin Shvachko <[EMAIL PROTECTED]> wrote: > I think its a good idea to release hadoop-0.20.203. It moves Apache Hadoop a > step forward. > > Looks like the technical difficulties are resolved now with latest Arun's > commits. > Being a superset of hadoop-0.20.2 it can be considered based on one of the > official Apache releases. > I don't think there was a lack of discussions on the lists about the issues > included in the release candidate. Todd did a thorough review of the entire > security branch. Many developers participated in discussions. > Agreeing with Stack I wish HBase was considered a primary target for Hadoop > support. But it is not realistic to have it in hadoop-0.20.203. > I have some experience running a version of this release candidate on a > large cluster. It works. I would add a couple of patches, which make it run > on Windows for me like HADOOP-7110, HADOOP-7126. But those are not blockers. > > Thanks, > --Konstantin > > > On Mon, May 2, 2011 at 5:12 PM, Ian Holsman <[EMAIL PROTECTED]> wrote: > >> >> On May 3, 2011, at 9:58 AM, Arun C Murthy wrote: >> >> >> >> >> Owen, Suresh and I have committed everything on this list except >> >> HADOOP-6386 and HADOOP-6428. Not sure which of the two are relevant/ >> >> necessary, I'll check with Cos. Other than that hadoop-0.20.203 now a >> >> superset of hadoop-0.20.2. >> >> >> > >> > Missed adding HADOOP-5759 to that list, I'll check with Amareshwari >> before committing. >> > >> > Arun >> >> Thanks for doing this so fast Arun. >> >> >
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Todd Lipcon 2011-05-03, 23:58
Just to gauge what amount of stuff is in branch-0.20-security-203 I wrote a
quick script which does a comparison based on JIRAs mention in the commit log. It output the following list of JIRAs that are in the branch but not committed to trunk. I've marked many as N/A meaning that they don't apply to trunk: < HADOOP-1026 < HADOOP-6304 N/A < HADOOP-6544 < HADOOP-6598 N/A < HADOOP-6638 < HADOOP-6653 N/A < HADOOP-6716 N/A < HADOOP-6718 N/A < HADOOP-6728 < HADOOP-6745 < HADOOP-6757 < HADOOP-6776 N/A < HADOOP-6808 < HADOOP-6810 N/A < HADOOP-6832 < HADOOP-6855 N/A < HADOOP-7143 N/A < HADOOP-7190 N/A < HADOOP-7232 N/A < HADOOP-7243 N/A < HADOOP-7246 (not sure about this) < HADOOP-7247 < HADOOP-7253 < HDFS-1020 N/A < HDFS-1022 < HDFS-1136 N/A < HDFS-1153 < HDFS-1158 < HDFS-1313 N/A < HDFS-495 N/A < HDFS-6599 <-- must be a typo in commit log < HDFS-740 N/A < MAPREDUCE-1088 < MAPREDUCE-1100 < MAPREDUCE-1118 < MAPREDUCE-1207 < MAPREDUCE-1361 N/A < MAPREDUCE-1376 N/A < MAPREDUCE-1442 N/A < MAPREDUCE-1521 < MAPREDUCE-1526 N/A < MAPREDUCE-1550 N/A < MAPREDUCE-1594 N/A < MAPREDUCE-1641 < MAPREDUCE-1671 < MAPREDUCE-1672 < MAPREDUCE-1676 < MAPREDUCE-1677 < MAPREDUCE-1682 < MAPREDUCE-1687 < MAPREDUCE-1699 Unclear < MAPREDUCE-1711 N/A < MAPREDUCE-1713 < MAPREDUCE-1716 < MAPREDUCE-1730 < MAPREDUCE-1741 < MAPREDUCE-1744 < MAPREDUCE-1758 < MAPREDUCE-1759 N/A < MAPREDUCE-1778 N/A < MAPREDUCE-1790 < MAPREDUCE-1807 N/A < MAPREDUCE-1854 < MAPREDUCE-1871 < MAPREDUCE-1872 < MAPREDUCE-1882 < MAPREDUCE-1889 < MAPREDUCE-1890 < MAPREDUCE-1914 < MAPREDUCE-1921 < MAPREDUCE-1933 < MAPREDUCE-1934 < MAPREDUCE-1938 < MAPREDUCE-1943 < MAPREDUCE-1954 < MAPREDUCE-1955 < MAPREDUCE-1960 < MAPREDUCE-1964 < MAPREDUCE-1966 < MAPREDUCE-1971 < MAPREDUCE-2005 N/A < MAPREDUCE-2019 < MAPREDUCE-2055 < MAPREDUCE-2316 < MAPREDUCE-2355 < MAPREDUCE-291 < MAPREDUCE-323 < MAPREDUCE-339 < MAPREDUCE-517 < MAPREDUCE-6419 <-- doesnt exist, probably typo Certainly some of these are new test cases, benchmark improvements, or system tests. But many others are large new features (e.g new metrics framework, separate JobHistory service). Others also introduce new configurations (eg new JT based limits). In the list above there are 58 that seem to be applicable, probably at least half of which are non-test code. This list above doesn't include 192 patches that were committed to the branch without reference to any JIRA in the commit message: todd@todd-w510:~/git/hadoop-common$ for x in $(git rev-list origin/branch-0.20..origin/branch-0.20-security-203 -- src) ; do git log -n1 $x | egrep -q '(MAPREDUCE|HDFS|HADOOP)[-:][0-9]+' || echo $x ; done | wc -l 192 Browsing through these, many have already been forward ported, or at least had corresponding JIRAs opened. But it's very difficult to match them up and evaluate which ones have been committed. Eli pointed out one earlier this week that was done by a non-committer with no public review that introduced an apparent performance regression; it's difficult to know whether there might be others as well. Rather than being a "maintenance release" (as is usually expected when incrementing the third component of a version string) this is essentially a separate trunk off of 0.20. I agree that the advancements in this branch are many, and are a great set of contributions for the community. User limits and security are two such that have been cited in this thread; unfortunately the new improvements in limits haven't been committed to trunk, and the security in trunk has a known root exploit. Do users really want to see us putting these things in 20 without making sure they'll also show up in future releases? Looking at recent history of 204 it seems some more patches have gone in there before going into trunk as well - for example MR-2429. Arun and Sid are working on forward-porting it, and it's obviously not due to any kind of bad intent that it was missed, but it underscores the dangers of having essentially two trunks in ASF. I completely agree that there should be long-term maintenance branches at the ASF, but we need to establish a clear process to make sure that "maintenance" doesn't diverge into something else. Here are two requests that others have made but I haven't seen an answer to yet: - Document the criteria by which developers can judge whether an improvement should be included in branch-0.20-security. The inclusion criteria for the branch as it stands is not clear -- given this branch's lineage, it clearly used to be "things important for the Yahoo clusters", but that doesn't seem like a reasonable community criterion. Up until now in Hadoop's history, the criteria has always been "compatible bug fixes only", which doesn't describe this branch either. - Clearly establish the process that all patches must either be committed to trunk first (and then backported), or include a comment on the JIRA explaining why this is not necessary. Additionally we should decide whether patches must be backported "through" 0.22 or if they may skip back from trunk to 20-security. (I'm assuming 21 is dead here) Perhaps this could go on a wiki page (or web site page) regarding the currently active branches? Thanks -Todd On Tue, May 3, 2011 at 10:02 AM, Eli Collins <[EMAIL PROTECTED]> wrote: Todd Lipcon Software Engineer, Cloudera
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Doug Cutting 2011-05-04, 00:16
On 05/02/2011 02:33 PM, Arun C Murthy wrote:
> Are you simply asking for someone to go through the 450 odd jiras and > set 'fix-for' fields? Every other release we've made is well-correlated with Jira. It should not be difficult to achieve that for this one. We could write a script to take all 450 bug IDs from the change log and use Jira's command-line tool to set the "fix-for" to be this 0.20+security release. Would you like help with that? Doug
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Arun C Murthy 2011-05-04, 01:01
On May 3, 2011, at 5:17 PM, "Doug Cutting" <[EMAIL PROTECTED]> wrote:
> On 05/02/2011 02:33 PM, Arun C Murthy wrote: >> Are you simply asking for someone to go through the 450 odd jiras and >> set 'fix-for' fields? > > Every other release we've made is well-correlated with Jira. It should > not be difficult to achieve that for this one. We could write a script > to take all 450 bug IDs from the change log and use Jira's command-line > tool to set the "fix-for" to be this 0.20+security release. Would you > like help with that? > Yes please, that would be great. Thanks! Arun
-
[VOTE] Release candidate 0.20.203.0-rc1Owen O'Malley 2011-05-04, 17:31
Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem.
The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ Please download it, inspect it, compile it, and test it. Clearly, I'm +1. -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-04, 19:17
On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote:
> Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > > -- Owen Hey Owen, Thanks for incorporating all the feedback and additional changes. It's great that this release won't be a regression against our previous stable release. I would like to call out that we are not just voting to adopt a particular release, we are starting a new version scheme for the project, doing new feature development on maintenance release branches (before trunk), and we're saying it's OK to release software that hasn't been reviewed by the community. I'd like to hear from our development community not just that we want to do a release from this branch but that we want to adopt these other changes as well. Here's a summary of the major *remaining* issues and a recommendation on how to proceed: 1. There are about ~50 changes that have jiras that are committed to the branch that are not yet in trunk. The next release (0.22) will be a regression against this release, with respect to these particular changes. Recomendation: we should get these changes in trunk before releasing so that new features do not show up in maintenace branches first. 2. There are 192 patches that were committed to the branch without reference to any Jira in the commit message. Some of these may have already been forward ported, but it is very difficult to match them up and evaluate which ones have been committed. Some are troublesome, when spot checking the commits I found some that have been done by non-committers with no public review that introduced an apparent performance regressions (eg see HADOOP-7255). Recommendation: we should update the commit log to make sure there is a jira for each issue, and all changes have been reviewed/committed. This is the way we've always done releases. 3. The new versioning scheme major.minor.point.X the new "X" component allows for new feature development on point releases. Recomendation: we should discuss in a separate thread whether we want to do new feature development on maintenance branches and if so to adopt this new version scheme. Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Allen Wittenauer 2011-05-04, 20:17
On May 4, 2011, at 10:31 AM, Owen O'Malley wrote: > Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. Am I misreading this, or are the MR protocols out of sync between 0.20.203 and 0.21? It would also appear that this is marked stable in 0.21. What is the user impact?
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eric Baldeschwieler 2011-05-04, 20:26
Hi Folks,
This is a release vote, let's stay focused. On this thread I think appropriate responses are either +1 and some short commentary (assuming you've tried it and it works) or -1 and some short commentary. It would also be cool if you noted if you've tried it. ---- In the spirit of my feedback, I'll respond to this under another subject. Thanks, E14 On May 4, 2011, at 12:17 PM, Eli Collins wrote: > On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: >> Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. >> >> The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ >> >> Please download it, inspect it, compile it, and test it. Clearly, I'm +1. >> >> -- Owen > > Hey Owen, > > Thanks for incorporating all the feedback and additional changes. It's > great that this release won't be a regression against our previous > stable release. > > I would like to call out that we are not just voting to adopt a > particular release, we are starting a new version scheme for the > project, doing new feature development on maintenance release branches > (before trunk), and we're saying it's OK to release software that > hasn't been reviewed by the community. > > I'd like to hear from our development community not just that we want > to do a release from this branch but that we want to adopt these other > changes as well. Here's a summary of the major *remaining* issues and > a recommendation on how to proceed: > > 1. There are about ~50 changes that have jiras that are committed to > the branch that are not yet in trunk. The next release (0.22) will be > a regression against this release, with respect to these particular > changes. Recomendation: we should get these changes in trunk before > releasing so that new features do not show up in maintenace branches > first. > > 2. There are 192 patches that were committed to the branch without > reference to any Jira in the commit message. Some of these may have > already been forward ported, but it is very difficult to match them up > and evaluate which ones have been committed. Some are troublesome, > when spot checking the commits I found some that have been done by > non-committers with no public review that introduced an apparent > performance regressions (eg see HADOOP-7255). Recommendation: we > should update the commit log to make sure there is a jira for each > issue, and all changes have been reviewed/committed. This is the way > we've always done releases. > > 3. The new versioning scheme major.minor.point.X the new "X" component > allows for new feature development on point releases. Recomendation: > we should discuss in a separate thread whether we want to do new > feature development on maintenance branches and if so to adopt this > new version scheme. > > Thanks, > Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-04, 22:03
On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote:
> Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > > -- Owen While rc2 is an improvement on rc1, I am -1 on this particular rc. Rationale: This rc contains many patches not yet committed to trunk. This would cause the next major release (0.22) to be a feature regression against our latest stable release (203), were 0.22 released soon. This rc contains many patches not yet reviewed by the community via the normal process (jira, patch against trunk, merge to a release branch). I think we should respect the existing community process that has been used for all previous releases. This rc introduces a new development and braching model (new feature development outside trunk) and Hadoop versioning scheme without sufficient discussion or proposal of these changes with the community. We should establish new process before the release, a release is not the appropriate mechanism for changing our review and development process or versioning . I do support a release from branch-0.20-security that follows the existing, established community process. Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Arun C Murthy 2011-05-04, 22:05
On May 4, 2011, at 10:31 AM, Owen O'Malley wrote:
> Here's an updated release candidate for 0.20.203.0. I've > incorporated the feedback and included all of the patches from > 0.20.2, which is the last stable release. I also fixed the eclipse- > plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, > I'm +1. > +1 Downloaded release, checked checksums, built, deployed single-node cluster. Arun
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Suresh Srinivas 2011-05-04, 22:06
Eli,
How many of these patches that you find troublesome are in CDH already? Regards, Suresh On 5/4/11 3:03 PM, "Eli Collins" <[EMAIL PROTECTED]> wrote: > On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: >> Here's an updated release candidate for 0.20.203.0. I've incorporated the >> feedback and included all of the patches from 0.20.2, which is the last >> stable release. I also fixed the eclipse-plugin problem. >> >> The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ >> >> Please download it, inspect it, compile it, and test it. Clearly, I'm +1. >> >> -- Owen > > While rc2 is an improvement on rc1, I am -1 on this particular rc. Rationale: > > This rc contains many patches not yet committed to trunk. This would > cause the next major release (0.22) to be a feature regression against > our latest stable release (203), were 0.22 released soon. > > This rc contains many patches not yet reviewed by the community via > the normal process (jira, patch against trunk, merge to a release > branch). I think we should respect the existing community process that > has been used for all previous releases. > > This rc introduces a new development and braching model (new feature > development outside trunk) and Hadoop versioning scheme without > sufficient discussion or proposal of these changes with the community. > > We should establish new process before the release, a release is not > the appropriate mechanism for changing our review and development > process or versioning . > > I do support a release from branch-0.20-security that follows the > existing, established community process. > > Thanks, > Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Doug Cutting 2011-05-04, 22:17
-1
This candidate has lots of patches that are not in trunk, potentially adding regressions to 0.22 and 0.23. This should be addressed before we release from 0.20-security. We should also not move to four-component version numbering. A release from the 0.20-security branch should perhaps be called 0.20.100. Doug On 05/04/2011 10:31 AM, Owen O'Malley wrote: > Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > > -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Todd Lipcon 2011-05-04, 22:23
-1 for the same reasons I outlined in my email yesterday. This is not a
community artifact following the community's processes, and thus should not be an official release until those issues are addressed. On Wed, May 4, 2011 at 3:17 PM, Doug Cutting <[EMAIL PROTECTED]> wrote: > -1 > > This candidate has lots of patches that are not in trunk, potentially > adding regressions to 0.22 and 0.23. This should be addressed before we > release from 0.20-security. We should also not move to four-component > version numbering. A release from the 0.20-security branch should > perhaps be called 0.20.100. > > Doug > > On 05/04/2011 10:31 AM, Owen O'Malley wrote: > > Here's an updated release candidate for 0.20.203.0. I've incorporated the > feedback and included all of the patches from 0.20.2, which is the last > stable release. I also fixed the eclipse-plugin problem. > > > > The candidate is at: > http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > > > > -- Owen > -- Todd Lipcon Software Engineer, Cloudera
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-04, 22:24
With my Cloudera hat on..
When we went through the 10x and 20x patches we only pulled a subset of them, primarily for security and the general improvements that we thought were good. We found both incompatible changes and some sketchy changes that we did not pull in from a quality perspective. There is a big difference between a patch set that's acceptable for Yahoo!'s user base and one that's a more general artifact. When we evaluated the YDH patch sets we were using that frame of mind. I'm now looking it in terms of an Apache release. And the place to review changes for an Apache release is on jira. CDH3 is based on the latest stable Apache release (20.2) so it doesn't regress against it. I'm nervous about rebasing future releases on 203 because of the compatibility and quality implications. Thanks, Eli On Wed, May 4, 2011 at 3:06 PM, Suresh Srinivas <[EMAIL PROTECTED]> wrote: > Eli, > > How many of these patches that you find troublesome are in CDH already? > > Regards, > Suresh > > > On 5/4/11 3:03 PM, "Eli Collins" <[EMAIL PROTECTED]> wrote: > >> On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: >>> Here's an updated release candidate for 0.20.203.0. I've incorporated the >>> feedback and included all of the patches from 0.20.2, which is the last >>> stable release. I also fixed the eclipse-plugin problem. >>> >>> The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ >>> >>> Please download it, inspect it, compile it, and test it. Clearly, I'm +1. >>> >>> -- Owen >> >> While rc2 is an improvement on rc1, I am -1 on this particular rc. Rationale: >> >> This rc contains many patches not yet committed to trunk. This would >> cause the next major release (0.22) to be a feature regression against >> our latest stable release (203), were 0.22 released soon. >> >> This rc contains many patches not yet reviewed by the community via >> the normal process (jira, patch against trunk, merge to a release >> branch). I think we should respect the existing community process that >> has been used for all previous releases. >> >> This rc introduces a new development and braching model (new feature >> development outside trunk) and Hadoop versioning scheme without >> sufficient discussion or proposal of these changes with the community. >> >> We should establish new process before the release, a release is not >> the appropriate mechanism for changing our review and development >> process or versioning . >> >> I do support a release from branch-0.20-security that follows the >> existing, established community process. >> >> Thanks, >> Eli > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Jakob Homan 2011-05-04, 22:29
@Eli >> This rc contains many patches not yet committed to trunk.
If you've compiled this list, can you post it? On Wed, May 4, 2011 at 3:24 PM, Eli Collins <[EMAIL PROTECTED]> wrote: > With my Cloudera hat on.. > > When we went through the 10x and 20x patches we only pulled a subset > of them, primarily for security and the general improvements that we > thought were good. We found both incompatible changes and some > sketchy changes that we did not pull in from a quality perspective. > There is a big difference between a patch set that's acceptable for > Yahoo!'s user base and one that's a more general artifact. > > When we evaluated the YDH patch sets we were using that frame of mind. > I'm now looking it in terms of an Apache release. And the place to > review changes for an Apache release is on jira. > > CDH3 is based on the latest stable Apache release (20.2) so it doesn't > regress against it. I'm nervous about rebasing future releases on 203 > because of the compatibility and quality implications. > > Thanks, > Eli > > > On Wed, May 4, 2011 at 3:06 PM, Suresh Srinivas <[EMAIL PROTECTED]> wrote: >> Eli, >> >> How many of these patches that you find troublesome are in CDH already? >> >> Regards, >> Suresh >> >> >> On 5/4/11 3:03 PM, "Eli Collins" <[EMAIL PROTECTED]> wrote: >> >>> On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: >>>> Here's an updated release candidate for 0.20.203.0. I've incorporated the >>>> feedback and included all of the patches from 0.20.2, which is the last >>>> stable release. I also fixed the eclipse-plugin problem. >>>> >>>> The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ >>>> >>>> Please download it, inspect it, compile it, and test it. Clearly, I'm +1. >>>> >>>> -- Owen >>> >>> While rc2 is an improvement on rc1, I am -1 on this particular rc. Rationale: >>> >>> This rc contains many patches not yet committed to trunk. This would >>> cause the next major release (0.22) to be a feature regression against >>> our latest stable release (203), were 0.22 released soon. >>> >>> This rc contains many patches not yet reviewed by the community via >>> the normal process (jira, patch against trunk, merge to a release >>> branch). I think we should respect the existing community process that >>> has been used for all previous releases. >>> >>> This rc introduces a new development and braching model (new feature >>> development outside trunk) and Hadoop versioning scheme without >>> sufficient discussion or proposal of these changes with the community. >>> >>> We should establish new process before the release, a release is not >>> the appropriate mechanism for changing our review and development >>> process or versioning . >>> >>> I do support a release from branch-0.20-security that follows the >>> existing, established community process. >>> >>> Thanks, >>> Eli >> >> >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Todd Lipcon 2011-05-04, 22:30
With Cloudera hat on, I agree with Eli's assessment.
With Apache hat on, I don't see how this is at all relevant to the task at hand. I would make the same arguments against taking CDH3 and releasing it as an ASF artifact -- we'd also have a certain amount of work to do to make sure that all of the patches are in trunk, first. Additionally, I'd want to outline what the inclusion criteria would be for that branch. -Todd On Wed, May 4, 2011 at 3:24 PM, Eli Collins <[EMAIL PROTECTED]> wrote: > With my Cloudera hat on.. > > When we went through the 10x and 20x patches we only pulled a subset > of them, primarily for security and the general improvements that we > thought were good. We found both incompatible changes and some > sketchy changes that we did not pull in from a quality perspective. > There is a big difference between a patch set that's acceptable for > Yahoo!'s user base and one that's a more general artifact. > > When we evaluated the YDH patch sets we were using that frame of mind. > I'm now looking it in terms of an Apache release. And the place to > review changes for an Apache release is on jira. > > CDH3 is based on the latest stable Apache release (20.2) so it doesn't > regress against it. I'm nervous about rebasing future releases on 203 > because of the compatibility and quality implications. > > Thanks, > Eli > > > On Wed, May 4, 2011 at 3:06 PM, Suresh Srinivas <[EMAIL PROTECTED]> > wrote: > > Eli, > > > > How many of these patches that you find troublesome are in CDH already? > > > > Regards, > > Suresh > > > > > > On 5/4/11 3:03 PM, "Eli Collins" <[EMAIL PROTECTED]> wrote: > > > >> On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> > wrote: > >>> Here's an updated release candidate for 0.20.203.0. I've incorporated > the > >>> feedback and included all of the patches from 0.20.2, which is the last > >>> stable release. I also fixed the eclipse-plugin problem. > >>> > >>> The candidate is at: > http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > >>> > >>> Please download it, inspect it, compile it, and test it. Clearly, I'm > +1. > >>> > >>> -- Owen > >> > >> While rc2 is an improvement on rc1, I am -1 on this particular rc. > Rationale: > >> > >> This rc contains many patches not yet committed to trunk. This would > >> cause the next major release (0.22) to be a feature regression against > >> our latest stable release (203), were 0.22 released soon. > >> > >> This rc contains many patches not yet reviewed by the community via > >> the normal process (jira, patch against trunk, merge to a release > >> branch). I think we should respect the existing community process that > >> has been used for all previous releases. > >> > >> This rc introduces a new development and braching model (new feature > >> development outside trunk) and Hadoop versioning scheme without > >> sufficient discussion or proposal of these changes with the community. > >> > >> We should establish new process before the release, a release is not > >> the appropriate mechanism for changing our review and development > >> process or versioning . > >> > >> I do support a release from branch-0.20-security that follows the > >> existing, established community process. > >> > >> Thanks, > >> Eli > > > > > -- Todd Lipcon Software Engineer, Cloudera
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-04, 22:36
On Wed, May 4, 2011 at 3:29 PM, Jakob Homan <[EMAIL PROTECTED]> wrote:
> @Eli >> This rc contains many patches not yet committed to trunk. > If you've compiled this list, can you post it? > Here's the list Todd posted yesterday: http://mail-archives.apache.org/mod_mbox/hadoop-general/201105.mbox/%[EMAIL PROTECTED]%3E Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Suresh Srinivas 2011-05-04, 22:41
Here is a snippet from your blog -
http://www.cloudera.com/blog/2010/10/cdh3-beta-3-now-available/ ------ Security Enhancements As one of the primary contributors and largest production users of Hadoop, Yahoo! publishes the source tree for the version of Hadoop that they run on their production clusters. We are pleased to announce that we have merged Yahoo¹s source tree into CDH3b3. This merge brings many improvements developed at Yahoo! into CDH, including improvements for MapReduce scalability on 1000+-node clusters and several new tools for benchmarking and testing Hadoop. ------ It would be great, if you can list how many of 192 changes were reviewed and became part of CDH. Your -1 vote essentially blocks the changes that are already available in CDH to be available from Apache open source! On 5/4/11 3:30 PM, "Todd Lipcon" <[EMAIL PROTECTED]> wrote: > With Cloudera hat on, I agree with Eli's assessment. > > With Apache hat on, I don't see how this is at all relevant to the task at > hand. I would make the same arguments against taking CDH3 and releasing it > as an ASF artifact -- we'd also have a certain amount of work to do to make > sure that all of the patches are in trunk, first. Additionally, I'd want to > outline what the inclusion criteria would be for that branch. > > -Todd > > On Wed, May 4, 2011 at 3:24 PM, Eli Collins <[EMAIL PROTECTED]> wrote: > >> With my Cloudera hat on.. >> >> When we went through the 10x and 20x patches we only pulled a subset >> of them, primarily for security and the general improvements that we >> thought were good. We found both incompatible changes and some >> sketchy changes that we did not pull in from a quality perspective. >> There is a big difference between a patch set that's acceptable for >> Yahoo!'s user base and one that's a more general artifact. >> >> When we evaluated the YDH patch sets we were using that frame of mind. >> I'm now looking it in terms of an Apache release. And the place to >> review changes for an Apache release is on jira. >> >> CDH3 is based on the latest stable Apache release (20.2) so it doesn't >> regress against it. I'm nervous about rebasing future releases on 203 >> because of the compatibility and quality implications. >> >> Thanks, >> Eli >> >> >> On Wed, May 4, 2011 at 3:06 PM, Suresh Srinivas <[EMAIL PROTECTED]> >> wrote: >>> Eli, >>> >>> How many of these patches that you find troublesome are in CDH already? >>> >>> Regards, >>> Suresh >>> >>> >>> On 5/4/11 3:03 PM, "Eli Collins" <[EMAIL PROTECTED]> wrote: >>> >>>> On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> >> wrote: >>>>> Here's an updated release candidate for 0.20.203.0. I've incorporated >> the >>>>> feedback and included all of the patches from 0.20.2, which is the last >>>>> stable release. I also fixed the eclipse-plugin problem. >>>>> >>>>> The candidate is at: >> http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ >>>>> >>>>> Please download it, inspect it, compile it, and test it. Clearly, I'm >> +1. >>>>> >>>>> -- Owen >>>> >>>> While rc2 is an improvement on rc1, I am -1 on this particular rc. >> Rationale: >>>> >>>> This rc contains many patches not yet committed to trunk. This would >>>> cause the next major release (0.22) to be a feature regression against >>>> our latest stable release (203), were 0.22 released soon. >>>> >>>> This rc contains many patches not yet reviewed by the community via >>>> the normal process (jira, patch against trunk, merge to a release >>>> branch). I think we should respect the existing community process that >>>> has been used for all previous releases. >>>> >>>> This rc introduces a new development and braching model (new feature >>>> development outside trunk) and Hadoop versioning scheme without >>>> sufficient discussion or proposal of these changes with the community. >>>> >>>> We should establish new process before the release, a release is not >>>> the appropriate mechanism for changing our review and development
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-04, 22:51
> Your -1 vote essentially blocks the changes that are already available in
> CDH to be available from Apache open source! As Eric mentioned, this thread is about an Apache release, not CDH. My -1 vote does not block these changes from being released via Apache. You can not veto a release. Releases are lazy majority, the release is only blocked if there are more -1 votes than +1 votes. If these changes are contributed on jira, discussed and reviewed, and committed to trunk I'm happy to support the release. There's a big difference between asking that a release respect the Apache community process and blocking it. If you want to get the release out how about contributing the work via the normal means so the community can review it like we review all other code changes. Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Tsz Wo \ 2011-05-04, 23:09
The list seems highly inaccurate. Checked the first few N/A items. All are
false positives. < HADOOP-6304 N/A -- fixed in trunk via HADOOP-7110 (Todd, it was fixed by you. Forgot?) < HADOOP-6598 N/A -- moved to HADOOP-6763 and committed to trunk < HADOOP-6653 N/A -- not applicable in trunk < HADOOP-6716 N/A -- as part of HADOOP-6815 which was committed to trunk < HADOOP-6718 N/A -- Incorporated in HADOOP-6706 for 0.22. < HADOOP-6776 N/A -- Tom White said "This is fixed in trunk, so can be closed." Regards, Nicholas ________________________________ From: Eli Collins <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Sent: Wed, May 4, 2011 3:36:16 PM Subject: Re: [VOTE] Release candidate 0.20.203.0-rc1 On Wed, May 4, 2011 at 3:29 PM, Jakob Homan <[EMAIL PROTECTED]> wrote: > @Eli >> This rc contains many patches not yet committed to trunk. > If you've compiled this list, can you post it? > Here's the list Todd posted yesterday: http://mail-archives.apache.org/mod_mbox/hadoop-general/201105.mbox/%[EMAIL PROTECTED]%3E Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-04, 23:11
On Wed, May 4, 2011 at 4:09 PM, Tsz Wo (Nicholas), Sze
<[EMAIL PROTECTED]> wrote: > The list seems highly inaccurate. Checked the first few N/A items. All are > false positives. Yes, that's why those are marked N/A ie "Not applicable". Check out the non N/A ones. Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Arun C Murthy 2011-05-04, 23:11
On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote:
> The list seems highly inaccurate. Checked the first few N/A items. > All are > false positives. > Also, can you please provide a list on features which are not related to gridmix benchmarks or herriot tests? Please remember, and I have said this on list and off-list, that many of the forward ports obviated the need for multiple patches which show up in the commit logs. thanks, Arun > < HADOOP-6304 N/A -- fixed in trunk via HADOOP-7110 (Todd, it was > fixed by you. > Forgot?) > < HADOOP-6598 N/A -- moved to HADOOP-6763 and committed to trunk > < HADOOP-6653 N/A -- not applicable in trunk > < HADOOP-6716 N/A -- as part of HADOOP-6815 which was committed to > trunk > < HADOOP-6718 N/A -- Incorporated in HADOOP-6706 for 0.22. > < HADOOP-6776 N/A -- Tom White said "This is fixed in trunk, so can > be closed." > > Regards, > Nicholas > > > > > > ________________________________ > From: Eli Collins <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED] > Sent: Wed, May 4, 2011 3:36:16 PM > Subject: Re: [VOTE] Release candidate 0.20.203.0-rc1 > > On Wed, May 4, 2011 at 3:29 PM, Jakob Homan <[EMAIL PROTECTED]> wrote: >> @Eli >> This rc contains many patches not yet committed to trunk. >> If you've compiled this list, can you post it? >> > > Here's the list Todd posted yesterday: > > http://mail-archives.apache.org/mod_mbox/hadoop-general/201105.mbox/%[EMAIL PROTECTED]%3E > > > Thanks, > Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Owen O'Malley 2011-05-04, 23:19
On May 4, 2011, at 1:17 PM, Allen Wittenauer wrote: > Am I misreading this, or are the MR protocols out of sync between 0.20.203 and 0.21? It would also appear that this is marked stable in 0.21. What is the user impact? The names of the protocols were changed, but the names of the protocols aren't user-facing. The protocols themselves also changed, as with all Hadoop major versions. (We need to switch to protobuf or something for RPC to provide wire compatibility.) -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Mahadev Konar 2011-05-04, 23:29
+1 for the release.
I downloaded the release, verified checksums, built and deployed. Ran randomwriter jobs on it. Everything passes. -- thanks mahadev @mahadevkonar On Wed, May 4, 2011 at 3:05 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: > On May 4, 2011, at 10:31 AM, Owen O'Malley wrote: > >> Here's an updated release candidate for 0.20.203.0. I've incorporated the >> feedback and included all of the patches from 0.20.2, which is the last >> stable release. I also fixed the eclipse-plugin problem. >> >> The candidate is at: >> http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ >> >> Please download it, inspect it, compile it, and test it. Clearly, I'm +1. >> > > +1 > > Downloaded release, checked checksums, built, deployed single-node cluster. > > Arun > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Konstantin Boudnik 2011-05-04, 23:32
On Wed, May 4, 2011 at 15:06, Suresh Srinivas <[EMAIL PROTECTED]> wrote:
> Eli, > > How many of these patches that you find troublesome are in CDH already? How is that relevant to the release vote and discrepancies listed in Eli's email? > Regards, > Suresh > > > On 5/4/11 3:03 PM, "Eli Collins" <[EMAIL PROTECTED]> wrote: > >> On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: >>> Here's an updated release candidate for 0.20.203.0. I've incorporated the >>> feedback and included all of the patches from 0.20.2, which is the last >>> stable release. I also fixed the eclipse-plugin problem. >>> >>> The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ >>> >>> Please download it, inspect it, compile it, and test it. Clearly, I'm +1. >>> >>> -- Owen >> >> While rc2 is an improvement on rc1, I am -1 on this particular rc. Rationale: >> >> This rc contains many patches not yet committed to trunk. This would >> cause the next major release (0.22) to be a feature regression against >> our latest stable release (203), were 0.22 released soon. >> >> This rc contains many patches not yet reviewed by the community via >> the normal process (jira, patch against trunk, merge to a release >> branch). I think we should respect the existing community process that >> has been used for all previous releases. >> >> This rc introduces a new development and braching model (new feature >> development outside trunk) and Hadoop versioning scheme without >> sufficient discussion or proposal of these changes with the community. >> >> We should establish new process before the release, a release is not >> the appropriate mechanism for changing our review and development >> process or versioning . >> >> I do support a release from branch-0.20-security that follows the >> existing, established community process. >> >> Thanks, >> Eli > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Todd Lipcon 2011-05-04, 23:44
On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote:
> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: > > The list seems highly inaccurate. Checked the first few N/A items. All >> are >> false positives. >> >> > Also, can you please provide a list on features which are not related to > gridmix benchmarks or herriot tests? > Here are a few I quickly pulled up: MAPREDUCE-2316 (docs for improved capacity scheduler) MAPREDUCE-2355 (adds new config for heartbeat dampening in MR) " BZ-4182948. Add statistics logging to Fred for better visibility into startup time costs. (Matt Foley)" - I believe I saw a note from Matt on the JIRA yesterday about this feature, where he decided that the version done in 203 wasn't a good approach, and it's done differently in trunk (not sure if done yet). MAPREDUCE-2364 (important bug fix for localization) - in fact most of localization is different in this branch compared to trunk due to inclusion of MAPREDUCE-2378, the trunk version of which is still on the "yahoo-merge" branch,. "New cunters for FileInput/OutputFormat. New Counter MAP_OUTPUT_MATERIALZIED_BYTES. Related bugs: 4241034, 3418543, 4217546" - not sure which JIRA this is, I think I've seen a JIRA for trunk, but not committed. - MAPREDUCE-1904, committed without JIRA as: " . Reducing new Path(), RawFileStatus() creation overhead in LocalDirAllocator" not in trunk + BZ4101537 . When a queue is built without any access rights we explain the + problem. (dking, rvw ramach) [attachment of 2010-11-24] seems to be on trunk as MR-2411, but not committed, best I can tell, despite the JIRA there being resolved (based on looking at QueueManager in trunk) " . Remove unnecessary reference to user configuration from TaskDistributedCacheManager causing memory leaks" Not in trunk, not sure which JIRA it might be.. probably part of 2178. Major new feature: MAPREDUCE-323 - very large rework of how job history files are managed Major change: MAPREDUCE-1100/MAPREDUCE-1176: unresolved on trunk, though probably will be attacked by different JIRAs Major new ops-visible feature: "metrics2" system Major new ops-visible feature: MAPREDUCE-291 job history can be viewed from a separate server Major new set of user-visible configurations: MAPREDUCE-1943 and friends which implement new limits in MapReduce (eg MAPREDUCE-1872 as well) I have code to work on, so I won't keep going, but this is from looking at the last couple months of 203. -Todd -- Todd Lipcon Software Engineer, Cloudera
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Doug Cutting 2011-05-04, 23:46
On 05/03/2011 06:01 PM, Arun C Murthy wrote:
> On May 3, 2011, at 5:17 PM, "Doug Cutting" <[EMAIL PROTECTED]> wrote: > >> On 05/02/2011 02:33 PM, Arun C Murthy wrote: >>> Are you simply asking for someone to go through the 450 odd jiras and >>> set 'fix-for' fields? >> >> Every other release we've made is well-correlated with Jira. It should >> not be difficult to achieve that for this one. We could write a script >> to take all 450 bug IDs from the change log and use Jira's command-line >> tool to set the "fix-for" to be this 0.20+security release. Would you >> like help with that? >> > > Yes please, that would be great. Thanks! Please find below a script that will add a fix-version to issues. Doug #!/bin/bash # reads bug ids from standard input # and adds the fixVersion named on command line if [ $# -eq 0 ] then echo "Usage: $0 bugid" exit 1 fi fix=$1 echo Setting fix version to $fix. server=https://issues.apache.org/jira jira=./jira-cli-2.0.0/jira.sh set -e echo -n "Jira username: " read user echo -n "Jira password: " stty -echo read password stty echo while read issue do # first read the old fix versions old=`$jira -a getFieldValue --server $server \ --password $password --user $user \ --issue $issue --field fixVersions | \ tail -n 1 | sed 's/([0-9]*)//g' | sed s/\'//g` # now update, adding new value # jira will ignore if this value is already present $jira -a updateIssue --server $server \ --password $password --user $user \ --issue $issue --fixVersions "${old},${fix}" done
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Arun C Murthy 2011-05-04, 23:55
On May 4, 2011, at 4:44 PM, Todd Lipcon wrote:
> On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> > wrote: > >> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: >> >> The list seems highly inaccurate. Checked the first few N/A >> items. All >>> are >>> false positives. >>> >>> >> Also, can you please provide a list on features which are not >> related to >> gridmix benchmarks or herriot tests? >> > > Here are a few I quickly pulled up: So, it's around 10? Approximately? Also, the ones you put up were reviewed via jira. Please note that several of the ones you are pointing out are already in y-merge branch which is nearly trunk. including MR-2378 as you pointed out. Thanks for the list, I'll ensure we work on forward porting them. Arun
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eric Baldeschwieler 2011-05-05, 00:05
Hi folks,
Let's stay focused. Let's take the other threads onto other threads. This is a vote. To the extent naming is a problem, let's take that to a thread and find an acceptable proposal. To the extent folks want to collaborate on certifying the release for total lack of regression or collaborate on the cleanest possible merge, I think all interested parties should take these topics to another thread and divide up the work. If you've voted, you don't need to comment further on this thread, no matter what company you work for! Thanks, --- E14 - typing on glass On May 4, 2011, at 4:46 PM, "Todd Lipcon" <[EMAIL PROTECTED]> wrote: > On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: > >> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: >> >> The list seems highly inaccurate. Checked the first few N/A items. All >>> are >>> false positives. >>> >>> >> Also, can you please provide a list on features which are not related to >> gridmix benchmarks or herriot tests? >> > > Here are a few I quickly pulled up: > MAPREDUCE-2316 (docs for improved capacity scheduler) > MAPREDUCE-2355 (adds new config for heartbeat dampening in MR) > > " BZ-4182948. Add statistics logging to Fred for better visibility into > startup time costs. (Matt Foley)" > - I believe I saw a note from Matt on the JIRA yesterday about this feature, > where he decided that the version done in 203 wasn't a good approach, and > it's done differently in trunk (not sure if done yet). > > MAPREDUCE-2364 (important bug fix for localization) > - in fact most of localization is different in this branch compared to trunk > due to inclusion of MAPREDUCE-2378, the trunk version of which is still on > the "yahoo-merge" branch,. > > "New cunters for FileInput/OutputFormat. New Counter > MAP_OUTPUT_MATERIALZIED_BYTES. Related bugs: 4241034, 3418543, > 4217546" > - not sure which JIRA this is, I think I've seen a JIRA for trunk, but not > committed. > > - MAPREDUCE-1904, committed without JIRA as: > " . Reducing new Path(), RawFileStatus() creation overhead in > LocalDirAllocator" > not in trunk > > + BZ4101537 . When a queue is built without any access rights we explain > the > + problem. (dking, rvw ramach) [attachment of 2010-11-24] > seems to be on trunk as MR-2411, but not committed, best I can tell, despite > the JIRA there being resolved (based on looking at QueueManager in trunk) > > " . Remove unnecessary reference to user configuration from > TaskDistributedCacheManager causing memory leaks" > Not in trunk, not sure which JIRA it might be.. probably part of 2178. > > Major new feature: MAPREDUCE-323 - very large rework of how job history > files are managed > Major change: MAPREDUCE-1100/MAPREDUCE-1176: unresolved on trunk, though > probably will be attacked by different JIRAs > Major new ops-visible feature: "metrics2" system > Major new ops-visible feature: MAPREDUCE-291 job history can be viewed from > a separate server > Major new set of user-visible configurations: MAPREDUCE-1943 and friends > which implement new limits in MapReduce (eg MAPREDUCE-1872 as well) > > I have code to work on, so I won't keep going, but this is from looking at > the last couple months of 203. > > -Todd > -- > Todd Lipcon > Software Engineer, Cloudera
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-05, 00:22
Good suggestion, it would be helpful to hash out the issues around
compatibility, feature branches, version numbers, how to contribute at Apache before putting up new votes that would be helpful, ie the vote would go much smoother if all the issues with the previous vote were addressed before starting a new one. Thanks, Eli On Wed, May 4, 2011 at 5:05 PM, Eric Baldeschwieler <[EMAIL PROTECTED]> wrote: > Hi folks, > > Let's stay focused. Let's take the other threads onto other threads. This is a vote. > > To the extent naming is a problem, let's take that to a thread and find an acceptable proposal. > > To the extent folks want to collaborate on certifying the release for total lack of regression or collaborate on the cleanest possible merge, I think all interested parties should take these topics to another thread and divide up the work. > > If you've voted, you don't need to comment further on this thread, no matter what company you work for! > > Thanks, > > --- > E14 - typing on glass > > On May 4, 2011, at 4:46 PM, "Todd Lipcon" <[EMAIL PROTECTED]> wrote: > >> On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: >> >>> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: >>> >>> The list seems highly inaccurate. Checked the first few N/A items. All >>>> are >>>> false positives. >>>> >>>> >>> Also, can you please provide a list on features which are not related to >>> gridmix benchmarks or herriot tests? >>> >> >> Here are a few I quickly pulled up: >> MAPREDUCE-2316 (docs for improved capacity scheduler) >> MAPREDUCE-2355 (adds new config for heartbeat dampening in MR) >> >> " BZ-4182948. Add statistics logging to Fred for better visibility into >> startup time costs. (Matt Foley)" >> - I believe I saw a note from Matt on the JIRA yesterday about this feature, >> where he decided that the version done in 203 wasn't a good approach, and >> it's done differently in trunk (not sure if done yet). >> >> MAPREDUCE-2364 (important bug fix for localization) >> - in fact most of localization is different in this branch compared to trunk >> due to inclusion of MAPREDUCE-2378, the trunk version of which is still on >> the "yahoo-merge" branch,. >> >> "New cunters for FileInput/OutputFormat. New Counter >> MAP_OUTPUT_MATERIALZIED_BYTES. Related bugs: 4241034, 3418543, >> 4217546" >> - not sure which JIRA this is, I think I've seen a JIRA for trunk, but not >> committed. >> >> - MAPREDUCE-1904, committed without JIRA as: >> " . Reducing new Path(), RawFileStatus() creation overhead in >> LocalDirAllocator" >> not in trunk >> >> + BZ4101537 . When a queue is built without any access rights we explain >> the >> + problem. (dking, rvw ramach) [attachment of 2010-11-24] >> seems to be on trunk as MR-2411, but not committed, best I can tell, despite >> the JIRA there being resolved (based on looking at QueueManager in trunk) >> >> " . Remove unnecessary reference to user configuration from >> TaskDistributedCacheManager causing memory leaks" >> Not in trunk, not sure which JIRA it might be.. probably part of 2178. >> >> Major new feature: MAPREDUCE-323 - very large rework of how job history >> files are managed >> Major change: MAPREDUCE-1100/MAPREDUCE-1176: unresolved on trunk, though >> probably will be attacked by different JIRAs >> Major new ops-visible feature: "metrics2" system >> Major new ops-visible feature: MAPREDUCE-291 job history can be viewed from >> a separate server >> Major new set of user-visible configurations: MAPREDUCE-1943 and friends >> which implement new limits in MapReduce (eg MAPREDUCE-1872 as well) >> >> I have code to work on, so I won't keep going, but this is from looking at >> the last couple months of 203. >> >> -Todd >> -- >> Todd Lipcon >> Software Engineer, Cloudera >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Mahadev Konar 2011-05-05, 00:30
Eli,
I think the intent from the email was to just vote on this thread, which I agree with. Discussions should be done in a separate threads. Hopefully we can all stick to just voting! thanks mahadev On Wed, May 4, 2011 at 5:22 PM, Eli Collins <[EMAIL PROTECTED]> wrote: > Good suggestion, it would be helpful to hash out the issues around > compatibility, feature branches, version numbers, how to contribute at > Apache before putting up new votes that would be helpful, ie the vote > would go much smoother if all the issues with the previous vote were > addressed before starting a new one. > > Thanks, > Eli > > On Wed, May 4, 2011 at 5:05 PM, Eric Baldeschwieler > <[EMAIL PROTECTED]> wrote: >> Hi folks, >> >> Let's stay focused. Let's take the other threads onto other threads. This is a vote. >> >> To the extent naming is a problem, let's take that to a thread and find an acceptable proposal. >> >> To the extent folks want to collaborate on certifying the release for total lack of regression or collaborate on the cleanest possible merge, I think all interested parties should take these topics to another thread and divide up the work. >> >> If you've voted, you don't need to comment further on this thread, no matter what company you work for! >> >> Thanks, >> >> --- >> E14 - typing on glass >> >> On May 4, 2011, at 4:46 PM, "Todd Lipcon" <[EMAIL PROTECTED]> wrote: >> >>> On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: >>> >>>> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: >>>> >>>> The list seems highly inaccurate. Checked the first few N/A items. All >>>>> are >>>>> false positives. >>>>> >>>>> >>>> Also, can you please provide a list on features which are not related to >>>> gridmix benchmarks or herriot tests? >>>> >>> >>> Here are a few I quickly pulled up: >>> MAPREDUCE-2316 (docs for improved capacity scheduler) >>> MAPREDUCE-2355 (adds new config for heartbeat dampening in MR) >>> >>> " BZ-4182948. Add statistics logging to Fred for better visibility into >>> startup time costs. (Matt Foley)" >>> - I believe I saw a note from Matt on the JIRA yesterday about this feature, >>> where he decided that the version done in 203 wasn't a good approach, and >>> it's done differently in trunk (not sure if done yet). >>> >>> MAPREDUCE-2364 (important bug fix for localization) >>> - in fact most of localization is different in this branch compared to trunk >>> due to inclusion of MAPREDUCE-2378, the trunk version of which is still on >>> the "yahoo-merge" branch,. >>> >>> "New cunters for FileInput/OutputFormat. New Counter >>> MAP_OUTPUT_MATERIALZIED_BYTES. Related bugs: 4241034, 3418543, >>> 4217546" >>> - not sure which JIRA this is, I think I've seen a JIRA for trunk, but not >>> committed. >>> >>> - MAPREDUCE-1904, committed without JIRA as: >>> " . Reducing new Path(), RawFileStatus() creation overhead in >>> LocalDirAllocator" >>> not in trunk >>> >>> + BZ4101537 . When a queue is built without any access rights we explain >>> the >>> + problem. (dking, rvw ramach) [attachment of 2010-11-24] >>> seems to be on trunk as MR-2411, but not committed, best I can tell, despite >>> the JIRA there being resolved (based on looking at QueueManager in trunk) >>> >>> " . Remove unnecessary reference to user configuration from >>> TaskDistributedCacheManager causing memory leaks" >>> Not in trunk, not sure which JIRA it might be.. probably part of 2178. >>> >>> Major new feature: MAPREDUCE-323 - very large rework of how job history >>> files are managed >>> Major change: MAPREDUCE-1100/MAPREDUCE-1176: unresolved on trunk, though >>> probably will be attacked by different JIRAs >>> Major new ops-visible feature: "metrics2" system >>> Major new ops-visible feature: MAPREDUCE-291 job history can be viewed from >>> a separate server >>> Major new set of user-visible configurations: MAPREDUCE-1943 and friends >>> which implement new limits in MapReduce (eg MAPREDUCE-1872 as well) thanks mahadev @mahadevkonar
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-05, 00:39
The point is that these discussion should be sorted out, ie you don't
change your development and release model on a release VOTE thread, you change it on a DISCUSSION thread. Ie before we release this we should understand what that means. What is being proposed is not just another release from branch-0.20 or branch-0.22. Thanks, Eli On Wed, May 4, 2011 at 5:30 PM, Mahadev Konar <[EMAIL PROTECTED]> wrote: > Eli, > I think the intent from the email was to just vote on this thread, > which I agree with. > Discussions should be done in a separate threads. Hopefully we can > all stick to just voting! > > thanks > mahadev > > On Wed, May 4, 2011 at 5:22 PM, Eli Collins <[EMAIL PROTECTED]> wrote: >> Good suggestion, it would be helpful to hash out the issues around >> compatibility, feature branches, version numbers, how to contribute at >> Apache before putting up new votes that would be helpful, ie the vote >> would go much smoother if all the issues with the previous vote were >> addressed before starting a new one. >> >> Thanks, >> Eli >> >> On Wed, May 4, 2011 at 5:05 PM, Eric Baldeschwieler >> <[EMAIL PROTECTED]> wrote: >>> Hi folks, >>> >>> Let's stay focused. Let's take the other threads onto other threads. This is a vote. >>> >>> To the extent naming is a problem, let's take that to a thread and find an acceptable proposal. >>> >>> To the extent folks want to collaborate on certifying the release for total lack of regression or collaborate on the cleanest possible merge, I think all interested parties should take these topics to another thread and divide up the work. >>> >>> If you've voted, you don't need to comment further on this thread, no matter what company you work for! >>> >>> Thanks, >>> >>> --- >>> E14 - typing on glass >>> >>> On May 4, 2011, at 4:46 PM, "Todd Lipcon" <[EMAIL PROTECTED]> wrote: >>> >>>> On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: >>>> >>>>> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: >>>>> >>>>> The list seems highly inaccurate. Checked the first few N/A items. All >>>>>> are >>>>>> false positives. >>>>>> >>>>>> >>>>> Also, can you please provide a list on features which are not related to >>>>> gridmix benchmarks or herriot tests? >>>>> >>>> >>>> Here are a few I quickly pulled up: >>>> MAPREDUCE-2316 (docs for improved capacity scheduler) >>>> MAPREDUCE-2355 (adds new config for heartbeat dampening in MR) >>>> >>>> " BZ-4182948. Add statistics logging to Fred for better visibility into >>>> startup time costs. (Matt Foley)" >>>> - I believe I saw a note from Matt on the JIRA yesterday about this feature, >>>> where he decided that the version done in 203 wasn't a good approach, and >>>> it's done differently in trunk (not sure if done yet). >>>> >>>> MAPREDUCE-2364 (important bug fix for localization) >>>> - in fact most of localization is different in this branch compared to trunk >>>> due to inclusion of MAPREDUCE-2378, the trunk version of which is still on >>>> the "yahoo-merge" branch,. >>>> >>>> "New cunters for FileInput/OutputFormat. New Counter >>>> MAP_OUTPUT_MATERIALZIED_BYTES. Related bugs: 4241034, 3418543, >>>> 4217546" >>>> - not sure which JIRA this is, I think I've seen a JIRA for trunk, but not >>>> committed. >>>> >>>> - MAPREDUCE-1904, committed without JIRA as: >>>> " . Reducing new Path(), RawFileStatus() creation overhead in >>>> LocalDirAllocator" >>>> not in trunk >>>> >>>> + BZ4101537 . When a queue is built without any access rights we explain >>>> the >>>> + problem. (dking, rvw ramach) [attachment of 2010-11-24] >>>> seems to be on trunk as MR-2411, but not committed, best I can tell, despite >>>> the JIRA there being resolved (based on looking at QueueManager in trunk) >>>> >>>> " . Remove unnecessary reference to user configuration from >>>> TaskDistributedCacheManager causing memory leaks" >>>> Not in trunk, not sure which JIRA it might be.. probably part of 2178. >>>> >>>> Major new feature: MAPREDUCE-323 - very large rework of how job history
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Konstantin Boudnik 2011-05-05, 00:43
I tend to agree. Changing release model of Apache Hadoop train isn't
something that should be done in a hassle or as a part of release voting. If these questions aren't addressed - let's postpone the vote and discuss all the complications or implications until they sorted out or the consensus/compromise is reached. Cos On Wed, May 4, 2011 at 17:39, Eli Collins <[EMAIL PROTECTED]> wrote: > The point is that these discussion should be sorted out, ie you don't > change your development and release model on a release VOTE thread, > you change it on a DISCUSSION thread. > > Ie before we release this we should understand what that means. What > is being proposed is not just another release from branch-0.20 or > branch-0.22. > > Thanks, > Eli > > On Wed, May 4, 2011 at 5:30 PM, Mahadev Konar <[EMAIL PROTECTED]> wrote: >> Eli, >> I think the intent from the email was to just vote on this thread, >> which I agree with. >> Discussions should be done in a separate threads. Hopefully we can >> all stick to just voting! >> >> thanks >> mahadev >> >> On Wed, May 4, 2011 at 5:22 PM, Eli Collins <[EMAIL PROTECTED]> wrote: >>> Good suggestion, it would be helpful to hash out the issues around >>> compatibility, feature branches, version numbers, how to contribute at >>> Apache before putting up new votes that would be helpful, ie the vote >>> would go much smoother if all the issues with the previous vote were >>> addressed before starting a new one. >>> >>> Thanks, >>> Eli >>> >>> On Wed, May 4, 2011 at 5:05 PM, Eric Baldeschwieler >>> <[EMAIL PROTECTED]> wrote: >>>> Hi folks, >>>> >>>> Let's stay focused. Let's take the other threads onto other threads. This is a vote. >>>> >>>> To the extent naming is a problem, let's take that to a thread and find an acceptable proposal. >>>> >>>> To the extent folks want to collaborate on certifying the release for total lack of regression or collaborate on the cleanest possible merge, I think all interested parties should take these topics to another thread and divide up the work. >>>> >>>> If you've voted, you don't need to comment further on this thread, no matter what company you work for! >>>> >>>> Thanks, >>>> >>>> --- >>>> E14 - typing on glass >>>> >>>> On May 4, 2011, at 4:46 PM, "Todd Lipcon" <[EMAIL PROTECTED]> wrote: >>>> >>>>> On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: >>>>> >>>>>> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: >>>>>> >>>>>> The list seems highly inaccurate. Checked the first few N/A items. All >>>>>>> are >>>>>>> false positives. >>>>>>> >>>>>>> >>>>>> Also, can you please provide a list on features which are not related to >>>>>> gridmix benchmarks or herriot tests? >>>>>> >>>>> >>>>> Here are a few I quickly pulled up: >>>>> MAPREDUCE-2316 (docs for improved capacity scheduler) >>>>> MAPREDUCE-2355 (adds new config for heartbeat dampening in MR) >>>>> >>>>> " BZ-4182948. Add statistics logging to Fred for better visibility into >>>>> startup time costs. (Matt Foley)" >>>>> - I believe I saw a note from Matt on the JIRA yesterday about this feature, >>>>> where he decided that the version done in 203 wasn't a good approach, and >>>>> it's done differently in trunk (not sure if done yet). >>>>> >>>>> MAPREDUCE-2364 (important bug fix for localization) >>>>> - in fact most of localization is different in this branch compared to trunk >>>>> due to inclusion of MAPREDUCE-2378, the trunk version of which is still on >>>>> the "yahoo-merge" branch,. >>>>> >>>>> "New cunters for FileInput/OutputFormat. New Counter >>>>> MAP_OUTPUT_MATERIALZIED_BYTES. Related bugs: 4241034, 3418543, >>>>> 4217546" >>>>> - not sure which JIRA this is, I think I've seen a JIRA for trunk, but not >>>>> committed. >>>>> >>>>> - MAPREDUCE-1904, committed without JIRA as: >>>>> " . Reducing new Path(), RawFileStatus() creation overhead in >>>>> LocalDirAllocator" >>>>> not in trunk >>>>> >>>>> + BZ4101537 . When a queue is built without any access rights we explain
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Devaraj Das 2011-05-05, 01:10
+1 based on some single node tests I did (with security ON).
On 5/4/11 10:31 AM, "Owen O'Malley" <[EMAIL PROTECTED]> wrote: Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ Please download it, inspect it, compile it, and test it. Clearly, I'm +1. -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Chris Douglas 2011-05-05, 01:12
I'm +1 on releasing rc1. The signature and hashes match on the
artifact, ran some of the more aggressive MR tests. Reviewed changes from rc0. It looks like we need a FAQ for this release, if only to prevent the same questions from being asked and answered across different threads and lists. Reservations, regressions, and pending work can also be documented there. Right now, Apache Hadoop releases are not recommended by its community. Instead, not only our end users, but other Apache projects run Cloudera's distribution. From all those wearing their Apache hat, I would like to see more effort directed toward a release that we can recommend soon and less time spent compiling tasks to delay it. Releasing this will complicate the documented process. However, that process *has not produced a usable release* for the last two out of six years. This is failure. Entertaining concerns like a one-to-one correspondence between commits and JIRA issues is bizarre in this context. Let's find a way to make progress instead of tossing pharisaic accusations of illegitimacy. -C On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: > Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > > -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eric Baldeschwieler 2011-05-05, 01:18
Ok. I'll bite.
The point of a vote is to learn what everyone thinks. So far we have learned: 1 - the team that is trying to contribute code and do a release thinks it is ready. 2 - Cloudera does not think the release is a good idea. No more talk between the Team contributing code and cloudera will educate us further Let's hear from the rest of the community. In parallel on other threads, let's work out how to address concerns. That will be useful however the vote goes. I promise to continue to work with everyone to help drive releases. We've called a vote, so let it proceed. That is how apache works. Thanks! --- E14 - typing on glass PS this is my last comment on this thread. Start new ones if you are not casting a vote. On May 4, 2011, at 5:45 PM, "Konstantin Boudnik" <[EMAIL PROTECTED]> wrote: > I tend to agree. Changing release model of Apache Hadoop train isn't > something that should be done in a hassle or as a part of release > voting. > > If these questions aren't addressed - let's postpone the vote and > discuss all the complications or implications until they sorted out or > the consensus/compromise is reached. > > Cos > > On Wed, May 4, 2011 at 17:39, Eli Collins <[EMAIL PROTECTED]> wrote: >> The point is that these discussion should be sorted out, ie you don't >> change your development and release model on a release VOTE thread, >> you change it on a DISCUSSION thread. >> >> Ie before we release this we should understand what that means. What >> is being proposed is not just another release from branch-0.20 or >> branch-0.22. >> >> Thanks, >> Eli >> >> On Wed, May 4, 2011 at 5:30 PM, Mahadev Konar <[EMAIL PROTECTED]> wrote: >>> Eli, >>> I think the intent from the email was to just vote on this thread, >>> which I agree with. >>> Discussions should be done in a separate threads. Hopefully we can >>> all stick to just voting! >>> >>> thanks >>> mahadev >>> >>> On Wed, May 4, 2011 at 5:22 PM, Eli Collins <[EMAIL PROTECTED]> wrote: >>>> Good suggestion, it would be helpful to hash out the issues around >>>> compatibility, feature branches, version numbers, how to contribute at >>>> Apache before putting up new votes that would be helpful, ie the vote >>>> would go much smoother if all the issues with the previous vote were >>>> addressed before starting a new one. >>>> >>>> Thanks, >>>> Eli >>>> >>>> On Wed, May 4, 2011 at 5:05 PM, Eric Baldeschwieler >>>> <[EMAIL PROTECTED]> wrote: >>>>> Hi folks, >>>>> >>>>> Let's stay focused. Let's take the other threads onto other threads. This is a vote. >>>>> >>>>> To the extent naming is a problem, let's take that to a thread and find an acceptable proposal. >>>>> >>>>> To the extent folks want to collaborate on certifying the release for total lack of regression or collaborate on the cleanest possible merge, I think all interested parties should take these topics to another thread and divide up the work. >>>>> >>>>> If you've voted, you don't need to comment further on this thread, no matter what company you work for! >>>>> >>>>> Thanks, >>>>> >>>>> --- >>>>> E14 - typing on glass >>>>> >>>>> On May 4, 2011, at 4:46 PM, "Todd Lipcon" <[EMAIL PROTECTED]> wrote: >>>>> >>>>>> On Wed, May 4, 2011 at 4:11 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: >>>>>> >>>>>>> On May 4, 2011, at 4:09 PM, Tsz Wo (Nicholas), Sze wrote: >>>>>>> >>>>>>> The list seems highly inaccurate. Checked the first few N/A items. All >>>>>>>> are >>>>>>>> false positives. >>>>>>>> >>>>>>>> >>>>>>> Also, can you please provide a list on features which are not related to >>>>>>> gridmix benchmarks or herriot tests? >>>>>>> >>>>>> >>>>>> Here are a few I quickly pulled up: >>>>>> MAPREDUCE-2316 (docs for improved capacity scheduler) >>>>>> MAPREDUCE-2355 (adds new config for heartbeat dampening in MR) >>>>>> >>>>>> " BZ-4182948. Add statistics logging to Fred for better visibility into >>>>>> startup time costs. (Matt Foley)" >>>>>> - I believe I saw a note from Matt on the JIRA yesterday about this feature,
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-05, 01:19
> Entertaining concerns like a one-to-one
> correspondence between commits and JIRA issues is bizarre in this > context. It's not about whether there's a jira, it's about whether the code was reviewed. We think code should be reviewed and vote on by the community before releasing it. That's how we've always rolled. Everyone agrees releases are too infrequent, that's not an excuse for steam rolling the community. Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eli Collins 2011-05-05, 01:24
On Wed, May 4, 2011 at 6:18 PM, Eric Baldeschwieler
<[EMAIL PROTECTED]> wrote: > Ok. I'll bite. > > The point of a vote is to learn what everyone thinks. So far we have learned: > > 1 - the team that is trying to contribute code and do a release thinks it is ready. > > 2 - Cloudera does not think the release is a good idea. > I don't think that's true. There's a difference between not supporting a given rc and not supporting a release from this branch in general. With both of my hats on, I want code to be reviewed before being release, I want releases to not regress against previous releases, I don't want the next major release to regress against a stable release, I want the community to discuss new version schemes and development models vs adopting them by accident just because we voted on a particular release. Thanks, Eli
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Jean-Daniel Cryans 2011-05-05, 01:24
Non-biding -1.
I did download it and checked it out, but when I look at the documentation I see it says "Hadoop 0.20 documentation" in the tab on top. From what I can tell this isn't the branch 0.20 so I think it's an error and from a user point of view this looks more like something I would call 0.22 (although yes I understand this is 0.20 +security +whatever). Why would a single company push so hard to go against the "normal" release process just for "the benefit of putting our work in the hands of all hadoop users" is beyond me. It's not like people were begging on the mailing lists to be able to get their hands on such a release to the point where an emergency point release including tons of new features is needed. So to me the more logical reason would be monetary gains, that I would understand better from a for-profit company. But then why go through the hurdles of having such an ASF release when Y! isn't even selling anything remotely related to Hadoop services? And why now? But then there's this spinoff thing and it suddenly makes a lot more sense. E14 said earlier that "That is how apache works." I would say yes, maybe this is how it works, but I'm not sure I want to see it working like _that_. The ASF shouldn't be the vehicle for a single (future) company's wishes. J-D On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: > Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > > -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Milind Bhandarkar 2011-05-05, 01:30
My (non-binding) vote for 0.20.203.0-rc1 is +1.
I downloaded, compiled, ran tests, ran my bigrams example, all ran perfectly. (I did a single node test without security on.) The voting criteria I used are: 1. Is this a working release? : Yes 2. Does it take the codebase forward? : Yes 3. Does it have features that the user community might find valuable? : Yes - milind -- Milind Bhandarkar [EMAIL PROTECTED] +1-650-776-3167 On 5/4/11 6:10 PM, "Devaraj Das" <[EMAIL PROTECTED]> wrote: >+1 based on some single node tests I did (with security ON). > > >On 5/4/11 10:31 AM, "Owen O'Malley" <[EMAIL PROTECTED]> wrote: > >Here's an updated release candidate for 0.20.203.0. I've incorporated the >feedback and included all of the patches from 0.20.2, which is the last >stable release. I also fixed the eclipse-plugin problem. > >The candidate is at: >http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > >Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > >-- Owen >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Roy T. Fielding 2011-05-05, 01:56
On May 4, 2011, at 5:39 PM, Eli Collins wrote:
> The point is that these discussion should be sorted out, ie you don't > change your development and release model on a release VOTE thread, > you change it on a DISCUSSION thread. That is no different than saying you have a right to veto a release until the issue is addressed, which you don't have. A release vote is a majority decision. If the majority decides to release, then whatever gets released will define the new norm by which policies are assumed. If not released, then I suggest collaborating more on the policies before trying to vote again. Either way, we don't hold up a vote for the sake of a policy discussion because voting is a more efficient means of discovering if the policy really matters. ....Roy
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Andrew Purtell 2011-05-05, 02:12
Speculation either on the motives of those objecting to a release or of those making contributions or proposing a release does not advance progress. The accusations and counter-accusations seen on this thread are regrettable and I feel less and less confident in the future of Apache Hadoop as time goes on. As a strong believer in and advocate of open source as an answer to technical and architectural challenges, I am pained to see the members of what should be a vibrant community litigating in an ultimately self-defeating way. If only this energy put into argument could be channeled into code or patches...
In open source, if opinions were code we would rule the world. So what of this candidate? Artifact looks good, DFS tests are good, MR tests are good. Looked over some of the documentation and found no errors. To my knowledge this is now a superset of branch-0.20, addressing the reasonably determined deficit of rc0. There seems no reason other issues cannot be addressed subsequently. There has not been a release of Apache Hadoop 0.20 since at least Feb 6 2010 yet since this time important security enhancements have been contributed, but in the form of an Apache product these are only available as patches on a non-release branch. Forward progress of the Apache product seems more important than achieving the perfect release in all eyes. For example, append features remain on a non-release branch. I would really have liked to see the append changes included in this candidate, but this is not grounds for objection merely regret, and I hope this can be covered by a subsequent release, perhaps soon. After security and append features are in 0.20, in my personal humble opinion the 0.20 release in total is sufficient and all attention should be paid to the next release (0.22 or whatever), except for critical bug fixes. +1 Best regards, - Andy Problems worthy of attack prove their worth by hitting back. - Piet Hein (via Tom White)
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Dhruba Borthakur 2011-05-05, 02:20
+1.
I downloaded the bits, compiled and ran unit tests. Also, looked at the source code to some extent. Looks good. -dhruba On Wed, May 4, 2011 at 6:56 PM, Roy T. Fielding <[EMAIL PROTECTED]> wrote: > On May 4, 2011, at 5:39 PM, Eli Collins wrote: > > > The point is that these discussion should be sorted out, ie you don't > > change your development and release model on a release VOTE thread, > > you change it on a DISCUSSION thread. > > That is no different than saying you have a right to veto a > release until the issue is addressed, which you don't have. > > A release vote is a majority decision. If the majority > decides to release, then whatever gets released will define > the new norm by which policies are assumed. If not released, > then I suggest collaborating more on the policies before > trying to vote again. > > Either way, we don't hold up a vote for the sake of a > policy discussion because voting is a more efficient > means of discovering if the policy really matters. > > ....Roy > > -- Connect to me at http://www.facebook.com/dhruba
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Roy T. Fielding 2011-05-05, 02:22
On May 4, 2011, at 6:24 PM, Jean-Daniel Cryans wrote:
> Non-biding -1. > > I did download it and checked it out, but when I look at the > documentation I see it says "Hadoop 0.20 documentation" in the tab on > top. From what I can tell this isn't the branch 0.20 so I think it's > an error and from a user point of view this looks more like something > I would call 0.22 (although yes I understand this is 0.20 +security > +whatever). > > Why would a single company push so hard to go against the "normal" > release process just for "the benefit of putting our work in the hands > of all hadoop users" is beyond me. It's not like people were begging > on the mailing lists to be able to get their hands on such a release > to the point where an emergency point release including tons of new > features is needed. > > So to me the more logical reason would be monetary gains, that I would > understand better from a for-profit company. But then why go through > the hurdles of having such an ASF release when Y! isn't even selling > anything remotely related to Hadoop services? And why now? > > But then there's this spinoff thing and it suddenly makes a lot more sense. > > E14 said earlier that "That is how apache works." > > I would say yes, maybe this is how it works, but I'm not sure I want > to see it working like _that_. The ASF shouldn't be the vehicle for a > single (future) company's wishes. The ASF is a vehicle for whomever wishes to collaborate on a given project. Collaboration means helping do the work. Those who do the work may do so for whatever reasons that they think are good, whether it is because they feel like being charitable today, they get paid a salary and the big boss said "work on this part", or because they just have an itch worth scratching. Apache does not care why people choose to collaborate or how they choose to apply their own intellectual efforts. We welcome all forms of contribution under the terms of our license. What we do require is a certain amount of civility regarding our voting procedures and an emphasis on individual responsibility for your votes. Anyone caught *voting* a particular way just because the boss says so will be dealt with severely. Votes are how we do quality control and make decisions, and no other company can be allowed to make decisions for our non-profit. ....Roy
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Ian Holsman 2011-05-05, 03:34
just as a Tally
we have 6+1's (andy.. is yours binding?? if so 7) and 3 -1's. so according to the votes so far we are releasing.. but according to our bylaws.. we need to wait 7 days for everyone to chime in. --I On May 5, 2011, at 12:22 PM, Roy T. Fielding wrote: > On May 4, 2011, at 6:24 PM, Jean-Daniel Cryans wrote: > >> Non-biding -1. >> >> I did download it and checked it out, but when I look at the >> documentation I see it says "Hadoop 0.20 documentation" in the tab on >> top. From what I can tell this isn't the branch 0.20 so I think it's >> an error and from a user point of view this looks more like something >> I would call 0.22 (although yes I understand this is 0.20 +security >> +whatever). >> >> Why would a single company push so hard to go against the "normal" >> release process just for "the benefit of putting our work in the hands >> of all hadoop users" is beyond me. It's not like people were begging >> on the mailing lists to be able to get their hands on such a release >> to the point where an emergency point release including tons of new >> features is needed. >> >> So to me the more logical reason would be monetary gains, that I would >> understand better from a for-profit company. But then why go through >> the hurdles of having such an ASF release when Y! isn't even selling >> anything remotely related to Hadoop services? And why now? >> >> But then there's this spinoff thing and it suddenly makes a lot more sense. >> >> E14 said earlier that "That is how apache works." >> >> I would say yes, maybe this is how it works, but I'm not sure I want >> to see it working like _that_. The ASF shouldn't be the vehicle for a >> single (future) company's wishes. > > The ASF is a vehicle for whomever wishes to collaborate on a > given project. Collaboration means helping do the work. Those > who do the work may do so for whatever reasons that they think > are good, whether it is because they feel like being charitable > today, they get paid a salary and the big boss said "work on > this part", or because they just have an itch worth scratching. > > Apache does not care why people choose to collaborate or > how they choose to apply their own intellectual efforts. We > welcome all forms of contribution under the terms of our license. > > What we do require is a certain amount of civility regarding > our voting procedures and an emphasis on individual responsibility > for your votes. Anyone caught *voting* a particular way just > because the boss says so will be dealt with severely. Votes > are how we do quality control and make decisions, and no other > company can be allowed to make decisions for our non-profit. > > ....Roy
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Nigel Daley 2011-05-05, 04:17
I'm really not sure yet how to vote here. I was going to vote +1 for what I was told by a number of Yahoo! committers would be a one time release as Yahoo! "comes back to Apache" after a hiatus last fall/winter and ended their own distribution. Clearly this code was not all developed as a community process, but I was going to support a one time release of what they had developed in exclusion.
Then I read Roy's email, which confused me. We would he or I or anyone else support this release setting precedent or policy since it would walk all over our bylaws, community process, and the consensus nature of our foundation? This release vote is a lazy majority of the PMC, but other decisions rolled up in this are supposed to be lazy majority of active committers or, in the case of code changes, a lazy consensus. Setting policy by this release means any sufficiently large group of committers could go off and develop on their own and then commit it to a branch and call a release. Furthermore, it now sounds like this is possibly the first in a line of feature releases off this branch. Bug fixes releases, sure. But feature releases? What's wrong with trunk? Nige On May 4, 2011, at 6:56 PM, Roy T. Fielding wrote: > On May 4, 2011, at 5:39 PM, Eli Collins wrote: > >> The point is that these discussion should be sorted out, ie you don't >> change your development and release model on a release VOTE thread, >> you change it on a DISCUSSION thread. > > That is no different than saying you have a right to veto a > release until the issue is addressed, which you don't have. > > A release vote is a majority decision. If the majority > decides to release, then whatever gets released will define > the new norm by which policies are assumed. If not released, > then I suggest collaborating more on the policies before > trying to vote again. > > Either way, we don't hold up a vote for the sake of a > policy discussion because voting is a more efficient > means of discovering if the policy really matters. > > ....Roy >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Jeff Hammerbacher 2011-05-05, 04:54
-1.
As Roy says, "whatever gets released will define the new norm by which policies are assumed", and I certainly don't want this project to change its norms to accommodate bad practices. In particular, Eli presented three very reasonable technical objections to this release. To summarize: 1) Let's get the JIRAs that are going into this release into trunk first. 2) Let's create a JIRA for each issue in the release. 3) Let's stick to the release numbering conventions established for this project. I know the folks at Yahoo! are all professional engineers and done tremendous work to help get the project to this point. There's no doubt in my mind they understand the validity of the above three technical objections. In fact, many of them helped author our "How to Contribute" page, which established these conventions: wiki.apache.org/hadoop/HowToContribute. We develop new features against trunk, we create JIRAs for each issue, we review code before it goes into trunk, and we only update old releases with bug fixes. I couldn't be more excited to have Yahoo! once again doing development in Apache, and I hope that we can work together to get the work that you've done in this branch into one of our upcoming feature releases. I hope those who voted +1 before Roy clarified what a release vote will mean for future project norms will reconsider their votes. While there may be many competing agendas in this community, we all wish to see Apache Hadoop releases of the highest quality. Changing our norms to allow huge, unreviewed patch sets introducing new features into a past release is a step in the wrong direction. With a little bit of elbow grease, we can get the work done in this branch into trunk, get 0.22 out the door, and be ready for a great 0.23 release. Later, Jeff On Wed, May 4, 2011 at 9:17 PM, Nigel Daley <[EMAIL PROTECTED]> wrote: > I'm really not sure yet how to vote here. I was going to vote +1 for what > I was told by a number of Yahoo! committers would be a one time release as > Yahoo! "comes back to Apache" after a hiatus last fall/winter and ended > their own distribution. Clearly this code was not all developed as a > community process, but I was going to support a one time release of what > they had developed in exclusion. > > Then I read Roy's email, which confused me. We would he or I or anyone > else support this release setting precedent or policy since it would walk > all over our bylaws, community process, and the consensus nature of our > foundation? This release vote is a lazy majority of the PMC, but other > decisions rolled up in this are supposed to be lazy majority of active > committers or, in the case of code changes, a lazy consensus. Setting > policy by this release means any sufficiently large group of committers > could go off and develop on their own and then commit it to a branch and > call a release. > > Furthermore, it now sounds like this is possibly the first in a line of > feature releases off this branch. Bug fixes releases, sure. But feature > releases? What's wrong with trunk? > > Nige > > On May 4, 2011, at 6:56 PM, Roy T. Fielding wrote: > > > On May 4, 2011, at 5:39 PM, Eli Collins wrote: > > > >> The point is that these discussion should be sorted out, ie you don't > >> change your development and release model on a release VOTE thread, > >> you change it on a DISCUSSION thread. > > > > That is no different than saying you have a right to veto a > > release until the issue is addressed, which you don't have. > > > > A release vote is a majority decision. If the majority > > decides to release, then whatever gets released will define > > the new norm by which policies are assumed. If not released, > > then I suggest collaborating more on the policies before > > trying to vote again. > > > > Either way, we don't hold up a vote for the sake of a > > policy discussion because voting is a more efficient > > means of discovering if the policy really matters. > > > > ....Roy > > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Eric Sammer 2011-05-05, 05:47
(non-binding) -1 for similar reasons to what Jeff and others have laid out,
and certainly if we're going to change the development process as a side effect of a release vote. On Wed, May 4, 2011 at 9:54 PM, Jeff Hammerbacher <[EMAIL PROTECTED]>wrote: > -1. > > As Roy says, "whatever gets released will define the new norm by which > policies are assumed", and I certainly don't want this project to change > its > norms to accommodate bad practices. In particular, Eli presented three very > reasonable technical objections to this release. To summarize: > > 1) Let's get the JIRAs that are going into this release into trunk first. > 2) Let's create a JIRA for each issue in the release. > 3) Let's stick to the release numbering conventions established for this > project. > > I know the folks at Yahoo! are all professional engineers and done > tremendous work to help get the project to this point. There's no doubt in > my mind they understand the validity of the above three technical > objections. In fact, many of them helped author our "How to Contribute" > page, which established these conventions: > wiki.apache.org/hadoop/HowToContribute. We develop new features against > trunk, we create JIRAs for each issue, we review code before it goes into > trunk, and we only update old releases with bug fixes. > > I couldn't be more excited to have Yahoo! once again doing development in > Apache, and I hope that we can work together to get the work that you've > done in this branch into one of our upcoming feature releases. > > I hope those who voted +1 before Roy clarified what a release vote will > mean > for future project norms will reconsider their votes. > > While there may be many competing agendas in this community, we all wish to > see Apache Hadoop releases of the highest quality. Changing our norms to > allow huge, unreviewed patch sets introducing new features into a past > release is a step in the wrong direction. > > With a little bit of elbow grease, we can get the work done in this branch > into trunk, get 0.22 out the door, and be ready for a great 0.23 release. > > Later, > Jeff > > On Wed, May 4, 2011 at 9:17 PM, Nigel Daley <[EMAIL PROTECTED]> wrote: > > > I'm really not sure yet how to vote here. I was going to vote +1 for > what > > I was told by a number of Yahoo! committers would be a one time release > as > > Yahoo! "comes back to Apache" after a hiatus last fall/winter and ended > > their own distribution. Clearly this code was not all developed as a > > community process, but I was going to support a one time release of what > > they had developed in exclusion. > > > > Then I read Roy's email, which confused me. We would he or I or anyone > > else support this release setting precedent or policy since it would walk > > all over our bylaws, community process, and the consensus nature of our > > foundation? This release vote is a lazy majority of the PMC, but other > > decisions rolled up in this are supposed to be lazy majority of active > > committers or, in the case of code changes, a lazy consensus. Setting > > policy by this release means any sufficiently large group of committers > > could go off and develop on their own and then commit it to a branch and > > call a release. > > > > Furthermore, it now sounds like this is possibly the first in a line of > > feature releases off this branch. Bug fixes releases, sure. But feature > > releases? What's wrong with trunk? > > > > Nige > > > > On May 4, 2011, at 6:56 PM, Roy T. Fielding wrote: > > > > > On May 4, 2011, at 5:39 PM, Eli Collins wrote: > > > > > >> The point is that these discussion should be sorted out, ie you don't > > >> change your development and release model on a release VOTE thread, > > >> you change it on a DISCUSSION thread. > > > > > > That is no different than saying you have a right to veto a > > > release until the issue is addressed, which you don't have. > > > > > > A release vote is a majority decision. If the majority > > > decides to release, then whatever gets released will define Eric Sammer twitter: esammer data: www.cloudera.com
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Matei Zaharia 2011-05-05, 06:21
I'm not going to cast a vote, but I'm concerned about this for the same reasons Eli brought up -- in particular, compatibility with 0.22. I'm an author of several patches that have gone into 0.21 and trunk, only to stay on hiatus for 2 years because the project hasn't made a stable release since 0.20. (Today, many of these patches are being used through CDH, which is great, but it would be nice to see them in an Apache release too.) This push of features into 0.20.203 makes a widely used 0.22 seem even more distant. Can we at least get a confirmation that these changes will be included in 0.22, as well as a timeline?
To support a vibrant developer community, Apache Hadoop should not just be a mechanism for Yahoo and Cloudera to publish patches. It should include a well-defined process for smaller third-party contributors to push changes that will make it into a stable release within a reasonable time horizon. The lack of such a process has been a major cause for the slowdown in the project in my perspective. Matei On May 4, 2011, at 10:47 PM, Eric Sammer wrote: > (non-binding) -1 for similar reasons to what Jeff and others have laid out, > and certainly if we're going to change the development process as a side > effect of a release vote. > > On Wed, May 4, 2011 at 9:54 PM, Jeff Hammerbacher <[EMAIL PROTECTED]>wrote: > >> -1. >> >> As Roy says, "whatever gets released will define the new norm by which >> policies are assumed", and I certainly don't want this project to change >> its >> norms to accommodate bad practices. In particular, Eli presented three very >> reasonable technical objections to this release. To summarize: >> >> 1) Let's get the JIRAs that are going into this release into trunk first. >> 2) Let's create a JIRA for each issue in the release. >> 3) Let's stick to the release numbering conventions established for this >> project. >> >> I know the folks at Yahoo! are all professional engineers and done >> tremendous work to help get the project to this point. There's no doubt in >> my mind they understand the validity of the above three technical >> objections. In fact, many of them helped author our "How to Contribute" >> page, which established these conventions: >> wiki.apache.org/hadoop/HowToContribute. We develop new features against >> trunk, we create JIRAs for each issue, we review code before it goes into >> trunk, and we only update old releases with bug fixes. >> >> I couldn't be more excited to have Yahoo! once again doing development in >> Apache, and I hope that we can work together to get the work that you've >> done in this branch into one of our upcoming feature releases. >> >> I hope those who voted +1 before Roy clarified what a release vote will >> mean >> for future project norms will reconsider their votes. >> >> While there may be many competing agendas in this community, we all wish to >> see Apache Hadoop releases of the highest quality. Changing our norms to >> allow huge, unreviewed patch sets introducing new features into a past >> release is a step in the wrong direction. >> >> With a little bit of elbow grease, we can get the work done in this branch >> into trunk, get 0.22 out the door, and be ready for a great 0.23 release. >> >> Later, >> Jeff >> >> On Wed, May 4, 2011 at 9:17 PM, Nigel Daley <[EMAIL PROTECTED]> wrote: >> >>> I'm really not sure yet how to vote here. I was going to vote +1 for >> what >>> I was told by a number of Yahoo! committers would be a one time release >> as >>> Yahoo! "comes back to Apache" after a hiatus last fall/winter and ended >>> their own distribution. Clearly this code was not all developed as a >>> community process, but I was going to support a one time release of what >>> they had developed in exclusion. >>> >>> Then I read Roy's email, which confused me. We would he or I or anyone >>> else support this release setting precedent or policy since it would walk >>> all over our bylaws, community process, and the consensus nature of our
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Sanjay Radia 2011-05-05, 06:32
+1
downloaded, built, deployed on one node cluster. sanjay On May 4, 2011, at 10:31 AM, Owen O'Malley wrote: > Here's an updated release candidate for 0.20.203.0. I've > incorporated the feedback and included all of the patches from > 0.20.2, which is the last stable release. I also fixed the eclipse- > plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, > I'm +1. > > -- Owen
-
RE: [VOTE] Release candidate 0.20.203.0-rc1Jane Chen 2011-05-05, 06:49
Agree. As a new comer, I had trouble figuring out which version to adopt -- 0.20.2 vs. 0.21. This new release candidate seems to add more confusion to general users.
Jane -----Original Message----- From: Matei Zaharia [mailto:[EMAIL PROTECTED]] Sent: Wednesday, May 04, 2011 11:21 PM To: [EMAIL PROTECTED] Subject: Re: [VOTE] Release candidate 0.20.203.0-rc1 I'm not going to cast a vote, but I'm concerned about this for the same reasons Eli brought up -- in particular, compatibility with 0.22. I'm an author of several patches that have gone into 0.21 and trunk, only to stay on hiatus for 2 years because the project hasn't made a stable release since 0.20. (Today, many of these patches are being used through CDH, which is great, but it would be nice to see them in an Apache release too.) This push of features into 0.20.203 makes a widely used 0.22 seem even more distant. Can we at least get a confirmation that these changes will be included in 0.22, as well as a timeline? To support a vibrant developer community, Apache Hadoop should not just be a mechanism for Yahoo and Cloudera to publish patches. It should include a well-defined process for smaller third-party contributors to push changes that will make it into a stable release within a reasonable time horizon. The lack of such a process has been a major cause for the slowdown in the project in my perspective. Matei On May 4, 2011, at 10:47 PM, Eric Sammer wrote: > (non-binding) -1 for similar reasons to what Jeff and others have laid out, > and certainly if we're going to change the development process as a side > effect of a release vote. > > On Wed, May 4, 2011 at 9:54 PM, Jeff Hammerbacher <[EMAIL PROTECTED]>wrote: > >> -1. >> >> As Roy says, "whatever gets released will define the new norm by which >> policies are assumed", and I certainly don't want this project to change >> its >> norms to accommodate bad practices. In particular, Eli presented three very >> reasonable technical objections to this release. To summarize: >> >> 1) Let's get the JIRAs that are going into this release into trunk first. >> 2) Let's create a JIRA for each issue in the release. >> 3) Let's stick to the release numbering conventions established for this >> project. >> >> I know the folks at Yahoo! are all professional engineers and done >> tremendous work to help get the project to this point. There's no doubt in >> my mind they understand the validity of the above three technical >> objections. In fact, many of them helped author our "How to Contribute" >> page, which established these conventions: >> wiki.apache.org/hadoop/HowToContribute. We develop new features against >> trunk, we create JIRAs for each issue, we review code before it goes into >> trunk, and we only update old releases with bug fixes. >> >> I couldn't be more excited to have Yahoo! once again doing development in >> Apache, and I hope that we can work together to get the work that you've >> done in this branch into one of our upcoming feature releases. >> >> I hope those who voted +1 before Roy clarified what a release vote will >> mean >> for future project norms will reconsider their votes. >> >> While there may be many competing agendas in this community, we all wish to >> see Apache Hadoop releases of the highest quality. Changing our norms to >> allow huge, unreviewed patch sets introducing new features into a past >> release is a step in the wrong direction. >> >> With a little bit of elbow grease, we can get the work done in this branch >> into trunk, get 0.22 out the door, and be ready for a great 0.23 release. >> >> Later, >> Jeff >> >> On Wed, May 4, 2011 at 9:17 PM, Nigel Daley <[EMAIL PROTECTED]> wrote: >> >>> I'm really not sure yet how to vote here. I was going to vote +1 for >> what >>> I was told by a number of Yahoo! committers would be a one time release >> as >>> Yahoo! "comes back to Apache" after a hiatus last fall/winter and ended >>> their own distribution. Clearly this code was not all developed as a
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Jean-Daniel Cryans 2011-05-05, 06:52
Roy,
On Wed, May 4, 2011 at 7:22 PM, Roy T. Fielding <[EMAIL PROTECTED]> wrote: > The ASF is a vehicle for whomever wishes to collaborate on a > given project. Collaboration means helping do the work. Those > who do the work may do so for whatever reasons that they think > are good, whether it is because they feel like being charitable > today, they get paid a salary and the big boss said "work on > this part", or because they just have an itch worth scratching. > > Apache does not care why people choose to collaborate or > how they choose to apply their own intellectual efforts. We > welcome all forms of contribution under the terms of our license. I don't think I was arguing against the contribution of the code in that branch, it's very welcome, but I'm questioning (and ranting about) the motivation for releasing a version that even just by name is a weird hulla-hoop around the usual development practices that Hadoop has had in the past (not that it's set in stone). So I wanted to contribute my negative non-binding vote to highlight that this release is probably very confusing for the general user. This is 0.20, but it's not. Also it has more numbers, and it starts at 203. Why doing this at all instead of just moving on with 0.22? Or is 0.22 bound to be like 0.21? It almost begs the question if this should be called 0.22.0 then. > > What we do require is a certain amount of civility regarding > our voting procedures and an emphasis on individual responsibility > for your votes. Anyone caught *voting* a particular way just > because the boss says so will be dealt with severely. Votes > are how we do quality control and make decisions, and no other > company can be allowed to make decisions for our non-profit. Yeah I don't think that's a problem here, everyone seem to have their very own strong opinions.
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Andrew Purtell 2011-05-05, 16:18
Non binding.
- Andy --- On Wed, 5/4/11, Ian Holsman <[EMAIL PROTECTED]> wrote: > just as a Tally we have > 6+1's (andy.. is yours binding?? if so 7) > and 3 -1's. > > so according to the votes so far we are releasing.. but > according to our bylaws.. we need to wait 7 days for > everyone to chime in. > > --I
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Stack 2011-05-05, 18:33
I abstain: +/- 0.
I can't vote against the good work done by the crew at Y! But I can't vote for a 0.20.clusterbomb release that railroads over precedent compounding further the existing confusion that already exists around the state of Hadoop. Thanks, St.Ack On Wed, May 4, 2011 at 10:31 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: > Here's an updated release candidate for 0.20.203.0. I've incorporated the feedback and included all of the patches from 0.20.2, which is the last stable release. I also fixed the eclipse-plugin problem. > > The candidate is at: http://people.apache.org/~omalley/hadoop-0.20.203.0-rc1/ > > Please download it, inspect it, compile it, and test it. Clearly, I'm +1. > > -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Suresh Srinivas 2011-05-05, 19:02
+1 Downloaded the release, validated checksums, deployed a single-node cluster, and ran some HDFS sanity tests, Web UI tests and mapreduce examples. Regards, Suresh
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Jakob Homan 2011-05-05, 20:56
+1
Downloaded, verified, tested on single node cluster to my satisfaction. We've also brought this release up on a sizable cluster and checked its basic sanity. Regardless of the difficult path we've had over the past year, this is a good chunk of code to get out to the community. I'd much rather explain a convoluted numbering system or what is or isn't in this release than continue to apologize for having no release at all. -Jakob On Thu, May 5, 2011 at 12:02 PM, Suresh Srinivas <[EMAIL PROTECTED]> wrote: > > +1 > > Downloaded the release, validated checksums, deployed a single-node > cluster, and ran some HDFS sanity tests, Web UI tests and mapreduce > examples. > > Regards, > Suresh > > > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Roy T. Fielding 2011-05-05, 21:10
On May 4, 2011, at 11:52 PM, Jean-Daniel Cryans wrote:
> Roy, > > On Wed, May 4, 2011 at 7:22 PM, Roy T. Fielding <[EMAIL PROTECTED]> wrote: >> The ASF is a vehicle for whomever wishes to collaborate on a >> given project. Collaboration means helping do the work. Those >> who do the work may do so for whatever reasons that they think >> are good, whether it is because they feel like being charitable >> today, they get paid a salary and the big boss said "work on >> this part", or because they just have an itch worth scratching. >> >> Apache does not care why people choose to collaborate or >> how they choose to apply their own intellectual efforts. We >> welcome all forms of contribution under the terms of our license. > > I don't think I was arguing against the contribution of the code in > that branch, it's very welcome, but I'm questioning (and ranting > about) the motivation for releasing a version that even just by name > is a weird hulla-hoop around the usual development practices that > Hadoop has had in the past (not that it's set in stone). Yes, and I said that kind of questioning is not appropriate. You are not responsible for other peoples' motivation. > So I wanted to contribute my negative non-binding vote to highlight > that this release is probably very confusing for the general user. > This is 0.20, but it's not. Also it has more numbers, and it starts at > 203. Why doing this at all instead of just moving on with 0.22? Or is > 0.22 bound to be like 0.21? It almost begs the question if this should > be called 0.22.0 then. Yes, I already made that same point. You don't need to talk about motivation in order to do so. If I had a vote, I would have voted -1 just because version numbers do matter to users, the three number form is well-known, and minting new versions is far cheaper than adding extra numbers. I'd have cut the release candidate as 0.30. However, I did not do that work. The person who did the work chose 0.20.203.0. Anyone who doesn't like that should vote accordingly and, preferably, make your communication about such things more open in the future so that you don't waste others' time on extra builds. And if the majority thinks releasing these bits are more important than my concerns, then I have to accept that as the will of the project. Please note also that policies are not technical discussions. Likewise, version numbers are not technical. If those were technical changes then anyone on the PMC could veto them, which would effectively mean anyone could veto a release and the project would quickly devolve into tyranny by minority. Likewise, just because I said that a successful release defines its own set of precedents (and therein policy), that doesn't mean the project can't vote on a new policy the next day or make another release that sets it again moving forward. Progress is in the doing. ....Roy
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Tsz Wo \ 2011-05-06, 00:42
+1 on 0.20.203.0-rc1.
I downloaded the release, verified signature, verified some message digests, started a one-node cluster and tested it with my MapReduce Math Library. In particular, I have run *DistMpMultTest* It runs 5 jobs: - 2 forward FFT jobs for calculating two 4086-dimensional DFTs. - 1 backward FFT job for calculating componentwise multiplication using a c program with GMP lib, and a 4086-dimensional inverse DFTs. - 1 componentwise summation job. - 1 job for carrying. *DistBbp* It runs 14 jobs for computing the 1024 bits of pi right after the billionth bit positions. Please see the details below. Regards, Nicholas ****** DistMpMultTest ****************************************************** distfft.VERSION = 20110124 distmpmult.VERSION = 20110307 distmpmult.LOCAL_THRESHOLD = 2^16 (=65536) args = [#=2 0: 18 1: gen ] dir = DistMpMultTest-20110505-222621889 NEW: Zahlen(bitsPerDigit=32, digitsPerArray=4096, numArrays=128) SchonhageStrassen.Factory.valueOf(numDigits=262144): digitsPerOperand = 2^18 (=262144) (highest=262144) bitsPerElement = 2^12 (=4096) D = 2^12 (=4096) modulusExponent = 10240 (ss_e=8224) efficiency = 0.801171875 (N=2^24 (=16777216), n=10240) NEW: SchonhageStrassen[modulas=2^10240 + 1, D=2^12 (=4096), bitsPerElement=2^12 (=4096), Z=Zahlen(bitsPerDigit=2^5, digitLimit=2^32, digitsPerArray=2^12, numArrays=128)] NEW: Zahlen(bitsPerDigit=32, digitsPerArray=4096, numArrays=1) NEW: SchonhageStrassen[modulas=2^10240 + 1, D=2^12 (=4096), bitsPerElement=2^12 (=4096), Z=Zahlen(bitsPerDigit=2^5, digitLimit=2^32, digitsPerArray=2^12, numArrays=1)] numDigits = 2^18 (=262144) (e=18) digitsPerOperand = 262144 J = 2^6 (=64) K = 2^6 (=64) Verifier: STARTED WorkGroup: Created DistMpMultTest with 2 threads. multiply_mapreduce: c; x = [a, b] distfft: forward inverse = false distmpbase: Z = Zahlen bitsPerDigit = 2^5 (=32) digitLimit = 2^32 (=4294967296) digitsPerArray = 2^12 (=4096) numArrays = 1 digitsSupported = 4096 schonhagestrassen = SchonhageStrassen D = 2^12 (=4096) D^(-1) = -2^10228 modulus = 2^10240 + 1 bitsPerElement = 2^12 (=4096) zeta = [2^0, 2^5, 2^10, 2^15, 2^20, 2^25, 2^30, 2^35, 2^40, 2^45, ...] J = 2^6 (=64) K = 2^6 (=64) dir = DistMpMultTest-20110505-222621889 digitsPerOperand = 262144 descriptor = FunctionDescriptor(input=a, output=a') descriptor = FunctionDescriptor(input=b, output=b') forward a: DistFft(J=2^6 (=64), K=2^6 (=64), dir=DistMpMultTest-20110505-222621889, inverse=false) GmpMultiplier(Thread-260, count=0): cmd=[./gmp_mult, 16] GmpMultiplier(Thread-260, count=1): messager started. GmpMultiplier(Thread-260, count=1): GmpMultiplier(Thread-260, count=1): START: Thu May 5 22:26:28 2011 GmpMultiplier(Thread-260, count=1): VERSION=20110122b, GMP4.1.4 GmpMultiplier(Thread-260, count=1): GmpMultiplier(Thread-260, count=1): argc=2, argv=[./gmp_mult, 16] SUBMIT JOB: DistMpMultTest-20110505-222621889: a' = dft(a) forward b: DistFft(J=2^6 (=64), K=2^6 (=64), dir=DistMpMultTest-20110505-222621889, inverse=false) SUBMIT JOB: DistMpMultTest-20110505-222621889: b' = dft(b) Verifier: expected = + 0129ED2D B8CF7E1D A9BF33D8 9A294511 8508413C ... 4EA85F30 EF79A552 1C080D0C 82F6882C 8CE9F355 (524,288 digits, 16,777,209 bits) descriptor = FunctionDescriptor(input=a' ** b', output=c') backward: DistFft(J=2^6 (=64), K=2^6 (=64), dir=DistMpMultTest-20110505-222621889, inverse=true) SUBMIT JOB: DistMpMultTest-20110505-222621889: c' = dft^-1(a' ** b') 2272875ms (=37:52.875) : backward, DistMpMultTest-20110505-222621889: c' = dft^-1(a' ** b') ----------------------------------------------------- descriptor = FunctionDescriptor(input=c'_0 + c'_1 + c'_2, output=c'') summation: DistMpSum(J=2^6 (=64), K=2^6 (=64), dir=DistMpMultTest-20110505-222621889/c') SUBMIT JOB: DistMpMultTest-20110505-222621889/c': c'' = c'_0 + c'_1 + c'_2 2805177ms (=46:45.177) : summation, DistMpMultTest-20110505-222621889/c': c'' = c'_0 + c'_1 + c'_2 ----------------------------------------------------- descriptor = FunctionDescriptor(input=c'', output=c) carrying: DistCarrying(J=2^6 (=64), K=2^6 (=64), dir=DistMpMultTest-20110505-222621889/c') SUBMIT JOB: DistMpMultTest-20110505-222621889/c': c = carry(c'') 3456765ms (=57:36.765) : carrying, DistMpMultTest-20110505-222621889/c': c = carry(c'') ----------------------------------------------------- multiply_mapreduce: returns ZahlenDescriptor(numParts=2^6 (=64), elementsPerPart=64) c = ZahlenDescriptor(numParts=2^6 (=64), elementsPerPart=64) 3456769ms (=57:36.769) : DistMpMult maxMemory = 963.00 MB totalMemory = 64.88 MB freeMemory = 26.32 MB computed = + 0129ED2D B8CF7E1D A9BF33D8 9A294511 8508413C ... 4EA85F30 EF79A552 1C080D0C 82F6882C 8CE9F355 (524,288 digits, 16,777,209 bits) 3457374ms (=57:37.374) : DONE (dir = DistMpMultTest-20110505-222621889) ****** DistBbp ************************************************************* Create file b1000000000-p1024-20110506-000707662.log STARTUP Fri May 06 00:07:07 UTC 2011 Started at org.apache.hadoop.util.RunJar.main(RunJar.java:156) Printed at org.apache.hadoop.mp.pi.DistSum.<init>(DistSum.java:428) WorkGroup: Created DistSum.OutputProcessor with 1 threads. DistBbp.VERSION = 20100731 b = 1,000,000,000 (bits skipped) precision = 1,024 nWorkers = 14 nJobs = 1 machine = MapSide(p1,t1), m1t1 remoteDir = 100b-1024-test tmpDir = ../100b-1024-test DistBbp.processExistingJobOutputs = true Check existing job outputs from hdfs://NAMENODE/user/100b-1024-test ... Read existing results f
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Lars Francke 2011-05-06, 10:07
> " BZ-4182948. Add statistics logging to Fred for better visibility into
> startup time costs. (Matt Foley)" > - I believe I saw a note from Matt on the JIRA yesterday about this feature, > where he decided that the version done in 203 wasn't a good approach, and > it's done differently in trunk (not sure if done yet). Could anyone elaborate on what this "Fred" is that has been coming up on these threads a few times now? And is there something like a RELEASE NOTES draft that I could look over? I try to follow these mailing lists as best as I can but I have lost track of all the branches and features being worked on and I can't imagine I'm the only one. It would be nice to get an overview of what this release is all about where work is being done etc. Thanks! Cheers, Lars
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Steve Loughran 2011-05-06, 11:52
Vote: +1 I'm a committer not PMC, so it's non binding. Why: -looks and works OK on my desktop -we've been using Y! releases, and this brings their branch back into the apache fold, meaning we can say "the official Apache release of Apache Hadoop is something you can use in production". -I'm confident that the Y! team have tested this well. I understand Eli's concerns that putting stuff in there that hasn't gone into trunk yet is danger. However, as the team makes no guarantees of 100% compatibility between releases, I don't think it's critical. It's just something that needs to be addressed -which can be done after this release has shipped. -Steve
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Andrew Purtell 2011-05-06, 18:30
> -we've been using Y! releases, and this brings their branch
> back into the apache fold, meaning we can say "the official > Apache release of Apache Hadoop is something you can use in > production". And it is worth consideration that there is some measure of competition between Apache Hadoop and other projects, for example: http://twitter.com/#!/CurtMonash/status/66343706743160832 "@SethGrimes The @DataStax story is "Cassandra is mature, unlike Apache Hadoop, and hence will rescue Hadoop"" I've requested proof points..." and they are happy to exploit any perception of division, lack of forward progress, and such. Perhaps this is additional, and I pray sufficient, motivation to cease the proxy battles, bury the hatchet, etc. - Andy --- On Fri, 5/6/11, Steve Loughran <[EMAIL PROTECTED]> wrote: > From: Steve Loughran <[EMAIL PROTECTED]> > Subject: Re: [VOTE] Release candidate 0.20.203.0-rc1 > To: [EMAIL PROTECTED] > Date: Friday, May 6, 2011, 4:52 AM > > Vote: +1 > > I'm a committer not PMC, so it's non binding. > > > Why: > -looks and works OK on my desktop > -we've been using Y! releases, and this brings their branch > back into the apache fold, meaning we can say "the official > Apache release of Apache Hadoop is something you can use in > production". > -I'm confident that the Y! team have tested this well. > > I understand Eli's concerns that putting stuff in there > that hasn't gone into trunk yet is danger. However, as the > team makes no guarantees of 100% compatibility between > releases, I don't think it's critical. It's just something > that needs to be addressed -which can be done after this > release has shipped. > > > -Steve > >
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Allen Wittenauer 2011-05-07, 00:49
On May 5, 2011, at 1:56 PM, Jakob Homan wrote: > +1 > > Downloaded, verified, tested on single node cluster to my > satisfaction. We've also brought this release up on a sizable cluster > and checked its basic sanity. All of you people doing single node tests are missing stuff. For example, the regression in how the secondary namenode addr stuff works vs. 0.20. By far, the biggest problem we've found is that the capacity scheduler documentation doesn't actually match what the code does. I have a hunch that the unit tests were written/change to match the outcome, rather than test what is supposed to happen. For us, this breakage makes it unusable out of the box and we'll likely either go back to our (relatively stable) backport of 0.21's cap sched, try to fix the 0.20.203 code, or maybe even switch to a completely different scheduler.
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Todd Papaioannou 2011-05-07, 01:43
Allen,
Can you provide some more details into what issues you are seeing with the capacity scheduler? Is it just the docs don't match the code, or are you seeing real issues with job scheduling? Thanks ToddP On 5/6/11 5:49 PM, "Allen Wittenauer" <[EMAIL PROTECTED]> wrote: > >On May 5, 2011, at 1:56 PM, Jakob Homan wrote: > >> +1 >> >> Downloaded, verified, tested on single node cluster to my >> satisfaction. We've also brought this release up on a sizable cluster >> and checked its basic sanity. > > All of you people doing single node tests are missing stuff. For >example, the regression in how the secondary namenode addr stuff works >vs. 0.20. > > By far, the biggest problem we've found is that the capacity scheduler >documentation doesn't actually match what the code does. I have a hunch >that the unit tests were written/change to match the outcome, rather than >test what is supposed to happen. For us, this breakage makes it unusable >out of the box and we'll likely either go back to our (relatively stable) >backport of 0.21's cap sched, try to fix the 0.20.203 code, or maybe even >switch to a completely different scheduler.
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Allen Wittenauer 2011-05-07, 02:31
On May 6, 2011, at 6:43 PM, Todd Papaioannou wrote: > Allen, > > Can you provide some more details into what issues you are seeing with the > capacity scheduler? Is it just the docs don't match the code, or are you > seeing real issues with job scheduling? Jobs are definitely not getting the maximum number of task slots they should be getting. I'm suspecting a bug with how max-limit of -1 queues are handled. I'll actually be in the office next week to try and see if I can figure out where things are going haywire. [I filed a bug on this a few weeks ago before I left for vacation. It was basically ignored.]
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Milind Bhandarkar 2011-05-07, 06:18
Allen, there are per job limits, and per user limits in this branch. (So,
max capacity of -1 is for the queue, but within the queue, the per user limits come into picture.) If I remember right, the defaults were based on a certain assumption of how many users would be on a queue simultaneously. Of course this would need to be set in the site-specific config. - milind -- Milind Bhandarkar [EMAIL PROTECTED] +1-650-776-3167 On 5/6/11 7:31 PM, "Allen Wittenauer" <[EMAIL PROTECTED]> wrote: > >On May 6, 2011, at 6:43 PM, Todd Papaioannou wrote: > >> Allen, >> >> Can you provide some more details into what issues you are seeing with >>the >> capacity scheduler? Is it just the docs don't match the code, or are you >> seeing real issues with job scheduling? > > Jobs are definitely not getting the maximum number of task slots they >should be getting. I'm suspecting a bug with how max-limit of -1 queues >are handled. I'll actually be in the office next week to try and see if >I can figure out where things are going haywire. > > [I filed a bug on this a few weeks ago before I left for vacation. It >was basically ignored.]
-
Re: [VOTE] Release candidate 0.20.203.0-rc1Allen Wittenauer 2011-05-07, 19:06
On May 6, 2011, at 11:18 PM, Milind Bhandarkar wrote: > Allen, there are per job limits, and per user limits in this branch. (So, > max capacity of -1 is for the queue, but within the queue, the per user > limits come into picture.) If I remember right, the defaults were based on > a certain assumption of how many users would be on a queue simultaneously. > Of course this would need to be set in the site-specific config. Yes, I'm aware of the changes. What I'm basically saying is that even with those new limits taken into consideration, the math doesn't seem to hold up.
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Konstantin Shvachko 2011-05-08, 05:41
-1 for rc1
I downloaded and ran the test target 3 times. First run failed because my umask is defaulted to 0002, which is a known problem HADOOP-5050 committed to 0.21 but not 0.20. Set umask to 0022 and re-ran test twice. Both resulted in failure. Here is the list of failed tests: [junit] Test org.apache.hadoop.mapred.TestJobTrackerRestart FAILED (timeout) [junit] Test org.apache.hadoop.mapred.TestJobTrackerRestartWithLostTracker FAILED [junit] Test org.apache.hadoop.mapred.TestJobTrackerSafeMode FAILED [junit] Test org.apache.hadoop.mapred.TestMiniMRMapRedDebugScript FAILED [junit] Test org.apache.hadoop.mapred.TestRecoveryManager FAILED [junit] Test org.apache.hadoop.mapred.TestTTMemoryReporting FAILED [junit] Test org.apache.hadoop.mapred.TestTaskTrackerLocalization FAILED [junit] Test org.apache.hadoop.hdfsproxy.TestHdfsProxy FAILED I am in favor of releasing hadoop-0.20.203. And we run a version of this release on a large cluster at eBay. I know it works. I understand the controversy behind it. I regret it hasn't been developed in a true community way. I think it nevertheless adds value to Apache Hadoop. Lets just make sure it passes the tests. Thanks, --Konstantin On Tue, May 3, 2011 at 1:39 AM, Konstantin Shvachko <[EMAIL PROTECTED]>wrote: > I think its a good idea to release hadoop-0.20.203. It moves Apache Hadoop > a step forward. > > Looks like the technical difficulties are resolved now with latest Arun's > commits. > Being a superset of hadoop-0.20.2 it can be considered based on one of the > official Apache releases. > I don't think there was a lack of discussions on the lists about the issues > included in the release candidate. Todd did a thorough review of the entire > security branch. Many developers participated in discussions. > Agreeing with Stack I wish HBase was considered a primary target for Hadoop > support. But it is not realistic to have it in hadoop-0.20.203. > I have some experience running a version of this release candidate on a > large cluster. It works. I would add a couple of patches, which make it run > on Windows for me like HADOOP-7110, HADOOP-7126. But those are not blockers. > > Thanks, > --Konstantin > > > On Mon, May 2, 2011 at 5:12 PM, Ian Holsman <[EMAIL PROTECTED]> wrote: > >> >> On May 3, 2011, at 9:58 AM, Arun C Murthy wrote: >> >> >> >> >> Owen, Suresh and I have committed everything on this list except >> >> HADOOP-6386 and HADOOP-6428. Not sure which of the two are relevant/ >> >> necessary, I'll check with Cos. Other than that hadoop-0.20.203 now a >> >> superset of hadoop-0.20.2. >> >> >> > >> > Missed adding HADOOP-5759 to that list, I'll check with Amareshwari >> before committing. >> > >> > Arun >> >> Thanks for doing this so fast Arun. >> >> >
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Owen O'Malley 2011-05-11, 17:36
The vote on 0.20.203.0-rc1 passes. We have our first stable release in a
year. The votes were: 13 +1's (10 binding: Arun, Chris, Devaraj, Dhruba, Jakob, Mahadev, Nicholas, Owen, Sanjay, Suresh) 7 -1's (4 binding: Doug, Eli, Konstantin, Todd) Konstantin, your issue with the test cases requiring a umask 02 is a good point. I'll patch it and can roll a 0.20.203.1 release candidate. -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Ian Holsman 2011-05-11, 17:38
congratulations guys!
On May 12, 2011, at 3:36 AM, Owen O'Malley wrote: > The vote on 0.20.203.0-rc1 passes. We have our first stable release in a > year. The votes were: > > 13 +1's (10 binding: Arun, Chris, Devaraj, Dhruba, Jakob, Mahadev, Nicholas, > Owen, Sanjay, > Suresh) > 7 -1's (4 binding: Doug, Eli, Konstantin, Todd) > > Konstantin, your issue with the test cases requiring a umask 02 is a good > point. I'll patch it and can roll a 0.20.203.1 release candidate. > > -- Owen
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Konstantin Shvachko 2011-05-11, 18:40
> Konstantin, your issue with the test cases requiring a umask 02 is a good
> point. I'll patch it and can roll a 0.20.203.1 release candidate. umask is not a big concern. I reset it to standard 0022. Still there were 8 other test failures: 7 in mapred, and 1 hdfsproxy. Stable release should pass unit tests. Lets make 0.20.203.1 stable. Thanks, --Konstantin On Wed, May 11, 2011 at 10:36 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: > The vote on 0.20.203.0-rc1 passes. We have our first stable release in a > year. The votes were: > > 13 +1's (10 binding: Arun, Chris, Devaraj, Dhruba, Jakob, Mahadev, > Nicholas, > Owen, Sanjay, > Suresh) > 7 -1's (4 binding: Doug, Eli, Konstantin, Todd) > > Konstantin, your issue with the test cases requiring a umask 02 is a good > point. I'll patch it and can roll a 0.20.203.1 release candidate. > > -- Owen >
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Raghu Angadi 2011-05-11, 19:09
Congrats anad thanks to all the developers for such passion and hard work
they they put into a very important project. +1 for getting new 0.20 release out with security. It is great news for users for that new apache release is finally coming out. I hope most of the technical as well as non-technical issues will resolved this or next release. There are many important issues raised in this thread and these are crucial for future of Apache Hadoop. wanted to echo one of them in particular : community is the most important aspect of the project and triumphs over the rest. I am sure the process would be much smoother going forward as we have more frequent releases. This is probably the first real test for the develop-a-large-feature-on-a-branch-and-merge process. Discussion here would certainly lead to important improvements to the process. Looking at from a different angle, Hadoop has a very enviable problem : there is so much development it is very hard to co-ordinate and scale. It has already scalled up a few times before, and with the leaders it has, it is doing it again. Raghu. On Wed, May 11, 2011 at 10:36 AM, Owen O'Malley <[EMAIL PROTECTED]> wrote: > The vote on 0.20.203.0-rc1 passes. We have our first stable release in a > year. The votes were: > > 13 +1's (10 binding: Arun, Chris, Devaraj, Dhruba, Jakob, Mahadev, > Nicholas, > Owen, Sanjay, > Suresh) > 7 -1's (4 binding: Doug, Eli, Konstantin, Todd) > > Konstantin, your issue with the test cases requiring a umask 02 is a good > point. I'll patch it and can roll a 0.20.203.1 release candidate. > > -- Owen >
-
Re: [VOTE] Release candidate 0.20.203.0-rc0Owen O'Malley 2011-05-11, 22:10
On May 11, 2011, at 11:40 AM, Konstantin Shvachko wrote: > umask is not a big concern. I reset it to standard 0022. It still should be fixed. > Still there were 8 other test failures: 7 in mapred, and 1 hdfsproxy. > Stable release should pass unit tests. All unit tests pass for me. Others have also run the unit tests and had them pass. If you can figure out what is going wrong, that would be great. > Lets make 0.20.203.1 stable. +1 -- Owen |