|
|
-
Homestretch for hadoop-0.22
Konstantin Shvachko 2011-10-28, 09:29
Hi everybody,
We reached the homestretch for hadoop-0.22 release. The testing proceeds very well. - The builds are building - HDFS and MR are running under load well now. - We applied pretty simple custom patches to make Hive (7) and Pig (8) work with 0.22. - HBase (92), which is expected to be released in the near future as I hear, is fully compatible with 0.22.
There are 4 main items remaining on my plate.
1. We are down to four blockers, which represent three issues: HADOOP-7035. Document incompatible API changes between releases MAPREDUCE-1100 / 1716. User's task-logs filling up local disks on the TaskTrackers HDFS-2498. TestParallelRead times out
I'd like also to include HDFS-1900. Clean up config key collision between common and HDFS and potentially further bug fixes.
2. Assemble build is needed. By assemble build I mean a Jenkins target which produces a tarball that includes all three sub-projects. It is like a daily build. One of this builds is assigned to be release a candidate and then the release. We can do it in traditional way, based on scripts developed in HADOOP-6846. Or we can try to do it with BigTop.
3. Jetty upgrade. 0.22 branch is on Jetty 6.1.14, while everything else moved to 6.1.26. Based on experiments I ran with Jetty 6.1.14 tasks hang forever if things go wrong, while with Jetty 6.1.26 you get a lot of "fetch failure"s. The latter at least fails fast, and does not need manual intervention, so I'd go for the upgrade.
4. Documentation. In addition to HADOOP-7035, which according to Tom needs a rerun of the tool, we need to confirm the documentation is up to date. I think it is, but re-checking would be good.
I seek help from the good people of Hadoopland on these last steps. If you have suggestions for inclusion or plan to help with the items, please reply to the thread or drop me a line. I understand the next two weeks will be busy with conferences. I plan to release after Apachecon by mid November. If the assemble build is ready soon people will be able to try it early.
Thanks, --Konstantin
+
Konstantin Shvachko 2011-10-28, 09:29
-
Re: Homestretch for hadoop-0.22
Harsh J 2011-10-28, 10:26
Hey Konstantin, On Fri, Oct 28, 2011 at 2:59 PM, Konstantin Shvachko <[EMAIL PROTECTED]> wrote: > 3. Jetty upgrade. > 0.22 branch is on Jetty 6.1.14, while everything else moved to 6.1.26. > Based on experiments I ran with Jetty 6.1.14 tasks hang forever if > things go wrong, while with Jetty 6.1.26 you get a lot of "fetch > failure"s. The latter at least fails fast, and does not need manual > intervention, so I'd go for the upgrade. I believe https://issues.apache.org/jira/browse/MAPREDUCE-2980 would help here. -- Harsh J
+
Harsh J 2011-10-28, 10:26
-
Re: Homestretch for hadoop-0.22
Konstantin Shvachko 2011-10-28, 20:51
For upgrade we can directly (or indirectly) use HADOOP-7450 It's pre-mavenization. Cos, you might want to use it. --Konstantin On Fri, Oct 28, 2011 at 3:26 AM, Harsh J <[EMAIL PROTECTED]> wrote: > Hey Konstantin, > > On Fri, Oct 28, 2011 at 2:59 PM, Konstantin Shvachko > <[EMAIL PROTECTED]> wrote: >> 3. Jetty upgrade. >> 0.22 branch is on Jetty 6.1.14, while everything else moved to 6.1.26. >> Based on experiments I ran with Jetty 6.1.14 tasks hang forever if >> things go wrong, while with Jetty 6.1.26 you get a lot of "fetch >> failure"s. The latter at least fails fast, and does not need manual >> intervention, so I'd go for the upgrade. > > I believe https://issues.apache.org/jira/browse/MAPREDUCE-2980 would help here. > > -- > Harsh J >
+
Konstantin Shvachko 2011-10-28, 20:51
-
Re: Homestretch for hadoop-0.22
Konstantin Boudnik 2011-10-28, 21:02
Yes, this is exactly what I took as the template last week ;( Thanks. On Fri, Oct 28, 2011 at 01:51PM, Konstantin Shvachko wrote: > For upgrade we can directly (or indirectly) use HADOOP-7450 > It's pre-mavenization. > Cos, you might want to use it. > --Konstantin > > > On Fri, Oct 28, 2011 at 3:26 AM, Harsh J <[EMAIL PROTECTED]> wrote: > > Hey Konstantin, > > > > On Fri, Oct 28, 2011 at 2:59 PM, Konstantin Shvachko > > <[EMAIL PROTECTED]> wrote: > >> 3. Jetty upgrade. > >> 0.22 branch is on Jetty 6.1.14, while everything else moved to 6.1.26. > >> Based on experiments I ran with Jetty 6.1.14 tasks hang forever if > >> things go wrong, while with Jetty 6.1.26 you get a lot of "fetch > >> failure"s. The latter at least fails fast, and does not need manual > >> intervention, so I'd go for the upgrade. > > > > I believe https://issues.apache.org/jira/browse/MAPREDUCE-2980 would help here. > > > > -- > > Harsh J > >
+
Konstantin Boudnik 2011-10-28, 21:02
-
Re: Homestretch for hadoop-0.22
Roman Shaposhnik 2011-10-28, 15:27
On Fri, Oct 28, 2011 at 2:29 AM, Konstantin Shvachko <[EMAIL PROTECTED]> wrote: > Hi everybody, > > We reached the homestretch for hadoop-0.22 release. > The testing proceeds very well. > - The builds are building > - HDFS and MR are running under load well now. > - We applied pretty simple custom patches to make Hive (7) and Pig (8) > work with 0.22. > - HBase (92), which is expected to be released in the near future as I > hear, is fully compatible with 0.22.
Yes, HBase is the only model citizen here. As for the rest of component I really would like to figure out what the plan is. Given that the patches are not big, is there any way Pig and Hive guys can be convinced to spin a maintenance release? I can certainly help with the logistics.
Right now, Bigtop is at a point where an entire stack based on Hadoop .22 can be deployed (via puppet) as a single step. That helps a great deal in validating the release. The problem is, that give Bigtop policies of not releasing with custom patches (we only package straight Apache releases) it would be a shame not to have.22 stack available for a single click install/deployment.
> 2. Assemble build is needed. > By assemble build I mean a Jenkins target which produces a tarball > that includes all three sub-projects. It is like a daily build. One of > this builds is assigned to be release a candidate and then the > release. We can do it in traditional way, based on scripts developed > in HADOOP-6846. Or we can try to do it with BigTop.
I can help with that. I was playing with the .22 build assembly yesterday and it seems that it shouldn't be a problem. I can also test it as a tarball once it is assembled.
Thanks, Roman.
+
Roman Shaposhnik 2011-10-28, 15:27
-
Re: Homestretch for hadoop-0.22
Konstantin Boudnik 2011-10-28, 17:53
Konstantin,
Great news! The progress is pretty spectacular I'd say.
I am slowly working on Jetty back-port. Unfortunately, other things require my attention too. Hopefully, it will be finished over next a couple days or so.
Cos
On Fri, Oct 28, 2011 at 02:29AM, Konstantin Shvachko wrote: > Hi everybody, > > We reached the homestretch for hadoop-0.22 release. > The testing proceeds very well. > - The builds are building > - HDFS and MR are running under load well now. > - We applied pretty simple custom patches to make Hive (7) and Pig (8) > work with 0.22. > - HBase (92), which is expected to be released in the near future as I > hear, is fully compatible with 0.22. > > There are 4 main items remaining on my plate. > > 1. We are down to four blockers, which represent three issues: > HADOOP-7035. Document incompatible API changes between releases > MAPREDUCE-1100 / 1716. User's task-logs filling up local disks on the > TaskTrackers > HDFS-2498. TestParallelRead times out > > I'd like also to include > HDFS-1900. Clean up config key collision between common and HDFS > and potentially further bug fixes. > > 2. Assemble build is needed. > By assemble build I mean a Jenkins target which produces a tarball > that includes all three sub-projects. It is like a daily build. One of > this builds is assigned to be release a candidate and then the > release. We can do it in traditional way, based on scripts developed > in HADOOP-6846. Or we can try to do it with BigTop. > > 3. Jetty upgrade. > 0.22 branch is on Jetty 6.1.14, while everything else moved to 6.1.26. > Based on experiments I ran with Jetty 6.1.14 tasks hang forever if > things go wrong, while with Jetty 6.1.26 you get a lot of "fetch > failure"s. The latter at least fails fast, and does not need manual > intervention, so I'd go for the upgrade. > > 4. Documentation. > In addition to HADOOP-7035, which according to Tom needs a rerun of > the tool, we need to confirm the documentation is up to date. I think > it is, but re-checking would be good. > > I seek help from the good people of Hadoopland on these last steps. > If you have suggestions for inclusion or plan to help with the items, > please reply to the thread or drop me a line. I understand the next > two weeks will be busy with conferences. I plan to release after > Apachecon by mid November. If the assemble build is ready soon people > will be able to try it early. > > Thanks, > --Konstantin
+
Konstantin Boudnik 2011-10-28, 17:53
-
Re: Homestretch for hadoop-0.22
Konstantin Boudnik 2011-10-29, 23:25
Konstantin, HADOOP-7450 (and its subs for HDFS and MR) have patches for 0.22 now.
On Fri, Oct 28, 2011 at 10:53AM, Konstantin Boudnik wrote: > Konstantin, > > Great news! The progress is pretty spectacular I'd say. > > I am slowly working on Jetty back-port. Unfortunately, other things require my > attention too. Hopefully, it will be finished over next a couple days or so. > > Cos > > On Fri, Oct 28, 2011 at 02:29AM, Konstantin Shvachko wrote: > > Hi everybody, > > > > We reached the homestretch for hadoop-0.22 release. > > The testing proceeds very well. > > - The builds are building > > - HDFS and MR are running under load well now. > > - We applied pretty simple custom patches to make Hive (7) and Pig (8) > > work with 0.22. > > - HBase (92), which is expected to be released in the near future as I > > hear, is fully compatible with 0.22. > > > > There are 4 main items remaining on my plate. > > > > 1. We are down to four blockers, which represent three issues: > > HADOOP-7035. Document incompatible API changes between releases > > MAPREDUCE-1100 / 1716. User's task-logs filling up local disks on the > > TaskTrackers > > HDFS-2498. TestParallelRead times out > > > > I'd like also to include > > HDFS-1900. Clean up config key collision between common and HDFS > > and potentially further bug fixes. > > > > 2. Assemble build is needed. > > By assemble build I mean a Jenkins target which produces a tarball > > that includes all three sub-projects. It is like a daily build. One of > > this builds is assigned to be release a candidate and then the > > release. We can do it in traditional way, based on scripts developed > > in HADOOP-6846. Or we can try to do it with BigTop. > > > > 3. Jetty upgrade. > > 0.22 branch is on Jetty 6.1.14, while everything else moved to 6.1.26. > > Based on experiments I ran with Jetty 6.1.14 tasks hang forever if > > things go wrong, while with Jetty 6.1.26 you get a lot of "fetch > > failure"s. The latter at least fails fast, and does not need manual > > intervention, so I'd go for the upgrade. > > > > 4. Documentation. > > In addition to HADOOP-7035, which according to Tom needs a rerun of > > the tool, we need to confirm the documentation is up to date. I think > > it is, but re-checking would be good. > > > > I seek help from the good people of Hadoopland on these last steps. > > If you have suggestions for inclusion or plan to help with the items, > > please reply to the thread or drop me a line. I understand the next > > two weeks will be busy with conferences. I plan to release after > > Apachecon by mid November. If the assemble build is ready soon people > > will be able to try it early. > > > > Thanks, > > --Konstantin
+
Konstantin Boudnik 2011-10-29, 23:25
-
Re: Homestretch for hadoop-0.22
Owen O'Malley 2011-10-28, 23:43
On Fri, Oct 28, 2011 at 2:29 AM, Konstantin Shvachko <[EMAIL PROTECTED]>wrote:
> Hi everybody, > > We reached the homestretch for hadoop-0.22 release. >
How are you going to deal with MAPREDUCE-2178, which we should mark as resolved? It fixes a critical security hole in the task-controller and is fixed in both 0.20.2xx and 0.23.
The two reasonable options:
* Remove the task-controller. (This was the fix that was used in 0.21.)
* Forward port the patch from 2xx.
-- Owen
+
Owen O'Malley 2011-10-28, 23:43
-
Re: Homestretch for hadoop-0.22
Konstantin Shvachko 2011-10-29, 18:23
MAPREDUCE-2767 removed LinuxTaskController as discussed earlier. The idea is to port changes from 20.security in the subsequent release.
Thanks, --Konstantin
On Fri, Oct 28, 2011 at 4:43 PM, Owen O'Malley <[EMAIL PROTECTED]> wrote: > On Fri, Oct 28, 2011 at 2:29 AM, Konstantin Shvachko > <[EMAIL PROTECTED]>wrote: > >> Hi everybody, >> >> We reached the homestretch for hadoop-0.22 release. >> > > How are you going to deal with MAPREDUCE-2178, which we should mark as > resolved? It fixes a critical security hole in the task-controller and is > fixed in both 0.20.2xx and 0.23. > > The two reasonable options: > > * Remove the task-controller. (This was the fix that was used in 0.21.) > > * Forward port the patch from 2xx. > > -- Owen >
+
Konstantin Shvachko 2011-10-29, 18:23
+
Roman Shaposhnik 2011-11-02, 05:12
-
Re: Homestretch for hadoop-0.22
Arun C Murthy 2011-11-24, 22:24
Konstantin, On Oct 28, 2011, at 2:29 AM, Konstantin Shvachko wrote: > We reached the homestretch for hadoop-0.22 release. > > There are 4 main items remaining on my plate. > > 1. We are down to four blockers, which represent three issues: > HADOOP-7035. Document incompatible API changes between releases > MAPREDUCE-1100 / 1716. User's task-logs filling up local disks on the > TaskTrackers > HDFS-2498. TestParallelRead times out > ... > I seek help from the good people of Hadoopland on these last steps. > If you have suggestions for inclusion or plan to help with the items, > please reply to the thread or drop me a line. I understand the next > two weeks will be busy with conferences. I plan to release after > Apachecon by mid November. If the assemble build is ready soon people > will be able to try it early. I was looking through some issues and have some suggestions for inclusion (primarily MR) which should help 0.22, for your consideration. I've created a label in jira which should help track them, please take a look: http://s.apache.org/critical-0.22.0thanks, Arun
+
Arun C Murthy 2011-11-24, 22:24
-
Re: Homestretch for hadoop-0.22
Konstantin Shvachko 2011-11-29, 09:43
Great list Arun. I agree with most of them and will ad more. Thanks, --Konstantin On Thu, Nov 24, 2011 at 2:24 PM, Arun C Murthy <[EMAIL PROTECTED]> wrote: > Konstantin, > > On Oct 28, 2011, at 2:29 AM, Konstantin Shvachko wrote: > >> We reached the homestretch for hadoop-0.22 release. >> >> There are 4 main items remaining on my plate. >> >> 1. We are down to four blockers, which represent three issues: >> HADOOP-7035. Document incompatible API changes between releases >> MAPREDUCE-1100 / 1716. User's task-logs filling up local disks on the >> TaskTrackers >> HDFS-2498. TestParallelRead times out >> > ... > >> I seek help from the good people of Hadoopland on these last steps. >> If you have suggestions for inclusion or plan to help with the items, >> please reply to the thread or drop me a line. I understand the next >> two weeks will be busy with conferences. I plan to release after >> Apachecon by mid November. If the assemble build is ready soon people >> will be able to try it early. > > I was looking through some issues and have some suggestions for inclusion (primarily MR) which should help 0.22, for your consideration. > > I've created a label in jira which should help track them, please take a look: > http://s.apache.org/critical-0.22.0> > thanks, > Arun > >
+
Konstantin Shvachko 2011-11-29, 09:43
|
|