This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net>
		
			
				
	
	
		
			460 lines
		
	
	
		
			22 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			460 lines
		
	
	
		
			22 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
| 2: HOW THE DEVELOPMENT PROCESS WORKS
 | |
| 
 | |
| Linux kernel development in the early 1990's was a pretty loose affair,
 | |
| with relatively small numbers of users and developers involved.  With a
 | |
| user base in the millions and with some 2,000 developers involved over the
 | |
| course of one year, the kernel has since had to evolve a number of
 | |
| processes to keep development happening smoothly.  A solid understanding of
 | |
| how the process works is required in order to be an effective part of it.
 | |
| 
 | |
| 
 | |
| 2.1: THE BIG PICTURE
 | |
| 
 | |
| The kernel developers use a loosely time-based release process, with a new
 | |
| major kernel release happening every two or three months.  The recent
 | |
| release history looks like this:
 | |
| 
 | |
| 	2.6.26	July 13, 2008
 | |
| 	2.6.25	April 16, 2008
 | |
| 	2.6.24	January 24, 2008
 | |
| 	2.6.23	October 9, 2007
 | |
| 	2.6.22	July 8, 2007
 | |
| 	2.6.21	April 25, 2007
 | |
| 	2.6.20	February 4, 2007
 | |
| 
 | |
| Every 2.6.x release is a major kernel release with new features, internal
 | |
| API changes, and more.  A typical 2.6 release can contain over 10,000
 | |
| changesets with changes to several hundred thousand lines of code.  2.6 is
 | |
| thus the leading edge of Linux kernel development; the kernel uses a
 | |
| rolling development model which is continually integrating major changes.
 | |
| 
 | |
| A relatively straightforward discipline is followed with regard to the
 | |
| merging of patches for each release.  At the beginning of each development
 | |
| cycle, the "merge window" is said to be open.  At that time, code which is
 | |
| deemed to be sufficiently stable (and which is accepted by the development
 | |
| community) is merged into the mainline kernel.  The bulk of changes for a
 | |
| new development cycle (and all of the major changes) will be merged during
 | |
| this time, at a rate approaching 1,000 changes ("patches," or "changesets")
 | |
| per day.
 | |
| 
 | |
| (As an aside, it is worth noting that the changes integrated during the
 | |
| merge window do not come out of thin air; they have been collected, tested,
 | |
| and staged ahead of time.  How that process works will be described in
 | |
| detail later on).
 | |
| 
 | |
| The merge window lasts for two weeks.  At the end of this time, Linus
 | |
| Torvalds will declare that the window is closed and release the first of
 | |
| the "rc" kernels.  For the kernel which is destined to be 2.6.26, for
 | |
| example, the release which happens at the end of the merge window will be
 | |
| called 2.6.26-rc1.  The -rc1 release is the signal that the time to merge
 | |
| new features has passed, and that the time to stabilize the next kernel has
 | |
| begun.
 | |
| 
 | |
| Over the next six to ten weeks, only patches which fix problems should be
 | |
| submitted to the mainline.  On occasion a more significant change will be
 | |
| allowed, but such occasions are rare; developers who try to merge new
 | |
| features outside of the merge window tend to get an unfriendly reception.
 | |
| As a general rule, if you miss the merge window for a given feature, the
 | |
| best thing to do is to wait for the next development cycle.  (An occasional
 | |
| exception is made for drivers for previously-unsupported hardware; if they
 | |
| touch no in-tree code, they cannot cause regressions and should be safe to
 | |
| add at any time).
 | |
| 
 | |
| As fixes make their way into the mainline, the patch rate will slow over
 | |
| time.  Linus releases new -rc kernels about once a week; a normal series
 | |
| will get up to somewhere between -rc6 and -rc9 before the kernel is
 | |
| considered to be sufficiently stable and the final 2.6.x release is made.
 | |
| At that point the whole process starts over again.
 | |
| 
 | |
| As an example, here is how the 2.6.25 development cycle went (all dates in
 | |
| 2008): 
 | |
| 
 | |
| 	January 24	2.6.24 stable release
 | |
| 	February 10	2.6.25-rc1, merge window closes
 | |
| 	February 15	2.6.25-rc2
 | |
| 	February 24	2.6.25-rc3
 | |
| 	March 4	 	2.6.25-rc4
 | |
| 	March 9		2.6.25-rc5
 | |
| 	March 16	2.6.25-rc6
 | |
| 	March 25	2.6.25-rc7
 | |
| 	April 1		2.6.25-rc8
 | |
| 	April 11	2.6.25-rc9
 | |
| 	April 16	2.6.25 stable release
 | |
| 
 | |
| How do the developers decide when to close the development cycle and create
 | |
| the stable release?  The most significant metric used is the list of
 | |
| regressions from previous releases.  No bugs are welcome, but those which
 | |
| break systems which worked in the past are considered to be especially
 | |
| serious.  For this reason, patches which cause regressions are looked upon
 | |
| unfavorably and are quite likely to be reverted during the stabilization
 | |
| period. 
 | |
| 
 | |
| The developers' goal is to fix all known regressions before the stable
 | |
| release is made.  In the real world, this kind of perfection is hard to
 | |
| achieve; there are just too many variables in a project of this size.
 | |
| There comes a point where delaying the final release just makes the problem
 | |
| worse; the pile of changes waiting for the next merge window will grow
 | |
| larger, creating even more regressions the next time around.  So most 2.6.x
 | |
| kernels go out with a handful of known regressions though, hopefully, none
 | |
| of them are serious.
 | |
| 
 | |
| Once a stable release is made, its ongoing maintenance is passed off to the
 | |
| "stable team," currently comprised of Greg Kroah-Hartman and Chris Wright.
 | |
| The stable team will release occasional updates to the stable release using
 | |
| the 2.6.x.y numbering scheme.  To be considered for an update release, a
 | |
| patch must (1) fix a significant bug, and (2) already be merged into the
 | |
| mainline for the next development kernel.  Continuing our 2.6.25 example,
 | |
| the history (as of this writing) is:
 | |
| 
 | |
| 	May 1		2.6.25.1
 | |
| 	May 6		2.6.25.2 
 | |
| 	May 9		2.6.25.3 
 | |
| 	May 15		2.6.25.4
 | |
| 	June 7		2.6.25.5
 | |
| 	June 9		2.6.25.6
 | |
| 	June 16		2.6.25.7
 | |
| 	June 21		2.6.25.8
 | |
| 	June 24		2.6.25.9
 | |
| 
 | |
| Stable updates for a given kernel are made for approximately six months;
 | |
| after that, the maintenance of stable releases is solely the responsibility
 | |
| of the distributors which have shipped that particular kernel.
 | |
| 
 | |
| 
 | |
| 2.2: THE LIFECYCLE OF A PATCH
 | |
| 
 | |
| Patches do not go directly from the developer's keyboard into the mainline
 | |
| kernel.  There is, instead, a somewhat involved (if somewhat informal)
 | |
| process designed to ensure that each patch is reviewed for quality and that
 | |
| each patch implements a change which is desirable to have in the mainline.
 | |
| This process can happen quickly for minor fixes, or, in the case of large
 | |
| and controversial changes, go on for years.  Much developer frustration
 | |
| comes from a lack of understanding of this process or from attempts to
 | |
| circumvent it.  
 | |
| 
 | |
| In the hopes of reducing that frustration, this document will describe how
 | |
| a patch gets into the kernel.  What follows below is an introduction which
 | |
| describes the process in a somewhat idealized way.  A much more detailed
 | |
| treatment will come in later sections.
 | |
| 
 | |
| The stages that a patch goes through are, generally:
 | |
| 
 | |
|  - Design.  This is where the real requirements for the patch - and the way
 | |
|    those requirements will be met - are laid out.  Design work is often
 | |
|    done without involving the community, but it is better to do this work
 | |
|    in the open if at all possible; it can save a lot of time redesigning
 | |
|    things later.
 | |
| 
 | |
|  - Early review.  Patches are posted to the relevant mailing list, and
 | |
|    developers on that list reply with any comments they may have.  This
 | |
|    process should turn up any major problems with a patch if all goes
 | |
|    well.
 | |
| 
 | |
|  - Wider review.  When the patch is getting close to ready for mainline
 | |
|    inclusion, it will be accepted by a relevant subsystem maintainer -
 | |
|    though this acceptance is not a guarantee that the patch will make it
 | |
|    all the way to the mainline.  The patch will show up in the maintainer's
 | |
|    subsystem tree and into the staging trees (described below).  When the
 | |
|    process works, this step leads to more extensive review of the patch and
 | |
|    the discovery of any problems resulting from the integration of this
 | |
|    patch with work being done by others.
 | |
| 
 | |
|  - Merging into the mainline.  Eventually, a successful patch will be
 | |
|    merged into the mainline repository managed by Linus Torvalds.  More
 | |
|    comments and/or problems may surface at this time; it is important that
 | |
|    the developer be responsive to these and fix any issues which arise.
 | |
| 
 | |
|  - Stable release.  The number of users potentially affected by the patch
 | |
|    is now large, so, once again, new problems may arise.
 | |
| 
 | |
|  - Long-term maintenance.  While it is certainly possible for a developer
 | |
|    to forget about code after merging it, that sort of behavior tends to
 | |
|    leave a poor impression in the development community.  Merging code
 | |
|    eliminates some of the maintenance burden, in that others will fix
 | |
|    problems caused by API changes.  But the original developer should
 | |
|    continue to take responsibility for the code if it is to remain useful
 | |
|    in the longer term.
 | |
| 
 | |
| One of the largest mistakes made by kernel developers (or their employers)
 | |
| is to try to cut the process down to a single "merging into the mainline"
 | |
| step.  This approach invariably leads to frustration for everybody
 | |
| involved.
 | |
| 
 | |
| 
 | |
| 2.3: HOW PATCHES GET INTO THE KERNEL
 | |
| 
 | |
| There is exactly one person who can merge patches into the mainline kernel
 | |
| repository: Linus Torvalds.  But, of the over 12,000 patches which went
 | |
| into the 2.6.25 kernel, only 250 (around 2%) were directly chosen by Linus
 | |
| himself.  The kernel project has long since grown to a size where no single
 | |
| developer could possibly inspect and select every patch unassisted.  The
 | |
| way the kernel developers have addressed this growth is through the use of
 | |
| a lieutenant system built around a chain of trust.
 | |
| 
 | |
| The kernel code base is logically broken down into a set of subsystems:
 | |
| networking, specific architecture support, memory management, video
 | |
| devices, etc.  Most subsystems have a designated maintainer, a developer
 | |
| who has overall responsibility for the code within that subsystem.  These
 | |
| subsystem maintainers are the gatekeepers (in a loose way) for the portion
 | |
| of the kernel they manage; they are the ones who will (usually) accept a
 | |
| patch for inclusion into the mainline kernel.
 | |
| 
 | |
| Subsystem maintainers each manage their own version of the kernel source
 | |
| tree, usually (but certainly not always) using the git source management
 | |
| tool.  Tools like git (and related tools like quilt or mercurial) allow
 | |
| maintainers to track a list of patches, including authorship information
 | |
| and other metadata.  At any given time, the maintainer can identify which
 | |
| patches in his or her repository are not found in the mainline.
 | |
| 
 | |
| When the merge window opens, top-level maintainers will ask Linus to "pull"
 | |
| the patches they have selected for merging from their repositories.  If
 | |
| Linus agrees, the stream of patches will flow up into his repository,
 | |
| becoming part of the mainline kernel.  The amount of attention that Linus
 | |
| pays to specific patches received in a pull operation varies.  It is clear
 | |
| that, sometimes, he looks quite closely.  But, as a general rule, Linus
 | |
| trusts the subsystem maintainers to not send bad patches upstream.
 | |
| 
 | |
| Subsystem maintainers, in turn, can pull patches from other maintainers.
 | |
| For example, the networking tree is built from patches which accumulated
 | |
| first in trees dedicated to network device drivers, wireless networking,
 | |
| etc.  This chain of repositories can be arbitrarily long, though it rarely
 | |
| exceeds two or three links.  Since each maintainer in the chain trusts
 | |
| those managing lower-level trees, this process is known as the "chain of
 | |
| trust." 
 | |
| 
 | |
| Clearly, in a system like this, getting patches into the kernel depends on
 | |
| finding the right maintainer.  Sending patches directly to Linus is not
 | |
| normally the right way to go.
 | |
| 
 | |
| 
 | |
| 2.4: STAGING TREES
 | |
| 
 | |
| The chain of subsystem trees guides the flow of patches into the kernel,
 | |
| but it also raises an interesting question: what if somebody wants to look
 | |
| at all of the patches which are being prepared for the next merge window?
 | |
| Developers will be interested in what other changes are pending to see
 | |
| whether there are any conflicts to worry about; a patch which changes a
 | |
| core kernel function prototype, for example, will conflict with any other
 | |
| patches which use the older form of that function.  Reviewers and testers
 | |
| want access to the changes in their integrated form before all of those
 | |
| changes land in the mainline kernel.  One could pull changes from all of
 | |
| the interesting subsystem trees, but that would be a big and error-prone
 | |
| job.
 | |
| 
 | |
| The answer comes in the form of staging trees, where subsystem trees are
 | |
| collected for testing and review.  The older of these trees, maintained by
 | |
| Andrew Morton, is called "-mm" (for memory management, which is how it got
 | |
| started).  The -mm tree integrates patches from a long list of subsystem
 | |
| trees; it also has some patches aimed at helping with debugging.  
 | |
| 
 | |
| Beyond that, -mm contains a significant collection of patches which have
 | |
| been selected by Andrew directly.  These patches may have been posted on a
 | |
| mailing list, or they may apply to a part of the kernel for which there is
 | |
| no designated subsystem tree.  As a result, -mm operates as a sort of
 | |
| subsystem tree of last resort; if there is no other obvious path for a
 | |
| patch into the mainline, it is likely to end up in -mm.  Miscellaneous
 | |
| patches which accumulate in -mm will eventually either be forwarded on to
 | |
| an appropriate subsystem tree or be sent directly to Linus.  In a typical
 | |
| development cycle, approximately 10% of the patches going into the mainline
 | |
| get there via -mm.
 | |
| 
 | |
| The current -mm patch can always be found from the front page of
 | |
| 
 | |
| 	http://kernel.org/
 | |
| 
 | |
| Those who want to see the current state of -mm can get the "-mm of the
 | |
| moment" tree, found at:
 | |
| 
 | |
| 	http://userweb.kernel.org/~akpm/mmotm/
 | |
| 
 | |
| Use of the MMOTM tree is likely to be a frustrating experience, though;
 | |
| there is a definite chance that it will not even compile.
 | |
| 
 | |
| The other staging tree, started more recently, is linux-next, maintained by
 | |
| Stephen Rothwell.  The linux-next tree is, by design, a snapshot of what
 | |
| the mainline is expected to look like after the next merge window closes.
 | |
| Linux-next trees are announced on the linux-kernel and linux-next mailing
 | |
| lists when they are assembled; they can be downloaded from:
 | |
| 
 | |
| 	http://www.kernel.org/pub/linux/kernel/people/sfr/linux-next/
 | |
| 
 | |
| Some information about linux-next has been gathered at:
 | |
| 
 | |
| 	http://linux.f-seidel.de/linux-next/pmwiki/
 | |
| 
 | |
| How the linux-next tree will fit into the development process is still
 | |
| changing.  As of this writing, the first full development cycle involving
 | |
| linux-next (2.6.26) is coming to an end; thus far, it has proved to be a
 | |
| valuable resource for finding and fixing integration problems before the
 | |
| beginning of the merge window.  See http://lwn.net/Articles/287155/ for
 | |
| more information on how linux-next has worked to set up the 2.6.27 merge
 | |
| window.
 | |
| 
 | |
| Some developers have begun to suggest that linux-next should be used as the
 | |
| target for future development as well.  The linux-next tree does tend to be
 | |
| far ahead of the mainline and is more representative of the tree into which
 | |
| any new work will be merged.  The downside to this idea is that the
 | |
| volatility of linux-next tends to make it a difficult development target.
 | |
| See http://lwn.net/Articles/289013/ for more information on this topic, and
 | |
| stay tuned; much is still in flux where linux-next is involved.
 | |
| 
 | |
| 
 | |
| 2.5: TOOLS
 | |
| 
 | |
| As can be seen from the above text, the kernel development process depends
 | |
| heavily on the ability to herd collections of patches in various
 | |
| directions.  The whole thing would not work anywhere near as well as it
 | |
| does without suitably powerful tools.  Tutorials on how to use these tools
 | |
| are well beyond the scope of this document, but there is space for a few
 | |
| pointers.
 | |
| 
 | |
| By far the dominant source code management system used by the kernel
 | |
| community is git.  Git is one of a number of distributed version control
 | |
| systems being developed in the free software community.  It is well tuned
 | |
| for kernel development, in that it performs quite well when dealing with
 | |
| large repositories and large numbers of patches.  It also has a reputation
 | |
| for being difficult to learn and use, though it has gotten better over
 | |
| time.  Some sort of familiarity with git is almost a requirement for kernel
 | |
| developers; even if they do not use it for their own work, they'll need git
 | |
| to keep up with what other developers (and the mainline) are doing.
 | |
| 
 | |
| Git is now packaged by almost all Linux distributions.  There is a home
 | |
| page at 
 | |
| 
 | |
| 	http://git.or.cz/
 | |
| 
 | |
| That page has pointers to documentation and tutorials.  One should be
 | |
| aware, in particular, of the Kernel Hacker's Guide to git, which has
 | |
| information specific to kernel development:
 | |
| 
 | |
| 	http://linux.yyz.us/git-howto.html
 | |
| 
 | |
| Among the kernel developers who do not use git, the most popular choice is
 | |
| almost certainly Mercurial:
 | |
| 
 | |
| 	http://www.selenic.com/mercurial/
 | |
| 
 | |
| Mercurial shares many features with git, but it provides an interface which
 | |
| many find easier to use.
 | |
| 
 | |
| The other tool worth knowing about is Quilt:
 | |
| 
 | |
| 	http://savannah.nongnu.org/projects/quilt/
 | |
| 
 | |
| Quilt is a patch management system, rather than a source code management
 | |
| system.  It does not track history over time; it is, instead, oriented
 | |
| toward tracking a specific set of changes against an evolving code base.
 | |
| Some major subsystem maintainers use quilt to manage patches intended to go
 | |
| upstream.  For the management of certain kinds of trees (-mm, for example),
 | |
| quilt is the best tool for the job.
 | |
| 
 | |
| 
 | |
| 2.6: MAILING LISTS
 | |
| 
 | |
| A great deal of Linux kernel development work is done by way of mailing
 | |
| lists.  It is hard to be a fully-functioning member of the community
 | |
| without joining at least one list somewhere.  But Linux mailing lists also
 | |
| represent a potential hazard to developers, who risk getting buried under a
 | |
| load of electronic mail, running afoul of the conventions used on the Linux
 | |
| lists, or both.
 | |
| 
 | |
| Most kernel mailing lists are run on vger.kernel.org; the master list can
 | |
| be found at:
 | |
| 
 | |
| 	http://vger.kernel.org/vger-lists.html
 | |
| 
 | |
| There are lists hosted elsewhere, though; a number of them are at
 | |
| lists.redhat.com.
 | |
| 
 | |
| The core mailing list for kernel development is, of course, linux-kernel.
 | |
| This list is an intimidating place to be; volume can reach 500 messages per
 | |
| day, the amount of noise is high, the conversation can be severely
 | |
| technical, and participants are not always concerned with showing a high
 | |
| degree of politeness.  But there is no other place where the kernel
 | |
| development community comes together as a whole; developers who avoid this
 | |
| list will miss important information.
 | |
| 
 | |
| There are a few hints which can help with linux-kernel survival:
 | |
| 
 | |
| - Have the list delivered to a separate folder, rather than your main
 | |
|   mailbox.  One must be able to ignore the stream for sustained periods of
 | |
|   time.
 | |
| 
 | |
| - Do not try to follow every conversation - nobody else does.  It is
 | |
|   important to filter on both the topic of interest (though note that
 | |
|   long-running conversations can drift away from the original subject
 | |
|   without changing the email subject line) and the people who are
 | |
|   participating.  
 | |
| 
 | |
| - Do not feed the trolls.  If somebody is trying to stir up an angry
 | |
|   response, ignore them.
 | |
| 
 | |
| - When responding to linux-kernel email (or that on other lists) preserve
 | |
|   the Cc: header for all involved.  In the absence of a strong reason (such
 | |
|   as an explicit request), you should never remove recipients.  Always make
 | |
|   sure that the person you are responding to is in the Cc: list.  This
 | |
|   convention also makes it unnecessary to explicitly ask to be copied on
 | |
|   replies to your postings.
 | |
| 
 | |
| - Search the list archives (and the net as a whole) before asking
 | |
|   questions.  Some developers can get impatient with people who clearly
 | |
|   have not done their homework.
 | |
| 
 | |
| - Avoid top-posting (the practice of putting your answer above the quoted
 | |
|   text you are responding to).  It makes your response harder to read and
 | |
|   makes a poor impression.
 | |
| 
 | |
| - Ask on the correct mailing list.  Linux-kernel may be the general meeting
 | |
|   point, but it is not the best place to find developers from all
 | |
|   subsystems.
 | |
| 
 | |
| The last point - finding the correct mailing list - is a common place for
 | |
| beginning developers to go wrong.  Somebody who asks a networking-related
 | |
| question on linux-kernel will almost certainly receive a polite suggestion
 | |
| to ask on the netdev list instead, as that is the list frequented by most
 | |
| networking developers.  Other lists exist for the SCSI, video4linux, IDE,
 | |
| filesystem, etc. subsystems.  The best place to look for mailing lists is
 | |
| in the MAINTAINERS file packaged with the kernel source.
 | |
| 
 | |
| 
 | |
| 2.7: GETTING STARTED WITH KERNEL DEVELOPMENT
 | |
| 
 | |
| Questions about how to get started with the kernel development process are
 | |
| common - from both individuals and companies.  Equally common are missteps
 | |
| which make the beginning of the relationship harder than it has to be.
 | |
| 
 | |
| Companies often look to hire well-known developers to get a development
 | |
| group started.  This can, in fact, be an effective technique.  But it also
 | |
| tends to be expensive and does not do much to grow the pool of experienced
 | |
| kernel developers.  It is possible to bring in-house developers up to speed
 | |
| on Linux kernel development, given the investment of a bit of time.  Taking
 | |
| this time can endow an employer with a group of developers who understand
 | |
| the kernel and the company both, and who can help to train others as well.
 | |
| Over the medium term, this is often the more profitable approach.
 | |
| 
 | |
| Individual developers are often, understandably, at a loss for a place to
 | |
| start.  Beginning with a large project can be intimidating; one often wants
 | |
| to test the waters with something smaller first.  This is the point where
 | |
| some developers jump into the creation of patches fixing spelling errors or
 | |
| minor coding style issues.  Unfortunately, such patches create a level of
 | |
| noise which is distracting for the development community as a whole, so,
 | |
| increasingly, they are looked down upon.  New developers wishing to
 | |
| introduce themselves to the community will not get the sort of reception
 | |
| they wish for by these means.
 | |
| 
 | |
| Andrew Morton gives this advice for aspiring kernel developers
 | |
| 
 | |
| 	The #1 project for all kernel beginners should surely be "make sure
 | |
| 	that the kernel runs perfectly at all times on all machines which
 | |
| 	you can lay your hands on".  Usually the way to do this is to work
 | |
| 	with others on getting things fixed up (this can require
 | |
| 	persistence!) but that's fine - it's a part of kernel development.
 | |
| 
 | |
| (http://lwn.net/Articles/283982/).
 | |
| 
 | |
| In the absence of obvious problems to fix, developers are advised to look
 | |
| at the current lists of regressions and open bugs in general.  There is
 | |
| never any shortage of issues in need of fixing; by addressing these issues,
 | |
| developers will gain experience with the process while, at the same time,
 | |
| building respect with the rest of the development community.
 |