Summary
Morphogen gradients provide embryonic tissues with positional information by inducing target genes at different concentration thresholds and thus at different positions. The Bicoid morphogen gradient in Drosophila melanogaster embryos has recently been analysed quantitatively, yet how it forms remains a matter of controversy. Several biophysical models that rely on production, diffusion and degradation have been formulated to account for the observed dynamics of the Bicoid gradient, but no one model can account for all its characteristics. Here, we discuss how existing data on this gradient fit the various proposed models and what aspects of gradient formation these models fail to explain. We suggest that knowing a few additional parameters, such as the lifetime of Bicoid, would help to identify and develop better models of Bicoid gradient formation.
Introduction
A fundamental question in developmental biology is how naïve tissue is provided with positional information. One answer to this question posed the idea of morphogens (Turing, 1952), patterngenerating substances that form concentration gradients across fields of cells (Wolpert, 1969; Crick, 1970; Gierer and Meinhardt, 1972). Simple molecular mechanisms, such as localised protein synthesis, diffusion and degradation, can yield nonuniform distributions of morphogens that determine pattern formation. In the modern formulation, cells or nuclei would respond to different concentration thresholds of the morphogen by the expression of specific target genes, thereby translating the differential morphogen concentration into patterns of gene expression (Wolpert, 1969; Crick, 1970).
The simplicity of the morphogen gradient model has been attractive to developmental biologists in that it reduces the problem of specifying positional information to measuring quantitative differences in a single molecule. It is relatively easy to envision cell biological processes and signalling events that could localise a morphogen source or break symmetry. The final informationrich distribution of a morphogen is controlled by measurable physical parameters, such as diffusion and lifetime.
bicoid, which encodes a homeobox transcription factor, was the first gene to be identified that fulfils the criteria of such a morphogen. The Bicoid gradient arises from maternally deposited mRNA at the anterior pole of the syncytial Drosophila embryo (St Johnston et al., 1989). It is translated at the start of egg deposition, and its protein gradient is established in less than 3 hours and is subsequently read by ~20 target genes, which are expressed in discrete patterns along the anteriorposterior (AP) axis (Jaeger et al., 2004; Schroeder et al., 2004; OchoaEspinosa et al., 2005; Manu et al., 2009a; Manu et al., 2009b; OchoaEspinosa et al., 2009).
A number of other morphogens were identified following the discovery of Bicoid. These include members of the Fibroblast growth factor (FGF), Transforming growth factor β (TGFβ), Hedgehog and Wingless (Wg) families, which pattern a wide variety of tissues in many different animals (Tabata and Takei, 2004). The biophysical parameters of several of these morphogens have also been measured; for example, for Decapentaplegic (Dpp; which belongs to the TGFβ family) and for Wg in the wing disk epithelium of Drosophila (Kicheva et al., 2007), for Fgf8 in the zebrafish embryo (Yu et al., 2009) and for the transcription factor Dorsal (Dl; the homologue of nuclear factor kappa B, NFκB) and Mitogenactivated protein kinase (MAPK) in the syncytial Drosophila embryo (DeLotto et al., 2007; Coppey et al., 2008; Kanodia et al., 2009). Lessons obtained from any of these gradients are potentially informative about the formation of morphogen gradients in general, but each gradient is distinguished by its own special features. In this review, we focus on the Bicoid gradient, which is the most thoroughly studied morphogen gradient. Our aim is to introduce the major classes of models and to discuss how quantitative data recently obtained from living embryos (Box 1) fit with each model, and which features of the Bicoid gradient are inconsistent with each model's predictions. We conclude by suggesting that knowing more about additional parameters of this gradient would help us to understand which of the models, if any, most accurately describes the formation of the Bicoid gradient and how future models could be improved.
Analysing the Bicoid gradient
Since its discovery in 1988, the Bicoid gradient has been analysed by a multitude of methods (Ephrussi and St Johnston, 2004). For the purpose of this review, we focus on those findings that directly relate to the biophysical models that we discuss.
bicoid RNA begins to be translated at egg deposition at a rate that is generally believed to be constant, and the protein gradient is established in less than 3 hours in the syncytial embryo, which is ~500 μm in length (Fig. 1, Fig. 2A). The protein is thought to move from its site of production to form a nuclear concentration gradient: nuclei closer to the source at the anterior contain higher concentrations of Bicoid than nuclei posterior to this position (Fig. 1, Fig. 2A,B). The environment in which the Bicoid gradient forms is highly dynamic and heterogeneous (Fig. 1). Following fertilisation, the embryo undergoes 13 synchronous rounds of nuclear division, such that within 2 hours the number of nuclei in the embryo increases by three orders of magnitude, from 1 to ~6000 nuclei [the nuclear diameter is ~10 μm until cycle 10, and then decreases to ~6 μm in cycle 14 (Gregor et al., 2007a)] (Fig. 1). In cycle 14, nuclei occupy ~30% of the cortical area that is
Box 1. Quantitative measurements of the Bicoid gradient
The recent reanalysis of the Bicoid gradient using live imaging of an enhanced green fluorescent protein 〈EGFP〉tagged form of Bicoid 〈Gregor et al., 2005; Gregor et al., 2007a; Gregor et al., 2007b〉 has allowed the dynamics of gradient formation to be followed quantitatively in living Drosophila embryos. The major findings from these and other recent studies that must be accounted for in the models of Bicoid gradient formation are as follows:
The intranuclear concentration gradient of Bicoid is well approximated by an exponentially decaying concentration profile 〈Driever and NussleinVolhard, 1988a; Houchmandzadeh et al., 2002; Gregor et al., 2007a〉. This gradient is established within 90 minutes or less 〈Gregor et al., 2007a〉.
Bicoid diffuses relatively slowly 〈D=0.3 μm^{2}/second〉 in the cortical cytoplasm on a time scale of minutes 〈Gregor et al., 2007a〉.
The nuclear Bicoid gradient is stable over cell cycles 1014. In particular, the initial concentration in nuclei in successive cycles is constant to within 10%, even though it decays by ~30% during the subsequent interphase period as nuclear volume and surface area change 〈Gregor et al., 2007a〉.
Nuclear Bicoid is in a dynamic equilibrium, which is brought about by its rapid import and export between the cytoplasm and the nucleus 〈Gregor et al., 2007a〉.
The spatial shape of the Bicoid gradient 〈the length constant for an exponential decay〉 scales with embryo length over a factor of five in different dipteran species with different egg sizes 〈Gregor et al., 2005〉.
Once the Bicoid gradient is established, target genes, such as the hunchback and Kruppel gap genes, are transcribed at different concentrations of Bicoid and thus at different positions in the embryo (Fig. 2C,D) (Driever and NussleinVolhard, 1988a; Driever and NussleinVolhard, 1988b; Driever and NussleinVolhard, 1989; Driever et al., 1989; Struhl et al., 1989; Hoch et al., 1991; Driever, 2004; Ephrussi and St Johnston, 2004; NussleinVolhard, 2004). Bicoid concentration is not the only information read by these target genes. Although altering the gene copy number of bicoid induces shifts in the boundaries of target gene expression domains, consistent with a concentrationdependent transcriptional response, the magnitude of these shifts is smaller than would be predicted from a concentration threshold mechanism alone (Driever and NussleinVolhard, 1988b; Struhl et al., 1989). Instead, the exact position of each boundary depends on the interactions that occur among the protein products of these target genes, several of which encode transcriptional repressors that are capable of repressing each other (Hoch and Jackle, 1993; Jaeger et al., 2004; Schroeder et al., 2004; OchoaEspinosa et al., 2005; OchoaEspinosa et al., 2009).
The Bicoid gradient was initially characterised using antibody staining and detection by enzymatic reaction, and images were acquired by brightfield microscopy (Driever and NussleinVolhard, 1988a). The concentration of Bicoid in nuclei decreases with the distance from the anterior pole and could be fitted to an exponential decay (Fig. 2E), an observation consistent with the possibility that the gradient arises by diffusion from its local source with spatially uniform degradation. Several models were subsequently proposed to explain the formation of the Bicoid gradient, which we discuss below.
Main models of Bicoid gradient formation
Central to all the models of Bicoid gradient formation is the constant synthesis of Bicoid protein at the anterior pole of the embryo (from an mRNA source that is either pointlike or graded), diffusion from its site of production, and its uniform degradation (Houchmandzadeh et al., 2002; Lander et al., 2002; Bergmann et al., 2007; Coppey et al., 2007; Bergmann et al., 2008; Spirov et al., 2009). Within each model, specific assumptions regarding the values of relevant variables (e.g. long or short lifetime) lead to differing interpretations of how the Bicoid gradient forms and whether and when it will be stabilised. The first two models rely solely on production, diffusion (see Box 2) and lifetime (see Box 3).
In mathematical terms, all models of Bicoid gradient formation rely on the following diffusion reaction differential equation, which is written for a semiinfinite onedimensional system (Rice, 1985):
Box 2. Diffusion coefficient
When a particle diffuses, its mean displacement is zero because diffusion is an isotropic process 〈it is uniform in all directions〉. However, its mean square displacement, which characterises the spread of the particle's position, is a linear function of time. The coefficient of proportionality is called the diffusion coefficient 〈the unit of which is length^{2}/time〉. A larger coefficient reflects a greater spread in position during a given time interval. The significance of an experimentally determined diffusion coefficient depends on how similar the probe used to measure this value is to the actual molecule of interest. For example, an inert probe, such as Dextran, will not localise to nuclei as Bicoid does.
In addition, the temporal window of observation can drastically influence an experimentally determined diffusion coefficient because the effective movement of a probe can include a mixture of mechanistically distinct transport events. On short time scales, such as a few seconds, measurements will reflect the movement of proteins within a cellular compartment. Over the course of minutes, measurements will include the effective diffusion within a compartment together with shuttling of molecules between neighbouring compartments. On the order of hours, molecular transport results from a number of events, including diffusion inside compartments, shuttling between them, and excursions into the deep cytoplasm and possibly the yolk. Moreover, a given time scale is intrinsically linked to a corresponding spatial scale, where short time scales correspond to short spatial scales and long time scales correspond to large spatial scales. Obviously, as the Bicoid gradient forms over a ~2 hour period the relevant diffusion coefficient should be measured on that time scale.
Given the heterogeneity and dynamics of the syncytial embryo, a modelling approach based on partial differential equations might appear to be inappropriate because this approach would need to be complemented by a large set of boundary conditions that describe the exchanges between compartments. Moreover, the parameters D, α and j(x,t) could be different for each compartment, and thus an analytical solution cannot be derived and only numerical simulations would be tractable.
However, the aim of the models presented in this review is to describe the establishment of the gradient over the time and length scales of the embryo without including all the details of the substructures. In this case, the parameters in equation (1) characterise the macroscopic properties of the syncytial embryo and are an average of the reaction and transport events, which include, for example, shuttling of Bicoid between cytoplasmic islands. An explicit formulation of the link between macroscopic parameters and detailed substructures can be obtained through the application of homogenisation routines (Sample and Shvartsman, 2010). The important point when using macroscopic parameters is that great care has to be taken when comparing the experimental values to the parameters of the model as they might not represent directly comparable quantities (see Box 2).
Model 1: the steadystate approximation
The most widely cited model (Wolpert, 1969; Crick, 1970) was formulated before the identification of Bicoid. Wolpert and Crick showed that a simple physical diffusion and reaction model from a localised source can account for most of the phenomenological observations of development known at the time, i.e. that a gradient could be established and naïve tissues patterned over time and length scales (the size of the patterned tissue) that are compatible with diffusion coefficients believed to be relevant for biologically active molecules such as proteins.
The observed exponentially decaying concentration gradient of Bicoid has been taken to mean that the Bicoid gradient arises by local constant protein synthesis, diffusion and spatially uniform degradation (referred to as the SDD model) (Driever and NussleinVolhard, 1988a; Houchmandzadeh et al., 2002). If the rate of protein degradation is high (i.e. if its lifetime is short) with respect to the time scale over which the gradient forms, then its distribution approaches a steady state because production is balanced by degradation. The consequence of the balance between synthesis and degradation is that both the total concentration of Bicoid in the embryo and the spatial concentration profile remain stable (Fig. 3A). Mathematically, at steady state, the diffusion reaction equation (1) becomes time independent and allows an exact solution. The concentration of Bicoid at a given position now depends only on the diffusion and lifetime of the protein.
A solution to equation (1) where the source is a point j(x,t)=Qδ(x)H(t) is obtained by setting the time derivative to zero ∂C(x,t)/∂t=0 (Rice, 1985) so that:
Box 3. Protein lifetime and steady state
The lifetime of a protein, that is, its characteristic survival time, sets a limit as to whether and when a steady state is reached. For firstorder chemical reactions, such as the unimolecular degradation scheme , the time evolution of the decaying chemical species is exponential: [A]〈t〉=[A]〈t=0〉exp〈–αt〉, where [A]〈t〉 is the concentration of the chemical species A, and α is the degradation rate 〈the amount of material being degraded per unit of time〉.
It is possible to define the lifetime τ as the mean survival time of a particle before it is degraded. For an exponential decay 〈as above〉, the lifetime is the inverse of the degradation rate: τ=1/α. Some studies refer to halflife h 〈the time at which the concentration of the chemical species has dropped to one half of its initial value〉 rather than to lifetime. The relation between lifetime and halflife is τ=hlog〈2〉.
The notion of steady state originates from the field of mathematics. The steady state is the timeindependent asymptotic solution of a differential equation. In reallife experimental conditions, a true steady state cannot be observed as it would require an infinite time of observation. However, most chemical systems relax towards their steady state exponentially quickly. This means that even if the steady state is never reached, an observed quantity can appear to be almost stable, and this stable state will be extremely close to the theoretical steady state. Therefore, an experimental quantity, such as the concentration of a protein, can be described as being at steady state if this quantity is within X% of its steadystate value (known theoretically), where X is a number that is defined arbitrarily.
With respect to the Bicoid gradient, numerous discussions have centred on whether or not the gradient is at steady state. Before asking this question, one should define what steady state means for such an experimentally observed quantity. Furthermore, the Bicoid gradient can appear to be stable without being close to any kind of steady state. Everything depends on what we see as being stable. For example, measurements of the nuclear concentrations of Bicoid may suggest that the total gradient does not change over the last five cell cycles and therefore that the gradient is at steady state. But, as we discuss below, this might or might not be true. It is worth pointing out
Box 4. The synthesis, diffusion and degradation (SDD) equation
The mathematical models presented in this review all rely on partial differential equations that describe Bicoid synthesis, diffusion, degradation and shuttling. The central quantity from which the equations are derived is the concentration of the protein: C〈x,t〉. This is the local concentration of Bicoid at position x at time t and follows equation 〈1〉. Below, we explain this equation.
Assuming given time intervals Δt and space intervals Δx, the equation takes the following form:
Model 2: the presteadystate hypothesis
Over the timecourse of the syncytial blastoderm (the first 3 hours of Drosophila development), the Bicoid gradient may never approach its steady state if the lifetime of Bicoid were long with respect to the time scale over which the gradient forms (Bergmann et al., 2007). The total levels of Bicoid would continue to rise, and target genes would see a continuously changing concentration profile (Fig. 3B). The reading of a dynamic gradient would be expected to cause a gradual posterior shift in the expression boundaries of target genes. Such a shift could be avoided if a precise timing mechanism determined a single point in development when the Bicoid gradient was utilised. In the model
Box 5. Length constant
The length constant 〈x〉, or characteristic length, is a quantitative estimate of the spread of a spatial profile 〈for example, of Bicoid molecules〉. Mathematically, it is defined as the mean position of a particle that makes up a concentration profile. Let C〈x〉 be a concentration profile of a molecular gradient in which all molecules are independent, meaning they do not interact and therefore do not influence the distribution of each other. The concentration profile can be expressed as:
The characteristic length, explicitly the first moment of a molecule's distribution, is then given by:
In the simple case of an exponentially decaying concentration profile C〈x〉=exp〈−x/λ〉, the characteristic length is the decay length of the exponent. Indeed, in this case, N_{tot}=λ, p〈x〉=1/λexp〈−x/λ〉, and the characteristic length is 〈x〉=1/λ∫exp〈−x/λ〉dx=λ. In other words, for an exponential profile, the length constant is the position along the AP axis at which the concentration of Bicoid has dropped to 1/exp 〈about onethird〉 of its maximal value at the anterior 〈see also Fig. 1E〉.
Experimentally, the length constant of the Bicoid gradient was principally obtained by fitting an exponential to the measured intensity profile 〈using an EGFPtagged form of Bicoid or antibody staining; see Box 1 for more information〉 to extract the length constant of the exponent 〈Fig. 2E〉. Ideally, the length constant is calculated by the integration of the spatial profile 〈x〉=∫xC〈x〉dx/∫C〈x′〉dx′ without assuming an exponentially decaying profile. Note that both methods of computing the length constant strongly depend on estimates of the background level of fluorescence.
Mathematically, the solution of equation (1) is now time dependent, meaning that the concentration of the morphogen at a given position is no longer stable but changes with time (Bergmann et al., 2007). As a consequence, both the total concentration and the spatial profile of Bicoid keep changing (Fig. 3B). Both changes become attenuated with increasing time, as can be seen graphically in Fig. 3B, when the curve symbolising the amount of morphogen at a given time asymptotically approaches a limit.
Model 2 is equivalent to model 1, but now the full timedependent solution of equation (1) is considered (Bergmann et al., 2007):
The length constant λ of the gradient is now dependent on diffusion, degradation and time, and it increases with time (unlike at steady state, where the length constant is time independent and
stable). The shape of the gradient is not a simple exponential and the mathematical expression of the length constant cannot be explicitly given. However, it can be approximated by λ(t)=√4Dt for times such that t<<1/α, whereas it tends to λ_{eq} for long time periods (Bergmann et al., 2007). At intermediate times, the length constant is always less than the length constant λ_{eff}(t) with no degradation (see model 3):
Model 3: the nuclear trapping model
During the period when the Bicoid gradient is being established in the early Drosophila embryo, the number of nuclei changes by three orders of magnitude. Because Bicoid, as a transcription factor, accumulates in nuclei, their changing number might affect the gradient locally and/or globally (Lander et al., 2002; Coppey et al., 2007). Such local and global effects have been detected for the gradient of phosphorylated MAPK, which is activated by the receptor tyrosine kinase Torso at the termini of the Drosophila embryo (Coppey et al., 2008) and could play a role in both shaping and stabilising the Bicoid gradient. In a simple view, the increasing number of nuclei could balance the increase in the total amount of Bicoid, resulting in a gradient in which nuclear concentrations at a given position remain stable even if Bicoid degradation is too low to balance synthesis or to achieve a stable total concentration. Target genes would read stable nuclear concentrations of Bicoid (as in the steadystate model) even though the total gradient is not at steady state. The most detailed version of this model (Coppey et al., 2007) assumes no degradation of Bicoid (Fig. 3C). It incorporates the constant production of Bicoid at the anterior, diffusion from this source, and the nucleocytoplasmic shuttling of Bicoid. Bicoid is presumed to be bound/immobile when in nuclei and free/diffusible when in the cytoplasm.
Mathematically, a term that describes nucleocytoplasmic shuttling of Bicoid is added to equation (1) (see Box 6). This term does not affect the total concentration of Bicoid in the embryo (which in this particular model depends only on production because there is no degradation) but it does affect the local distribution of Bicoid. As with the previous model, the solution is time dependent, yet the result of nucleocytoplasmic shuttling is that (given the right parameters) a stable, timeindependent state is mimicked.
Box 6. Nucleocytoplasmic shuttling
The physical properties of nucleocytoplasmic shuttling are of particular importance as they set the relationship between the concentration of Bicoid in the nuclei, where the developmental decision takes place, and its cytoplasmic concentration, where the gradient is established. The characteristic shuttling time is relatively fast compared with the duration of a nuclear cycle 〈1 minute versus 820 minutes〉 and allows the nuclear concentration, N〈x,t〉, to be expressed as a simple function of the cytoplasmic concentration C〈x,t〉:
Eventually, the mathematical quantity that best reflects the experimentally measured Bicoid concentration per nucleus, n〈x,t〉, is obtained by dividing the modelled nuclear concentration by the nuclear density ρ: n〈x,t〉=N〈x,t〉/ρ. Assuming that the nuclear import k_{in} is proportional to the nuclear density, n〈x,t〉 would be proportional to C_{tot}〈x,t〉/1+K, so that the increase in C_{tot}〈x,t〉 due to continuous production of Bicoid could be balanced by the increase of K.
The varying total Bicoid concentration has the following timedependent expression (Rice, 1985):
In this model, there is no steady state and total Bicoid concentration increases infinitely (Fig. 3C).
Model 4: a gradient of bicoid mRNA
The RNA that encodes the Bicoid protein is synthesized during oogenesis and remains tightly localised to the anterior pole of the egg until fertilisation (St Johnston et al., 1989; St Johnston et al., 1991). At that point, it assumes a more diffuse distribution (Weil et al., 2008) and its translation is presumed to begin. Model 4 incorporates a graded distribution of the bicoid RNA source (in contrast to the first three models, see Fig. 1F) (Spirov et al., 2009). How the RNA distribution affects the protein gradient will depend on the distribution of the source RNA and the diffusion/degradation parameters of the protein. If the length constant √D/α of the protein is very low compared with the length scale of the mRNA source (slow diffusion and short lifetime), then Bicoid will accumulate where it is synthesized and the protein distribution will mirror the source RNA distribution (Spirov et al., 2009). However, if √D/α is very large compared with the length scale of the source (if there is no degradation and/or a large diffusion coefficient), then Bicoid will accumulate in the embryo exactly as it would for a pointlike source.
As all the mRNA is provided by the mother, if the mRNA diffuses from its initial site at the anterior tip, then it will result in a Gaussian rather than exponential distribution (because there is no production of mRNA). Then, the source term of equation (1) is:
In conclusion, although all models assume constant production of Bicoid, its diffusion and spatially uniform degradation, and, in one case, nucleocytoplasmic shuttling, the difference in parameter choice (e.g. long or short lifetime) yields Bicoid gradients with very different spatiotemporal characteristics, which would then affect the subsequent interpretation of the gradient by its target genes.
Discussion of the models
The length scale of the gradient and diffusion
The length scale has been the most experimentally measured parameter of the Bicoid gradient (Houchmandzadeh et al., 2002; Crauk and Dostatni, 2005; Gregor et al., 2005; Gregor et al., 2007a; Gregor et al., 2007b; He et al., 2008), as it represents the most obvious quantitative characteristic of the gradient and it is easy to measure both in fixed and living embryos (Fig. 2E). Experimental studies have shown that the length constant λ_{obs} of the exponential fit of the gradient (Fig. 2E, Box 5) is ~100 μm over the 500 μm total egg length.
In all models, the length scale of the Bicoid gradient depends critically on the diffusivity of Bicoid protein. The larger the diffusion coefficient, the more Bicoid reaches positions far from its source. However, the diffusion coefficient that was measured in live embryos is ~0.3 μm^{2}/second. With this value of diffusivity, models 13 cannot explain the experimentally observed length scale of the gradient, as explained below.
The steadystate model (model 1) predicts a length constant of λ_{eq}=√D/α. Given the low diffusivity of Bicoid protein, for the gradient to achieve 95% of the theoretical steady state after 90 minutes (which corresponds to cycle 10, when nuclear stability is first observed) [Q(t)=1–e^{t/τ}, then 0.95Q=1–e^{90/30}], the lifetime of Bicoid would have to be less than 30 minutes (α≥5.5×10^{−4} seconds). With such numerical values, the maximal length scale of the gradient λ_{eq} would be less than 25 μm, a value far from the observed 100 μm (see Fig. 3D).
The nonstationary models (models 2 and 3), in which the lifetime of Bicoid is assumed to be long or infinite and a steady state is not approached, predict a length constant λ_{eff}(t)=2√4Dt/3√π, which gives a maximal length scale of 35 μm when the considered time is maximal, namely at cycle 14 (t=7200 seconds). At shorter times (after 90 minutes at cycle 10, when nuclear stability is already observed) t=4800 seconds and the length scale is even smaller at λ_{eff}(t)=28 μm.
For the RNA gradient model (model 4), the situation is different as the length scale of the mRNA source can play a major role in setting the final length scale of the protein gradient. A recent analysis of the mRNA gradient suggests a value of λ_{RNA}/L≈0.2 (L being the length of the embryo along the AP axis) at cycle 14 (Spirov et al., 2009), which corresponds to λ_{RNA}≈100 μm and is sufficient by itself to explain the length constant of the protein. However, for the formation of the Bicoid gradient, the 2 hours prior to cycle 14 matter the most. During this time, the length scale of the RNA source is ~50 μm (observed for early cycles 912) and is probably less than this at earlier times. If the RNA distribution has a length constant of 50 μm, then the gain in the length scale of the protein gradient due to diffusion would be a mere 20 μm, which is insufficient to explain the actual protein distribution.
In conclusion, given the measured length constant and diffusion coefficient, none of the models predicts the gradient as it is actually observed, although an RNA gradient model is potentially more compatible with the observed low diffusion coefficient.
Stability of nuclear concentration
By the time nuclei reach the surface of the embryo during cycle 10 (90 minutes after fertilisation), until cycle 14, the nuclear concentration of Bicoid at a given position remains stable (Gregor et al., 2007a). How can nuclear stability be achieved when nuclei continue proliferating (from cycle 10 to 14 the number of nuclei increases by a factor of 16)?
In the steadystate model (model 1), the total amount of Bicoid in the embryo is fixed at the same time as its spatial concentration profile (Fig. 3A). Under these conditions, stable nuclear concentration can be achieved if nuclear import is proportional to nuclear volume and if the nuclear Bicoid represents only a small fraction of the total pool in the cytoplasm. This is probably the case during cycles 10 through 12, but by cycle 1314 nuclei begin to represent a significant fraction of the cortical volume.
In the presteadystate models (models 2 and 3, Fig. 3B,C), the total concentration of Bicoid in the embryo is predicted to rise and the total gradient continues to change its shape. A balance between total Bicoid levels and the number of nuclei can be achieved only if the total nuclear volume increases linearly with time, assuming that rates of synthesis are constant and accumulation is linear. Although the number of nuclei increases exponentially, their size decreases with each cycle (Gregor et al., 2007a), such that the total increase in nuclear volume is close to linear from cycles 1013. However, it is unclear how the total amount and spatial distribution of Bicoid protein change over time.
The RNA gradient model (model 4) predicts a short lifetime. In this model, to obtain nuclear stability, the same conditions as those for the steadystate model (model 1) apply: the increasing nuclear volume samples only a small fraction of a constant protein gradient.
In conclusion, to determine which of the models might satisfy the nuclear stability condition we need to know additional parameters, namely the total concentration of Bicoid protein in the egg, its rates of nuclear import and the lifetime of the protein.
Scaling of the gradient
The Bicoid gradient in eggs of different sizes scales such that the length constant is larger in longer eggs (Fig. 1H) (Gregor et al., 2005). It is not obvious how this can be achieved.
One possible mechanism for scaling in model 1, where the length constant is λ_{eq}=√D/α, is nuclear degradation of Bicoid. If degradation takes place inside (and only inside) nuclei, then the rate of degradation α would be proportional to the nuclear density ρ. This nuclear density is given by ρ=N/L^{2}, where N is the number of nuclei and L the length of the embryo. Because the number of nuclei remains the same in embryos of different lengths, the degradation rate would be inversely proportional to L^{2}, so that the length constant would be proportional to L, thus achieving scaling.
However, scaling that is mediated by nuclear degradation seems to be in conflict with the stability of the nuclear gradient during successive cleavage divisions if increasing numbers of nuclei increased the overall rate of degradation. The length constant of the gradient λ_{eq}=√D/α(ρ) would change at each cycle with respect to the nuclear density. Thus, scaling and stability of the gradient cannot be achieved simultaneously in the steadystate model (model 1).
As noted above, for the gradient to scale, the length constant of the gradient has to be a function of egg length. In the presteadystate models, the length constant is time dependent. If the time at which the gradient is utilised were dependent on the length of the embryo, then the gradient would not scale but the pattern of its responses would. To determine whether such a timing mechanism could account for scaling requires a detailed analysis of the developmental timing in different species. As a formal alternative, the embryo could alter the effective diffusion coefficient to scale with egg length. Finally, if the RNA gradient scales with embryo length, then the protein gradient would do so as well. Any potential mechanism(s) responsible for scaling of either the RNA gradient or the effective diffusion coefficient remain unknown.
The unknowns
As discussed above, none of these models can satisfy all of the criteria listed in Box 1. To understand the formation of the Bicoid gradient, we need to add new parameters to these models and to reevaluate some of the previously measured parameters, as we discuss below.
How does Bicoid move?
All measurements of Bicoid diffusivity so far have resulted in values (D≈0.3 μm^{2}/second) (Gregor et al., 2007a) that are an order of magnitude too small to explain the length scale of the gradient. This is a problem for all except the RNA gradient model (Spirov et al., 2009), in which the distribution of Bicoid protein is controlled by the as yet unmeasured diffusion of the RNA. So how does Bicoid get to where we see it?
With certainty, these measurements show that there is a fraction of cytoplasmic Bicoid that diffuses slowly. But one cannot exclude several additional uncertainties. (1) There could be another fraction of cytoplasmic Bicoid that diffuses much faster. All recovery curves from photobleaching experiments start at a level that seems to be much higher than the background, suggesting that a fraction of Bicoid is already reequilibrated. (2) Fast and effective diffusion events may be visible over longer time scales, perhaps owing to bulk movement or to the ‘stirring’ of the cytoplasm during cleavage divisions (Hecht et al., 2009). (3) The diffusion coefficient might be much higher in other compartments of the embryo; for example, within the centre of the embryo where diffusion cannot be observed. (4) Bicoid might diffuse faster during early cycles (cycles 19), a time during which the Bicoid protein cannot be seen by live imaging, and therefore the diffusion coefficient of Bicoid cannot be measured directly.
How much Bicoid is there?
One prediction of the steadystate model (model 1) is that the total amount of Bicoid in the embryo does not change with time (Fig. 3A). If we knew this value, we could distinguish between the gradient at steady state (constant amount) (Fig. 3A) or at presteady state (increasing amount) (Fig. 3B,C). The optical approach (using confocal or twophoton microscopes) only tells us about the surface concentration of Bicoid (which keeps rising and is therefore consistent with the presteadystate models), and as we do not know how Bicoid distributes in the various compartments, we cannot infer its total amount in the embryo.
How is Bicoid distributed in each compartment at any stage?
Given the total amount of Bicoid at a given time, we need to know how this amount is partitioned between the physical structures of the embryo. There are several compartments in the syncytial embryo, which are dynamic during the first 3 hours of development when the Bicoid gradient is established (Mavrakis et al., 2009a; Mavrakis et al., 2009b). We do not know the exact volumes of these compartments, except for the nuclei (which are well characterised with respect to their diameter) (Gregor et al., 2007a), and therefore we cannot infer the total amounts of Bicoid in each compartment and have to use relative concentrations. But the geometry and physical properties of these compartments strongly influence the shuttling of Bicoid between them and, as a consequence, the overall transport of Bicoid on the scale of the embryo, as well as its nuclear localisation.
Lifetime
Another way of distinguishing between the steadystate and presteadystate models requires us to know the lifetime of Bicoid protein. If the lifetime is short with respect to developmental time scales, then the gradient would approach a steady state (Fig. 3A), whereas with a long or infinite lifetime the gradient would keep changing (Fig. 3B,C).
In addition, the degradation of Bicoid sets a limit to how far the protein can move over a given time. In mathematical terms, lifetime and diffusion control the length scale of the gradient (λ=√Dτ). The higher the degradation rate, the sharper the profile of the resulting gradient will be.
RNA localisation
Although bicoid RNA is tightly bound to the anterior cortex of the developing oocyte during Drosophila oogenesis, it has been known for some time that its localisation becomes more diffuse after fertilisation (St Johnston et al., 1989). Part of this displacement involves microtubular transport (Weil et al., 2008). How this dispersion affects the ultimate distribution of Bicoid protein is still a matter of some controversy. Two issues seem important here. First, as the nuclear protein gradient is stable from cycle 10/11, the relevant RNA distribution must precede that point. Second, Bicoid protein is detected in nuclei at the surface and in the centre of the egg. Both these issues point to the importance of measuring RNA distribution early during development, prior to cycle 10 when nuclei are not yet at the surface. So far, published quantifications of RNA have been performed by wholemount in situ hybridisation, from which only the surface level of bicoid mRNA can be reliably measured (Spirov et al., 2009).
Production
Translation of Bicoid protein is assumed to be constant during the first 3 hours of development, when the gradient is established. The major argument supporting constant production is the fact that bicoid mRNA is stable for the first 3 hours, before it starts to decay (Surdej and JacobsLorena, 1998). However, the length of the bicoid poly(A) tail is dynamic, reaching a maximum length at ~1.5 hours of development, suggesting that Bicoid protein production might not be constant (Salles et al., 1994). In addition, Bicoid protein is undetectable by western blotting in embryos during the first hour of development (Driever and NussleinVolhard, 1988a), again suggesting that protein production is not constant. This evidence is indirect, however, and we need a direct measurement of the rate of Bicoid synthesis.
If production turns out to be nonlinear, all the models we have discussed (which rely on the constant production assumption) become much more complicated.
Nuclear transport
Nuclear import and export determine the relative cytoplasmic and nuclear concentrations of Bicoid. The experimental observations of this are mainly based on the measured nuclear concentrations of Bicoid. However, the models we discuss predict total Bicoid concentrations. One way to compare the predicted and measured values is by assessing the nuclear transport rates of Bicoid.
At steady state, the relative nuclear and cytoplasmic Bicoid concentrations reveal the import to export ratio, K. If K remains constant over nuclear cycles, then the Bicoid concentration per nucleus reflects the cytoplasmic Bicoid concentration in that these values will differ only by a proportionality constant. By contrast, if K changes with time, then there will be a timedependent filter between the cytoplasmic and nuclear concentrations.
K can be a function of the nuclear volume, the nuclear surface or the number of nuclei. First, nuclear volume could dictate this ratio if the import rate were controlled by a firstorder chemical reaction between cytoplasmic Bicoid proteins and the nuclear binding sites. Second, if the import rate were limited by the diffusive capture of cytoplasmic Bicoid by the nucleus (the time it takes to reach the nuclear surface) or by the number of nuclear pores present on the nuclear membrane, K would be a linear function of the nuclear surface area. Third, if nuclear accumulation were controlled by the nonspecific binding of Bicoid to DNA and not by the time needed to enter the nucleus, then K would be proportional to the amount of DNA and thus to the number of nuclei.
The volume of the cytoplasm will be the most relevant physical factor determining the value of K. Potentially, subcompartmentalisation of the cytoplasm could also be a critical feature; for example, if the shuttling of Bicoid within or between cytoplasmic islands reflects an additional set of importexport rates.
When is the gradient utilised?
The transcription of Bicoid target genes has been observed as early as cycle 10 (Knipple et al., 1985; Tautz et al., 1987; Jaeger et al., 2007). At this stage, for certain Bicoid lifetime values, the gradient might be far from its steady state (Fig. 3B,C). If the relevant reading of the Bicoid gradient by target genes occurs at cycle 10, as has been suggested by Bergmann et al. (Bergmann et al., 2007), they would have to read a dynamic gradient and a precise timing mechanism becomes crucial. However, if target genes read the Bicoid gradient only in cycle 14, when it is closer to its steady state, there would be no need for such a mechanism. Therefore, it would be interesting to determine when the Bicoid gradient is actually utilised, or, more precisely, when does the relevant interpretation of the Bicoid gradient occur? For example, Bicoid might induce a low level of hunchback and of other gap gene expression at cycle 10, but if the gradient is read continuously in nuclei between cycles 10 and 14, only the highlevel response at cycle 14 might be relevant for the next step in patterning and in the establishment of pairrule gene expression.
The temporal requirement for Bicoid has been tested by shifting a temperaturesensitive bicoid allele (called E3) to the restrictive temperature at various developmental stages. Starting at polecell formation (cycle 9), Drosophila embryos display a high sensitivity to the temperature shift, and this sensitivity lasts until cycle 14, suggesting that Bicoid does matter as early as cycle 9/10 (Frohnhoefer and NuessleinVolhard, 1987). In another experiment, anterior cytoplasm (which contains Bicoid) was transferred to hosts of different stages. A strong response (scored by phenotype) was only induced in hosts of ~1.5 hours of age (corresponding approximately to cycle 10), and the strength of the response fell sharply afterwards (Frohnhoefer and NuessleinVolhard, 1986).
Lastly, the Bicoid gradient has been perturbed using a microfluidic device, which allows the embryo to be exposed to large temperature differences. Here, the critical time for Bicoiddependent patterning was found to be between 65 and 100 minutes of development, corresponding to cycles 811 (Lucchetta et al., 2005). Together, these experiments support the idea that Bicoid is needed early, around cycle 10/11, consistent with the model proposed by Bergmann et al. (Bergmann et al., 2007). However, if the Bicoid concentration were measured only at cycle 10 and then stably ‘remembered’ during the subsequent proliferation of nuclei, it is hard to explain the tight correlation that is observed between Bicoid and Hunchback levels at cycle 14, because under these circumstances, despite the smooth variation in Bicoid, Hunchback would be expected to show a coarse, patchy distribution in nuclei with similar cycle10 lineages. The similarly graded distribution of both proteins that is actually observed at cycle 14 (Gregor et al., 2007b) suggests that the Hunchback response continues to be adjusted throughout the syncytial cycles (Gregor et al., 2007b).
Are there different Bicoids?
The models assume the presence of a single Bicoid activity in the embryo, which might not be correct. Western blots demonstrate the presence of differently modified forms of Bicoid in the embryo (Driever and NussleinVolhard, 1988a; Ronchi et al., 1993; Janody et al., 2000), and each modification might alter its specific activity. Phosphorylation by the receptor tyrosine kinase Torso affects the transcriptional activity of Bicoid (Ronchi et al., 1993; Janody et al., 2000). Bicoid might be modified by other pathways; removal of a sumoylation enzyme affects its nuclear localisation (Epps and Tanda, 1998), but it is not known whether Bicoid is a direct target of this modification. Bicoid eventually decays, and it is tempting to speculate that this occurs by the ubiquitin proteasome pathway. So even if the total amount of Bicoid were consistent with one or all of the models, the population of Bicoid that matters for providing positional information, i.e. that of the transcriptionally active form, might support a different model. Also, different forms of Bicoid might move at different rates through the embryo. So it will be important to learn more about these modifications and their consequences, and to adjust the models of gradient formation accordingly.
Comparison with other morphogen gradients
For three other morphogen gradients – Dpp and Wg in the fly wing disk (Kicheva et al., 2007) and Fgf8 in the zebrafish embryo (Yu et al., 2009) – the measured key physical parameters, i.e. the diffusion coefficient and the lifetime, are consistent with the SDD model (in which the constant synthesis, diffusion and spatially uniform degradation of the morphogen is assumed). Notably, both Dpp and Wg diffuse at roughly similar rates as Bicoid (<1 μm^{2}/second), whereas the diffusion coefficient of Fgf8 in the zebrafish embryo is two orders of magnitude higher, similar to the diffusion of a GFPsized protein in water (Swaminathan et al., 1997; Petrasek and Schwille, 2008). The high Fgf8 diffusion coefficient may reflect properties of the extracellular space in zebrafish (which might, for example, feature fewer binding sites for Fgf8 or low viscosity) or it might be an effect of the measurement method used [fluorescence correlation spectroscopy (FCS)]. In all cases, the lifetimes of these morphogens were found to be short enough for the gradient to approach steady state, given the time available for the formation of these gradients (τ_{Fgf8}≈30 minutes, τ_{Dpp}≈60 minutes, τ_{Wg}≈15 minutes) (Kicheva et al., 2007; Yu et al., 2009).
In the early Drosophila embryo, two other morphogen gradients, Dl and MAPK, form within the same environment as the Bicoid gradient, yet the resulting gradients differ in important aspects (DeLotto et al., 2007; Coppey et al., 2008; Kanodia et al., 2009). Although the Bicoid gradient retains both its shape and amplitude (when nuclear Bicoid is observed), the MAPK gradient sharpens with time while retaining its amplitude, and the Dl gradient increases its amplitude and keeps its shape (DeLotto et al., 2007; Coppey et al., 2008; Kanodia et al., 2009). The time at which each morphogen gradient is established is thought to play a role in their different characteristics (Bicoid is assumed to reach its final shape before nuclei reach the surface of the embryo and before nuclei constitute a significant fraction of the embryonic volume), as are the lifetime of the morphogen (the short lifetime of active MAPK) and the spatial extent of the sources that activate Dl (broad source) and MAPK (restricted source) (Kanodia et al., 2009).
Conclusion
Several models have been proposed to account for the establishment of the Bicoid gradient in the Drosophila embryo. However, none of those that we have discussed is fully consistent with all available experimental observations; each can only partially account for and explain experimental findings on the Bicoid gradient. Thus, one can assert that there is no consensus on the biophysical mechanisms that underpin Bicoid gradient formation and there is still much to understand about this process. Beyond this fact, we have shown that these simple models raise fundamental questions that highlight the need for new quantitative experiments. It is very likely that this constructive interplay between models and experiments will soon lead to a unified picture of Bicoid gradient formation. In particular, whether or not future models will be more complex with regard to the number of biophysical parameters that they include, there must be a simple ‘coarsegrained’ model that can explain the main features of the dynamics of Bicoid and its gradient based on experimentally measured parameters.
Acknowledgements
We are grateful to Stas Shvartsman, Stefano Di Talia, Shawn Little and other members of the E.W. laboratory for helpful suggestions and discussions. M.C. was supported by the Human Frontier Science Program. This work was supported by the Howard Hughes Medical Institute and the NIH. Deposited in PMC for release after 12 months.
Footnotes

Competing interests statement
The authors declare no competing financial interests.
 © 2010.
References
Summary
Morphogen gradients provide embryonic tissues with positional information by inducing target genes at different concentration thresholds and thus at different positions. The Bicoid morphogen gradient in Drosophila melanogaster embryos has recently been analysed quantitatively, yet how it forms remains a matter of controversy. Several biophysical models that rely on production, diffusion and degradation have been formulated to account for the observed dynamics of the Bicoid gradient, but no one model can account for all its characteristics. Here, we discuss how existing data on this gradient fit the various proposed models and what aspects of gradient formation these models fail to explain. We suggest that knowing a few additional parameters, such as the lifetime of Bicoid, would help to identify and develop better models of Bicoid gradient formation.
 Bicoid
 Morphogen
 Pattern formation
Introduction
A fundamental question in developmental biology is how naïve tissue is provided with positional information. One answer to this question posed the idea of morphogens (Turing, 1952), patterngenerating substances that form concentration gradients across fields of cells (Wolpert, 1969; Crick, 1970; Gierer and Meinhardt, 1972). Simple molecular mechanisms, such as localised protein synthesis, diffusion and degradation, can yield nonuniform distributions of morphogens that determine pattern formation. In the modern formulation, cells or nuclei would respond to different concentration thresholds of the morphogen by the expression of specific target genes, thereby translating the differential morphogen concentration into patterns of gene expression (Wolpert, 1969; Crick, 1970).
The simplicity of the morphogen gradient model has been attractive to developmental biologists in that it reduces the problem of specifying positional information to measuring quantitative differences in a single molecule. It is relatively easy to envision cell biological processes and signalling events that could localise a morphogen source or break symmetry. The final informationrich distribution of a morphogen is controlled by measurable physical parameters, such as diffusion and lifetime.
bicoid, which encodes a homeobox transcription factor, was the first gene to be identified that fulfils the criteria of such a morphogen. The Bicoid gradient arises from maternally deposited mRNA at the anterior pole of the syncytial Drosophila embryo (St Johnston et al., 1989). It is translated at the start of egg deposition, and its protein gradient is established in less than 3 hours and is subsequently read by ~20 target genes, which are expressed in discrete patterns along the anteriorposterior (AP) axis (Jaeger et al., 2004; Schroeder et al., 2004; OchoaEspinosa et al., 2005; Manu et al., 2009a; Manu et al., 2009b; OchoaEspinosa et al., 2009).
A number of other morphogens were identified following the discovery of Bicoid. These include members of the Fibroblast growth factor (FGF), Transforming growth factor β (TGFβ), Hedgehog and Wingless (Wg) families, which pattern a wide variety of tissues in many different animals (Tabata and Takei, 2004). The biophysical parameters of several of these morphogens have also been measured; for example, for Decapentaplegic (Dpp; which belongs to the TGFβ family) and for Wg in the wing disk epithelium of Drosophila (Kicheva et al., 2007), for Fgf8 in the zebrafish embryo (Yu et al., 2009) and for the transcription factor Dorsal (Dl; the homologue of nuclear factor kappa B, NFκB) and Mitogenactivated protein kinase (MAPK) in the syncytial Drosophila embryo (DeLotto et al., 2007; Coppey et al., 2008; Kanodia et al., 2009). Lessons obtained from any of these gradients are potentially informative about the formation of morphogen gradients in general, but each gradient is distinguished by its own special features. In this review, we focus on the Bicoid gradient, which is the most thoroughly studied morphogen gradient. Our aim is to introduce the major classes of models and to discuss how quantitative data recently obtained from living embryos (Box 1) fit with each model, and which features of the Bicoid gradient are inconsistent with each model's predictions. We conclude by suggesting that knowing more about additional parameters of this gradient would help us to understand which of the models, if any, most accurately describes the formation of the Bicoid gradient and how future models could be improved.
Analysing the Bicoid gradient
Since its discovery in 1988, the Bicoid gradient has been analysed by a multitude of methods (Ephrussi and St Johnston, 2004). For the purpose of this review, we focus on those findings that directly relate to the biophysical models that we discuss.
bicoid RNA begins to be translated at egg deposition at a rate that is generally believed to be constant, and the protein gradient is established in less than 3 hours in the syncytial embryo, which is ~500 μm in length (Fig. 1, Fig. 2A). The protein is thought to move from its site of production to form a nuclear concentration gradient: nuclei closer to the source at the anterior contain higher concentrations of Bicoid than nuclei posterior to this position (Fig. 1, Fig. 2A,B). The environment in which the Bicoid gradient forms is highly dynamic and heterogeneous (Fig. 1). Following fertilisation, the embryo undergoes 13 synchronous rounds of nuclear division, such that within 2 hours the number of nuclei in the embryo increases by three orders of magnitude, from 1 to ~6000 nuclei [the nuclear diameter is ~10 μm until cycle 10, and then decreases to ~6 μm in cycle 14 (Gregor et al., 2007a)] (Fig. 1). In cycle 14, nuclei occupy ~30% of the cortical area that is
Box 1. Quantitative measurements of the Bicoid gradient
The recent reanalysis of the Bicoid gradient using live imaging of an enhanced green fluorescent protein 〈EGFP〉tagged form of Bicoid 〈Gregor et al., 2005; Gregor et al., 2007a; Gregor et al., 2007b〉 has allowed the dynamics of gradient formation to be followed quantitatively in living Drosophila embryos. The major findings from these and other recent studies that must be accounted for in the models of Bicoid gradient formation are as follows:
The intranuclear concentration gradient of Bicoid is well approximated by an exponentially decaying concentration profile 〈Driever and NussleinVolhard, 1988a; Houchmandzadeh et al., 2002; Gregor et al., 2007a〉. This gradient is established within 90 minutes or less 〈Gregor et al., 2007a〉.
Bicoid diffuses relatively slowly 〈D=0.3 μm^{2}/second〉 in the cortical cytoplasm on a time scale of minutes 〈Gregor et al., 2007a〉.
The nuclear Bicoid gradient is stable over cell cycles 1014. In particular, the initial concentration in nuclei in successive cycles is constant to within 10%, even though it decays by ~30% during the subsequent interphase period as nuclear volume and surface area change 〈Gregor et al., 2007a〉.
Nuclear Bicoid is in a dynamic equilibrium, which is brought about by its rapid import and export between the cytoplasm and the nucleus 〈Gregor et al., 2007a〉.
The spatial shape of the Bicoid gradient 〈the length constant for an exponential decay〉 scales with embryo length over a factor of five in different dipteran species with different egg sizes 〈Gregor et al., 2005〉.
Once the Bicoid gradient is established, target genes, such as the hunchback and Kruppel gap genes, are transcribed at different concentrations of Bicoid and thus at different positions in the embryo (Fig. 2C,D) (Driever and NussleinVolhard, 1988a; Driever and NussleinVolhard, 1988b; Driever and NussleinVolhard, 1989; Driever et al., 1989; Struhl et al., 1989; Hoch et al., 1991; Driever, 2004; Ephrussi and St Johnston, 2004; NussleinVolhard, 2004). Bicoid concentration is not the only information read by these target genes. Although altering the gene copy number of bicoid induces shifts in the boundaries of target gene expression domains, consistent with a concentrationdependent transcriptional response, the magnitude of these shifts is smaller than would be predicted from a concentration threshold mechanism alone (Driever and NussleinVolhard, 1988b; Struhl et al., 1989). Instead, the exact position of each boundary depends on the interactions that occur among the protein products of these target genes, several of which encode transcriptional repressors that are capable of repressing each other (Hoch and Jackle, 1993; Jaeger et al., 2004; Schroeder et al., 2004; OchoaEspinosa et al., 2005; OchoaEspinosa et al., 2009).
The Bicoid gradient was initially characterised using antibody staining and detection by enzymatic reaction, and images were acquired by brightfield microscopy (Driever and NussleinVolhard, 1988a). The concentration of Bicoid in nuclei decreases with the distance from the anterior pole and could be fitted to an exponential decay (Fig. 2E), an observation consistent with the possibility that the gradient arises by diffusion from its local source with spatially uniform degradation. Several models were subsequently proposed to explain the formation of the Bicoid gradient, which we discuss below.
Main models of Bicoid gradient formation
Central to all the models of Bicoid gradient formation is the constant synthesis of Bicoid protein at the anterior pole of the embryo (from an mRNA source that is either pointlike or graded), diffusion from its site of production, and its uniform degradation (Houchmandzadeh et al., 2002; Lander et al., 2002; Bergmann et al., 2007; Coppey et al., 2007; Bergmann et al., 2008; Spirov et al., 2009). Within each model, specific assumptions regarding the values of relevant variables (e.g. long or short lifetime) lead to differing interpretations of how the Bicoid gradient forms and whether and when it will be stabilised. The first two models rely solely on production, diffusion (see Box 2) and lifetime (see Box 3).
In mathematical terms, all models of Bicoid gradient formation rely on the following diffusion reaction differential equation, which is written for a semiinfinite onedimensional system (Rice, 1985):
Box 2. Diffusion coefficient
When a particle diffuses, its mean displacement is zero because diffusion is an isotropic process 〈it is uniform in all directions〉. However, its mean square displacement, which characterises the spread of the particle's position, is a linear function of time. The coefficient of proportionality is called the diffusion coefficient 〈the unit of which is length^{2}/time〉. A larger coefficient reflects a greater spread in position during a given time interval. The significance of an experimentally determined diffusion coefficient depends on how similar the probe used to measure this value is to the actual molecule of interest. For example, an inert probe, such as Dextran, will not localise to nuclei as Bicoid does.
In addition, the temporal window of observation can drastically influence an experimentally determined diffusion coefficient because the effective movement of a probe can include a mixture of mechanistically distinct transport events. On short time scales, such as a few seconds, measurements will reflect the movement of proteins within a cellular compartment. Over the course of minutes, measurements will include the effective diffusion within a compartment together with shuttling of molecules between neighbouring compartments. On the order of hours, molecular transport results from a number of events, including diffusion inside compartments, shuttling between them, and excursions into the deep cytoplasm and possibly the yolk. Moreover, a given time scale is intrinsically linked to a corresponding spatial scale, where short time scales correspond to short spatial scales and long time scales correspond to large spatial scales. Obviously, as the Bicoid gradient forms over a ~2 hour period the relevant diffusion coefficient should be measured on that time scale.
Given the heterogeneity and dynamics of the syncytial embryo, a modelling approach based on partial differential equations might appear to be inappropriate because this approach would need to be complemented by a large set of boundary conditions that describe the exchanges between compartments. Moreover, the parameters D, α and j(x,t) could be different for each compartment, and thus an analytical solution cannot be derived and only numerical simulations would be tractable.
However, the aim of the models presented in this review is to describe the establishment of the gradient over the time and length scales of the embryo without including all the details of the substructures. In this case, the parameters in equation (1) characterise the macroscopic properties of the syncytial embryo and are an average of the reaction and transport events, which include, for example, shuttling of Bicoid between cytoplasmic islands. An explicit formulation of the link between macroscopic parameters and detailed substructures can be obtained through the application of homogenisation routines (Sample and Shvartsman, 2010). The important point when using macroscopic parameters is that great care has to be taken when comparing the experimental values to the parameters of the model as they might not represent directly comparable quantities (see Box 2).
Model 1: the steadystate approximation
The most widely cited model (Wolpert, 1969; Crick, 1970) was formulated before the identification of Bicoid. Wolpert and Crick showed that a simple physical diffusion and reaction model from a localised source can account for most of the phenomenological observations of development known at the time, i.e. that a gradient could be established and naïve tissues patterned over time and length scales (the size of the patterned tissue) that are compatible with diffusion coefficients believed to be relevant for biologically active molecules such as proteins.
The observed exponentially decaying concentration gradient of Bicoid has been taken to mean that the Bicoid gradient arises by local constant protein synthesis, diffusion and spatially uniform degradation (referred to as the SDD model) (Driever and NussleinVolhard, 1988a; Houchmandzadeh et al., 2002). If the rate of protein degradation is high (i.e. if its lifetime is short) with respect to the time scale over which the gradient forms, then its distribution approaches a steady state because production is balanced by degradation. The consequence of the balance between synthesis and degradation is that both the total concentration of Bicoid in the embryo and the spatial concentration profile remain stable (Fig. 3A). Mathematically, at steady state, the diffusion reaction equation (1) becomes time independent and allows an exact solution. The concentration of Bicoid at a given position now depends only on the diffusion and lifetime of the protein.
A solution to equation (1) where the source is a point j(x,t)=Qδ(x)H(t) is obtained by setting the time derivative to zero ∂C(x,t)/∂t=0 (Rice, 1985) so that:
Box 3. Protein lifetime and steady state
The lifetime of a protein, that is, its characteristic survival time, sets a limit as to whether and when a steady state is reached. For firstorder chemical reactions, such as the unimolecular degradation scheme , the time evolution of the decaying chemical species is exponential: [A]〈t〉=[A]〈t=0〉exp〈–αt〉, where [A]〈t〉 is the concentration of the chemical species A, and α is the degradation rate 〈the amount of material being degraded per unit of time〉.
It is possible to define the lifetime τ as the mean survival time of a particle before it is degraded. For an exponential decay 〈as above〉, the lifetime is the inverse of the degradation rate: τ=1/α. Some studies refer to halflife h 〈the time at which the concentration of the chemical species has dropped to one half of its initial value〉 rather than to lifetime. The relation between lifetime and halflife is τ=hlog〈2〉.
The notion of steady state originates from the field of mathematics. The steady state is the timeindependent asymptotic solution of a differential equation. In reallife experimental conditions, a true steady state cannot be observed as it would require an infinite time of observation. However, most chemical systems relax towards their steady state exponentially quickly. This means that even if the steady state is never reached, an observed quantity can appear to be almost stable, and this stable state will be extremely close to the theoretical steady state. Therefore, an experimental quantity, such as the concentration of a protein, can be described as being at steady state if this quantity is within X% of its steadystate value (known theoretically), where X is a number that is defined arbitrarily.
With respect to the Bicoid gradient, numerous discussions have centred on whether or not the gradient is at steady state. Before asking this question, one should define what steady state means for such an experimentally observed quantity. Furthermore, the Bicoid gradient can appear to be stable without being close to any kind of steady state. Everything depends on what we see as being stable. For example, measurements of the nuclear concentrations of Bicoid may suggest that the total gradient does not change over the last five cell cycles and therefore that the gradient is at steady state. But, as we discuss below, this might or might not be true. It is worth pointing out
Box 4. The synthesis, diffusion and degradation (SDD) equation
The mathematical models presented in this review all rely on partial differential equations that describe Bicoid synthesis, diffusion, degradation and shuttling. The central quantity from which the equations are derived is the concentration of the protein: C〈x,t〉. This is the local concentration of Bicoid at position x at time t and follows equation 〈1〉. Below, we explain this equation.
Assuming given time intervals Δt and space intervals Δx, the equation takes the following form:
Model 2: the presteadystate hypothesis
Over the timecourse of the syncytial blastoderm (the first 3 hours of Drosophila development), the Bicoid gradient may never approach its steady state if the lifetime of Bicoid were long with respect to the time scale over which the gradient forms (Bergmann et al., 2007). The total levels of Bicoid would continue to rise, and target genes would see a continuously changing concentration profile (Fig. 3B). The reading of a dynamic gradient would be expected to cause a gradual posterior shift in the expression boundaries of target genes. Such a shift could be avoided if a precise timing mechanism determined a single point in development when the Bicoid gradient was utilised. In the model
Box 5. Length constant
The length constant 〈x〉, or characteristic length, is a quantitative estimate of the spread of a spatial profile 〈for example, of Bicoid molecules〉. Mathematically, it is defined as the mean position of a particle that makes up a concentration profile. Let C〈x〉 be a concentration profile of a molecular gradient in which all molecules are independent, meaning they do not interact and therefore do not influence the distribution of each other. The concentration profile can be expressed as:
The characteristic length, explicitly the first moment of a molecule's distribution, is then given by:
In the simple case of an exponentially decaying concentration profile C〈x〉=exp〈−x/λ〉, the characteristic length is the decay length of the exponent. Indeed, in this case, N_{tot}=λ, p〈x〉=1/λexp〈−x/λ〉, and the characteristic length is 〈x〉=1/λ∫exp〈−x/λ〉dx=λ. In other words, for an exponential profile, the length constant is the position along the AP axis at which the concentration of Bicoid has dropped to 1/exp 〈about onethird〉 of its maximal value at the anterior 〈see also Fig. 1E〉.
Experimentally, the length constant of the Bicoid gradient was principally obtained by fitting an exponential to the measured intensity profile 〈using an EGFPtagged form of Bicoid or antibody staining; see Box 1 for more information〉 to extract the length constant of the exponent 〈Fig. 2E〉. Ideally, the length constant is calculated by the integration of the spatial profile 〈x〉=∫xC〈x〉dx/∫C〈x′〉dx′ without assuming an exponentially decaying profile. Note that both methods of computing the length constant strongly depend on estimates of the background level of fluorescence.
Mathematically, the solution of equation (1) is now time dependent, meaning that the concentration of the morphogen at a given position is no longer stable but changes with time (Bergmann et al., 2007). As a consequence, both the total concentration and the spatial profile of Bicoid keep changing (Fig. 3B). Both changes become attenuated with increasing time, as can be seen graphically in Fig. 3B, when the curve symbolising the amount of morphogen at a given time asymptotically approaches a limit.
Model 2 is equivalent to model 1, but now the full timedependent solution of equation (1) is considered (Bergmann et al., 2007):
The length constant λ of the gradient is now dependent on diffusion, degradation and time, and it increases with time (unlike at steady state, where the length constant is time independent and
stable). The shape of the gradient is not a simple exponential and the mathematical expression of the length constant cannot be explicitly given. However, it can be approximated by λ(t)=√4Dt for times such that t<<1/α, whereas it tends to λ_{eq} for long time periods (Bergmann et al., 2007). At intermediate times, the length constant is always less than the length constant λ_{eff}(t) with no degradation (see model 3):
Model 3: the nuclear trapping model
During the period when the Bicoid gradient is being established in the early Drosophila embryo, the number of nuclei changes by three orders of magnitude. Because Bicoid, as a transcription factor, accumulates in nuclei, their changing number might affect the gradient locally and/or globally (Lander et al., 2002; Coppey et al., 2007). Such local and global effects have been detected for the gradient of phosphorylated MAPK, which is activated by the receptor tyrosine kinase Torso at the termini of the Drosophila embryo (Coppey et al., 2008) and could play a role in both shaping and stabilising the Bicoid gradient. In a simple view, the increasing number of nuclei could balance the increase in the total amount of Bicoid, resulting in a gradient in which nuclear concentrations at a given position remain stable even if Bicoid degradation is too low to balance synthesis or to achieve a stable total concentration. Target genes would read stable nuclear concentrations of Bicoid (as in the steadystate model) even though the total gradient is not at steady state. The most detailed version of this model (Coppey et al., 2007) assumes no degradation of Bicoid (Fig. 3C). It incorporates the constant production of Bicoid at the anterior, diffusion from this source, and the nucleocytoplasmic shuttling of Bicoid. Bicoid is presumed to be bound/immobile when in nuclei and free/diffusible when in the cytoplasm.
Mathematically, a term that describes nucleocytoplasmic shuttling of Bicoid is added to equation (1) (see Box 6). This term does not affect the total concentration of Bicoid in the embryo (which in this particular model depends only on production because there is no degradation) but it does affect the local distribution of Bicoid. As with the previous model, the solution is time dependent, yet the result of nucleocytoplasmic shuttling is that (given the right parameters) a stable, timeindependent state is mimicked.
Box 6. Nucleocytoplasmic shuttling
The physical properties of nucleocytoplasmic shuttling are of particular importance as they set the relationship between the concentration of Bicoid in the nuclei, where the developmental decision takes place, and its cytoplasmic concentration, where the gradient is established. The characteristic shuttling time is relatively fast compared with the duration of a nuclear cycle 〈1 minute versus 820 minutes〉 and allows the nuclear concentration, N〈x,t〉, to be expressed as a simple function of the cytoplasmic concentration C〈x,t〉:
Eventually, the mathematical quantity that best reflects the experimentally measured Bicoid concentration per nucleus, n〈x,t〉, is obtained by dividing the modelled nuclear concentration by the nuclear density ρ: n〈x,t〉=N〈x,t〉/ρ. Assuming that the nuclear import k_{in} is proportional to the nuclear density, n〈x,t〉 would be proportional to C_{tot}〈x,t〉/1+K, so that the increase in C_{tot}〈x,t〉 due to continuous production of Bicoid could be balanced by the increase of K.
The varying total Bicoid concentration has the following timedependent expression (Rice, 1985):
In this model, there is no steady state and total Bicoid concentration increases infinitely (Fig. 3C).
Model 4: a gradient of bicoid mRNA
The RNA that encodes the Bicoid protein is synthesized during oogenesis and remains tightly localised to the anterior pole of the egg until fertilisation (St Johnston et al., 1989; St Johnston et al., 1991). At that point, it assumes a more diffuse distribution (Weil et al., 2008) and its translation is presumed to begin. Model 4 incorporates a graded distribution of the bicoid RNA source (in contrast to the first three models, see Fig. 1F) (Spirov et al., 2009). How the RNA distribution affects the protein gradient will depend on the distribution of the source RNA and the diffusion/degradation parameters of the protein. If the length constant √D/α of the protein is very low compared with the length scale of the mRNA source (slow diffusion and short lifetime), then Bicoid will accumulate where it is synthesized and the protein distribution will mirror the source RNA distribution (Spirov et al., 2009). However, if √D/α is very large compared with the length scale of the source (if there is no degradation and/or a large diffusion coefficient), then Bicoid will accumulate in the embryo exactly as it would for a pointlike source.
As all the mRNA is provided by the mother, if the mRNA diffuses from its initial site at the anterior tip, then it will result in a Gaussian rather than exponential distribution (because there is no production of mRNA). Then, the source term of equation (1) is:
In conclusion, although all models assume constant production of Bicoid, its diffusion and spatially uniform degradation, and, in one case, nucleocytoplasmic shuttling, the difference in parameter choice (e.g. long or short lifetime) yields Bicoid gradients with very different spatiotemporal characteristics, which would then affect the subsequent interpretation of the gradient by its target genes.
Discussion of the models
The length scale of the gradient and diffusion
The length scale has been the most experimentally measured parameter of the Bicoid gradient (Houchmandzadeh et al., 2002; Crauk and Dostatni, 2005; Gregor et al., 2005; Gregor et al., 2007a; Gregor et al., 2007b; He et al., 2008), as it represents the most obvious quantitative characteristic of the gradient and it is easy to measure both in fixed and living embryos (Fig. 2E). Experimental studies have shown that the length constant λ_{obs} of the exponential fit of the gradient (Fig. 2E, Box 5) is ~100 μm over the 500 μm total egg length.
In all models, the length scale of the Bicoid gradient depends critically on the diffusivity of Bicoid protein. The larger the diffusion coefficient, the more Bicoid reaches positions far from its source. However, the diffusion coefficient that was measured in live embryos is ~0.3 μm^{2}/second. With this value of diffusivity, models 13 cannot explain the experimentally observed length scale of the gradient, as explained below.
The steadystate model (model 1) predicts a length constant of λ_{eq}=√D/α. Given the low diffusivity of Bicoid protein, for the gradient to achieve 95% of the theoretical steady state after 90 minutes (which corresponds to cycle 10, when nuclear stability is first observed) [Q(t)=1–e^{t/τ}, then 0.95Q=1–e^{90/30}], the lifetime of Bicoid would have to be less than 30 minutes (α≥5.5×10^{−4} seconds). With such numerical values, the maximal length scale of the gradient λ_{eq} would be less than 25 μm, a value far from the observed 100 μm (see Fig. 3D).
The nonstationary models (models 2 and 3), in which the lifetime of Bicoid is assumed to be long or infinite and a steady state is not approached, predict a length constant λ_{eff}(t)=2√4Dt/3√π, which gives a maximal length scale of 35 μm when the considered time is maximal, namely at cycle 14 (t=7200 seconds). At shorter times (after 90 minutes at cycle 10, when nuclear stability is already observed) t=4800 seconds and the length scale is even smaller at λ_{eff}(t)=28 μm.
For the RNA gradient model (model 4), the situation is different as the length scale of the mRNA source can play a major role in setting the final length scale of the protein gradient. A recent analysis of the mRNA gradient suggests a value of λ_{RNA}/L≈0.2 (L being the length of the embryo along the AP axis) at cycle 14 (Spirov et al., 2009), which corresponds to λ_{RNA}≈100 μm and is sufficient by itself to explain the length constant of the protein. However, for the formation of the Bicoid gradient, the 2 hours prior to cycle 14 matter the most. During this time, the length scale of the RNA source is ~50 μm (observed for early cycles 912) and is probably less than this at earlier times. If the RNA distribution has a length constant of 50 μm, then the gain in the length scale of the protein gradient due to diffusion would be a mere 20 μm, which is insufficient to explain the actual protein distribution.
In conclusion, given the measured length constant and diffusion coefficient, none of the models predicts the gradient as it is actually observed, although an RNA gradient model is potentially more compatible with the observed low diffusion coefficient.
Stability of nuclear concentration
By the time nuclei reach the surface of the embryo during cycle 10 (90 minutes after fertilisation), until cycle 14, the nuclear concentration of Bicoid at a given position remains stable (Gregor et al., 2007a). How can nuclear stability be achieved when nuclei continue proliferating (from cycle 10 to 14 the number of nuclei increases by a factor of 16)?
In the steadystate model (model 1), the total amount of Bicoid in the embryo is fixed at the same time as its spatial concentration profile (Fig. 3A). Under these conditions, stable nuclear concentration can be achieved if nuclear import is proportional to nuclear volume and if the nuclear Bicoid represents only a small fraction of the total pool in the cytoplasm. This is probably the case during cycles 10 through 12, but by cycle 1314 nuclei begin to represent a significant fraction of the cortical volume.
In the presteadystate models (models 2 and 3, Fig. 3B,C), the total concentration of Bicoid in the embryo is predicted to rise and the total gradient continues to change its shape. A balance between total Bicoid levels and the number of nuclei can be achieved only if the total nuclear volume increases linearly with time, assuming that rates of synthesis are constant and accumulation is linear. Although the number of nuclei increases exponentially, their size decreases with each cycle (Gregor et al., 2007a), such that the total increase in nuclear volume is close to linear from cycles 1013. However, it is unclear how the total amount and spatial distribution of Bicoid protein change over time.
The RNA gradient model (model 4) predicts a short lifetime. In this model, to obtain nuclear stability, the same conditions as those for the steadystate model (model 1) apply: the increasing nuclear volume samples only a small fraction of a constant protein gradient.
In conclusion, to determine which of the models might satisfy the nuclear stability condition we need to know additional parameters, namely the total concentration of Bicoid protein in the egg, its rates of nuclear import and the lifetime of the protein.
Scaling of the gradient
The Bicoid gradient in eggs of different sizes scales such that the length constant is larger in longer eggs (Fig. 1H) (Gregor et al., 2005). It is not obvious how this can be achieved.
One possible mechanism for scaling in model 1, where the length constant is λ_{eq}=√D/α, is nuclear degradation of Bicoid. If degradation takes place inside (and only inside) nuclei, then the rate of degradation α would be proportional to the nuclear density ρ. This nuclear density is given by ρ=N/L^{2}, where N is the number of nuclei and L the length of the embryo. Because the number of nuclei remains the same in embryos of different lengths, the degradation rate would be inversely proportional to L^{2}, so that the length constant would be proportional to L, thus achieving scaling.
However, scaling that is mediated by nuclear degradation seems to be in conflict with the stability of the nuclear gradient during successive cleavage divisions if increasing numbers of nuclei increased the overall rate of degradation. The length constant of the gradient λ_{eq}=√D/α(ρ) would change at each cycle with respect to the nuclear density. Thus, scaling and stability of the gradient cannot be achieved simultaneously in the steadystate model (model 1).
As noted above, for the gradient to scale, the length constant of the gradient has to be a function of egg length. In the presteadystate models, the length constant is time dependent. If the time at which the gradient is utilised were dependent on the length of the embryo, then the gradient would not scale but the pattern of its responses would. To determine whether such a timing mechanism could account for scaling requires a detailed analysis of the developmental timing in different species. As a formal alternative, the embryo could alter the effective diffusion coefficient to scale with egg length. Finally, if the RNA gradient scales with embryo length, then the protein gradient would do so as well. Any potential mechanism(s) responsible for scaling of either the RNA gradient or the effective diffusion coefficient remain unknown.
The unknowns
As discussed above, none of these models can satisfy all of the criteria listed in Box 1. To understand the formation of the Bicoid gradient, we need to add new parameters to these models and to reevaluate some of the previously measured parameters, as we discuss below.
How does Bicoid move?
All measurements of Bicoid diffusivity so far have resulted in values (D≈0.3 μm^{2}/second) (Gregor et al., 2007a) that are an order of magnitude too small to explain the length scale of the gradient. This is a problem for all except the RNA gradient model (Spirov et al., 2009), in which the distribution of Bicoid protein is controlled by the as yet unmeasured diffusion of the RNA. So how does Bicoid get to where we see it?
With certainty, these measurements show that there is a fraction of cytoplasmic Bicoid that diffuses slowly. But one cannot exclude several additional uncertainties. (1) There could be another fraction of cytoplasmic Bicoid that diffuses much faster. All recovery curves from photobleaching experiments start at a level that seems to be much higher than the background, suggesting that a fraction of Bicoid is already reequilibrated. (2) Fast and effective diffusion events may be visible over longer time scales, perhaps owing to bulk movement or to the ‘stirring’ of the cytoplasm during cleavage divisions (Hecht et al., 2009). (3) The diffusion coefficient might be much higher in other compartments of the embryo; for example, within the centre of the embryo where diffusion cannot be observed. (4) Bicoid might diffuse faster during early cycles (cycles 19), a time during which the Bicoid protein cannot be seen by live imaging, and therefore the diffusion coefficient of Bicoid cannot be measured directly.
How much Bicoid is there?
One prediction of the steadystate model (model 1) is that the total amount of Bicoid in the embryo does not change with time (Fig. 3A). If we knew this value, we could distinguish between the gradient at steady state (constant amount) (Fig. 3A) or at presteady state (increasing amount) (Fig. 3B,C). The optical approach (using confocal or twophoton microscopes) only tells us about the surface concentration of Bicoid (which keeps rising and is therefore consistent with the presteadystate models), and as we do not know how Bicoid distributes in the various compartments, we cannot infer its total amount in the embryo.
How is Bicoid distributed in each compartment at any stage?
Given the total amount of Bicoid at a given time, we need to know how this amount is partitioned between the physical structures of the embryo. There are several compartments in the syncytial embryo, which are dynamic during the first 3 hours of development when the Bicoid gradient is established (Mavrakis et al., 2009a; Mavrakis et al., 2009b). We do not know the exact volumes of these compartments, except for the nuclei (which are well characterised with respect to their diameter) (Gregor et al., 2007a), and therefore we cannot infer the total amounts of Bicoid in each compartment and have to use relative concentrations. But the geometry and physical properties of these compartments strongly influence the shuttling of Bicoid between them and, as a consequence, the overall transport of Bicoid on the scale of the embryo, as well as its nuclear localisation.
Lifetime
Another way of distinguishing between the steadystate and presteadystate models requires us to know the lifetime of Bicoid protein. If the lifetime is short with respect to developmental time scales, then the gradient would approach a steady state (Fig. 3A), whereas with a long or infinite lifetime the gradient would keep changing (Fig. 3B,C).
In addition, the degradation of Bicoid sets a limit to how far the protein can move over a given time. In mathematical terms, lifetime and diffusion control the length scale of the gradient (λ=√Dτ). The higher the degradation rate, the sharper the profile of the resulting gradient will be.
RNA localisation
Although bicoid RNA is tightly bound to the anterior cortex of the developing oocyte during Drosophila oogenesis, it has been known for some time that its localisation becomes more diffuse after fertilisation (St Johnston et al., 1989). Part of this displacement involves microtubular transport (Weil et al., 2008). How this dispersion affects the ultimate distribution of Bicoid protein is still a matter of some controversy. Two issues seem important here. First, as the nuclear protein gradient is stable from cycle 10/11, the relevant RNA distribution must precede that point. Second, Bicoid protein is detected in nuclei at the surface and in the centre of the egg. Both these issues point to the importance of measuring RNA distribution early during development, prior to cycle 10 when nuclei are not yet at the surface. So far, published quantifications of RNA have been performed by wholemount in situ hybridisation, from which only the surface level of bicoid mRNA can be reliably measured (Spirov et al., 2009).
Production
Translation of Bicoid protein is assumed to be constant during the first 3 hours of development, when the gradient is established. The major argument supporting constant production is the fact that bicoid mRNA is stable for the first 3 hours, before it starts to decay (Surdej and JacobsLorena, 1998). However, the length of the bicoid poly(A) tail is dynamic, reaching a maximum length at ~1.5 hours of development, suggesting that Bicoid protein production might not be constant (Salles et al., 1994). In addition, Bicoid protein is undetectable by western blotting in embryos during the first hour of development (Driever and NussleinVolhard, 1988a), again suggesting that protein production is not constant. This evidence is indirect, however, and we need a direct measurement of the rate of Bicoid synthesis.
If production turns out to be nonlinear, all the models we have discussed (which rely on the constant production assumption) become much more complicated.
Nuclear transport
Nuclear import and export determine the relative cytoplasmic and nuclear concentrations of Bicoid. The experimental observations of this are mainly based on the measured nuclear concentrations of Bicoid. However, the models we discuss predict total Bicoid concentrations. One way to compare the predicted and measured values is by assessing the nuclear transport rates of Bicoid.
At steady state, the relative nuclear and cytoplasmic Bicoid concentrations reveal the import to export ratio, K. If K remains constant over nuclear cycles, then the Bicoid concentration per nucleus reflects the cytoplasmic Bicoid concentration in that these values will differ only by a proportionality constant. By contrast, if K changes with time, then there will be a timedependent filter between the cytoplasmic and nuclear concentrations.
K can be a function of the nuclear volume, the nuclear surface or the number of nuclei. First, nuclear volume could dictate this ratio if the import rate were controlled by a firstorder chemical reaction between cytoplasmic Bicoid proteins and the nuclear binding sites. Second, if the import rate were limited by the diffusive capture of cytoplasmic Bicoid by the nucleus (the time it takes to reach the nuclear surface) or by the number of nuclear pores present on the nuclear membrane, K would be a linear function of the nuclear surface area. Third, if nuclear accumulation were controlled by the nonspecific binding of Bicoid to DNA and not by the time needed to enter the nucleus, then K would be proportional to the amount of DNA and thus to the number of nuclei.
The volume of the cytoplasm will be the most relevant physical factor determining the value of K. Potentially, subcompartmentalisation of the cytoplasm could also be a critical feature; for example, if the shuttling of Bicoid within or between cytoplasmic islands reflects an additional set of importexport rates.
When is the gradient utilised?
The transcription of Bicoid target genes has been observed as early as cycle 10 (Knipple et al., 1985; Tautz et al., 1987; Jaeger et al., 2007). At this stage, for certain Bicoid lifetime values, the gradient might be far from its steady state (Fig. 3B,C). If the relevant reading of the Bicoid gradient by target genes occurs at cycle 10, as has been suggested by Bergmann et al. (Bergmann et al., 2007), they would have to read a dynamic gradient and a precise timing mechanism becomes crucial. However, if target genes read the Bicoid gradient only in cycle 14, when it is closer to its steady state, there would be no need for such a mechanism. Therefore, it would be interesting to determine when the Bicoid gradient is actually utilised, or, more precisely, when does the relevant interpretation of the Bicoid gradient occur? For example, Bicoid might induce a low level of hunchback and of other gap gene expression at cycle 10, but if the gradient is read continuously in nuclei between cycles 10 and 14, only the highlevel response at cycle 14 might be relevant for the next step in patterning and in the establishment of pairrule gene expression.
The temporal requirement for Bicoid has been tested by shifting a temperaturesensitive bicoid allele (called E3) to the restrictive temperature at various developmental stages. Starting at polecell formation (cycle 9), Drosophila embryos display a high sensitivity to the temperature shift, and this sensitivity lasts until cycle 14, suggesting that Bicoid does matter as early as cycle 9/10 (Frohnhoefer and NuessleinVolhard, 1987). In another experiment, anterior cytoplasm (which contains Bicoid) was transferred to hosts of different stages. A strong response (scored by phenotype) was only induced in hosts of ~1.5 hours of age (corresponding approximately to cycle 10), and the strength of the response fell sharply afterwards (Frohnhoefer and NuessleinVolhard, 1986).
Lastly, the Bicoid gradient has been perturbed using a microfluidic device, which allows the embryo to be exposed to large temperature differences. Here, the critical time for Bicoiddependent patterning was found to be between 65 and 100 minutes of development, corresponding to cycles 811 (Lucchetta et al., 2005). Together, these experiments support the idea that Bicoid is needed early, around cycle 10/11, consistent with the model proposed by Bergmann et al. (Bergmann et al., 2007). However, if the Bicoid concentration were measured only at cycle 10 and then stably ‘remembered’ during the subsequent proliferation of nuclei, it is hard to explain the tight correlation that is observed between Bicoid and Hunchback levels at cycle 14, because under these circumstances, despite the smooth variation in Bicoid, Hunchback would be expected to show a coarse, patchy distribution in nuclei with similar cycle10 lineages. The similarly graded distribution of both proteins that is actually observed at cycle 14 (Gregor et al., 2007b) suggests that the Hunchback response continues to be adjusted throughout the syncytial cycles (Gregor et al., 2007b).
Are there different Bicoids?
The models assume the presence of a single Bicoid activity in the embryo, which might not be correct. Western blots demonstrate the presence of differently modified forms of Bicoid in the embryo (Driever and NussleinVolhard, 1988a; Ronchi et al., 1993; Janody et al., 2000), and each modification might alter its specific activity. Phosphorylation by the receptor tyrosine kinase Torso affects the transcriptional activity of Bicoid (Ronchi et al., 1993; Janody et al., 2000). Bicoid might be modified by other pathways; removal of a sumoylation enzyme affects its nuclear localisation (Epps and Tanda, 1998), but it is not known whether Bicoid is a direct target of this modification. Bicoid eventually decays, and it is tempting to speculate that this occurs by the ubiquitin proteasome pathway. So even if the total amount of Bicoid were consistent with one or all of the models, the population of Bicoid that matters for providing positional information, i.e. that of the transcriptionally active form, might support a different model. Also, different forms of Bicoid might move at different rates through the embryo. So it will be important to learn more about these modifications and their consequences, and to adjust the models of gradient formation accordingly.
Comparison with other morphogen gradients
For three other morphogen gradients – Dpp and Wg in the fly wing disk (Kicheva et al., 2007) and Fgf8 in the zebrafish embryo (Yu et al., 2009) – the measured key physical parameters, i.e. the diffusion coefficient and the lifetime, are consistent with the SDD model (in which the constant synthesis, diffusion and spatially uniform degradation of the morphogen is assumed). Notably, both Dpp and Wg diffuse at roughly similar rates as Bicoid (<1 μm^{2}/second), whereas the diffusion coefficient of Fgf8 in the zebrafish embryo is two orders of magnitude higher, similar to the diffusion of a GFPsized protein in water (Swaminathan et al., 1997; Petrasek and Schwille, 2008). The high Fgf8 diffusion coefficient may reflect properties of the extracellular space in zebrafish (which might, for example, feature fewer binding sites for Fgf8 or low viscosity) or it might be an effect of the measurement method used [fluorescence correlation spectroscopy (FCS)]. In all cases, the lifetimes of these morphogens were found to be short enough for the gradient to approach steady state, given the time available for the formation of these gradients (τ_{Fgf8}≈30 minutes, τ_{Dpp}≈60 minutes, τ_{Wg}≈15 minutes) (Kicheva et al., 2007; Yu et al., 2009).
In the early Drosophila embryo, two other morphogen gradients, Dl and MAPK, form within the same environment as the Bicoid gradient, yet the resulting gradients differ in important aspects (DeLotto et al., 2007; Coppey et al., 2008; Kanodia et al., 2009). Although the Bicoid gradient retains both its shape and amplitude (when nuclear Bicoid is observed), the MAPK gradient sharpens with time while retaining its amplitude, and the Dl gradient increases its amplitude and keeps its shape (DeLotto et al., 2007; Coppey et al., 2008; Kanodia et al., 2009). The time at which each morphogen gradient is established is thought to play a role in their different characteristics (Bicoid is assumed to reach its final shape before nuclei reach the surface of the embryo and before nuclei constitute a significant fraction of the embryonic volume), as are the lifetime of the morphogen (the short lifetime of active MAPK) and the spatial extent of the sources that activate Dl (broad source) and MAPK (restricted source) (Kanodia et al., 2009).
Conclusion
Several models have been proposed to account for the establishment of the Bicoid gradient in the Drosophila embryo. However, none of those that we have discussed is fully consistent with all available experimental observations; each can only partially account for and explain experimental findings on the Bicoid gradient. Thus, one can assert that there is no consensus on the biophysical mechanisms that underpin Bicoid gradient formation and there is still much to understand about this process. Beyond this fact, we have shown that these simple models raise fundamental questions that highlight the need for new quantitative experiments. It is very likely that this constructive interplay between models and experiments will soon lead to a unified picture of Bicoid gradient formation. In particular, whether or not future models will be more complex with regard to the number of biophysical parameters that they include, there must be a simple ‘coarsegrained’ model that can explain the main features of the dynamics of Bicoid and its gradient based on experimentally measured parameters.
Acknowledgements
We are grateful to Stas Shvartsman, Stefano Di Talia, Shawn Little and other members of the E.W. laboratory for helpful suggestions and discussions. M.C. was supported by the Human Frontier Science Program. This work was supported by the Howard Hughes Medical Institute and the NIH. Deposited in PMC for release after 12 months.
Footnotes

Competing interests statement
The authors declare no competing financial interests.