Lianxing Wen -> Research -> Geodynamics -> Residual Tomography


Slabs, hotspots, cratons and mantle convection revealed from residual seismic tomography in the upper mantle

abstract

Upper mantle seismic tomography correlates with tectonic features on the surface of the Earth. Cratonic "roots", thickening oceanic plates and subducting slabs are the first order contributors to the tomography, at least above about 400 km depth. In order to further investigate the fate of slabs, the structure of hotspots and the style of mantle convection, we calculate residual maps by excluding from the tomography the first order effects of conductive cooling of oceanic plates, deep craton "roots", and partial melting or cooling caused by subducted lithosphere. The good correlations between residual tomography in the transition zone (400-650 km) with 0-30 Ma subduction, at degree l=2 can be explained by slab accumulation in this region. The correlations between residual tomography in the transition region and the subduction during earlier periods are poor. This may indicate that slabs reside near the 670 km discontinuity for only a certain period of time. Hotspots correlate with the residual tomography in the shallow mantle (100-400 km). Correlations decrease rapidly with depth. Good correlations occur at degree l=2, between hotspots and residual tomography in the shallow mantle and between hotspots and tomography in the lowermost mantle. Various correlations suggest that the correlations between hotspots and the lowermost mantle are intriguing. At degree 6, cratons correlate with residual topography, hotspots, and upper mantle seismic tomography. Cratonic "roots" may affect, or modulate, upper mantle convection. Subduction may affect the locations of both upwellings and downwelling, and may control mantle convection. The 670 km discontinuity may not be the place where long-lived mantle convective stratification takes place. Mantle convection may be decorrelated closer to 900 km, near a recently rediscovered mantle discontinuity.

Introduction

The relationship between subduction and seismic tomography has been studied widely. Richards and Engebretson (1992) interpreted the good correlations between the large-scale seismic heterogeneity , averaged over the whole lower mantle, and subduction during the Cenozoic and Mesozoic as the result of the cooling effects of the subduction. Scrivner and Anderson (1992) correlated subduction positions since the breakup of Pangea, with seismic tomography depth by depth throughout the whole mantle. They found correlations in the transition zone region. Ray and Anderson (1994) found good correlations between integrated slab locations since Pangea breakup and fast velocities in the depth range 220-1022 km. Wen and Anderson (1995) quantified the slab flux by estimating the subducted volume in the hotspot reference frame and correlated it with seismic tomography throughout the mantle. They found significant correlations in the depth interval 900-1100 km and attributed these to the accumulation of subducted lithosphere in this region. Correlations were also found in the upper mantle and transition zone for recent subduction. The relationship between hotspots and seismic tomography has also been investigated. Excellent correlations at degree l=2 were found in the lower mantle and, at degree l=6 in the upper mantle (Richards et al., 1988; Cazenave, et al., 1989; Kedar et al., 1992). Cazenave et al. (1989) interpreted their results in terms of degree 2 convection in the lower mantle and degree 6 dominated convection in the upper mantle. Richards et al. (1988) hypothesized that hotspots originated in the deep lower mantle, based on the good correlations at degree 2, whereas Montagner and Romanowicz (1993) speculated that hotspots came from the transition zone, based on the dramatic decrease in the correlation below the transition zone, at degree 6. Ray and Anderson (1994) pointed out that hotspot locations were no better correlated with lower mantle tomography than were ridge locations. Subduction history provides a powerful constrain on the fate of the slabs in the deep mantle, particularly as seismic tomography provides more realistic images of the interior of the Earth. One of the difficulties in relating tomography with subduction and hotspots in the upper mantle, is the complexity of the tomography in this region. There are large contributions from the cooling of the oceanic plates and craton "roots" and this makes it difficult to get meanful results from the hotspot and slab correlations with the seismic tomography. The role of deep craton "roots" in the upper mantle has been extensively discussed (Lerner-Lam and Jordan, 1987; Hara and Geller, 1994; Polet and Anderson, 1995). The contribution from downgoing slabs or possible stagnant slabs, may also prevent one from correctly relating the seismic tomography to hotspots. Some information about mantle convection may be revealed by studying the correlation between surface tectonic features and geophysical observables (e.g. topography, hotspots etc.). In this paper we construct residual tomography for the upper mantle by excluding effects from conductive cooling of oceanic plates, craton "roots", and the partial melting or cooling induced by downgoing slabs. The residual tomography is compared with the pattern of subduction history in the past 130 Ma (Wen and Anderson, 1995) and with the distribution of hotspots. The procedures for excluding these effects are given in the first section. The correlation between residual tomography and past subduction is presented in the second section. In the third section we use residual tomography to help constrain the origin of hotspots. In the fourth section, the degree 6 significance in geodynamics is discussed. Finally, the style of mantle convection will be discussed.

Residual upper mantle tomographic models

The first order contributors to seismic tomography in the upper mantle are assumed to be oceanic plates, craton "roots", and on-going subduction. Other possible contributors, such as hotspots, stagnant slabs, and small scale mantle convection are assumed to be second order. If they are important, they may show up in the residual maps. The residual tomography is defined as the seismic tomography excluding the effects from those first order contributors.

Oceanic Plates

Our first job is to remove from the tomographic models the effects of the cooling oceanic plates. Velocity heterogeneities can be related to the temperature of the mantle by the temperature derivatives for the minerals in the mantle. The temperature distribution beneath oceans can be calculated from the age of oceanic lithosphere and thermal cooling models (de Jonge et al., 1994; Nataf and Ricard, 1995). We use the digital age map of M\"{u}ller et al. (1993). Several thermal models have been established in order to explain the bathymetry and heat flow in the ocean. Of particular importance are the half space cooling model (Turcotte and Schubert, 1982) and plate model ( Parsons and McKenzie, 1978). The plate model gives an adequate fit to these observations (e.g. Stein and Stein, 1992). However, the flattening of the oceanic age-depth relation for sea floor older than 70 Ma may be attributed to other effects, such as hotspots (Heestand and Crough, 1981; Schoreder, 1984) and other geodynamic process (Davies, 1988a, b; Cazenave and Lago, 1991; Morgan and Smith, 1992). Recent seismic studies also indicate that the oceanic lithosphere continues to cool after 70 Ma (e.g. Zhang and Tanimoto, 1991; Woodward and Masters, 1991). 3D global seismic tomography can also be used to constrain the oceanic thermal model. Fig. 1 shows the velocity perturbation variation with the age of the oceanic lithosphere at depths from 100 to 650 km, based on the seismic tomographic model SH12WM13 ( Su et al., 1994). The velocity perturbations are calculated by averaging the velocity perturbation along the positions corresponding to each isochron on the surface of the Earth. The velocity perturbation vs. age curves at different depths have the same characteristics; they are flat over a certain period of age, then increase rapidly with age. The "turning points" of these curves are strongly dependent on the thermal diffusivity. The traditional value of diffusivity used in the geodynamical literature is about $\kappa=1.0 mm^{2}s^{-1}$. With this value, the conduction is primarily confined to about the top 150 km of the mantle. Based on the tomographic models, cooling may extend deeper. The characteristics of the velocity-age curves at 200-450 km depth indicate that the cooling, possibly due to conduction, is still happening at these depths. The "turning points" at 250-300 km are almost the same as that of 200 km. This indicates there might be a diffusivity "jump" in this region near the 210 km discontinuity, which is possibly a chemical boundary. The pressure also plays an important role. We assume, for our present purposes, that the tomographic models are "exact" and do not suffer from any smearing. We are trying to remove the near-surface effects than interpret them. The thermal diffusivity is very anisotropic for olivine. We use the mean diffusivity for olivine, $\kappa_{1}=1.65 mm^{2}s^{-1}$ (Kobayashi, 1974), from 0-210 km. The diffusivity below 210 km is assumed to be doubled, possibly due to pressure and chemical differences. A cooling model with one layer (0-210 km) over a half space is used to explain the velocity-age curves at various depths. The Appendix gives analytical solutions for the temperature distribution. There is still one unknown, the conductivity ratio between that of the layer and that of the half space. The temperature derivatives which relate the temperature perturbation to velocity perturbation are uncertain. The experimental results are also uncertain (Estey and Douglas, 1986; Karato, 1993). Pressure and chemical differences would make this parameter even more uncertain. The conductivity ratio controls the temperature distribution as well. Since our purpose is to find the best fit model and remove it rather than to find the conductivity ratio or temperature derivatives, We therefore make no effort in guessing these parameters in the mantle, but fix the conductivity ratio and and find the best temperature derivatives at various depths. The best fit velocity models, in the oceanic regions, are found by adjusting the temperature derivatives, at various depths, in order to minimizing the difference between the thermal velocity model and SH12WM13 in the oceanic regions. Fig. 1 also shows the predicted velocity-age relation and comparisons with the SH12WM13. Any deviations from this predicted model, based on the age of the oceanic lithosphere on the surface will be regarded as anomalies. The predicted models are expanded into spherical harmonics and are truncated at degree 12, in order to compare with SH12WM13. The first residual tomography (RES1) is obtained by excluding the oceanic plate component from the seismic tomography (SH12WM13). It should be mentioned that tomographic modeling often use simplified assumptions about near surface condition and there can also be a problem with vertical smearing. We ignore these complications.

Cratons

Polet and Anderson (1995) divided the cratons into two provinces, based on Sclater et al. (1981). Province 1 contains continents older than 1700 Ma (Archean and Early Proterozoic) and province 2 includes continents between 800 and 1700 Ma (Middle Proterozoic). We classify the cratons according to their geographic locations and ages. The cratons are divided into 13 groups; six for cratons between 800-1700 Ma, and seven for those older than 1700 Ma. Each group is related to its age and a major plate. For instance, cratons, between 800-1700 Ma, on the South American plate are placed in the same group. Cratons older than 1700 Ma in Eurasia fall into two groups. Based on the residual tomography (RES1), we calculate the average velocity perturbation beneath each group of cratons. Fig. 2 shows the velocity perturbations beneath each group of cratons vs. depth. The heavy lines are the velocity perturbations for cratons older than 1700 Ma. The light lines are for cratons between 800-1700 Ma. The contribution of craton "roots" to seismic velocity variation in spherical harmonic space is obtained by expanding the function, which, in cratonic regions for each group of craton, has the value of the average velocity perturbations from the residual tomography (RES1), and zero outside, into spherical harmonics. This set of spherical harmonic coefficients can be multiplied by an arbitrary constant ($ C $). This proportionality constant ($ C $) is the second parameter to be determined.

On-going Subduction

We assume that the subducting plates sink vertically into the upper mantle at the velocity of the plate at the trench. The ages of the slabs are reconstructed at every depth. We assume that the seismic velocity perturbation within the slab is constant ($\delta V_{s}$) at a certain depth. The width of a particular slab segment is equal to its thickness. Thickness is calculated from the age of the oceanic lithosphere at the time of subduction (Wen and Anderson, 1995). On-going subduction can cause low-velocities in the shallow mantle, because of volatile fluxed melting in the mantle wedge (Anderson et al., 1992), and high-velocities at greater depth due to low-temperatures in the slab. We permit the $\delta V_{s}$ to take on negative or positive values (negative values imply partial melting).

Residual tomographic Models

Synthetic Models are obtained by linear superposition of the contributions from slabs, oceanic plates, and cratons at various depths. The two parameters ($\delta V_{s}$ and $C$) at each depth are chosen by minimizing the quantity: $\sum \sqrt{V_{syn}^{2}-V_{tomo}^{2}}$ Where $V_{syn}$ and $V_{tomo}$ are the velocity perturbations of synthetic model and SH12WM13 at certain depth respectively. The summation is over every $1^{\circ} \times 1^{\circ}$ cell in a global grid. The residual models are obtained by subtracting synthetic tomography from SH12WM13. The residual models from 100-650 km depths are plotted in Fig. 3.

Implications from the correlations between residual tomographic models with the subduction history

The subduction history for the past 130 Ma has been reconstructed in the hotspot reference frame, based on plate tectonic models (Wen and Anderson, 1995). We correlate the residual tomographic models, at various depths in the upper mantle, with subduction history. Since no significance correlations are found at other degrees, only correlations at degree 2 and 6 are plotted in Fig. 4. Positive correlations mean that the subduction material corresponds with high velocities. The good correlations between 0-30 Ma subduction and residual tomography, in the transition region, may imply that some slabs are trapped in this region. There are no significant correlations between the 30-130 Ma subduction history and residual tomography at degree 2. Significant negative correlations are found for 60-90 Ma subduction in the transition region (400-650 km) at degree 6. These results confirm the previous results of Wen and Anderson (1995). Slabs subducted prior to 30 Ma may have sunk into the lower mantle. It is surprising that good correlations occur for the recently subducted slabs, because the on-going subduction has been subtracted out of the seismic tomography. One possibility is that we didn't subtract out the effects of on-going subduction efficiently. In order to check this possibility, we subtract an additional slab effect, in order to make the correlation coefficient, at degree l=2, between resultant residual tomography and 0-30 Ma subduction, just below the 50\% confidence level. The resultant residual tomography shows a very low velocity ring along the subduction zones except the Kurile, Japan, Izu-Bonin, Mariana, New Hebrides and Philippine trenchs. This suggests that the effects of on-going subduction have been removed efficiently and also reveals that the trapping of slabs at the 670 km discontinuity can be localized. Different slabs in different subduction zones may have different behaviors in the transition zone. Some slabs may be stopped at the 670 km discontinuity for a period of time, while some slabs may penetrate into the lower mantle. The residual tomography shows high velocity beneath the Kurile, Japan, Izu-Bonin, Mariana, New Hebrides and Philippine trenchs. This implies that the subducted slabs may accumulate beneath these trenches at the 670 km discontinuity. We test this possibility by dividing the subduction in the past 30 Ma into two groups. Group 1 contains only the 0-30 Ma subduction in the Kurile, Japan, Izu-Bonin, Mariana, New Hebrides and Philippine trenches; group 2 includes subduction in the other convergence regions (e.g. Aleutian, Chile-Peru, Tonga-Fiju, Java trenches etc). We found excellent correlations at degree 2 and 3 for group 1 subduction and no correlation for group 2 subduction (Table 1). Correlations which are significant at greater than 90 \% confidence level are underlined. The good correlations between the 0-30 Ma subduction and residual tomography can be explained by the accumulation of slabs beneath the Kurile, Japan, Izu-Bonin, Mariana, New Hebrides and Philippine trenches in the transition zone. It should be pointed out that we cannot resolve individual subducted slabs in this study. Slabs beneath group 1 trenches have relatively shallow dip (except Mariana). The degree 2 heterogeneity is the most important component for many geophysical observables, such as seismic velocity (e.g. Masters et al., 1982; Nakanishi and Anderson, 1983; Woodhouse and Dziewonski, 1989) and geoid (Lerch et al., 1979). However, the origin of this degree is still controversial. Scrivner and Anderson (1992) suggested that the degree 2 heterogeneity might be caused by the history of subduction since the breakup of Pangea. In particular they suggested the presence of stagnant slabs in the mesosphere, based on the good correlations between time-integrated slab positions and seismic tomography. Our analysis supports their conclusion. We subtract effect of the stagnant slabs from the residual tomography in the transition zone (450-650 km), assuming that stagnant slabs are only beneath group 1 trenches. Fig. 5 shows the power spectra of the seismic tomography (SH12WM13), cratonic roots, on-going subduction and stagnant slabs, at degree l=2. In the shallow mantle (above 200 km ), the oceanic lithosphere, cratons and subducting slabs contribute most of the power at degree 2. In the transition zone, stagnant slabs are apparently responsible for the degree 2 lateral variation. Seismology suggests that the behavior of the slabs at the 670 km discontinuity is highly variable (e.g. Jordan and Lynn, 1974; Zhou and Anderson, 1989; van der Hist et al., 1991; Fukao, et al., 1992). Our results indicate that, indeed, the behavior of the slab at the 670 km discontinuity is region- dependent. But the 670 km discontinuity may not be the final destination of the slabs based on the poor correlations between the residual tomography and 30-130 Ma subduction. Stagnant slabs may sink into the lower mantle after a period of time of accumulation. Geodynamical stimulations indicate that trench migration and the motion of overriding plates are closely related to the behavior of slabs at the 670 km discontinuity (Gurnis and Zhong, 1995). Gurnis and Zhong (1995) also indicated that the 670-km phase change does not significantly influence the ultimate ability of subducted slabs to penetrate into the lower mantle, assuming a homogeneous mantle.

Hotspots and mantle convection

Residual tomography, which excludes the near-surface features, may provide a constraint on convection in the mantle. Passive ridges are the first-order upwellings on the surface of the Earth. They possibly represent normal (uncooled) mantle. Hotspots are generally considered to be caused by deep, narrow, active upwellings. In this section, we use seismic tomography to constrain the characteristics of hotspots throughout the mantle. We use seismic tomography (SH12WM13) in the lower mantle and residual tomography in the upper mantle. The list of 47 hotspots, complied by Morgan (1981) and Crough and Jurdy (1980), is used. The overall distribution of hotspots correlates very well with low seismic velocities in the deep lower mantle (1700 km-CMB) at degree l=2. There is some correlation with l=3 as well in the lower mantle. However, the degree 2 correlations are poor in the depth region 700-1700 km (except 900-1000 km). This is consistent with the results of previous authors (e.g. Kedar et al., 1992). Table 2 gives correlation coefficients between hotspot distribution and residual tomography in the upper mantle. Positive values mean that hotspot positions favor low velocity regions. Correlations which are significant greater that 90 % confidence level are underlined. Hotspots correlate with slow seismic velocities at degree l=2 to 400 km depth and at degree l=4 to 150 km depth. The degree 2 correlations decrease very rapidly into the transition zone. Also, at l=4, hotspots correlate with fast velocities in part of the transition region ( 400-500 km) (Table 2). The good correlations between hotspots and low seismic velocities at degree l=2 in the top of the upper mantle have two possible interpretations. One interpretation is that the good correlations in the top 400 km and in the deep lower mantle may have the same cause. Residual tomography, which may be a reflection of the mantle convection pattern, has a pattern which is similar to the seismic tomography in the deep mantle. Hotspots may connect the thermal structures in the deep lower mantle and the shallow mantle. On the other hand, the good correlations at degree l=2, 4 between residual tomography and hotspots in the top 400 km, together with those at l=6 of tomography, hotspots and geoid (Tanimoto and Anderson, 1985; Cazenave and Thoraval, 1994) suggest that hotspots might originate in the top 400 km. The good correlations between hotspots and lower mantle heterogeneities at degree l=2 are intriguing, but are not necessarily evidence for a deep mantle origin for hotspots. It does suggest, however, that lower mantle convection may modulate, or control, upper mantle behavior, or vice versa. In order to check the correlations between hotspots and residual tomography, it is useful to discuss hotspots in the context of the three dimensional residual velocity structure of the upper mantle. Most of the hotspots in the Pacific and circum-Pacific area are in low velocity regions of the upper mantle. Some hotspots are also in low velocity regions in Africa, the Indian ocean and the North Atlantic. Most hotspots in the south Atlantic, around South America and near South Africa are in high velocity regions. The high velocities in some hotspots in Africa could be because that they are close to high-velocity cratonic "roots". This spatial visual check plus the good correlations at degree l=4 and some correlation at degree l=6 suggest that the correlations, at degree l=2, between hotspots and residual seismic tomography in the upper 400 km are meaningful. One way to check if the hotspot correlations with residual tomography are related to those in the deep mantle is to correlate the residual tomography with the seismic tomography in the deep lower mantle. Fig. 6 shows the correlations between seismic tomography at 2500 km and residual tomographic models, at degree l=2. No significant correlation is found. The correlation even becomes negative for residual tomography at 650 km. Tomographic models from 1700 km to CMB have similar correlations. However, residual tomographic models between 200-500 km depth correlate with the seismic tomography model in the 900-1000 km depth region. This is the only region in the lower mantle that correlates with residual tomography. Direct comparison among the degree 2 patterns of hotspots, residual tomography and lower mantle tomography is shown in Fig. 7. They all have low values in the central Pacific and Africa, but the residual tomography and hotspot highs (devoid of hotspots) extend to the south of South America and east of Asia. The lower mantle is affected by subduction. It is still not clear that the lowermost mantle tomography will correlate with the residual tomography and hotspots, when the subduction effects are excluded. The poor correlations between seismic tomography in the deep lower mantle and residual tomography, plus the cautions of Ray and Anderson (1994), suggest that the hotspot-lower mantle connection is intriguing. Other seismic models and other hotspot lists give different results (Kedar et al., 1992; Ray and Anderson, 1994). Scrivner and Anderson (1992) found strong negative correlations between hotspot positions and the 0-180 Ma slab locations. Hotspots do not originate in mantle that has been cooled or blocked by slab. Normal mantle, cooled by the subduction will, of course, correlate with the subduction history. But this mantle will also correlate with hotspots, even if only "normal" mantle is present. Downwellings in the deep lower mantle, whether they are caused by slab accumulations, or are indirectly related to present or past subduction, will make the seismic patterns in this region correlate with hotspots. Do slabs cool the mantle or do plumes heat the mantle, or both? Is normal mantle uncooled or not heated? The cause and the effect must still be disentangled.

Degree 6 in Geodynamics

The l=6 component is particularly important in geodynamics and tomography. Tanimoto and Anderson (1985) showed an excellent correlation between Love wave phase velocities and the geoid, suggesting that this part of the geoid originates in the upper mantle. Some hotspot lists exhibit a peak at l=6 (e.g. Richards, et al., 1988; Kedar et al., 1992). Residual topography at l=6 correlates very well with seismic velocities between 300 and 500 km depth (Cazenave and Thoraval, 1994). Highs of the degree 6 residual topography coincide with the south Pacific Superswell, the Afar region, the North Atlantic, the equatorial Atlantic and south of New Zealand (Cazenave and Thoraval, 1994). The l=6 hotspot map (Richards et al., 1988) is similar, as are l=6 upper mantle tomographic maps in the upper mantle. We have expanded our craton function into spherical harmonics. Fig. 8 shows the degree 6 map of the craton function, which has unit value in cratonic regions and zero outside. The l=6 expansion picks up most of the cratons, as might be expected (as fast regions) but also has low velocities in the North Atlantic, equatorial Atlantic, the Pacific superswell, south of New Zealand and the Afar; i.e., it looks very much like the l=6 hotspots and tomographic maps. This has several alternative explanations: \begin{enumerate} \item{Hotspots, swells and hotter-than-average mantle occur where there are no cratons and these regions are generally antipodal to cratons} \item{Hot upwelling mantle tends to drive cratons away and they settle in areas of colder mantle.} \item{l=6 convection is intrinsic to the upper mantle (Tackley et al., 1993) and cratons establish the phase of the pattern; upwellings occur in complementary locations.} \item{The pattern of convection in the mantle is controlled by cratons, and their associated subduction zones (and the history of subduction); hot upwellings are not particularly fundamental but occur in "normal" mantle.} \end{enumerate} Cratons probably have long-lived "roots" extending to about 200 km and associated high-velocity material, perhaps a thermal boundary layer, extends somewhat deeper. This is a large fraction of the depth of the upper mantle and the presence of thick cratons must influence mantle convection. In addition, a moving craton overrides cold oceanic lithosphere, placing a cold slab about 100 km thick under the craton. This cold dense downwelling also affects mantle convection, even after it settles "on the bottom." The upper mantle can be viewed as a system that is differentially cooled from below, cooled and dragged down at subduction zones and affected by lateral temperature gradients at the surface, with subcratonic isotherms at a deeper level than isotherms elsewhere, except in slabs. This general type of convection was treated by Pekeris (1935), Allan et al. (1967) and King and Anderson (1994). It is quite distinct from the Rayleigh-B\'{e}nard convection in a fluid system heated from below which forms the basis for most discussions of mantle convection. Of course, the mantle is, to some extent, heated from below and from within, and all these factors must be considered. The correlations discussed here and by Cazenave and Thoraval (1994) suggest that swells and high-temperature mantle are part of a global circulation pattern, rather than localized plumes that are independent of plate tectonics and background mantle convection. Previous authors have suggested that degree 6 is a dominant wavelength of plume formation. Our results, plus those of Tackley et al. (1993), suggest that l=6 convection could be intrinsic to the background flow, modulated by surface boundary conditions. It is of interest that two of the most intense thermal anomalies on Earth (as judged by tomographic maps) are in the Afar and South Pacific superswell regions. These are also two of the "anti-craton" maxima on the l=6 craton map. These maxima are even more intense if the actual tomographic craton function are expanded (i.e. giving different weights to the cratons, depending on the magnitude of their seismic anomalies). These l=6 maxima are in phase with, and reinforced by, the l=2 pattern. Other anti-craton maxima are in the western Pacific, the south equatorial Atlantic, the region between New Zealand, Australia and Antarctica, the North Atlantic, the central Indian Ocean, Asia, and the NE Pacific. These are areas that, in general, are not cratons (except Siberia) and often not subduction zones (except Chile-Peru, New Zealand and Sumatra). This again suggests that the intermediate wavelength convective field in the mantle, typified by l=6, whose pattern may be controlled by surface (cratons) and edge (subduction) boundary conditions, is controlling the locations of features which have been attributed to local and independent deep mantle plumes from an unstable lower thermal boundary layer.

The stratification of mantle convection

The thick craton "roots" might have strong feedback on mantle flow, even if they don't have a major thermal effect on driving the flow. However, subduction is probably the most important control on mantle convection. Mantle convection can be viewd as a system, which is, at first order, mainly cooled by the subduction inside and modulated by the thick craton "roots" in the top of the mantle. That only some slabs are trapped at the 670-km discontinuity for a period of time indicates that the 670-km phase change may not preclude penetrative convection. However, this doesn't mean that we are ruling out layered mantle convection. Evidence for a deeper boundary has accumulated. The degree 2 patterns of the shallow and deep mantle seismic tomography are spatially shifted across a depth of about 800 km (Tanimoto, 1990). Kawakatsu and Niu (1994) presented seismic evidence for a 920 km discontinuity. Ritzwoller and Lavely (1995) showed that there is a significant decorrelation of mantle tomography at a depth of about 1000 km. Wen and Anderson (1995) found good correlations between the subducted slabs and seismic tomography in the 900-1100 depth region. The mantle does not become radially homogeneous and adiabatic until about 800 km depth (Dziewonski and Anderson, 1981). The velocity discontinuity at 670 km is primarily due to a phase change and the chemical change at this depth may be small. If so, the endothermic nature of the phase change may delay slab penetration but cannot prevent large masses of cold material from episodically cascading across the boundary (Honda et al., 1993; Tackley et al., 1993). Chemical discontinuities in the mantle are harder to detect if they are not associated with changes in mineralogy. In a convecting mantle chemical interfaces will tend to have high relief and will therefore be difficult to detect by standard seismological means. However, a small change in density can effectively suppress penetration of slabs. A variety of evidence suggests that there might be a barrier to convection at a depth of about 900-1000 km. The existence of a chemical boundary at this depth might induce convective stratification.

Conclusion

The residual tomography, which excludes the effects of oceanic lithosphere cooling, craton "roots" and partial melting or cooling related to subducted slabs, correlates with 0-30 Ma subduction, especially the subduction in the Kurile, Japan, Izu-Bonin, Mariana New Hebrides and Philippine trenchs. Some slabs may be trapped at the 670 km discontinuity for a period of time due to the endothermal phase change. Many areas of past subduction do not have evidence for cold slab at the base of the transition region. Slabs in these areas may penetrate the 670 km discontinuity into the top part of the lower mantle. Most hotspots appear to be in hot, or uncooled, regions of the upper mantle. The overall pattern of hotspots correlates with residual tomography in the top 400 km at degree l=2 and 4. Correlations decrease very rapidly in the transition zone region. Some hotspots occur in regions of the mantle that are slower than average down to 650 km depth. Hotspots also correlate with seismic tomography in the 900-1000 km region and the deep lower mantle at degree l=2. However the correlation between residual tomography in the upper mantle and seismic tomography in the deep lower mantle is poor. Subduction is probably the most important control on mantle convection. The thick "roots" of craton may have strong feedback to mantle convection in the upper mantle. Mantle convective stratification does not happen at the 670 km discontinuity. However, the possibility that mantle convection is stratified at deeper chemical boundary cannot be ruled out at this stage.

Acknowledgements: We thank Jascha Polet for providing digital data of cratons. This work was funded by NSF grant EAR 92-18390. Contribution No. 5554, Division of Geological and Planetary Sciences, California Institute of Technology.