The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.
Working Paper 9611 SECTORAL WAGE CONVERGENCE: A NONPARAMETRIC DISTRIBUTIONAL ANALYSIS by Mark E. Schweitzer and Max Dupuy Mark E. Schweitzer is an economist at the Federal Reserve Bank of Cleveland, and Max Dupuy is a graduate student at the Woodrow Wilson School of Public and International Affairs, Princeton University. For helpful comments and suggestions, the authors would like to thank seminar participants at the Humboldt University Conference on Smoothing and Resampling in Economics and at the Federal Reserve Banks of Cleveland, Philadelphia, and San Francisco. Particularly useful suggestions were offered by Eric Severance-Lossin, J.S. Marron, and Randy Wright. Correspondence may be sent to Mark E. Schweitzer, Federal Reserve Bank of Cleveland, Research Department, P.O. Box 6387, Cleveland, Ohio, 44101. Phone: (216) 579-2014. Fax: (216) 579-3050. Email: mschweitzer@clev.frb.org. Working Papers of the Federal Reserve Bank of Cleveland are preliminary materials circulated to stimulate discussion and critical comment. The views contained herein are those of the authors and not necessarily those of the Federal Reserve Bank of Cleveland or of the Board of Governors of the Federal Reserve System. Working Papers are now available electronically through the Cleveland Feds home page on the World Wide Web: http://www.clev.frb.org. December 1996 Abstract The large shift of U.S. employment from goods producers to service producers has generated concern over future income distribution, because of perceived large relative pay differences. This paper applies a nonparametric density overlap statistic to compare the sectors distribution of full-time, weekly wages at all wage levels. To counter problematic features of Current Population Survey data--sampling variation at infrequent wage rates and extensive rounding at common wage rates--we employ nonparametric density-estimation procedures to isolate the underlying shapes of the densities. The validity and accuracy of these two approaches when combined is supported by Monte Carlo simulations. Standard errors and confidence intervals indicate that our results are statistically significant. Broad similarity between goods and services wage distributions is found throughout the period from 1969 to 1993; however, the densities slowly diverge until 1980, after which they tend to converge. By the 1990s, the estimated densities are more than 95 percent identical. The breadth of this similarity and steady recent convergence are not easily identified by typical comparison statistics. Furthermore, the wage densities are most comparable in the central deciles, a finding that disputes the bimodal characterization of service-sector wages. Two potential explanations for the time pattern of the overlapping coefficient are considered by forming hypothetical distributions, but neither of these explanations removes the pattern. I. Introduction The dramatic expansion of the share of U.S. workers employed in serviceproducing industries has provoked much controversy.1 Judgments regarding the desirability of this transformation often imply assumptions about the relative distribution of wages in the two sectors, and about changes in the nature of the distributions over time. The shift toward service-producing employment is often credited with changing certain features of the overall wage distribution -- Bluestone and Harris (1988)stress this explanation for growing wage inequality. One version of this story contends that the service-sector wage distribution is somewhat bimodal relative to the goods-producing distribution.2 Consequently, the growing service sector is blamed for a perceived replacement of manufacturing and construction jobs at the middle of the overall wage distribution with low-wage and high-wage service positions.3 Furthermore, differences in industry wage structures represent an important element in foreign trade explanations of rising earnings inequality, like Borjas and Ramey (1995). Academic research on industry relative wage levels has mainly focused on differentials between narrow industries. While the research in this area (for example, Krueger and Summers [1987, 1989] and Helwege [1992]) does not necessarily contradict the results of our analysis because of methodology differences, it reinforced the belief of 1 Barlett and Steele (1992) and Bernstein (1994) are two recent books which warn about wage consequences of the shift away from goods-producing employment. Newspapers and other popular publications are also a recurring source of similar opinions, for example, Johnson (New York Times, 1994) and Hoagland (Washington Post, 1993). The 1994 Federal Reserve Bank of Dallas annual report, titled The Service Sector: Give It Some Respect is fairly representative of the other side of the debate. 2 See Kassab, 1992, p. 4. This view also crops up in newspapers: according to Johnson (New York Times, 1994), As the Millers [a family supported until recently by manufacturing jobs] gaze into the future...they see an employment landscape shaped like a barbell. At one end are bankers and lawyers...; at the other end are countermen at fast-food franchises.... 3 Barlett and Steele (1992) stress this thesis. 1 large sectoral wage differences. Most analyses of sectoral wage differences focus on averages (perhaps derived from a regression with controlling variables), which ignores differences in wage distribution to the extent that they fail to alter the mean difference.4 The statistical tools available to compare two unknown distributions typically rely either on strong distributional assumptions (for example, equivalence of parameters for a normal or lognormal distribution), or do not provide estimates of the level of similarity between nonequivalent distributions (such as the Kolmogorov-Smirnov equality-ofdistributions test). These tests also require exacting confidence levels to reject the hypothesis that the distributions are distinct when sample sizes reach the thousands of observations available in the Current Population Survey (CPS). In order to examine the relative shapes of the sectoral wage distributions, this paper uses a nonparametric measure of density overlap to examine wage differences between the two sectors over time. While the statistic has long existed in the literature, it rigorously developed only with respect to normal distributions and has rarely been applied. We also modify this statistic in order to identify the locations within the distribution that account for the nonoverlap in each year. The statistical significance of all overlapping statistics in this analysis is evaluated using bootstrapping techniques. This statistic is applied both to empirical densities and to smooth densities estimated using a kernel density estimation procedure. The estimated densities have the advantage of reflecting the shape of the densities without the large amount of rounding 4 An exception is Lawrence (1984) who compares distributions, in terms of low, middle and high income ranges at two points of time. This approach would be complicated if extended over the full time period as sectors because of need to choose appropriate cutoffs and differing business cycles in the sectors. While similar in focus, this analysis is much less general in its coverage and lacks measures of the statistical significance of the results. 2 evident in the raw data. Rounding lowers the apparent overlap of densities by allowing economically insignificant variations in pay levels to lead to substantial nonoverlap at clustered wage levels. Monte Carlo simulations demonstrate two major advantages of applying smoothing prior to calculating the overlapping coefficient: sharply reduced bias in the measure and improved small sample estimation. These simulations are based on controlled samples from commonly used CPS wage data. The combination of these two techniques enables us to reliably measure broad sectoral wage differences and should be broadly applicable to other comparisons of wages between groups. Our results chronicle substantial sectoral wage convergence over the last decade, and also indicate that overlap has been consistently strongest over the middle quantiles of the distributions These results reflect secular trends easily distinguished from sampling variation. We extend these results to show that more narrowly-defined industries and a hypothetically constant education distribution do not disrupt these trends. II. The Data The results in this paper are based on weekly wage data drawn from 25 years of the March CPS--1970 to 1994. Our weekly wages are constructed from weeks worked the previous year and total earnings from the previous year, resulting in wage data that span the period from 1969 to 1993. Annual earnings are corrected for Census Bureau topcoding procedures that cap reported annual wage and salary earnings at $50,000 to $199,998, depending on the year.5 While not necessary for most of the analysis in the 5 The topcoding correction assigns all topcoded wage observations the mean of a Pareto distribution truncated at the topcode, according to the formula reported in Shryock, et al. (1971). The steepness of the distribution prior to the topcode is measured from the 90th percentile to the topcode. 3 paper, wages are inflated (using the GDP Personal Consumption Expenditures Deflator) into constant 1993 dollars to allow readers to compare figures across years. Our sample includes noninstitutional civilian adults who usually worked full time (at least 35 hours per week) for at least 39 weeks in the previous year. Part-time workers are not considered, partially because hourly wage data are not available prior to 1985, but also because we want to consider comparable workers and jobs in each sector. The differences between full-time and part-time wages, while potentially relevant due to the higher part-time employment rates in the service sector, reflect a wide variety of factors (many of them unrelated to employment opportunities) that are not the focus of this study. The majority of part-time workers choose their hours for noneconomic reasons (see Dupuy and Schweitzer [1995]). Furthermore, Blank (1990) finds that the lower pay accorded to part-time positions primarily reflects the workers lower observed and unobserved skills. We exclude workers listed as reporting less than half of the real 1993 minimum wage to avoid a small number of problematically low wage observations.6 For the sake of comparison with published figures, the difference between sectoral median weekly wages for our full-time sample are presented in Figure 1. The most striking feature is the convergence of median wages between 1979 and the early 1990s. In 1993, the median service job paid $19 per week less than the median goods-producing job -- down from a 1979 difference of $83. The relatively small differences between sectors throughout the period are due to focusing on full-time workers. 6 The minimum full-time workweek of 35 hours is used to calculate the weekly earnings implied by this cutoff. 4 However, even for 1993, the wage distributions for the two sectors are statistically distinguishable from each other. Kolmogorov-Smirnov tests indicate that the null hypothesis of equal sectoral wage distributions can be rejected with great confidence (higher than 99.9 percent) for each year in the sample. Furthermore, for both sectors in each year, Kolmogorov-Smirnov tests reject the hypothesis that wages are distributed lognormally (again with greater than 99.9 percent confidence). III. Measuring the Closeness of Distributions While any number of summary statistics can be used to compare distributions, our approach focuses on comparisons of probability density functions. The overlapping coefficient (OVL) compares the frequencies throughout the range of a variable between two samples. Direct application of the OVL provides an easily interpreted, substantive measure of the closeness of two samples, drawn from a population of an arbitrary functional form, when a suitably defined histogram is an adequate representation of the populations. The OVL is a straightforward, but seldom used, measure. Bradley (1985) and Inman and Bradley (1989) promote the use of OVL as an intuitive measure of the substantive similarity between two probability distributions. Graphically, OVL is the area where the densities of the two distributions overlap when plotted on the same axes (see Figure 2). This representation allows a simple hypothesis--that workers in one group are more likely to earn a particular wage than workers in another--to be expanded across all possible wage levels. 5 In the discrete case, appropriate for empirical distributions, OVL is formally defined as OVL = ∑ min[ f1( X ), f2 ( X )] , X where f 1(X) and f 2(X) are the empirical probability density functions or simply proportions of the sample. With continuous distributions, OVL is defined analogously with integration replacing the summation.7 While Inman and Bradleys (1989) development of OVL focuses on the coefficients estimation and properties assuming normal distributions, the value of the OVL in this application is due to the fact that OVL is defined without regard to any distributional assumptions. Furthermore, OVL is invariant to transformations that are one-for-one and order-preserving (like a price deflator), when applied to both distributions. One limitation of OVL was noted by Gastwirth (1975) in the case of income comparisons: Potentially meaningful changes in income for individuals do not necessarily alter OVL. In particular, referring again to Figure 2, if one of the observations beyond the intersection of the densities (v) is given more X (which could be wages), OVL is unchanged. More generally, for xi the value of X for observation i adding or subtracting D to is holdings of X such that sign[ f1(xi ) - f2 (xi )] = sign [ f1 (xi + ∆ ) - f2 (xi + ∆ )] leaves OVL unchanged. While Gastwirth considers this a serious problem for evaluating the effects of affirmative-action programs on the wages of whites and minorities, in comparing the wage distributions of industries there is no sense in which it is preferable for particular workers in one industry to get larger salary increases than in another. 7 Inman and Bradley (1989). 6 On the other hand, we may wish to know what wage ranges cause the distributions to differ substantially. An example of a hypothesis easily framed in this context is the following: While wages are quite similar for top earners in both sectors, the service sector is dominated by good jobs and bad jobs, lacking the midlevel wage opportunities available in goods production. To address these issues using OVL, we can split OVL into the overlap associated with a range of wages. Defining qa as the wage rate at the ath percentile of the full sample (both sectors) and g as a constant percentage, OVL can be split into quantile ranges: ∑ {min[ f ( X ), f OVLQ α = X ∈( qα ,qα + γ ] 1 2 ( X )]} γ ∈[0,1]. For the same reason that OVL is generally unaffected by changes in wages for specific observations (location doesnt matter), the choice of a does not alter the possible values that OVLQa may take. In the case where at each wage level between qa and qa+g the observed frequencies f 1(x) and f 2(x) are always equal, OVLQa equals the sum of the frequencies of f(x) (the density of the full sample) between qa and qa+g, which by definition of the percentiles equals g divided by g, or one. The other extreme is defined by the case where wages in the two sectors are completely disjoint in the range defined by qa and qa+g; thus the minimum of the two densities is always zero in this range. This could occur in a variety of ways; for example, when no workers in a sector are paid wages in the range, or when workers in one sector are paid in even dollar amounts while the other sector pays in odd dollar amounts. 7 OVL allows intuitive comparisons of the degree of similarity between empirical distributions across years. OVLQ allows the similarity or dissimilarity to be located within the distribution of wages. IV. Nonparametric Density Estimation In cases where the discrete jumps of frequency (a feature of histograms) are not an acceptable description of the underlying density, a nonparametric estimate of the empirical density may be favored. Nonparametric density estimation has been recommended for exploratory data analysis in the statistics literature because features of the distribution are often readily visible in the density (Fox [1990] and Révész [1984]). Nonparametric density estimation can easily be thought of as sophisticated histograms. The appearance and implicit interpretation of histograms are strongly dependent on the number of bins. As their binwidth increases (the number of bins is reduced), potentially interesting details of distribution are lost. However as the binwidth is decreased, discontinuities due to sampling may arise. Nonparametric density estimation attempts to strike a balance between these effects when the underlying density is assumed to be smooth. In the case of U.S. wage data there are two clear reasons to believe that some smoothing may be needed: sampling and rounding. The CPS, while an unusually large survey, is still subject to noticeable sampling errors at the level of detail needed for empirical density functions. For example, at the fairly common wage of $400 ($10/hour for 40 hours) only 294 goods-producing workers were surveyed in 1993. Year-to-year variation in the sample could lead to surprising differences between sectors at a given 8 wage level. If the underlying densities of wages are smooth, then the surrounding wage rates may yield information that ameliorates this phenomenon. A very prominent feature of CPS wage data is the high frequency of wage observations at round numbers. This could be due to recall bias favoring round numbers on the part of survey respondents or a tendency for employers to round pay to round numbers. Regardless, the spikes evident in the raw data may not be relevant features for the purposes of the comparison. For example, a smaller tendency to round in one industry would alter the measured OVL without implying large or relevant differences in the underlying wage densities.8 A kernel density estimator smoothes out the discrete jumps in the histogram by applying a kernel function in place of the frequency of observations at each wage level. Kernel functions, K(z), are simply probability density functions integrating to one, so a variety of options exist. Given a selected kernel, the estimated density function is: ) 1 n x − Xi f K (x) = ∑ K nh i =1 h where n is the number of observations in the sample and h is the bandwidth, which corresponds to half of the range observations assumed relevant for frequency at x. The choice of a bandwidth can greatly alter the apparent features of the estimated density, much as the number of bins alters the characteristics of the histogram. 8 Actually, tendencies to round that vary differently over the wage distributions could be equally damaging. 9 A variety of bandwidth selection rules exist in the kernel-density estimation literature (Jones, Marron, and Sheather, 1994). These rules are typically implementations of minimizing the Mean Integrated Squared Error, ( ) MISE(h) = E ∫ f$h - f , where f is the actual density estimated by f$h , which is dependent on the bandwidth h. While this approach has yielded some interesting new bandwidth rules, it does not address directly the critical need of this analysis--removal of the spikes caused by rounded wage rates. Further, a single bandwidth is needed for each sector in all years because a given bandwidth implies a degree of smoothness for the estimated density. OVL estimates can depend on the degree to which spikes are smoothed, as noted in section II. In this light, we applied three rules of thumb to provide guidance on what ranges of bandwidths might be reasonable, but based our final choice on visual inspection. A critical variable in all bandwidth rules is the number of observations: As observations rise, the bandwidth goes to zero. Table 1 shows the results of our three rules of thumb for both sectors in three years: an early year with a small sample with nearly equal sectoral employment levels (1969); a middle year with a larger sample size, but a smaller goods sector (1980); and the last year (1993). These rules vary substantially, with Scotts (1992) oversmoothing rule, designed to be conservative in finding potential modes, always the largest. The visually selected bandwidth turns out to be in the middle of the bandwidth rules of thumb across all of these classes. Specifically, we found that the Gaussian kernel with a bandwidth of $50 yielded the most complete reduction in rounding without 10 smoothing out local frequency differences in the wage distributions.9 Other bandwidths were explored with little change in the qualitative results. Figure 3 shows the remarkable degree to which the CPS data are clustered. The smooth plot is the Gaussian kernel estimate, which on this scale shows little of the shape of the kernel (see Figure 7 for a clearer view of this estimate). In this particular case (the goods sector in 1993), over 77 percent of the weight of the histogram is in spikes above the smooth density, which represent about 22 percent of the possible wage rates. Once the densities have been estimated using these techniques, the estimates may be used to calculate OVL. In this case, OVL is a function of the estimation procedure and reflects the degree of similarity of the two densities, given underlying densities that are believed to be smooth. Even without assuming that the population densities are smooth, the OVL applied to the smooth density indicates the degree of similarity evident in basic shape of density. This number will typically be higher than the OVL calculated from the raw sample, due to reduced sampling variation and rounding differences which can increase the estimated OVL. OVLQ can also be calculated, although the quantile estimates for the full sample should reflect the same procedure applied to sector distributions. V. Diagnostics of the OVL Measures OVL is a straightforward, visually oriented statistic that we augment with a wellestablished technique for estimating densities; however, the statistical characteristics of 9 Other popular kernels tended to reproduce discrete jumps associated with larger wage clusters at all but the largest bandwidths. OVL estimates based on these estimated densities would continue to reflect differences in the rates of clustering between the comparison groups. A similar problem with non-Gaussian kernels was noted by Minotte and Scott (1993) in a similar context. 11 this combined measure as applied to earnings data are not known. We approach this issue by simulating direct analogues of characteristics of interest using samples based on the dataset used in this analysis. Bias of the Overlapping Coefficient As a statistical measure, OVL is fundamentally biased. This is because any sampling variation in the two density estimates results in the statistic being strictly less than one, even when the samples are actually drawn from the same population. Thus, OVL estimates near 1.0 may indicate that the densities actually are drawn from the same population. The most obvious solution is to apply an unbiased test like KolmogorovSmirnov, to determine whether the samples are potentially drawn from the same population. However, this test does not inform us on the closeness. To address the issue of bias in OVL, we estimate that bias in the context of CPS earnings data by fabricating samples that are drawn from the same population. Two basic tests are applied: 1) The actual wage density for one industry is sampled with replacement to simulate a population with substantial rounding of earnings levels, and 2) Samples are drawn from a lognormal distribution with the empirical mean and variance of the wages used in the first test, which eliminates the rounding in the CPS data. These tests are applied at both large (»25,000 per sector) and small (»10,000-13,000 per sector) sample sizes. These simulations are repeated a thousand times to estimate the distribution of bias for each case. Table 2 presents the results of the simulations for both the OVL as applied to the empirical density and the estimated OVL along with its quantiles for each scenario. The 12 starkest conclusion of this analysis is the large degree to which OVL as applied to empirical density (OVL [raw]) is biased away from 1.0. The OVL of the kernel density estimates (OVL [sm]) is biased much less (1.0 to 1.6 percent on average), but still noticeably. The simulations underlying Table 2 also indicate that the bias does not vary substantially relative to its average level in any given sample: For either OVL, the standard deviation of the bias simulations is always under 0.5 percent. In all cases, reducing the sample size increases the bias; however, the bias estimates for OVL (sm) are increased only by about half a percentage point for a sample reduction of approximately 50 percent. The quantile bias measures indicate that the bias in the estimated density OVL are concentrated in the tails of the density. These differential biases must be accounted for when the OVL is broken into OVLQ. These biases blunt one conclusion of our analysis, but having been recognized, they can be easily accounted for without losing the ability to address the location of the differences in the densities. The Role of Sample Size OVL being calculated at all wage rates implies that reducing even the large CPS sample can increase the measured overlap. To estimate the role of sample size across a broad range of samples, simulations on the 1993 data are run for both OVL measures with sample sizes from 4,907 to 196,270. In the smaller samples, 90 to 10 percent samples were drawn from both sectors wage distributions, prior to estimating the full set of overlapping coefficients. A new sample is drawn for each sample size. Larger sample sizes are created by adding samples drawn with replacement of the size of the original 13 dataset to yield datasets from double to quadruple the size (49,069) of the original 1993 sample. In order to estimate the sampling distributions of the simulations, these procedures are repeated 100 times. The results of the sample-size simulations are shown in Figure 4. OVL (sm) is the mean of the simulations on the OVL of the estimated density, and OVL (raw) is the mean of the simulations for the empirical density. The dotted lines indicate one-standarddeviation bounds around the simulation means. The key conclusion is that OVL (sm) are roughly constant at any sample size. On the other hand, OVL applied to the raw data deteriorates rapidly. A 90 percent reduction in the sample lowers the OVL estimate from the raw data from almost 0.85 to 0.69, while the OVL of the estimated densities declines only a third as much, from 0.95 to 0.93. This characteristic is very important, because the CPS sample size has nearly doubled over the period, and some of the comparisons that will be made in the extensions section involve even smaller samples. Both statistics are only slightly affected by expanding their sample size through sampling with replacement. VI. The Evidence for Convergence since the Early 1980s The substantial amount of wage variation in any year is evident from the estimated densities, shown in Figures 5 to 7. Further, while the distributions of earnings have changed over time, the two sectors earnings distributions have generally been reasonably similar. The most notable distinction between the wage distributions is the higher frequencies of goods workers in the range from $700 to $1,100 in 1980. The sectoral densities are visually more similar in 1969 and 1993 than in 1980. These 14 qualitative dimensions of relative earnings, while potentially derivable in a more traditional approach, are obvious from the estimated density. Quantifying these comparisons with OVL allows fine distinctions to be identified and the statistical reliability of these observations to be tested. As section III showed, both OVL and OVLQ estimates are bounded by zero and one. The perfect overlap bound of one is approached in certain ranges of Figure 7, but can only be obtained if the employment frequencies in the two sectors are identical at every wage rate. Because both the calculated statistics and the bootstrapped confidence intervals reflect these bounds (they never equal one), it is useful to keep a level of effective equivalence in mind. Given estimated distributions that reflect only variation in the location and the general shape of the distributions, this level should be high: we will use 0.95 (nearly equivalent) and 0.98 (effectively equivalent). These numbers imply that, for wages in the relevant range, 100 workers in the more prevalent sector would typically be matched with at least 95 or 98 workers in the other. It is helpful to keep cutoffs (though not necessarily ours) in mind, but the actual estimates are, of course, reported. While the nonparametric density estimates do not alter the basic character of the wage distributions, they do significantly alter the implied OVL. Figure 8 shows that the gap between OVL (sm) and OVL (raw) is substantial, sometimes exceeding 0.1. As noted above, sampling variation and differences in rounding would tend to increase the OVL measured in raw data. The other factor in the gap between the two measures is the summarization of wages implied by the smooth density. To counter the potential problem of variation in smoothness driving our results, we have also varied the parameters which affect the smoothness and found similar qualitative results. It should be noted that the 15 estimated densities do show notable features after smoothing, and that the estimated densities are easily rejected as normal or lognormal.10 The upward trend in OVL since around 1980 is visible in either OVL (sm) or OVL (raw), although the estimated densities show more convergence. That these trends are statistically significant can easily be verified in the first two columns of Table 3. The standard errors derived from a thousand repetitions of the bootstrapping algorithm described in the appendix are reported in the parentheses for each of the statistics. The standard errors for both of the OVLs of both the empirical and estimated densities are quite small--generally less than 0.005; thus, the larger changes of both OVLs are typically statistically significant. Unfortunately, the bootstrapped standard errors cannot be taken to imply exact hypothesis tests in this case. One bias already discussed and estimated is the degree to which the OVL estimates differ from 1.0 when the populations are, in fact, identical. This bias is not picked up in the bootstrap because each bootstrap sample yields estimates which also have the same problem. The other bias to be concerned with is the tradeoff between estimator variability and bias in kernel-density estimates. While this bias is also picked up by all bootstrap samples, the OVL (raw) estimates give us reason to suspect that this bias is small, because their standard-error estimates should overstate the ideal smoothed density errors by virtue of being undersmoothed. Given the known bias, estimated in Table 2, we expect that the confidence intervals reported here are conservative reflecting the unconstrained side, with no bias adjustment applied to the mean, and that the standard errors may be somewhat underestimated. 10 While visual features of these estimates appear to violate the parametric densities, we applied both Kolmogorov-Smirnov tests and a test based on skewness and kurtosis to verify this statement. 16 In the most recent years, OVL (sm) is approaching levels where we could easily question the importance of the distinction; however, the choice of cutoffs between substantial and trivial differences depends on personal interpretations. While the bootstrapped standard errors are useful for characterizing the variability of our estimators, we apply bootstrapped confidence intervals to test whether these estimates pass our hypothetical cutoffs.11 The confidence interval approach is favored, because bounded statistics tend to result in asymmetric estimation errors as the bound is approached. Again in Table 3, estimated OVLs that exceed, with 90 percent certainty, the 0.95 cutoff are indicated by one asterisk, 0.98 by two asterisks, and 0.99 by three asterisks. No fulldensity OVLs exceed the cutoffs with this degree of confidence, but they certainly are getting close. As measured by our bootstrap analysis, the OVL (sm) estimates in 1993 exceed 0.95 with a probability of almost 0.5. One of the advantages we noted for OVL is that it can be easily split into quantile components. Table 3 also shows the decile OVLQs for the estimated densities. While only in recent years has the convergence of wages for the full distributions reached the nearly identical cutoff, the middle deciles have frequently exceeded this and higher cutoffs. Even when the wage distributions were most distinct (1980), the sixth and seventh deciles qualify as at least 95 percent overlapped, with 90 percent confidence. These decile OLVQ statistics clearly demonstrate that the wage distributions in the goods and services sectors of the economy have always been closest in the middle ranges, belying the oft-made criticism that the services provide only high- and low-paid work 11 estimation. We follow the approach and guidance of Efron and Tibshirani (1993) on applying bootstrap techniques to confidence-interval 17 relative to goods production. The reality is that the frequencies of middle salary deciles in the two sectors are highly similar in most years. The growing convergence in wage distribution in the 1980s and 1990s can also be allocated according to deciles by the same statistics, because the components average to the overall.12 Comparing 1980 with 1993, virtually every decile is more similar in 1993, but the largest changes have been in the second through the fourth deciles and in the top two deciles. These increases put the fourth through eighth quantiles beyond the 95 percent level of comparability. Wage frequencies are substantially different only in the lowest two deciles, where service-sector jobs continue to be more frequent, and in the topmost decile. What wage ranges led to the peak disparity between distributions seen in 1980? Again, wages were much more similar in the second through the fourth deciles, along with the top two deciles, in the early 1970s relative to the early 1980s. In the second through fourth deciles, it is generally service-sector jobs that are more frequent, whereas the upper deciles have greater frequencies of goods-sector jobs. Thus, the late seventies and early eighties were a period when the relative frequencies of employment in the two sectors became more distinct by shifting towards the wages that are viewed as conventional for each sector. But the surrounding periods show that the more typical wage patterns in the two sectors might be more equal. 12 The reported statistics do not average exactly, because the discrete approximation implies variability in the realized quantile sizes, which are adjusted for in the formula. 18 VII. Further Comparisons The preceding analysis takes an extreme view of wage comparability that runs counter to regression analysis: Wages reflect a mixture of investments and compensating differentials that, while not controlled for, are largely offsetting. While this assumption has allowed the analysis to focus on the full distribution in ways that are not possible in a regression framework, this technique does not necessitate a complete lack of controls. In this section, we consider two simple hypotheses that can be analyzed in the same framework: 1) that the very broad sectors used in the analysis hide the real wage differences; and 2) that wages are converging because service-sector workers have pursued more education, which is rising in value. Narrower Industries At the limit, it is self-evident that narrower industries should be more distinct: Wages in transportation equipment (which includes both automobile and airplane manufacturers) must be and are different from fast food-restaurants. The workers employed by the industries are clearly different. Nonetheless, comparisons may be made at the intermediate categories; for example, manufacturing and narrow services.13 This particular comparison is relevant because much of the sectoral shift has occurred in these divisions. Manufacturing employment has been shrinking rapidly, while the narrow services have been among the most rapidly expanding industries. 13 Manufacturing includes both durable and nondurable components. Narrow services includes: Hotels and Other Lodging; Personal Services; Business Services; Auto Services; Repair Services; Motion Pictures; Amusement and Recreation Services; Health Services; Legal Services; Educational Services; Social Services; Museums; Membership Organizations; Engineering and Management Services; and Private Household Employment. 19 Figure 7 shows that these narrower industries have paralleled the development of the broader sectors.14 After starting at a relatively high overlap (and with more workers in manufacturing) wages become more dissimilar, until they reach a minimum in 1980. By the 1990s wages are nearly as similar in these narrower industries as they are in the broader sectors. The change is all the sharper in the narrow services, because OVL for the narrower industries started lower in the early years. For the sake of brevity we did not report the quantile estimates, but they also repeat the patterns seen in the broader sectors: Wage frequencies have typically been comparable in the middle deciles, and the convergence has occurred in the surrounding deciles. Education Formal (that is, reported) education levels are higher in the service sector and have been rising. This fact, combined with the widely observed rising returns to education, suggests another interpretation of the convergence. Rising education levels have pushed up the wages of service-sector workers as workers have chosen more formal education in lieu of high-paying jobs in goods production. While the structural details of this description are not easily described in the framework, a modified shift-share analysis is possible. We can ask, What might wages look like if the distributions in both sectors reflected the education levels of an earlier base year?15 Without the regression analysis to summarize education returns, the hypothesis must be built in by adjusting the observed frequencies to the base year frequencies. A 14 1969 is not shown because substantial changes in industry coding disrupt comparisons to 1970 and later at this level of disaggregation. 15 The groups are: Less than a high school diploma, high school diploma, some college but no four-year degree, four-year college degree, and some graduate school. We use these rough categories in order to compare education over the entire sample. 20 simple approach is to modify the population weights already used in the CPS to reflect the education distribution of the base year: edfri y edwgt iy = wgt i * , y edfri where wgti is the CPS supplement weight assigned to the individual, and the education frequency terms (edfri) refer to the population frequency of the individuals education level in the base and current years. This reweighting implies an assumption that lower education levels for an individual result in pay comparable to that of current workers at that education level. Unlike a regression shift-share analysis, it does not assume that returns to education can be summarized by a single figure for each education level. While the hypothesis is limited by its assumptions, the results should indicate the direction of these effects. Even though the education shifts are large in wage distributions, altering the composition of the labor force to reflect lower education levels in both sectors affects wages in the sectors fairly evenly. Only in the latest years does any real distinction develop between the previously estimated OVL and the OVL constrained to early education levels (see Figure 10). This startling result negates what seemed to be a fairly credible hypothesis. VIII. Conclusion Wages in the goods- and service-producing sectors are much more comparable than the existing policy literature suspects. The broad-based similarity of wage frequencies in the two sectors has not previously been examined; rather, economists have focused on statistically significant average differences, typically in a regression setting 21 with a variety of controls. For many policy applications these controls may not be relevant (for example, in estimates of the increase in the tax base implied by recruiting firms from a particular sector). Similarly, our results suggest that policies intended to shift employment back to goods production from services will not meaningfully alter the overall distribution of earnings. In fact our results indicate that there is less and less room for income inequality in the economy to increase due to the goods sector employing a smaller fraction of the workforce. This paper proposes and applies an alternative approach to comparing a variable in two sub-populations that focuses on the similarity of the frequencies over the full distribution. While we clearly want to support an approach that does not focus so heavily on the central tendencies of variables, as both means and regressions tend to do, this is not to suggest that regressions have little value in comparing variables like wages in subpopulations. Regressions allow the simultaneous summarization of varied controls which can become impractical in our approach. Nonetheless, we strongly recommend the use of our techniques to clarify the nature of differences or the location of diminished differences between wages in related sectors. Our approach allows detailed, statistical comparisons without making distributional assumptions which are not generally supported by wage data. 22 References Barlett, Donald L. and James B. Steele, America: What Went Wrong? (Kansas City: Andrews and McMeel, 1992). Bernstein, Michael A. Understanding American Economic Decline: The Contours of the Late-Twentieth-Century Experience, in M. A. Bernstein and D. E. Adler (eds.), Understanding American Economic Decline, (Cambridge, England: Cambridge University Press, 1994). Blank, Rebecca M., Are Part-time Jobs Bad Jobs? in Gary T. Burtless (ed.), A Future of Lousy Jobs? The Changing Structure of U.S. Wages, (Washington, D. C.: Brookings Institution, 1990). Borjas, George J. and Valerie A. Ramey, "Foreign Competition, Market Power, and Wage Inequality," The Quarterly Journal of Economics, 110 (1995), 1075-1110. Bluestone, Barry, and Bennet Harrison, The Great U-Turn: Corporate Restructuring and the Polarizing of America (New York: Basic Books, 1988). Bradley, Edwin. L., Jr., Overlapping Coefficient, in S. Kotz and N. L. Johnson (eds.), Encyclopedia of Statistical Sciences, 6 (1985), 546-547. Dupuy, Max and Mark E. Schweitzer, Are Service-Sector Jobs Inferior? Federal Reserve Bank of Cleveland Economic Commentary, (Feb. 1, 1994). Dupuy, Max and Mark E. Schweitzer, Another Look at Part-time Employment, Federal Reserve Bank of Cleveland Economic Commentary, (Feb. 1, 1995). Efron, Bradley and Robert J. Tibshirani, An Introduction to the Bootstrap, (New York: Chapman & Hall, 1993). Federal Reserve Bank of Dallas, The Service Sector: Give It Some Respect, 1994 Annual Report. Fox, John, Describing Univariate Distributions, in John Fox and J. Scott Long (eds.), Modern Methods of Data Analysis, (Newbury Park, CA: Sage Publications, 1990), 58-125. Gastwirth, Joseph L., Statistical Measures of Earnings Differentials, The American Statistician 29 (1975), 32-35. Helwege, Jean, "Sectoral Shifts and Interindustry Wage Differentials," Journal of Labor Economics, 10 (1992), 55-84. Hoagland, Jim, Its Jobs, Remember? The Washington Post, May 13, 1993, p. A27. 23 Inman, Henry F. and Edwin L. Bradley, Jr., The Overlapping Coefficient as a Measure of Agreement between Two Probability Distributions and Point Estimation of the Overlap of Two Normal Densities, Communications in Statistics--Theory and Methodology 18 (1989), 3852-3874. Johnson, Dirk, Family Struggles To Make Do after Fall from Middle Class, The New York Times, March 11, 1994, A1. Jones, M. C., J. S. Marron, and S. J. Sheather, A Brief Survey of Bandwidth Selection for Density Estimation, unpublished manuscript (1994). Kassab, Cathy, Income and Inequality: The Role of the Service Sector in the Changing Distribution of Income (New York: Greenwood Press, 1992). Krueger, Alan and Lawrence Summers, "Reflections on the Inter-Industry Wage Structure," in Unemployment and the Structure of Labor Markets, Kevin Lang and Jonathan Leonard, eds. (New York: Basil Blackwell, 1987). Krueger, Alan and Lawrence Summers, "Effeciency Wages and the Inter-Industry Wage Structure," Econometrica, 56 (1988), 259-94. Lawrence, Robert Z. Sectoral Shifts and the Size of the Middle Class The Brookings Review, (Fall 1994), 3-11. Minotte, Michael C. and David W. Scott, The Mode Tree: A Tool for Visualization of Nonparametric Density Features, Journal of Computational and Graphical Statistics 2 (1993), 51-68. Révész, P., Density Estimation, in P. R. Krishnaiah and P. K. Sen (eds.), Handbook of Statistics 4 (Amsterdam: North Holland, 1984), 531-549. Scott, David W., Multivariate Density Estimation: Theory, Practice, and Visualization (New York: John Wiley and Sons, 1992). Shryock, Henry S., Jacob S. Siegel and Associates, U.S. Bureau of the Census, The Methods and Materials of Demography, (Washington, D.C.: U.S. G.P.O., 1971). Silverman, B. W., Density Estimation for Statistics and Data Analysis (London: Chapman & Hall, 1986). 24 Technical Appendix: Algorithms The Overlap Statistic by Quantiles This algorithm is exact, given a rounding factor and a smoothing algorithm. While exact, the choice of these components can alter the estimates. Larger bin sizes increase the measures overlap. Smoothing can reduce the impact of the rounding factor by limiting the discrete jumps that typically occur with greater regularity with narrow bins. 1. Collect data into bins according to the rounding factor, R. 2. Assure that within the range of wages in the full sample, frequencies exist for each bin for both sectors, by assigning zeroes where necessary. 3. Smooth frequency distributions for both sectors, if desired. 4. Calculate and identify the quantiles associated with each wage bin, from the weighted sum of the sectoral densities. 5. Calculate the overlap at each wage rate, then sum by quantile and over the full distribution, according to equation . 6. Adjust quantile overlaps for size variation in the quantiles. Bootstrapped Standard Errors and Confidence Intervals We apply simple bootstrapping wherever standard errors or hypothesis tests are reported for overlap coefficients. Most estimates are constructed from a thousand bootstrap replications to allow reasonably exact confidence intervals. 1. Resample, with replacement from the original dataset, a bootstrap sample of equal size. 2. Calculate the overlap statistics (smoothed or raw) from the beginning. Store the results. 3. Repeat steps 1 and 2, until the replication dataset reaches the desired size. 4. Calculate the standard errors from the standard deviations of this dataset, and confidence intervals from the percentiles of this replications dataset. 25 Figure 1: Difference Between Goods- and Service-Producing Median Weekly Wages 90 80 70 CWW93 60 50 40 30 20 10 0 1969 1972 1975 1978 1981 1984 1987 1990 1993 Year SOURCE: Authors calculations from Current Population Survey data. Figure 2: Graphic Representation of Overlapping Coefficient SOURCE: Authors drawing. Figure 3: Extreme Rounding Reduced by Kernel Density Estimation SOURCE: Authors calculations from Current Population Survey data. Figure 4: The Effect of Sample Size on OVL Measures 1 0.95 0.9 OVL 0.85 OVL (sm) OVL (raw) 0.8 0.75 0.7 0.65 0.6 10 20 30 40 50 60 70 80 Percent of 1993 Sample SOURCE: Authors calculations from Current Population Survey data. 90 100 200 300 400 Figure 5: 1969 Estimated Wage Densities SOURCE: Authors calculations from Current Population Survey data. Figure 6: 1980 Estimated Wage Densities SOURCE: Authors calculations from Current Population Survey data. Figure 7: 1993 Estimated Wage Densities SOURCE: Authors calculations from Current Population Survey data. 0.90 Estimated Density Raw Data 0.80 0.85 OVL 0.95 1.00 Figure 8: Overlapping Coefficients 1969 1972 1975 1978 1981 1984 1987 1990 1993 Year SOURCE: Authors calculations from Current Population Survey data. Figure 9: OVL for Narrower Industries OVL (Est. Densities) 0.96 0.94 0.92 Broad Sectors Narrow Industries 0.9 0.88 0.86 0.84 1969 1972 1975 1978 1981 1984 Year SOURCE: Authors calculations from Current Population Survey data. 1987 1990 1993 Figure 10: OVL When Workforce Education Composition Is Held Constant 0.96 Smoothed Overlap 0.95 0.94 0.93 0.92 Education Groups Vary 0.91 Education Groups Constant 0.90 0.89 0.88 1969 1972 1975 1978 1981 1984 1987 1990 1993 Year NOTE: Base year is 1972. SOURCE: Authors calculations from Current Population Survey data. Table 1: Bandwidth Selection Rules Goods Number of Observations Silvermans Härdles Better Scotts Oversmoothing 1969 1980 Services Goods Services 13702 42.2 49.7 76 15191 41.1 48.4 75.2 19116 42.7 50.3 72.1 36583 31.5 37.1 51.9 Goods 1993 Services 13484 50 58.9 82.4 35644 37 43.6 61 SOURCE: Authors calculations from Current Population Survey data. Table 2: Bias Simulation Results Distributions Lognormal 1994 Goods Sector Large Sample Small Sample Large Sample Small Sample Avg. Observations per sector OVL (raw) OVL (sm) OVLQ (sm) 10 20 30 40 50 60 70 80 90 100 24915 0.862 0.990 9966 0.788 0.984 25000 0.893 0.987 13484 0.880 0.985 0.990 0.993 0.994 0.994 0.994 0.993 0.993 0.990 0.988 0.973 0.984 0.990 0.991 0.991 0.991 0.989 0.987 0.985 0.981 0.955 0.988 0.992 0.991 0.993 0.992 0.992 0.988 0.982 0.985 0.967 0.985 0.990 0.991 0.991 0.990 0.989 0.988 0.986 0.981 0.961 SOURCE: Authors calculations from Current Population Survey data. Table 3: Estimated Overlapping Coefficients YR 69 70 71 72 73 74 75 76 77 78 79 80 Raw Overlap 0.84189 (0.0048) 0.84833 (0.0041) 0.84983 (0.0042) 0.85061 (0.0043) 0.86147 (0.0044) 0.84496 (0.0043) 0.81478 (0.0045) 0.82309 (0.0045) 0.83285 (0.0043) 0.82796 (0.0045) 0.83214 (0.0042) 0.82422 (0.0042) Estimated Overlap 0.92242 (0.0055) 0.93989 (0.0044) 0.93113 (0.0047) 0.92256 (0.0049) 0.92575 (0.0049) 0.92177 (0.0049) 0.91748 (0.0048) 0.91464 (0.0048) 0.91376 (0.0048) 0.90666 (0.0047) 0.89732 (0.0047) 0.89642 (0.0045) First Decile 0.83134 (0.0147) 0.76956 (0.0140) 0.75886 (0.0150) 0.77289 (0.0148) 0.80608 (0.0152) 0.78113 (0.0154) 0.74071 (0.0145) 0.76109 (0.0147) 0.76308 (0.0147) 0.77577 (0.0138) 0.77139 (0.0137) 0.74909 (0.0137) Second Decile 0.90418 (0.0110) 0.91962 (0.0109) 0.90440 (0.0114) 0.90306 (0.0116) 0.90018 (0.0118) 0.87077 (0.0117) 0.88423 (0.0117) 0.87912 (0.0111) 0.86711 (0.0113) 0.86862 (0.0111) 0.84616 (0.0104) 0.83831 (0.0104) SOURCE: Authors calculations from Current Population Survey data. Third Decile 0.93550 (0.0101) 0.97382 * (0.0096) 0.96332 * (0.0100) 0.94133 (0.0104) 0.93624 (0.0108) 0.93602 (0.0108) 0.94822 (0.0107) 0.92665 (0.0102) 0.90888 (0.0107) 0.90674 (0.0102) 0.88190 (0.0098) 0.88134 (0.0092) Fourth Fifth Sixth Decile Decile Decile 0.95562 0.98433 * 0.95910 (0.0093) (0.0052) (0.0098) 0.99587 *** 0.99153 ** 0.98371 ** (0.0043) (0.0055) (0.0068) 0.99556 ** 0.99790 *** 0.98570 * (0.0058) (0.0048) (0.0059) 0.96244 0.98389 * 0.97005 * (0.0103) (0.0083) (0.0068) 0.94873 0.97471 * 0.97313 * (0.0108) (0.0094) (0.0067) 0.96738 * 0.98482 * 0.97244 * (0.0102) (0.0079) (0.0070) 0.95583 0.97444 * 0.97356 * (0.0107) (0.0094) (0.0066) 0.94440 0.96037 0.98508 ** (0.0105) (0.0107) (0.0038) 0.93481 0.98297 * 0.97404 * (0.0104) (0.0057) (0.0069) 0.91944 0.94708 0.98896 ** (0.0103) (0.0107) (0.0037) 0.91086 0.94671 0.98604 ** (0.0098) (0.0107) (0.0031) 0.91747 0.95746 0.99021 ** (0.0097) (0.0103) (0.0040) Seventh Decile 0.92184 (0.0101) 0.96230 * (0.0077) 0.95129 (0.0079) 0.93365 (0.0076) 0.93158 (0.0075) 0.94071 (0.0078) 0.94975 (0.0076) 0.95690 (0.0074) 0.95695 (0.0074) 0.96228 * (0.0079) 0.95834 (0.0070) 0.96256 * (0.0068) Eighth Decile 0.90620 (0.0120) 0.93118 (0.0087) 0.92387 (0.0089) 0.92092 (0.0086) 0.92503 (0.0090) 0.92710 (0.0088) 0.92807 (0.0085) 0.93329 (0.0083) 0.93395 (0.0083) 0.92738 (0.0078) 0.91912 (0.0081) 0.93490 (0.0074) Ninth Decile 0.91575 (0.0137) 0.93353 (0.0105) 0.92132 (0.0105) 0.93839 (0.0102) 0.92857 (0.0105) 0.91085 (0.0101) 0.90569 (0.0098) 0.89577 (0.0100) 0.90168 (0.0096) 0.87332 (0.0094) 0.84852 (0.0092) 0.87368 (0.0093) Tenth Decile 0.90991 (0.0151) 0.93648 (0.0119) 0.90904 (0.0126) 0.89887 (0.0132) 0.93241 (0.0120) 0.92545 (0.0115) 0.91315 (0.0111) 0.90168 (0.0113) 0.91260 (0.0109) 0.89539 (0.0108) 0.90410 (0.0109) 0.85778 (0.0097) Table 3 (continued): Estimated Overlapping Coefficients YR 81 82 83 84 85 86 87 88 89 90 91 92 93 Raw Overlap 0.82744 (0.0044) 0.82059 (0.0043) 0.82813 (0.0045) 0.84207 (0.0044) 0.83519 (0.0046) 0.84146 (0.0045) 0.85491 (0.0043) 0.84514 (0.0048) 0.85023 (0.0044) 0.84752 (0.0047) 0.85320 (0.0045) 0.84708 (0.0044) 0.84848 (0.0046) Estimated Overlap 0.91072 (0.0048) 0.90795 (0.0048) 0.92179 (0.0047) 0.93812 (0.0045) 0.92795 (0.0047) 0.92657 (0.0048) 0.93622 (0.0050) 0.93603 (0.0048) 0.94056 (0.0047) 0.94935 (0.0049) 0.95108 (0.0050) 0.95525 (0.0046) 0.94949 (0.0050) First Decile 0.75720 (0.0146) 0.79182 (0.0149) 0.81349 (0.0156) 0.82018 (0.0155) 0.76212 (0.0150) 0.81874 (0.0156) 0.82613 (0.0162) 0.81095 (0.0166) 0.82125 (0.0164) 0.82713 (0.0164) 0.83366 (0.0170) 0.83426 (0.0175) 0.80731 (0.0176) Second Decile 0.86065 (0.0113) 0.85896 (0.0109) 0.88049 (0.0121) 0.92411 (0.0122) 0.86795 (0.0119) 0.87455 (0.0122) 0.90135 (0.0131) 0.90585 (0.0131) 0.89311 (0.0129) 0.90458 (0.0131) 0.90985 (0.0128) 0.91591 (0.0135) 0.92038 (0.0139) SOURCE: Authors calculations from Current Population Survey data. Third Decile 0.91322 (0.0104) 0.89329 (0.0102) 0.89926 (0.0111) 0.94633 (0.0107) 0.93105 (0.0114) 0.90842 (0.0114) 0.92847 (0.0116) 0.93325 (0.0125) 0.94566 (0.0122) 0.95985 (0.0125) 0.96489 (0.0124) 0.97210 * (0.0122) 0.96772 (0.0124) Fourth Decile 0.93798 (0.0103) 0.91778 (0.0104) 0.91747 (0.0110) 0.95219 (0.0108) 0.96323 (0.0113) 0.94615 (0.0122) 0.93723 (0.0119) 0.96935 (0.0122) 0.97787 (0.0112) 0.98594 (0.0094) 0.97598 (0.0117) 0.99570 (0.0068) 0.98009 (0.0124) Fifth Decile 0.96208 (0.0110) 0.94246 (0.0112) 0.96332 (0.0106) 0.97129 (0.0114) 0.99197 (0.0042) 0.97463 (0.0117) 0.97747 (0.0102) * 0.99513 (0.0082) * 0.99000 (0.0089) * 0.99356 (0.0041) * 0.99428 (0.0044) ** 0.99606 (0.0044) * 0.99249 (0.0068) Sixth Decile 0.98904 (0.0035) 0.98712 (0.0045) 0.98481 (0.0054) * 0.98254 (0.0108) ** 0.98937 (0.0055) * 0.99657 (0.0055) * 0.99193 (0.0048) ** 0.99061 (0.0099) * 0.99716 (0.0049) ** 0.98860 (0.0055) ** 0.98465 (0.0058) *** 0.99642 (0.0047) ** 0.98510 (0.0054) Seventh Decile * 0.96670 (0.0069) * 0.96985 (0.0069) ** 0.98384 (0.0065) *** 0.98924 (0.0042) ** 0.98941 (0.0055) ** 0.99441 (0.0050) * 0.98516 (0.0062) ** 0.99013 (0.0059) * 0.98335 (0.0061) * 0.98633 (0.0060) * 0.98148 (0.0064) * 0.98676 (0.0055) * 0.97186 (0.0065) * * * ** ** ** * ** * * * * * Eighth Decile 0.93265 (0.0080) 0.93895 (0.0081) 0.96363 (0.0077) 0.95226 (0.0075) 0.95457 (0.0076) 0.94915 (0.0080) 0.96154 (0.0073) 0.95304 (0.0078) 0.97096 (0.0074) 0.97077 (0.0074) 0.98003 (0.0069) 0.97405 (0.0074) 0.97315 (0.0074) * * * * * * * Ninth Decile 0.89831 (0.0096) 0.88389 (0.0093) 0.90463 (0.0089) 0.90992 (0.0090) 0.91087 (0.0089) 0.90549 (0.0089) 0.92595 (0.0090) 0.92855 (0.0094) 0.92345 (0.0086) 0.94375 (0.0085) 0.95068 (0.0082) 0.95241 (0.0082) 0.95660 (0.0083) Tenth Decile 0.88825 (0.0104) 0.89454 (0.0106) 0.90463 (0.0089) 0.93242 (0.0097) 0.91744 (0.0106) 0.89662 (0.0118) 0.92616 (0.0100) 0.88192 (0.0126) 0.90172 (0.0115) 0.93207 (0.0110) 0.93415 (0.0131) 0.92845 (0.0120) 0.93899 (0.0131)