 Methodology
 Open Access
 Published:
Trend tests for the evaluation of exposureresponse relationships in epidemiological exposure studies
Epidemiologic Perspectives & Innovations volume 6, Article number: 1 (2009)
Abstract
One possibility for the statistical evaluation of trends in epidemiological exposure studies is the use of a trend test for data organized in a 2 × k contingency table. Commonly, the exposure data are naturally grouped or continuous exposure data are appropriately categorized. The trend test should be sensitive to any shape of the exposureresponse relationship. Commonly, a global trend test only determines whether there is a trend or not. Once a trend is seen it is important to identify the likely shape of the exposureresponse relationship. This paper introduces a best contrast approach and an alternative approach based on orderrestricted information criteria for the model selection of a particular exposureresponse relationship. For the simple change point alternative H _{1} : π _{1} = ...= π _{ q } <π _{ q+1} = ... = π _{ k } an appropriate approach for the identification of a global trend as well as for the most likely shape of that exposureresponse relationship is characterized by simulation and demonstrated for real data examples. Power and simultaneous confidence intervals can be estimated as well. If the conditions are fulfilled to transform the exposureresponse data into a 2 × k table, a simple approach for identification of a global trend and its elementary shape is available for epidemiologists.
Introduction
Statistical trend analysis is an important component of epidemiological exposure studies. Here, "trend" simply means the demonstration of any monotone relationship between the response rate and the continuous exposure. For example, the association between all major types of childhood cancer and exposure to magnetic fields from high voltage installations was analyzed by Lausen et al. [1] using the data shown in Table 1, where the original continuous exposure data (Olsen et al., [2]) were categorized.
Although this example is seriously unbalanced, real epidemiological exposure studies with many unexposed or lowexposure cases but few highexposure cases can be found. The appropriate evaluation of such epidemiological exposure studies is a statistical challenge. Many similar examples can be found in the literature, e.g. a casecontrol study for respiratory cancer possibly caused by longterm exposure to coke oven emissions [3].
In exposure studies, an unexposed group, E_{1}, is commonly compared with several exposure groups, E_{2},..., E_{k}. The outcome of the study is the number of cases suffering from the disease being investigated, such as a specific tumor, and the number of observations without the disease (controls), i.e. the risk of disease in each category of exposure. One important objective in exposure epidemiology is causation; the demonstration of a global exposureresponse relationship represents one of the causation criteria, according to Hill [4]. A global trend test leads to identification of a trend, whereas model selection allows inference of the likelihood of a particular elementary model.
The sampling strategy of epidemiological exposure studies is either a cohort study, in which a 2 × k contingency table represents the data, or a casecontrol study, in which two multinomial distributions are compared. However, the likelihood ratio test of identical multinomials against the elementary odds ratios alternative, for a sufficient total number of observations, is equivalent to the comparison of the k independent binomial proportions against a simple ordered alternative (Agresti and Coull, [5]; Hothorn et al., [6]). Therefore, it is appropriate to evaluate both designs by means of an asymptotic trend test for a 2 × k contingency table.
Numerous methods, including modelbased (e.g. Royston et al., [7]) and testbased approaches (e.g. Dosemeci and Benichou, [8]), are used to analyze exposureresponse relationships. A basic problem is that the shape of the exposureresponse is unknown a priori and is an outcome of the study. However, the choice of model or test greatly depends on the shape of the exposureresponse. Therefore, a broad class of models or tests should be used, but that, in turn, leads to a model selection dilemma. Model selection is an intricate component of statistical problems. Model selection in this case is not the objective, but is only a tool for identifying the correct trend from several possible elementary alternatives. An alternative hypothesis can be decomposed into its underlying elementary alternatives, e.g. the simple order alternative H _{1}: π _{1} ≤ π _{2} ≤ π _{3} can be decomposed into the three elementary hypotheses : π _{1} = π _{2} <π _{3}, : π _{1} <π _{2} = π _{3}, : π _{1} <π _{2} <π _{3}.
The pvalue, a commonly used outcome of a trend test, is frequently insufficient for epidemiological studies. Information concerning the shape of the exposureresponse and/or a measure of the magnitude of the effect, such as relative risks or odds ratios, is desirable for a significant trend. Thus, the level of the false positive decision rate (α) should be controlled. In addition, an approach with a minimum false negative decision rate (β) (respective maximum power π = 1  β) for the global test decision and a maximum correct decision rate for the selected model should be identified. The correct classification rate, the proportion of correctly identified elementary alternatives, is used as a major performance measure later on.
The exposure in casecontrol studies is frequently measured on a continuous scale. Categorization at preselected cutoff points of a small number of ordered categories is common; for example, four categories of trihalomethane exposure (Jones et al., [9]), or three categories of lifetime dose of hair dye (Benavente et al., [10]). Inappropriately chosen cutoff points dramatically reduce the power of the trend test (Greenland, [11]). Some exposures are naturally grouped, for example 2–3 cups of coffee per day, by the impreciseness of the definitions, such as "cup" and "coffee" (Ascherio et al., [12]). An example of ordinal definition of the exposure is given in a casecontrol study of Norwegian nickel refinery workers (Grimsrud et al., [13]). The exposurerelated associations between smokingadjusted lung cancer rates and cumulative exposure to different forms of nickel used the categories "low," "medium," and "high."
The best approach, in terms of both power and interpretation, occurs when a single cutoff point exists and is known a priori, resulting in a twosample test "above" vs. "below" the cutoff point. This is because an odds ratio and its onesided confidence interval can be estimated. The trend test approach discussed here is designed for naturally grouped exposure with a single change point. For continuous exposure models a continuous covariate can be used. However, the choice of an appropriate model – such as linear, logistic, or other – remains open and model selection influences the inference.
In this paper, a trend test for the comparison of k ordered binomial proportions using a change point alternative is presented. Either a single change point is directly of interest or the change point alternative is pivotal, i.e. many other elementary monotone alternatives can be generated from it. The concept of multiple contrasts is used because of the simplicity and the availability of the distribution under the alternative. After a significant trend test, information is provided that determines which contrast was the "best," and therefore, which exposureresponse shape describes the data most accurately. Alternatively, an information criterionbased approach for the likelihood ratio test under monotone orderrestriction according to Anraku [14] is examined.
Therefore, the primary objective of this paper is not just describing the exposureresponse relationship but also identifying the most likely elementary exposureresponse model with a control of the false model classification rate.
Analysis
Global tests on exposureresponse relationships
The number of diseased and healthy persons for each exposure group, Ej, are organized in the following 2 × k table, where Index 1 denotes the group without exposure.
The estimator for the proportions per exposure group is p _{ j } = n _{ j1}/n _{ j.} j = 1,..., k, the total is p = n _{.1}/n.., and the expected values for the proportions are denoted as π _{ j }. The hypotheses system for a monotone order is:
H _{0}: π _{1} = π _{2} = ... = π _{ k } against
H _{1}: π _{1} ≤ π _{2} ≤ ... ≤ π _{ k } with at least one strict inequality.
For simplicity, assume increasing effects with increased exposure; analogously, a directional decision for a decrease is possible.
There are an extensive number of publications concerning orderrestricted tests, including the analysis of 2 × k contingency tables (e.g. Agresti and Coull, [5]; Leuraud and Benichou, [15]). However, no uniformly powerful trend test exists for all possible alternative shapes. The possible shapes can be seen as different equalityinequality patterns of H_{1}. This can be seen for an extreme convex shape {0, 0, 0, π}. Clearly, the "Helmert's contrast" is most powerful because of the optimal pooling of all the lower exposures and the comparison with the high exposure: p _{4}  (p _{1} + p _{2} + p _{3})/3. However, power for Helmert's contrast is greatly reduced for the extreme concave shape {0, π, π, π}. The shape of the exposureresponse relationship is unknown a priori. Irrespective of numerous recent alternative proposals, the likelihood ratio test represents an appropriate solution for this situation. This test is numerically complicated, particularly concerning its distribution under the alternative, which is needed for power/sample size calculations (Robertson et al., [16]). The multiple contrast test according to Bretz and Hothorn [17] approximates its power and is simpler. There are 2 ^{k} 1 different shapes for k exposure groups, and for each shape a contrast with a minimum false negative rate (β) can be defined. The idea is to select the best contrast, which is sensitive for a certain shape. The best contrast is simply tested by a maximum test. Because the proportions p _{ j } are asymptotically normally distributed, their linear combination (denoted as contrast) is also normally distributed, and therefore, the single contrast test statistic is asymptotically normally distributed, where ∑_{ j } c _{ j } = 0 guarantees a level α test under the null hypothesis. Different variance estimators can be used, but to keep the problem simple, the commonly used pooled estimator p is used here. Asymptotic test versions are used throughout. The contrast coefficients, c _{ j }, are specific for each contrast test; for example the Helmert's contrast [c _{ i } = 1; j = 1,..., k  1 and c _{ k } = k]. A multiple contrast test is the maximum of s predefined single contrast tests , i = 1,..., s where c _{ i } = (c _{ i1},..., c _{ ik }) is a k vector of contrasts. Under the null hypotheses, the joint distribution of the linear contrast tests t _{ SingleC }(c _{ i }) i = 1,..., s is an svariate normal distribution with a zero vector of means and a nonproductmoment correlation matrix. The correlation between two arbitrary contrasts, a = (a _{1},..., a _{ k }) and b = (b _{1},..., b _{ k }), is .
This socalled isotonic contrast approach, based on s = 7 contrasts, for the balanced design with four exposure groups is demonstrated in Table 3.
However, the correct classification rates for the most likely elementary alternative (shape of the exposureresponse) were found to be unsatisfactory for isotonic contrasts (Hothorn et al., [6]). Therefore, a special case of orderrestricted inference is considered for step shapes only and denoted as a change point alternative (Hirotsu and Marumo, [18]). Two situations should be considered: i) threshold level studies assuming that an exposureresponse reveals a single change point, which can be characterized by a lower part, an upper part, and an abrupt change between both; and ii) exposureresponse studies with continuous exposure data where the change point alternative is a special and substantial component of the allpattern alternative, which can simplify the evaluation. In some epidemiological problems this question arises. An example of a threshold level study is a diabetes study (PastorBarriuso et al., [19]) with the relationship between 2hour plasma glucose and mortality, where the following questions were formulated: i) Does a certain glucose level exists that markedly increases the mortality risk? ii) Can this change point be estimated? Proposals in the literature are directed only at proof of the existence of such a change point. However, epidemiologists not only want to know that such a change exists, but also where this change is located. Here it is demonstrated that the estimation of the change point q is characterized by its correct classification rate by means of multiple contrast tests, that is, in a testing framework. The hypotheses system for a change from q to q+1 is:H _{0}: π _{1} = π _{2} = ... = π _{ k } H _{1}: π _{1} = ... = π _{ q } <π _{ q+1} = ... π _{ k } q ∈ (1,..., k  1)
The above hypotheses system can be tested by multiple step contrasts. Exactly (k1) step contrasts are appropriate for testing the above hypothesis:
Exactly three possible change points, q, exist for the simple design with one unexposed and three exposure groups. Exactly one contrast is poweroptimal for the balanced design of each change point:
"Poweroptimal" simply means the maximum test statistics because the is normally distributed, and therefore, standardized. The t _{ MultipleC } is qvariate normally distributed. The contrast coefficients, c, for q contrasts are defined for the general unbalanced design (Hirotsu et al., [20]):
These step contrasts reveal a nice ability to transform the ksample problem into an unbalanced twosample problem, which can be used later for estimation of the unadjusted relative risk (or odds ratio) "above/below" the change point. Moreover, the step contrasts belong to a broader class of multiple contrasts. Isotonic contrasts approximate the power of the likelihood ratio test for the monotone ordered hypothesis. The bivariate up/down proposals (Neuhaeuser and Hothorn, [21]; Stewart and Ruberg, [22]) only use the two extreme contrasts (Table 3). Therefore, the change point alternative represents a compromise for testing trends. It is much less dependent on the power of the shape compared with the frequently used single linear contrast test, although only k instead of 2^{k}  1 isotonic contrasts were used. The multiple contrast test (above) is defined for differences of proportions, but can be reformulated for the relative risk, commonly used in epidemiology (see Appendix A).
It seems that a multiple contrast test may be a different approach to the commonly used logistic model. However, a strong relationship between the multiple contrast test and the score test in a logistic model exists, which allows the correction for additional confounders (Hothorn et al., [6]).
Identification of the exposureresponse shape
The trend tests distinguish only globally between the null hypothesis and alternative hypotheses, based on the asymptotic distribution of the test statistics under the null hypothesis. That is, either a trend exists or it does not. However, the alternative hypothesis is not unique. For example, the following three hypotheses are possible for the change point alternative for a design with one unexposed and three exposure groups:
However, the global trend tests provide no answer as to which particular alternative exists. Two different approaches can be used to answer this question: i) the best contrast approach; and ii) a model selection approach based on the information criterion for order restriction. This paper explores the identification of one of the possible k  1 elementary alternatives; that is, a classification into . Consequently, the correct classification rate, or the proportion of correctly identified elementary alternatives, is used as a performance measure later on.
The global test decision for the multiple contrast approach is based on the maximum of all included single contrasts , i = 1,..., s, where each single contrast is power optimal for a particular type of alternative (Table 3). Therefore, this maximum contrast approach can be used as an estimator for the exposureresponse shape, where the classification is performed after a significant trend test for control α. For example, two alternatives are possible for a design with three exposure groups: π _{1} = π _{2} <π _{3} or : π _{1} <π _{2} = π _{3}. Assume that the number of diseased cases, n _{11},..., n _{ k1}, is drawn from k binomial random variables with parameters π _{ j } and n _{j.} A possible exposureresponse is described by a contrast vector, c = (c _{1},..., c _{ k }). The problem is to estimate the underlying exposureresponse relationship when s contrast vectors are given. A simple estimator is the function Ψ : (n _{11},..., n _{ k1}) → {1,..., s} which can be derived from the associated contrast test, i.e. . Then explore variability of the simple estimator, Ψ_{1}. How likely is each of the s possible values under the observed data? This question can be addressed via the parametric bootstrap. Repeated realizations from k binomial distributions with sample sizes n _{ j }. and the estimated success parameter p _{ j } = n _{ j1}/n _{ j.} for j = 1,..., k are drawn.

Draw B bootstrap samples

Compute

Compute the relative frequency of each possible value from 1,..., s
This is a measure for the variance of the estimator. Under special circumstances, an improved estimator can be computed by a majority voting according to Breiman [23] over , where I denotes the indicator function. This approach is designated the "parametric bootstrap best contrast" approach.
The model selection approach, based on the information criterion for orderrestriction of normally distributed variables according to Anraku [14], can be modified for proportions and the change point alternative. The AIC criterion for the unrestricted maximum likelihood estimator : (with l() = loglikelihood, p = dimension of θ) was modified for orderrestricted maximum likelihood estimators: . The penalty term is calculated for each model using the level of probabilities under an orderrestriction. The explicit formulas for a design with three exposure groups, such as the nullmodel M_{0} and the two change point models M_{1} and M_{2}, are given in Appendix B. The ORICapproach represents a model estimation approach, where model M _{0}{H _{0}: π _{1} = π _{2} = π _{3}}, model M _{1} { : π _{1} = π _{2} <π _{3}}, or model M _{2} { : π _{1} = π _{2} <π _{3}} will be estimated as a "best fitted" model.
Simulation study
The simulation study is structured in two parts: i) empirical comparison between the bestcontrast approach and the ORIC approach for a design with three groups; and ii) investigation of the best contrast approach for more general designs. Fifty thousand pseudorandom 2 × k tables (k ranging from 3 to 7) were generated and 10,000 bootstrap samples were drawn. Two criteria are used, the correct classification rate – the empirical decision rate for the correct model – and the power.
Part I
The correct classification rates for the ORIC approach, ORIC (M_{0}, M_{1}, M_{2}), and the parametric bootstrap best contrast approach, Max(H^{1}, H^{2}), were compared for a design with three exposure groups (in Table 4) for the change point alternatives with different unexposed rates, π _{1}. From the first row in Table 4, where no differences between the proportions were investigated, the main difference between both approaches becomes clear. The ORIC approach, as an estimation approach, did not control for α. Only in 76% of the cases, not 95%, was M_{0} selected under the null hypothesis. On the other hand, the best contrast test approach does control for α. Both approaches reveal high correct classification rates, greater than 90%, as long as the power is sufficient: either small unexposed rates, π _{1}, or large noncentrality parameters Δ (Table I in Appendix C (available as additional file 1) and larger sample sizes in Table II in Appendix C). This behavior is similar to the power of trend tests of proportions (Bretz and Hothorn, [17]). Due to the fact that the correct classification rates of the best contrast approach are similar or superior to those of the ORIC approach with decreasing π _{1}, increasing Δ, and n_{j}, the best contrast approach is recommended because of its simplicity and generalizability for use within the generalized linear model.
Part II
For one selected change point alternative {π _{1}, π _{1}, π _{1}, π _{1}, π _{1} + Δ } the best contrast approach was investigated for the different dimensions k, different unexposed rates π _{1}, and several noncentrality parameters Δ, shown in Table 5. With an increasing number of exposure groups, a slight decrease of the correct classification rate occurs where the power is slightly increasing. With a decreasing sample size, a slight decrease of the correct classification rate occurs where the power is substantially decreasing. The wellknown decrease of sensitivity with an increasing unexposed rate from 2 × 2 table analysis holds true for power and, less markedly, for the correct classification rate. The effect size (noncentrality Δ) has much less impact on the correct classification rate compared with its wellknown impact on power.
Table 6 demonstrates the decreasing correct classification rate for change points q <<k. More important, from an epidemiological point of view, are the asymmetrical cumulative false classification rates. False classification is primarily from an overestimation instead of an underestimation of the true change point, that is, it is very unlikely to mistake a lower change point for the true one.
Extreme unbalanced exposure data
Particularly for environmental studies, much of the data is for unexposed and lowtomedium exposures; only rarely does data for high exposure exist. This is quite fortunate from an ethical point of view. However, this results in extremely unbalanced 2 × k tables and the statistical outcome depends on the rare, highlevel exposure data. In a casecontrol study for respiratory cancer possibly caused by longterm exposure to coke oven emissions, the sample size was 10,198 in the unexposed group, but only 487 were in the highest exposure group (Costantino et al., [3]). A more extreme example was the study evaluating the connection between childhood cancer and magnetic fields from high voltage installations. The sample size was 2 in the highest exposure group, but 6,457 in the unexposed group (Table 1). The power decreases greatly for extremely unbalanced designs and accordingly the correct classification rate also decreases. If the total sample size is increased to achieve the same power, then the correct classification would be of the same magnitude as the balanced case, see Table 7. The identification of a trend in such a highly unbalanced design is complicated. A significant trend may depend on only these few cases, and the size and power of unbalanced designs differ greatly from those in balanced designs. In unbalanced designs with smaller change points, the correct classification rate increases if the resulting twosample test is less unbalanced (as a result of the related step contrast). A change point at a high exposure that is based on rare data is very vague, however it becomes more stable when mediumtohigh exposure from additional data are obtained.
Unbalanced designs, where the smallest sample size occurs in the informative groups (large change point s), reveal a clearly reduced classification rate. However, that decrease, compared with the balanced design, is much weaker than the related power loss. A further reduction occurs for the "inbetween" change points as long as the sample size of the pooled informative groups is still smaller than the lower exposure groups. A further substantial increase of the sample size for the unexposed group had almost no influence on the classification rate.
Since a sample size of n _{ j } = 1 is possible, in principle, for this approach, the impact of the continuous exposure categorization can be demonstrated quantitatively with respect to power and classification rate. When a single change point exists, the best approach is the categorization below or above this change point. The true alternative is never known a priori when dealing with real data. Therefore, appropriate categorization may be helpful and inappropriate categorization can greatly reduce the sensitivity.
The asymptotic power for the change point alternative is available (Bretz and Hothorn, [17]). Based on an Rcode, the power can be calculated for an arbitrary sample size pattern, which shapes the exposure response and dimensions k. Power estimation for unbalanced designs can be found in [6] whereas a serious power loss can be observed when the sample size in the informative high exposure groups is very small compared with the sample size in the unexposed or low exposure groups.
Evaluation of the example
The pvalue for the global trend test (change point alternative) and the classification rate of the best contrast approach is determined using an implementation of the proposed procedures in R (R Development Core Team, [24]). The most likely change point, q, and simultaneous confidence intervals for the related change point contrasts can be calculated for the 2 × k contingency table data. A marginal confidence interval can be estimated for each elementary contrast because it represents a linear combination of the proportions p _{ j }. Simultaneous confidence intervals for the maximum of several contrasts can be estimated using a multivariate normal distribution. A detailed description for the estimation of simultaneous confidence intervals for several multiple contrast tests can be found in [25] where the particular problems for binomial data were described recently [26]. The software is available as the R library bindosres as additional file 2. This file can be installed in the private R program via "Install packages from local zip files",
The magnet field cancer data in Table 8 revealed a change point q = 8 with a classification rate of 0.74 (pvalue for a global trend = 0.002). The cumulative false classification of 0.26 is nearly concentrated on q = 7. The maximum simultaneous lower confidence limit is for the subset [10 vs. {1, 2, 3, 4, 5, 6, 7, 8, 9}] and seems to be medically relevant with 0.563, but differs only a little from that of subset {10, 9} vs. {1, 2, 3, 4, 5, 6, 7, 8} that is related to the change point. The analysis of the continuous data using maximally selected rank statistics gave a cutpoint of 0.45 μTesla^{1}. However, above this cutpoint only six cancer cases with an exposure of 0.51, 0.73, 1.0, 1.59, 1.66, and 1.72, and two cases without cancer with exposures 0.73 and 0.83 μ Tesla were available. A careful interpretation is recommended: i) the correct classification rate is not high, ii) a high change point was identified, iii) above the change point are only 4 of 6,491 cases, and iv) the spontaneous rate of 0.263 is rather high. More examples and their interpretation can be found in Hothorn et al., [6].
Conclusion
Trend tests for the analysis of 2 × k tables using epidemiological exposure data are described to identify the change point alternatives. Not only is the identification of a trend of interest important, but also the information regarding the particular types of alternatives. The best contrast approach for the multiple contrast test is useful for identifying the type of alternative or the change point, whereas a parametric bootstrap is suitable for an assessment of the variability. Both the bootstrapped best contrast and the ORIC approach are appropriate for different dimensions, noncentralities, sample sizes, and the unexposed group rates (due to the asymmetry in binomial testing). The consequences of unbalanced designs – of a large number in the unexposed or low exposure groups and a small number in the high exposure groups – can be calculated depending on the expected shape. Simultaneous confidence intervals for the change point alternative are also available.
Approaches that test a global trend in epidemiological exposure data and also provide information on the pattern of the exposureresponse relationship are rare. The most competitive approach is the fractional polynomials model [7], which is a specific multivariable regression approach.
Most epidemiological studies are characterized not only by the primary exposure factor but also by several covariates, such as gender, age, occupational status, and competing risk characteristics. Therefore, the best contrast approach within the framework of the generalized linear model is recently available [27]. Using the related R library (multcomp), real data can be evaluated using the contrast option "Changepoint" [28].
The suitability of such a simple change point alternative in epidemiological exposure studies should be critically discussed and some real data examples tested. Clearly, such a change point test describes the exposureresponse of the population only. Further investigations are required to demonstrate that this simple approach can be utilized to estimate the center of the individuallevel change point distribution. Moreover, the above approach is not limited to change point alternatives: other trend alternatives, such as Williamstype trends [29], can be assumed as well.
Appendix A
Formulation contrast tests for the relative risk
The estimators for the relative risk (RR) of each exposure group versus unexposed (j = 1) are: The single contrast tests can be formulated for relative risks, for example for the reverse Helmert's contrast:
For general contrasts hold true
Appendix B
The ORIC approach for three binomials and the change point alternative. The three models are:
M _{0}{H _{0}: π _{1} = π _{2} = π _{3}}, M _{1} { : π _{1} = π _{2} <π _{3}}, M _{2} { : π _{1} <π _{2} = π _{3}}. The likelihood is . With the expected values π _{ j } and their crude estimators: , ,
The are the maximum likelihood estimates under the simple order restriction: . The likelihood for the nullmodel M_{0} is:
where provided w _{ j } = n _{ j }
The likelihood for the model M_{1} is:
where , for
and
The likelihood for the model M_{2} is:
where for
and
The modelspecific ORIC are: ORIC(M _{ r }) = log L ()  penalty(M _{ r }).
Where the penalty terms are
Withw(M _{0}) = n _{1.} + n _{2.} + n _{3.} w(M _{1}) = n _{1.} + n _{2.}, n _{3.} w(M _{2}) = n _{1.}, n _{2} + n _{3.}
Because P{1,1, w(M _{0})} = 1 ORIC(M _{0}) = L()  1
Because
Because
References
 1.
Lausen B, Lerche R, Schumacher M: Maximally selected rank statistics for doseresponse problems. Biometrical Journal 2002, 44:131–147.
 2.
Olsen JH, Nielsen A, Schulgen G: Residence near high voltage facilities and risk of cancer in children. British Medical Journal 1993, 307:891–895.
 3.
Costantino JP, Redmond CK, Bearden A: Occupationally related cancer risk among coke oven workers: 30 years of followup. Journal of Occupational and Environmental Medicine 1995, 37:597–604.
 4.
Hill AB: Principles of medical statistics. 9 Edition Oxford University Press, New York 1971, 309–323.
 5.
Agresti A, Coull BA: The analysis of contingency tables under inequality constraints. Journal of Statistical Planning and Inference 2002, 107:45–73.
 6.
Hothorn LA, Vaeth M, Hothorn T: Trend tests for the evaluation of doseresponse relationships in epidemiological exposure studies. [http://www.biostat.unihannover.de/report/ReportAarhus2003.pdf] Research Report 2003–3 Department of Biostatistics, University of Aarhus 2003.
 7.
Royston P, Ambler G, Sauerbrei W: The use of fractional polynomials to model continuous risk variables in epidemiology. International Journal of Epidemiology 1999, 28:964–974.
 8.
Dosemeci M, Benichou J: An alternative test for trend in exposureresponse analysis. J Expo Anal Environ Epidemiol 1998,8(1):9–15.
 9.
Jones AQ, Dewey CE, Dore K, Majowicz SE, McEwen SA, WaltnerToews D: Exposure assessment in investigations of waterborne illness: a quantitative estimate of measurement error. Epidemiologic Perspectives & Innovations 2006, 3:6.
 10.
Benavente Y, Garcia N, DomingoDomenech E: Regular use of hair dyes and risk of lymphoma in Spain. International Journal of Epidemiology 2005, 34:1118–1122.
 11.
Greenland S: Avoiding power loss associated with categorization and ordinal scores in doseresponse and trend analysis. Epidemiology 1995, 6:450–454.
 12.
Ascherio A, Weisskopf MG, O'Reilly EJ: Coffee consumption, gender, and Parkinson's disease mortality in the Cancer Prevention Study II cohort: The modifying effects of estrogen. American Journal of Epidemiology 2004, 160:977–84.
 13.
Grimsrud TK, Berge SR, Haldorsen T, Andersen A: Exposure to different forms of nickel and risk of lung cancer. American Journal of Epidemiology 2002, 156:1123–1132.
 14.
Anraku K: An information criterion for parameters under a simple order restriction. Biometrika 1999, 86:141–152.
 15.
Leuraud K, Benichou J: Tests for monotonic trend from casecontrol data: Cochranarmitagemantel trend test, isotonic regression and single and multiple contrast tests. Biometrical Journal 2004, 46:731–749.
 16.
Robertson T, Wright FT, Dykstra RL: Order restricted statistical inference. New York. Wiley 1988.
 17.
Bretz F, Hothorn LA: Detecting doseresponse using contrasts: asymptotic power and sample size determination for binomial data. Statistics in Medicine 2002, 21:3325–3335.
 18.
Hirotsu C, Marumo K: Changepoint analysis as a method for isotonic inference. Scandinavian Journal of Statistics 2002, 29:125–138.
 19.
PastorBarriuso R, Guallar E, Coresh J: Transition models for changepoint estimation in logistic regression. Statistics in Medicine 2003, 22:1141–1162.
 20.
Hirotsu C, Kuriki S, Hayter AJ: Multiple comparison procedures based on the maximal component of the cumulative chisquare statistic. Biometrika 1992, 79:381–392.
 21.
Neuhaeuser M, Hothorn LA: Trend tests for dichotomous endpoints with application in carcinogenicity studies. Drug Information Journal 1997, 30:463–469.
 22.
Stewart WH, Ruberg SJ: Detecting dose response with contrasts. Statistics in Medicine 2000, 19:913–921.
 23.
Breiman L: Bagging Predictors. Machine Learning 1996, 24:123–140.
 24.
R Development Core Team. R: [http://www.Rproject.org] A language and environment for statistical computing R Foundation for Statistical Computing, Vienna, Austria 2005.
 25.
Hothorn LA: Multiple comparisons and multiple contrasts in randomized doseresponse trials – confidence interval orient approaches. J Biopharm Stat 2006, 16:7111–731.
 26.
Schaarschmidt F, Sill M, Hothorn LA: Approximate simultaneous confidence intervals for multiple contrasts of binomial proportions. Biometrical Journal 2008, 50:782–792.
 27.
Hothorn T, Bretz F, Westfall P: Simultaneous inference in general parametric models. Biometrical Journal 2008, 50:346–363.
 28.
Bretz F, Hothorn T, Westfall P: On multiple comparisons in R. R News 2002, 3:14–17.
 29.
Williams DA: Comparison of several dose levels with a zero dose control. Biometrics 1972, 28:519–531.
Acknowledgements
This paper was worked out during the first author's sabbatical at the Department of Biostatistics of the University of Aarhus, under the financial support of the Volkswagen Stiftung grand no. II/78 956.
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
LAH adapted the multiple contrast tests on epidemiological casecontrol studies and performed part of the simulation study. MV selected, analyzed, and interpreted the epidemiological examples and designed the simulation study. TH developed the majority voting algorithm and wrote the R program.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Hothorn, L.A., Vaeth, M. & Hothorn, T. Trend tests for the evaluation of exposureresponse relationships in epidemiological exposure studies. Epidemiol Perspect Innov 6, 1 (2009). https://doi.org/10.1186/1742557361
Received:
Accepted:
Published: