|
|
||||||||
EXCLUDING SCIENCE FROM THE COURTROOM |
Correspondence: Requests for reprints should be sent to Ronald L. Melnick, National Institute for Environmental Health Sciences, P.O. Box 12233, Research Triangle Park, NC 27709 (e-mail: melnickr{at}niehs.nih.gov).
| ABSTRACT |
|---|
|
|
|---|
In the US Supreme Courts Daubert v Merrell Dow Pharmaceuticals, Inc decision, federal judges were directed to examine the scientific method underlying expert evidence and admit that which is scientifically reliable and relevant.
However, if a judge does not have adequate training or experience in dealing with scientific uncertainty, understand the full value or limit of currently used methodologies, or recognize hidden assumptions, misrepresentations of scientific data, or the strengths of scientific inferences, he or she may reach an incorrect decision on the reliability and relevance of evidence linking environmental factors to human disease.
This could lead to the unfair exclusion of valid scientific evidence, particularly that which is essential to a plaintiffs case in toxic tort litigation.
| INTRODUCTION |
|---|
|
|
|---|
The case of Daubert v Merrell Dow Pharmaceuticals, Inc concerned whether or not Bendectin can cause birth defects in humans. The district court maintained that expert testimony based on in vitro and live animal studies, pharmacological similarities between Bendectin and other substances known to cause birth defects, and unpublished reanalyses of negative epidemiological studies on Bendectin were inadmissible evidence of causality.
What is reliable and therefore admissible scientific evidence? According to the Daubert opinion it is the following: (1) evidence based on a testable theory or technique; (2) the theory or technique has been peer reviewed; (3) the technique has a known error rate; and (4) there is general acceptance of the underlying science. Because there are no clear guidelines on how to objectively determine scientific validity, judges may make decisions based on their own values and preconceived notions.4 The criteria for admissible evidence indicated in the Daubert decision can be met without achieving scientific validity and, conversely, validity may exist without meeting these criteria.5
The burden on the judge is considerable because failure to fully understand the scientific issues or to distinguish reliable from unreliable testimony could result in a decision whereby juries would not hear expert witnesses present relevant, reliable, and legitimate evidence. The decision not to admit expert testimony by judges, who Chief Justice Rehnquist labeled "amateur scientists,"6 could lead to the exclusion of scientific evidence essential to a plaintiffs claims in toxic tort litigation. Thus, a Daubert motion provides a special opportunity for defendants to exclude incriminating evidence from a court proceeding.
Scientific data relevant to human health effects come in many different forms (e.g., clinical trials, epidemiological studies in humans, controlled studies in experimental animals, or laboratory studies in in vitro systems), which have strengths and limitations. Understanding the relevance and reliability of the diverse experimental approaches and findings generally depends on how the study was designed, how the data were collected, analyzed, and evaluated, and the different perspectives put forward by experts in multiple disciplines (e.g., epidemiology, toxicology, pathology, medical cellular/molecular biology, chemistry, statistics, biomathematical modeling, etc.). It is unrealistic to expect a judge untrained in these areas to understand all of the underlying issues that might impact the validity and relevance of data from each of these disciplines with respect to determining human health risks.
Weak experimental designs and methods bias data used to render interpretations of human health risk toward not finding a risk even if a risk exists. For example, an insensitive analytical method that does not detect an agent in the environment or an insensitive or inadequately designed study that does not achieve statistical significance for an adverse effect would not necessarily mean that the agent was not present in the environment or that risk does not exist. A laboratory study with animal group sizes of 2050 usually cannot detect a significant risk of 10% or less; a study in 10 healthy adult male volunteers is not only insensitive but also provides little information about risks to children, the elderly, women, susceptible subgroups, or those exposed simultaneously to other toxic agents; an under-powered cancer epidemiological study cannot rule out the possibility of increased cancer risk in exposed populations. For these reasons and more, health agencies develop guidelines for judging the adequacy of experimental data for evaluating human health risks. For example, to reach the conclusion that human evidence suggests the lack of carcinogenicity, the World Health Organizations International Agency for Research on Cancer7 requires data from multiple, mutually consistent, adequately powered epidemiological studies covering the full range of human exposures that exclude with reasonable certainty bias, confounding, and chance as well as provide individual and pooled estimates of risk near unity with narrow confidence intervals. In addition, the International Agency for Research on Cancer cautions that "latent periods substantially shorter than 30 years cannot provide evidence for lack of carcinogenicity."7
Unless a judge has had specific training in the multiple scientific and engineering fields relevant to the expert testimony being presented, it is unlikely that he or she alone would be able to recognize all the biases or hidden assumptions that could render evidence or counterarguments suspect or invalid. If the issues were truly that clear, there would not be disagreement and debate among health scientists concerning their opinions on the relative influence of complex interactions of environmental, genetic, medical, and lifestyle factors on the health of individuals and the public. Although not unexpected, it is generally scientists representing the interests of industry who overstate matters of controversy by downplaying the value of scientific evidence that was not obtained from human exposure studies and by exaggerating the possible role of confounders in epidemiological studies that show positive associations.8 In order to render a valid decision, it is essential that the judge be able to recognize exaggerations or misrepresentations of scientific controversy. By disallowing testimony of the plaintiffs witnesses who offer opinions within the boundaries of normal scientific debate, the judge has essentially interjected his or her bias into complex environmental health issues that may not be resolved in the scientific community.
A judge who does not fully understand critical aspects of scientific methodology and interpretation of data may dismiss the evidence of expert witnesses who provide opinions based on methods well established and commonly accepted in the scientific and health communities. Whereas some judges may have claimed that results from animal studies cannot be extrapolated to humans,9 this opinion is contrary to the positions of all public health agencies, both national and international. For example, the preamble to all International Agency for Research on Cancer Monographs on the Evaluation of Carcinogenic Risks to Humans states that "in the absence of adequate data on humans, it is biologically plausible and prudent to regard agents and mixtures for which there is sufficient evidence of carcinogenicity in experimental animals as if they presented a carcinogenic risk to humans."7 The latter view is based on the fact that all known human carcinogens that have been adequately tested have produced significant carcinogenic effects in animal models.10 Rodents have been widely used as models for humans in toxicity and drug safety tests because of the trans-species similarities in physiological and biochemical processes. The scientific community considers animal data to be relevant and reliable for human risk assessments, even though most animal studies are performed at exposures higher than those to which humans are exposed in the environment. Concerning human exposures in the workplace, many bioassays have been conducted at similar exposure levels.
Low-dose extrapolations from experimental studies in animals to human exposure levels are necessary because the lower limit of detection of additional risk in animals, which is approximately 10% above the background rate, is an unacceptable level of risk for humans. Dose-response analyses of experimental data are critical for estimating human risk at environmental or occupational exposures. Without adequate familiarity with experimental designs, data analyses, and methods for evaluating human risk, a judges pretrial Daubert decision could well lead to a jury being denied hearing "reliable and relevant" evidence from knowledgeable experts; such a decision usurps the jurys role of assessing the validity of scientific testimony and determining whether opinions are plausible.5,11
Because the defendant would certainly hire scientists from multiple disciplines with biases favoring their positions, the cross-examination process and presentation of relevant contrary evidence or opinions by the defendants experts to the jury is a fairer way of revealing testimony that is reliable and credible. The dismissal of expert testimony prior to a trial based on the possible mistaken perceptions of a trial judge is inconsistent with our national principle of equal and impartial justice for all citizens. Consider the situation in which a Daubert motion is made and by which a judge decides that the plaintiffs expert witnesses in a toxic tort case are relevant and reliable. Shouldnt the judge automatically then determine that the opinions of the defendants expert witnesses (those who would claim that the defendants products or agents released into the environment are not harmful to human health) are also reliable and relevant? If the latter opinions are not found to be reliable, then presumably the judge would exclude their testimony as evidence to be presented before a jury. Such a decision is theoretically possible, but would likely be challenged for not allowing the defendants the means of obtaining witnesses in their favor. Likewise, not allowing the plaintiff the right to a trial by jury is unfair to the party in a toxic tort case that is seeking compensation for its injuries. Why shouldnt a jury be allowed to hear all of the relevant scientific evidence and opinions regarding adverse health effects that may result when a company pollutes the environment or workplaces of US citizens? Because the plaintiffs carry the burden of proof in toxic tort litigation, dismissal of expert testimony affects plaintiffs more than it does defendants. Thus, the application of Daubert in jury trials tips the scales of justice strongly in favor of defendants, who may have adversely affected the health of others through negligent or irresponsible emission or manufacture of harmful agents.
| INCOMPLETE SCIENCE DOES NOT JUSTIFY EXCLUDING EVIDENCE |
|---|
|
|
|---|
National and international agencies that provide evaluations of human health risks do not rely solely on associations observed in epidemiological studies. Most often, no adequate studies have been performed, especially on newly introduced chemicals. Rather, expert multidisciplinary panels use all of the available and relevant scientific evidence in reaching their overall conclusions. Interestingly, regarding dioxin and ethylene oxide, the International Agency for Research on Cancer13,14 and the National Toxicology Program15 have determined that both of these chemicals are known human carcinogens (i.e., sufficient evidence exists that there is a causal relationship between exposure to the agent and human cancer), although there was less than compelling evidence of carcinogenicity from studies in humans. Evaluating each piece of evidence separately, as might occur in a Daubert decision, could often lead to incorrect judgments of causality of human disease.16
For most toxic agents, reliable epidemiological evidence is not available. Protection of public health is based on primary prevention and acting on warning signals from all relevant sources of information.17 By reducing or eliminating exposure to cancer suspect agents, we may thankfully never see enough cancer patients to confirm their carcinogenicity. The alternative of waiting for dead bodies to appear before taking any preventative action has been referred to as the "body in the morgue approach."18
Most scientific interpretations related to health risks are based on a variety of assumptions; some are explicit, whereas others are frequently based on the different ways in which individuals evaluate available evidence and consider alternative explanations. It is virtually impossible to state with absolute certainty that an individuals disease condition was due solely to a specific exposure; likewise, it is impossible to state with absolute certainty that a past exposure to a particular toxic agent did not contribute in some way to that disease. If a judge requires nearly absolute certainty of causation, then he or she has raised the standard of proof for plaintiffs in such toxic tort cases to a nearly unachievable level. Recognizing the difficulty in drawing conclusions from epidemiological studies, Sir Bradford Hill developed a series of widely used criteria for determining causality in cancer epidemiology.19 On the issue of making health-based decisions with incomplete evidence, Hill noted "all scientific work is incompletewhether it is observational or experimental. All scientific work is liable to be upset or modified by advancing knowledge. That does not confer upon us a freedom to ignore the knowledge we already have, or to postpone the action that it appears to demand at a given time."
Because of uncertainties or lack of complete information on disease processes and how intrinsic and extrinsic factors may be involved, it is not possible to estimate precisely the level of human health risk from experimental toxicity data. Although many industrial chemicals have been studied for toxic effects in animals, no toxicological information is available for the majority of chemicals. Also, although new mechanistic insights on disease processes are advancing daily, there is still much to be learned on how environmental factors, human variability (e.g., genetics, gender, age, exposure to other agents), and lifestyle factors (e.g., diet, exercise, alcohol consumption) interact to influence the likelihood of disease outcome. Exposure issues such as timing, duration, frequency, and intensity, as well as exposures to other agents and latency for clinical manifestation of the disease (e.g., cancer latency may be as long as 2040 years) also impact on evaluations of disease-exposure relationships, yet precise information on these factors are not always available for exposed populations. Because of uncertainties, scientists may come to different conclusions on the relevance of specific findings to disease causation. With appropriate hypothesis testing, knowledge gained can reduce uncertainty. However, even with additional study, it is unlikely that we will know completely the mechanisms of disease causation by environmental agents and thus prove that a specific exposure was the sole cause of an individuals diseased condition. The fact that uncertainty exists does not mean that valid evidence cannot establish realistic links between exposure and disease causality.
| HEALTH-PROTECTIVE DECISIONS PREVENT NEEDLESS SUFFERING |
|---|
|
|
|---|
In some instances, judges have excluded epidemiological evidence that shows a statistically significant increase in risk when those studies did not demonstrate increased risks greater than a doubling (relative risk of 2.0) in exposed populations. The reasoning behind this legal threshold is that if the relative risk in an exposed population is greater than 2.0, then for an exposed individual, disease causality is more likely than not to have been due to that exposure, that is, it exceeds 50% for exposed individuals. However, this judgment fails to recognize that risk probabilities are underestimated for exposures that accelerate the time of disease occurrence,26 that is, the time until cancers are detected is reduced in the exposed population. Several additional flaws in such rulings are also noticeable. First, for agents that are prevalent in the environment, human exposure may occur at multiple locations or sources (e.g., environmental tobacco smoke, drinking water disinfection byproducts, benzene); consequently, there is no truly unexposed reference population. Thus, the contribution of that agent to the disease rate in the reference population will result in an underestimation of relative risk in the exposed population. Second, if the relative risk estimates were obtained from occupational exposure studies, then the "healthy worker effect" may lead to underestimates of risk when disease rates are compared to the general (less healthy) population. Because of the healthy worker effect, risk for all causes of death is less than 1.0. Consequently, meaningful estimates of relative risk in workers compared with the general population need to be adjusted upwards, or relative risk estimates must be based on incidence in exposed workers versus unexposed workers; but, even then, "unexposed" workers often means "less exposed." Third, because risks are not uniformly distributed in exposed populations, a risk much greater than 2 may exist for various susceptible subgroups, even though the overall risk is 2.0 or less. For example, interindividual variability in the probability of disease causation may occur because of differences in the magnitude, frequency, and duration of exposure; genetic differences that influence how individuals metabolize the agent, produce or eliminate reactive metabolites, repair genetic damage, or predispose an individual to a disease; exposure to other agents (e.g., pharmaceuticals or occupational or consumer chemicals) that affect the behavior of the agent of concern in individuals; differences in health status (e.g., pre-existing disease, immune-system deficiency); and other age- and gender-related differences. Because of the complex nature and multiple interactions among risk factors, an individuals risk cannot be estimated from epidemiological data alone. Focusing on a value of 2.0 as a measure of the relative risk in an exposed population rather than analyzing all of the data that contribute to the risk estimate is an irrational and inappropriate way to judge causality in an exposed individual. Requiring a relative risk greater than 2.0 is not a valid reason for dismissing pertinent evidence relevant to an individuals claim of lost years of healthy life.
| THE DATA QUALITY ACT SUPPRESSES SCIENTIFIC EVIDENCE |
|---|
|
|
|---|
| CONCLUSIONS |
|---|
|
|
|---|
| Acknowledgments |
|---|
| Footnotes |
|---|
The author is with the Environmental Toxicology Program, National Institute of Environmental Health Sciences, National Institutes of Health, Bethesda, Md.
Note. This commentary does not represent a view of the National Institute of Environmental Health Sciences, the National Institutes of Health, or the Department of Health and Human Services.
Accepted for publication July 27, 2004.
| References |
|---|
|
|
|---|
2. Cranor C. Scientific inferences in the laboratory and the law. Am J Public Health. 2005;95:S121S128.
3. Wagner W. The perils of relying on interested parties to evaluate scientific quality. Am J Public Health. 2005; 95:S99S106.
4. Cecil JS. Ten years of judicial gatekeeping under Daubert. Am J Public Health. 2005;95:S74S80
5. Jasanoff S. Laws knowledge: science for justice in legal settings. Am J Public Health. 2005;95:S49S58.
6. Haack S. Trial and error: the Supreme Courts philosophy of science. Am J Public Health. 2005;95:S66S73.
7. International Agency for Research on Cancer. Some traditional herbal medicines, some mycotoxins, naphthalene and styrene. IARC Monogr Eval Carcinog Risks Hum. 2002;82:1556.[Medline]
8. Huff J. IARC monographs, industry influence, and upgrading, downgrading, and under-grading chemicals: a personal point of view. Int J Occup Environ Health. 2002;8:249270.[Web of Science][Medline]
9. Berger MA. What has a decade of Daubert wrought? Am J Public Health. 2005;95:S59S65.
10. Tomatis L, Melnick RL, Haseman J, Barrett JC, Huff J. Alleged misconceptions distort perceptions of environmental cancer risks. FASEB J. 2001;15: 195203.
11. Lakoff G. A cognitive scientist looks at Daubert. Am J Public Health. 2005;95:S114S120.
12. Melnick RL. Carcinogenicity and mechanistic insights on the behavior of epoxides and epoxide-forming chemicals. Ann N Y Acad Sci. 2002;982: 177189.[Web of Science][Medline]
13. International Agency for Research on Cancer. 2,3,7,8-Tetrachlorodibenzopara-dioxin. IARC Monogr Eval Carcinog Risks Hum. 1997;69:33343.[Medline]
14. International Agency for Research on Cancer. Ethylene oxide. IARC Monogr Eval Carcinog Risks Hum. 1994; 60:73159.[Medline]
15. National Toxicology Program. Report on Carcinogens. Carcinogen Profiles 2002. 10th ed. Research Triangle Park, NC: US Department of Health and Human Services; 2002.
16. Krimsky S. The weight of scientific evidence in policy and law. Am J Public Health. 2005;95:S129S136.
17. Tomatis L, Huff J, Hertz-Picciotto I, et al. Avoided and avoidable risks of cancer. Carcinogenesis. 1997;18: 97105.
18. Moure-Eraso R. Primary prevention and precaution in hazard identification in the NIEHS/NTP: body in the morgue approach. Public Health Rep. 2002;117:564573.[CrossRef][Web of Science][Medline]
19. Hill AB. The environment and disease: association or causation? Proc R Soc Med. 1965;58:295300.[Web of Science][Medline]
20. Huff J. Chemicals and cancer in humans: first evidence in experimental animals. Environ Health Perspect. 1993; 100:201210.[Web of Science][Medline]
21. Occupational Safety and Health Administration. Occupational exposure to 1,3-butadiene; Final rule. Fed Reg. 1996;61:5674656856.
22. National Toxicology Program. NTP Toxicology and Carcinogenesis Studies of 1,3-Butadiene (CAS No. 106-99-0) in B6C3F1 Mice (Inhalation Studies). Natl Toxicol Program Tech Rep Ser. 1984; 288:1111.[Medline]
23. Huff JE, Melnick RL, Solleveld HA, Haseman JK, Powers M, Miller RA. Multiple organ carcinogenicity of 1,3-butadiene in B6C3F1 mice after 60 weeks of inhalation exposure. Science. 1985; 227:548549.
24. Macaluso M, Larson R, Delzell E, et al. Leukemia and cumulative exposure to butadiene, styrene and benzene among workers in the synthetic rubber industry. Toxicology. 1996;113: 190202.[CrossRef][Web of Science][Medline]
25. Melnick RL, Huff J, Chou BJ, Miller RA. Carcinogenicity of 1,3-butadiene in C57BL/6 x C3H F1 mice at low exposure concentrations. Cancer Res. 1990; 50:65926599.
26. Greenland S. Relation of probability of causation to relative risk and doubling dose: a methodologic error that has become a social problem. Am J Public Health. 1999;89:11661169.
27. Hayes T, Haston K, Tsui M, Hoang A, Haeffele C, Vonk A. Herbicides: feminization of male frogs in the wild. Nature. 2002;419:895896.[CrossRef][Medline]
28. Hayes TB, Collins A, Lee M, et al. Hermaphroditic, demasculinized frogs after exposure to the herbicide atrazine at low ecologically relevant doses. Proc Natl Acad Sci USA. 2002;99: 54765480.
29. Sass J, Devine J. The Center for Regulatory Effectiveness invokes the Data Quality Act to reject published studies on atrazine toxicity. Environ Health Perspect. 2004;112:A18.[Web of Science][Medline]
30. Kavlock RJ, Daston GP, DeRosa C, et al. Research needs for the risk assessment of health and environmental effects of endocrine disruptors: a report of the US EPA-sponsored workshop. Environ Health Perspect. 1996;104(suppl 4): 715740.
31. Stoker TE, Laws SC, Guidici DL, Cooper RL. The effect of atrazine on puberty in male Wistar rats: an evaluation in the protocol for the assessment of pubertal development and thyroid function. Toxicol Sci. 2000;58:5059.
32. Friedmann AS. Atrazine inhibition of testosterone production in rat males following peripubertal exposure. Reprod Toxicol. 2002;16:275279.[CrossRef][Web of Science][Medline]
This article has been cited by other articles:
![]() |
E. Bingham, L. Boden, R. Clapp, P. Hoppin, S. Krimsky, D. Michaels, D. Ozonoff, and A. Robbins BINGHAM ET AL. RESPOND Am J Public Health, February 1, 2006; 96(2): 206 - 207. [Full Text] [PDF] |
||||
![]() |
D. Michaels Scientific Evidence and Public Policy Am J Public Health, July 1, 2005; 95(S1): S5 - S7. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |