
    William DAUBERT, Joyce Daubert, individually and as Guardians Ad Litem for Jason Daubert, a minor; Anita De Young, individually, and as Guardian Ad Litem for Eric Schuller, Plaintiffs-Appellants, v. MERRELL DOW PHARMACEUTICALS, INC., a Delaware corporation, Defendant-Appellee.
    No. 90-55397.
    United States Court of Appeals, Ninth Circuit.
    Argued and Submitted March 22, 1994.
    Decided Jan. 4, 1995.
    
      Michael H. Gottesman, Georgetown Univ. Law Center, Washington, DC, Kenneth J. Chesebro, Cambridge, MA, Mary F. Gillick, Luce, Forward, Hamilton & Scripps, San Diego, CA, and Barry J. Nace, Paulson, Nace, Norwin & Sellinger, Washington, DC, for the plaintiffs-appellants.
    Charles Fried, Cambridge, MA, Hall R. Marston, George E. Berry, Dickson, Carlson & Campillo, Santa Monica, CA, and Joel I. Klein, and Richard G. Taranto, Washington, DC, for the defendant-appellee.
    Before: KOZINSKI and O’SCANNLAIN, Circuit Judges, and McNAMEE, District Judge.
    
      
      The Honorable Stephen M. McNamee, United States District Judge for the District of Arizona, sitting by designation.
    
   KOZINSKI, Circuit Judge.

On remand from the United States Supreme Court, we undertake “the task of ensuring that an expert’s testimony both rests on a reliable foundation and is relevant to the task at hand.” Dauberb v. Merrell Dow Pharmaceuticals, Inc., — U.S. -, -, 113 S.Ct. 2786, 2799, 125 L.Ed.2d 469 (1993).

I

A. Background

Two minors brought suit against Merrell Dow Pharmaceuticals, claiming they suffered limb reduction birth defects because their mothers had taken Bendectin, a drug prescribed for morning sickness to about 17.5 million pregnant women in the United States between 1957 and 1982. See Resp’t’s Br. on Writ of Cert. at 2; Turpin v. Merrell Dow Pharmaceuticals, Inc., 959 F.2d 1349, 1350 (6th Cir.1992). This appeal deals with an evidentiary question: whether certain expert scientific testimony is admissible to prove that Bendectin caused the plaintiffs’ birth defects.

For the most part, we don’t know how birth defects come about. We do know they occur in 2-3% of births, whether or not the expectant mother has taken Bendectin. See Jose F. Cordero & Godfrey P. Oakley, Jr., Drug Exposure During Pregnancy: Some Epidemiologic Considerations, 26 Clinical Obstetrics & Gynecology 418, 424-25 (June 1983). Limb defects are even rarer, occurring in fewer than one birth out of every 1000. Turpin, 959 F.2d at 1353. But scientists simply do not know how teratogens (chemicals known to cause limb reduction defects) do their damage: They cannot reconstruct the biological chain of events that leads from an expectant mother’s ingestion of a teratogenic substance to the stunted development of a baby’s limbs. Nor do they know what it is about teratogens that causes them to have this effect. No doubt, someday we will have this knowledge, and then we will be able to tell precisely whether and how Ben-dectin (or any other suspected teratogen) interferes with limb development; in the current state of scientific knowledge, however, we are ignorant.

Not knowing the mechanism whereby a particular agent causes a particular effect is not always fatal to a plaintiffs claim. Causation can be proved even when we don’t know precisely how the damage occurred, if there is sufficiently compelling proof that the agent must have caused the damage somehow. One method of proving causation in these circumstances is to use statistical evidence. If 50 people who eat at a restaurant one evening come down with food poisoning during the night, we can infer that the restaurant’s food probably contained something unwholesome, even if none of the dishes is available for analysis. This inference is based on the fact that, in our health-conscious society, it is highly unlikely that 50 people who have nothing in common except that they ate at the same restaurant would get food poisoning from independent sources.

It is by such means that plaintiffs here seek to establish that Bendectin is responsible for their injuries. They rely on the testimony of three groups of scientific experts. One group proposes to testify that there is a statistical link between the ingestion of Ben-dectin during pregnancy and limb reduction defects. These experts have not themselves conducted epidemiological (human statistical) studies on the effects of Bendectin; rather, they have reanalyzed studies published by other scientists, none of whom reported a statistical association between Bendectin and birth defects, Other experts proffered by plaintiffs propose to testify that Bendectin causes limb reduction defects in humans because it causes such defects in laboratory animals. A third group of experts sees a link between Bendectin and birth defects because Bendectin has a chemical structure that is similar to other drugs suspected of causing birth defects.

The opinions proffered by plaintiffs’ experts do not, to understate .the point, reflect the consensus within the scientific community. The FDA — an agency not known for its promiscuity in approving drugs — continues to approve Bendectin for use by pregnant women because “available data do not demonstrate an association between birth defects and Bendectin.” U.S. Department of Health and Human Services News, No. P80-45 (Oct. 7, 1980). Every published study here and abroad — and there have been many — concludes that Bendectin is not a teratogen. Turpin, 959 F.2d at 1353-56. In fact, apart from the small but determined group of scientists testifying on behalf of the Bendectin plaintiffs in this and many other cases, there doesn’t appear to be a single scientist who has concluded that Bendectin causes limb reduction defects.

It is largely because the opinions proffered by plaintiffs’ experts run counter to the substantial consensus in the scientific community that we affirmed’ the district court’s grant of summary judgment the last time the ease appeared before us. Daubert v. Merrell Dow Pharmaceuticals, Inc., 951 F.2d 1128, 1131 (9th Cir.1992). The standard for admissibility of expert testimony in this circuit at the time was the so-called Frye test: Scientific evidence was admissible if it was based on a scientific technique generally accepted as reliable within the scientific community. Frye v. United States, 293 F. 1013, 1014 (D.C.Cir.1923). We found that the district court properly applied this standard, and affirmed. The Supreme Court reversed, holding that Frye was superceded by Federal Rule of Evidence 702, — U.S. at -, 113 S.Ct. at 2794, and remanded for us to consider the admissibility of plaintiffs’ expert testimony under this new standard.

B. Procedural Issues

First, however, we address plaintiffs’ argument that we should simply remand the case so the district court can make the initial determination of admissibility under the new standard announced by the Supreme Court. There is certainly something to be said for this position, as the district court is charged with making the initial determination whether to admit evidence. In the peculiar circumstances of this case, however, we have determined that the interests of justice and judicial economy will best be served by deciding those issues that are properly before us and, in the process, offering guidance on the application of the Daubert standard in this circuit.

The district court already made a determination as to admissibility, albeit under a different standard than we apply on remand, and granted summary judgment based on its exclusion of plaintiffs’ expert testimony. Daubert v. Merrell Dow Pharmaceuticals, Inc., 727 F.Supp. 570, 575-76 (S.D.Cal.1989). A grant of summary judgment may be sustained on any basis supported by the record, Leonard v. Clark, 12 F.3d 885, 889 (9th Cir.1993), so we shall consider whether the district court’s grant of summary judgment can be sustained under the new standard announced by the Supreme Court. Our review here is, of course, very narrow: We will affirm the summary judgment only if, as a matter of law, the proffered evidence would have to be excluded at trial. The district court’s power is far broader; were we to conclude that the expert testimony is not per se inadmissible, the district court on remand would nevertheless have discretion to reject it under Rule 403 or 702. Daubert, — U.S. at -, 113 S.Ct. at 2798. Such a ruling would be reviewed under the deferential abuse of discretion standard.

One other procedural matter detains us. According to plaintiffs, they weren’t required to come forward with any evidence to survive summary judgment because the affidavit of Merrell’s expert was itself inadmissible under Daubert; the burden thus never shifted to plaintiffs to demonstrate a genuine issue as to causation. Plaintiffs not only fail to mention the many other exhibits offered by Merrell, they also misunderstand the moving party’s burden on summary judgment. Because plaintiffs bear the ultimate burden of proof on causation, Merrell had only to point to the absence of a genuine issue of material fact; it wasn’t required to produce any evidence at all. See Maffei v. Northern Insulation of New York, 12 F.3d 892, 899 (9th Cir.1993). Thus, the admissibility of Merrell’s expert’s affidavit is beside the point; the question is whether plaintiffs adduced enough admissible evidence to create a genuine issue of material fact as to whether Bendeetin caused their injuries. See Elkins v. Richardson-Merrell, Inc., 8 F.3d 1068, 1071-72 (6th Cir.1993). It is to that question we now turn.

II

A. Brave New World

Federal judges ruling on the admissibility of expert scientific testimony face a far more complex and daunting task in a postDaubert world than before. The judge’s task under Frye is relatively simple: to determine whether the method employed by the experts is generally accepted in the scientific community. Solomon, 753 F.2d at 1526. Under Daubert, we must engage in a difficult, two-part analysis. First, we must determine nothing less than whether the experts’ testimony reflects “scientific knowledge,” whether their findings are “derived by the scientific method,” and whether their work product amounts to “good science.” — U.S. at -, -, 113 S.Ct. at 2795, 2797. Second, we must ensure that the proposed expert testimony is “relevant to the task at hand,” id. at -, 113 S.Ct. at 2797, i.e., that it logically advances a material aspect of the proposing party’s ease. The Supreme Court referred to this second prong of the analysis as the “fit” requirement. Id. at -, 113 S.Ct. at 2796.

The first prong of Daubert puts federal judges in an uncomfortable position. The question of admissibility only arises if it is first established that the individuals whose testimony is being proffered are experts in a particular scientific field; here, for example, the Supreme Court waxed eloquent on the impressive qualifications of plaintiffs’ experts. Id. at - n. 2, 113 S.Ct. at 2791 n. 2. Yet something doesn’t become “scientific knowledge” just because it’s uttered by a scientist; nor can an expert’s self-serving assertion that his conclusions were “derived by the scientific method” be deemed conclusive, else the Supreme Court’s opinion could have ended with footnote two. As we read the Supreme Court’s teaching in Daubert, therefore, though we are largely untrained in science and certainly no match for any of the witnesses whose testimony we are reviewing, it is our responsibility to determine whether those experts’ proposed testimony amounts to “scientific knowledge,” constitutes “good science,” and was “derived by the scientific method.”

The task before us is more daunting still when the dispute concerns matters at the very cutting edge of scientific research, where fact meets theory and certainty dissolves into probability. As the record in this case illustrates, scientists often have vigorous and sincere disagreements as to what research methodology is proper,- what should be accepted as sufficient proof for the existence of a “fact,” and whether information derived by a particular method can tell us anything useful about the subject under study.

Our responsibility, then, unless we badly misread the Supreme Court’s opinion, is to resolve disputes among respected, well-ere-dentialed scientists about matters squarely within their expertise, in areas where there is no scientific consensus as to what is and what is not “good science,” and occasionally to reject such expert testimony because it was not “derived by the scientific method.” Mindful of our position in the hierarchy of the federal judiciary, we take a deep breath and proceed with this heady task.

B. Deus ex Machina

The Supreme Court’s opinion in Daubert focuses closely on the language of Fed. R.Evid. 702, which permits opinion testimony by experts as to matters amounting to “scientific ... knowledge.” The Court recognized, however, that knowledge in this context does not mean absolute certainty. — U.S. at -, 113 S.Ct. at 2795. Rather, the Court said, “in order to qualify as ‘scientific knowledge,’ an inference or assertion must be derived by the scientific method.” Id. Elsewhere in its opinion, the Court noted that Rule 702 is satisfied where the proffered testimony is “based on scientifically valid principles.” Id. at -, 113 S.Ct. at 2799. Our task, then, is to analyze not what the experts say, but what basis they have for saying it.

Which raises the question: How do we figure out whether scientists have derived their findings through the scientific method or whether their testimony is based on scientifically valid principles? Each expert proffered by the plaintiffs assures us that he has “utiliz[ed] the type of data that is generally and reasonably relied upon by scientists” in the relevant field, see, e.g., Newman Aff. at 5, and that he has “utilized the methods and methodology that would generally and reasonably be accepted” by people who deal in these matters, see, e.g., Gross Aff. at 5. The Court held, however, that federal judges perform a “gatekeeping role,” Daubert, — U.S. at -, 113 S.Ct. at 2798; to do so they must satisfy themselves that scientific evidence meets a certain standard of reliability before it is admitted. This means that the expert’s bald assurance of validity is not enough. Rather, the party presenting the expert must show that the expert’s findings are based on sound science, and this will require some objective, independent validation of the expert’s methodology.

While declining to set forth a “definitive checklist or test,” id. at -, 113 S.Ct. at 2796, the Court did list several factors federal judges can consider in determining whether to admit expert scientific testimony under Fed.R.Evid. 702: whether the theory or technique employed by the expert is generally accepted in the scientific community; whether it’s been subjected to peer review and publication; whether it can be and has been tested; and whether the known or potential rate of error is acceptable. Id. at -, 113 S.Ct. at 2796-97. We read these factors as illustrative rather than exhaustive; similarly, we do not deem each of them to be equally applicable (or applicable at all) in every ease. Rather, we read the Supreme Court as instructing us to determine whether the analysis undergirding the experts’ testimony falls within the range of accepted standards governing how scientists conduct their research and reach their conclusions.

One very significant fact to be considered is whether the experts are proposing to testify about matters growing naturally and directly out of research , they have conducted independent of the litigation, or whether they have developed their opinions expressly for purposes of testifying. That an expert testifies for money does not necessarily cast doubt on the reliability of his testimony, as few experts appear in court merely as an eleemosynary gesture. But in determining whether proposed expert testimony amounts to good science, we may not ignore the fact that a scientist’s normal workplace is the lab or the field, not the courtroom or the lawyer’s office.

That an expert testifies based on research he has conducted independent of the litigation provides important, objective proof that the research comports with the dictates of good science. See Peter W. Huber, Galileo’s Revenge: Junk Science in the Courtroom 206-09 (1991) (describing how the prevalent practice of expert-shopping leads to bad science). For one thing, experts whose findings flow from existing research are less likely to have been biased toward a particular conclusion by the promise of remuneration; when an expert prepares reports and findings before being hired as a witness, that record will limit the degree to which he can tailor his testimony to serve a party’s interests. Then, too, independent research carries its own indicia of reliability, as it is conducted, so to speak, in the usual course of business and must normally satisfy a variety of standards to attract funding and institutional support. Finally, there is usually a limited number of scientists actively conducting research on the very subject that is germane to a particular case, which provides a natural constraint, on parties’ ability to shop for experts who will come to the desired conclusion. That the testimony proffered by an expert is based directly on legitimate, preexisting research unrelated to the litigation provides the most persuasive basis for concluding that the opinions he expresses were “derived by the scientific method.”

We have examined carefully the affidavits proffered by plaintiffs’ experts, as well as the testimony from prior trials that plaintiffs have introduced in support of that testimony, and find that none of the experts based his testimony on preexisting or independent research. While plaintiffs’ scientists are all experts in their respective fields, none claims to have studied the effect of Bendectin on limb reduction defects before being hired to testify in this or related cases.

If the proffered expert testimony is not based on independent research, the party proffering it must come forward with other objective, verifiable evidence that the testimony is based on “scientifically valid principles.” One means of showing this is by proof that the research and analysis supporting the proffered conclusions have been subjected to normal scientific scrutiny through peer review and publication. Huber, Galileo’s Revenge at 209 (suggesting that “[t]he ultimate test of [a scientific expert’s] integrity is her readiness to publish and be damned”).

Peer review and publication do not, of course, guarantee that the conclusions reached are correct; much published scientific research is greeted with intense skepticism and is not borne out by further research. But the test under Daubert is not the correctness of the expert’s conclusions but the soundness of his methodology. See n. 11 infra. That the research is accepted for publication in a reputable scientific journal after being subjected to the usual rigors of peer review is a significant indication that it is taken seriously by other scientists, i.e., that it meets at least the minimal criteria of good science. Daubert, — U.S. at -, 113 S.Ct. at 2797 (“[Scrutiny of the scientific community is a component of ‘good science.’”). If nothing else, peer review and publication “increase the likelihood that substantive flaws in methodology will be detected.” Daubert, — U.S. at -, 113 S.Ct. at 2797.

Bendectin litigation has been pending in the courts for over a decade, yet the only review the plaintiffs’ experts’ work has received has been by judges and juries, and the only place their theories and studies have been published is in the pages of federal and state reporters. None of the plaintiffs’ experts has published his work on Bendectin in a scientific journal or solicited formal review by his colleagues. Despite the many years the controversy has been brewing, no one in the scientific community — except defendant’s experts — has deemed these studies worthy of verification, refutation or even comment. It’s as if there were a tacit understanding within the scientific community that what’s going on here is not science at all, but litigation.

Establishing that an expert’s proffered testimony grows out of pre-litigation research or that the expert’s research has been subjected to peer review are the two principal ways the proponent of expert testimony can show that the evidence satisfies the first prong of Rule 702. Where such evidence is unavailable, the proponent of expert scientific testimony may attempt to satisfy its burden through the testimony of its own experts. For such a showing to be sufficient, the experts must explain precisely how they went about reaching their conclusions and point to some objective source — a learned treatise, the policy statement of a professional association, a published article in a reputable scientific journal or the like — to show that they have followed the scientific method, as it is practiced by (at least) a recognized minority of scientists in their field. See United States v. Rincon, 28 F.3d 921, 924 (9th Cir.1994) (research must be described “in sufficient detail that the district court [can] determine if the research was scientifically valid”).

Plaintiffs have made no such showing. As noted above, plaintiffs rely entirely on the experts’ unadorned assertions that the methodology they employed comports with standard scientific procedures. In support of these assertions, plaintiffs offer only the trial and deposition testimony of these experts in other cases. While these materials indicate that plaintiffs’ experts have relied on animal studies, chemical structure analyses and epidemiological data, they neither explain the methodology the experts followed to reach their conclusions nor point to any external source to validate that methodology. We’ve been presented with only the experts’ qualifications, their conclusions and their assurances of reliability. Under Daubert, that’s not enough.

This is especially true of Dr. Palmer — the only expert willing to testify “that Bendectin did cause the limb defects in each of the children.” Palmer Aff. at 8. In support of this conclusion, Dr. Palmer asserts only that Bendectin is a teratogen and that he has examined the plaintiffs’ medical records, which apparently reveal the timing of their mothers’ ingestion of the drug. Dr. Palmer offers no tested or testable theory to explain how, from this limited information, he was able to eliminate all other potential causes of birth defects, nor does he explain how he alone can state as a fact that Bendec-tin caused plaintiffs’ injuries. We therefore agree with the Sixth Circuit’s observation that “Dr. Palmer does not testify on the basis of the collective view of his scientific discipline, nor does he take issue with his peers and explain the grounds for his differences. Indeed, no understandable scientific basis is stated. Personal opinion, not science, is testifying here.” Turpin, 959 F.2d at 1360. For this reason, Dr. Palmer’s testimony is inadmissible as a matter of law under Rule 702.

The failure to make any objective showing as to admissibility under the first prong of Rule 702 would also fatally undermine the testimony of plaintiffs’ other experts, but for the peculiar posture of this case. Plaintiffs submitted their experts’ affidavits while Frye was the law of the circuit and, although they’ve not requested an opportunity to augment their experts’ affidavits in light of Dau-bert, the interests of justice would be dis-served by precluding plaintiffs from doing so. Given the opportunity to augment their original showing of admissibility, plaintiffs might be able to show that the methodology adopted by some of their experts is based on sound scientific principles. For instance, plaintiffs’ epidemiologists might validate their reanalyses by explaining why they chose only certain of the data that was available, or the experts relying on animal studies might point to some authority for extrapolating human causation from teratogenicity in animals.

Were this the only question before us, we would be inclined to remand to give plaintiffs an opportunity to submit additional proof that the scientific testimony they proffer was “derived by the scientific method.” Dauberb, however, establishes two prongs to the Rule 702 admissibility inquiry. See pp. 1315-16 supra. We therefore consider whether the testimony satisfies the second prong of Rule 702: Would plaintiffs’ proffered scientific evidence “assist the trier of fact to ... determine a fact in issue”? Fed.R.Evid. 702.

C. No Visible Means of Support

In elucidating the second requirement of Rule 702, Daubert stressed the importance of the “fit” between the testimony and an issue in the case: “Rule 702’s ‘helpfulness’ standard requires a valid scientific connection to the pertinent inquiry as a precondition to admissibility.” — U.S. at -, 113 S.Ct. at 2796. Here, the pertinent inquiry is causation. In assessing whether the proffered expert testimony “will assist the trier of fact” in resolving this issue, we must look to the governing substantive standard, which in this case is supplied by California tort law.

Plaintiffs do not attempt to show causation directly; instead, they rely on experts who present circumstantial proof of causation. Plaintiffs’ experts testify that Bendectin is a teratogen because it causes birth defects when it is tested on animals, because it is similar in chemical structure to other suspected teratogens, and because statistical studies show that Bendectin use increases the risk of birth defects. Modern tort law permits such proof, but plaintiffs must nevertheless carry their traditional burden; they must prove that their injuries were the result of the accused cause and not some independent factor. In the ease of birth defects, carrying this burden is made more difficult because we know that some defects — including limb reduction defects — occur even when expectant mothers do not take Bendectin, and that most birth defects occur for no known reason.

California tort law requires plaintiffs to show not merely that Bendectin increased the likelihood of injury, but that it more likely than not caused their injuries. See Jones v. Ortho Pharmaceutical Corp., 163 Cal.App.3d 396, 403, 209 Cal.Rptr. 456 (1985). In terms of statistical proof, this means that plaintiffs must establish not just that their mothers’ ingestion of Bendectin increased somewhat the likelihood of birth defects, but that it more than doubled it — only then can it be said that Bendectin is more likely than not the source of their injury. Because the background rate of limb reduction defects is one per thousand births, plaintiffs must show that among children of mothers who took Bendectin the incidence of such defects was more than two per thousand.

None of plaintiffs’ epidemiological experts claims that ingestion of Bendectin during pregnancy more than doubles the risk of birth defects. To evaluate the relationship between Bendectin and limb reduction defects, an epidemiologist would take a sample of the population and compare the frequency of birth defects in children whose mothers took Bendectin with the frequency of defects in children whose mothers did not. See DeLuca, 911 F.2d at 946. The ratio derived from this comparison would be an estimate of the “relative risk” associated with Bendectin. See generally Joseph L. Fleiss, Statistical Methods for Rates and Proportions (2d ed. 1981). For an epidemiological study to show causation under a preponderance standard, “the relative risk of limb reduction defects arising from the epidemiological data ... will, at a minimum, have to exceed ‘2’.” DeLuca, 911 F.2d at 958. That is, the study must show that children whose mothers took Bendectin are more than twice as likely to develop limb reduction birth defects as children whose mothers did not. While plaintiffs’ epidemiologists make vague assertions that there is a statistically significant relationship between Bendectin and birth defects, none states that the relative risk is greater than two. These studies thus would not be helpful, and indeed would only serve to confuse the jury, if offered to prove rather than refute causation. A relative risk of less than two may suggest teratogenicity, but it actually tends to disprove legal causation, as it shows that Bendectin does not double the likelihood of birth defects.

With the exception of Dr. Palmer, whose testimony is inadmissible under the first prong of the Rule 702 analysis, see p. 1319 supra, the remaining experts proffered by plaintiffs were equally unprepared to testify that Bendectin caused plaintiffs’ injuries; they were willing to testify only that Bendec-tin is “capable of causing” birth defects. Crescitelli Aff. at 3, 8; Glasser Aff. at 6, 8; Gross Aff. at 9; Newman Aff. at 5, 9; Swan Aff. at 7. Plaintiffs argue “these scientists use the words ‘capable of causing’ meaning that it does cause. This is an ambiguity of language.... If something is capable of causing damage in humans, it does.” Tape of Oral Arg. Mar. 22,1994. But what plaintiffs must prove is not that Bendectin causes some birth defects, but that it caused their birth defects. To show this, plaintiffs’ experts would have had to testify either that Bendectin actually caused plaintiffs’ injuries (which they could not say) or that Bendectin more than doubled the likelihood of limb reduction birth defects (which they did not say).

As the district court properly found below, “the strongest inference to be drawn for plaintiffs based on the epidemiological evidence is that Bendectin could possibly have caused plaintiffs’ injuries.” 727 F.Supp. at 576. .The same is true of the other testimony derived from animal studies and chemical structure analyses — these experts “testify to a possibility rather than a probability.” Turpin, 959 F.2d at 1360. Plaintiffs do not quantify this possibility, or otherwise indicate how their conclusions about causation should be weighted, even though the substantive legal standard has always required proof of causation by a preponderance of the evidence. Unlike these experts’ explanation of their methodology, this is not a shortcoming that could be corrected on remand; plaintiffs’ experts could augment their affidavits with independent proof that their methods were sound, but to augment the substantive testimony as to causation would require the experts to change their conclusions altogether. Any such tailoring of the experts’ conclusions would, at this stage of the proceedings, fatally undermine any attempt to show that these findings were “derived by the scientific method.” Plaintiffs’ experts must, therefore, stand by the conclusions they originally proffered, rendering their testimony inadmissible under the second prong of Fed.R.Evid. 702.

Conclusion

The district court’s grant of summary judgment is AFFIRMED. 
      
      . Limb reduction defects involve incomplete development of arms, legs, fingers and toes, such as the defects associated with the Thalidomide disaster of the 1960s.
     
      
      . We had adopted Frye as the law of the circuit in United States v. Solomon, 753 F.2d 1522, 1526 (9th Cir.1985).
     
      
      . These factors raise many questions, such as how do we determine whether the rate of error is acceptable, and by what standard? Or, what should we infer from the fact that the methodology has been tested, but only by the party’s own expert or experts? Do we ask whether the methodology they employ to test their methodology is itself methodologically sound? Such questions only underscore the basic problem, which is that we must devise standards for acceptability where respected scientists disagree on what’s acceptable.
     
      
      . Two of the four factors mentioned by the Supreme Court would be difficult or impossible to apply to the expert testimony in this case. Only one of plaintiffs’ experts has done original research. Dr. Crescitelli mentions that he “specifically performed studies" on Bendectin and its antihistamine component, Aff. at 3, but does not explain the nature of those studies or the methodology employed. The others have examined the available literature and studies within their respective fields and drawn different conclusions than the scientists who performed the original work. As to such derivative analytical work, it makes little sense to ask whether the technique employed “can be (and has been) tested,” Daubert, - U.S. at -, 113 S.Ct. at 2796, or what its "known or potential rate of error” might be, id. at -, 113 S.Ct. at 2797.
     
      
      . There are, of course, exceptions. Fingerprint analysis, voice recognition, DNA fingerprinting and a variety of other scientific endeavors closely tied to law enforcement may indeed have the courtroom as a principal theatre of operations. See, e.g., United States v. Chischilly, 30 F.3d 1144, 1153 (9th Cir.1994) (admitting expert testimony concerning a DNA match as proof the defendant committed sexual abuse and murder). As to such disciplines, the fact that the expert has developed an expertise principally for purposes of litigation will obviously not be a substantial consideration.
     
      
      . We refer, of course, to publication in a generally-recognized scientific journal that conditions publication on a bona fide process of peer review. See Daubert, - U.S. at -, 113 S.Ct. at 2797 ("The fact of publication (or lack thereof) in a peer-reviewed journal thus will be ... rele-vant_") (emphasis added). See generally The Journal’s Peer-Review Process, 321 New Eng. J.Med. 837 (1989).
     
      
      . For instance, peer review might well have brought to light the more glaring arithmetical errors in the testimony presented by plaintiffs' experts in other Bendectin cases. See DeLuca v. Merrell Dow Pharmaceuticals, Inc., 791 F.Supp. 1042, 1048 (D.N.J.1992), aff'd, 6 F.3d 778 (3d Cir.1993).
     
      
      . As Judge Frank Johnson has succinctly noted, "the examination of a scientific study by a cadre of lawyers is not the same as its examination by others trained in the field of science or medicine." Perry v. United States, 755 F.2d 888, 892 (11th Cir.1985).
     
      
      . There may well be good reasons why a scientific study has not been published. For example, it may be too recent or of insufficiently broad interest. Daubert, - U.S. at -, 113 S.Ct. at 2797. These reasons do not apply here. Except with respect to the views expressed in this litigation, plaintiffs' experts have been well-published, see, e.g., Crescitelli Aff. at 3 (authored 125 formal papers, 80-100 short notes or abstracts, a half-dozen reviews, and articles concerning antihistamines and related compounds), and the opinions they proffer, if supported by sound methodology, would doubtless be greedily devoured by the machinery of peer review. A conclusion that Bendectin causes birth defects would be of significant public interest both in this country (where millions of women have taken Bendectin and the FDA continues to approve its use) and abroad (where Bendectin is still widely used). That plaintiffs' experts have been unable or unwilling to publish their work undermines plaintiffs’ claim that the findings these experts proffer are "ground[ed] in the methods and procedures of science” and "derived by the scientific method.” Daubert, - U.S. at -, -, 113 S.Ct. at 2795, 2796.
     
      
      .This showing would not, of course, be conclusive. Proffering scientific testimony and making an initial showing that it was derived by the scientific method enables a party to establish a prima facie case as to admissibility under Rule 702. The opposing party would then be entitled to challenge that showing. This it could do by presenting evidence (including expert testimony) that the proposing party’s expert employed unsound methodology or failed to assiduously follow an otherwise sound protocol. Where the opposing party thus raises a material dispute as to the admissibility of expert scientific evidence, the district court must hold an in limine hearing (a so-called Daubert hearing) to consider the conflicting evidence and make findings about the soundness and reliability of the methodology employed by the scientific experts. See Fed.R.Evid. 104(a) ("In making its determination [the court] is not bound by the rules of evidence.”); Fed. R.Evid. 706 (on the use of court-appointed experts).
     
      
      . This underscores the difference between Dau-bert and Frye. Under Frye, the party proffering scientific evidence had to show it was based on the method generally accepted in the scientific community. The focus under Daubert is on the reliability of the methodology, and in addressing that question the court and the parties are not limited to what is generally accepted; methods accepted by a minority in the scientific community may well be sufficient. However, the party proffering the evidence must explain the expert's methodology and demonstrate in some objectively verifiable way that the expert has both chosen a reliable scientific method and followed it faithfully. Of course, the fact that one party's experts use a methodology accepted by only a minority of scientists would be a proper basis for impeachment at trial.
     
      
      . Dr. Palmer could not similarly bolster his testimony. Unlike the other experts, who speak in terms of probabilities, Dr. Palmer goes so far as to conclude that plaintiffs' injuries were in fact caused by Bendectin rather than another cause. The record in this case categorically refutes the notion that anyone can tell what caused the birth defects in any given case. See p. 1313 supra.
      
     
      
      . No doubt, there will be unjust results under this substantive standard. If a drug increases the likelihood of birth defects, but doesn’t more than double it, some plaintiffs whose injuries are attributable to the drug will be unable to recover. There is a converse unfairness under a regime that allows recovery to everyone that may have been affected by the drug. Under this regime, all potential plaintiffs are entitled to recover, even though most will not have suffered an injury that can be attributed to the drug. One can conclude from this that unfairness is inevitable when our tools for detecting causation are imperfect and we must rely on probabilities rather than more direct proof. In any event, this is a matter to be sorted out by the states, whose substantive legal standards we are bound to apply. See O'Melveny & Myers v. FDIC, - U.S. -, -, 114 S.Ct. 2048, 2053, 129 L.Ed.2d 67 (1994).
     
      
      . The only exception is Dr. Done, who in another case presented metaanalysis studies purporting to show a relative risk greater than two. But his conclusion in that case rested on a demonstrably faulty methodology, see DeLuca, 791 F.Supp. at 1047-59, and perhaps for that reason was not proffered here.
     
      
      . For a more complete explanation of the relationship between the burdén of proof and relative risk, see Robert P. Charrow & David E. Bernstein, Scientific Evidence in the Courtroom: Admissibility and Statistical Significance after Daubert 28-33 (Wash.Legal Found., 1994).
     
      
      . A statistical study showing a relative risk of less than two could be combined with other evidence to show it is more likely than not that the accused cause is responsible for a particular plaintiff’s injury. For example, a statistical study may show that a particular type of birth defect is associated with some unknown causes, as well as two known potential causes — e.g., smoking and drinking. If a study shows that the relative risk of injury for those who smoke is 1.5 as compared to the general population, while it is 1.8 for those who drink, a plaintiff who does not drink might be able to reanalyze the data to show that the study of smoking did not account for the effect of drinking on the incidence of birth defects in the general population. By making the appropriate comparison — between non-drinkers who smoke and non-drinkers who do not smoke — the teeto-taller plaintiff might be able to show that the relative risk of smoking for her is greater than two. Here, however, plaintiffs’ experts did not seek to differentiate these plaintiffs from the subjects of the statistical studies. The studies must therefore stand or fall on their own.
     
      
      . The Supreme Court recognized that the "fit” requirement "goes primarily to relevance,” Daubert, — U.S. at -, 113 S.Ct. at 2795, but it obviously did not intend the second prong of Rule 702 to be merely a reiteration of the general relevancy requirement of Rule 402. In elucidating the "fit” requirement, the Supreme Court noted that scientific expert testimony carries special dangers to the fact-finding process because it “ 'can be both powerful and quite misleading because of the difficulty in evaluating it.’ ” Id. at -, 113 S.Ct. at 2798 (quoting Weinstein, Rule 702 of the Federal Rules of Evidence Is Sound; It Should Not Be Amended, 138 F.R.D. 631, 632 (1991)). Federal judges must therefore exclude proffered scientific evidence under Rules 702 and 403 unless they are convinced that it speaks clearly and directly to an issue in dispute in the case, and that it will not mislead the jury.
     
      
      . Dr. Palmer’s testimony would easily meet Rule 702’s fit requirement, were it not rendered inadmissible by the total lack of scientific basis for his conclusions. See pp. 1319-20 & n. 12 supra. Dr. Palmer's testimony thus illustrates how the two prongs of Rule 702 work in tandem to ensure that junk science is kept out of the federal courtroom.
     
      
      . Several circuits have conducted a similar analysis in finding plaintiffs' expert testimony insufficient to prove causation as a matter of law. See Elkins, 8 F.3d at 1071-72; Turpin, 959 F.2d at 1359-61; Ealy v. Richardson-Merrell, Inc., 897 F.2d 1159, 1163 (D.C.Cir.1990); Brock v. Merrell Dow Pharmaceuticals, Inc., 874 F.2d 307, 311-15 (5th Cir.1989); Lynch v. Merrell-Nat’l Labs., 830 F.2d 1190, 1195-97 (1st Cir.1987).
     