|
|
||||||||
RESEARCH AND PRACTICE |
Gabriel M. Leung, Tai-Hing Lam, Thuan Q. Thach, and Anthony J. Hedley are with the Department of Community Medicine, University of Hong Kong Medical Centre, Hong Kong.
Correspondence: Requests for reprints should be sent to G. M. Leung, MD, MPH, Department of Community Medicine, 21 Sassoon Road, Patrick Manson Building, University of Hong Kong, Pokfulam, Hong Kong (e-mail: gmleung{at}hku.hk).
| ABSTRACT |
|---|
|
|
|---|
Objectives. We sought to systematically review the evidence for population-based mammography as applied to a Chinese population.
Methods. Primary reports for meta-analysis were identified by a search of MEDLINE and the Cochrane Library. Outcome measures included breast cancerrelated mortality, the number needed to be screened to prevent 1 death, and the positive predictive value of mammography.
Results. Pooled relative risk for breast cancerrelated death in the screened group was 0.80 (95% confidence interval = 0.71, 0.90). Applied to Hong Kong, this figure translates into a number needed to screen of 1 302 healthy women screened annually for 10 years to prevent 1 death. Conclusions. Evidence is insufficient to justify population-based breast cancer screening by mammography for women in Hong Kong and other Asian populations with low breast cancer prevalence.
| INTRODUCTION |
|---|
|
|
|---|
We have reviewed the evidence for population-based screening for early breast cancer detection and examined the applicability of these results to a Chinese population. We first performed an updated systematic review and meta-analysis of clinical trials evaluating mammography screening. We then applied these results to women in Hong Kong, where 95% of the resident population are ethnic Chinese.4 We restricted our attention to mass screening at the population level, rather than issues relating to opportunistic screening or case finding. We also only examined this issue in women aged at least 50 years, given the lack of proven efficacy in screening women younger than 50 years, even in Western populations.
| METHODS |
|---|
|
|
|---|
Two authors (G. M. L. and T. Q. T.) independently reviewed these 88 papers to identify studies that met eligibility criteria for inclusion in the meta-analysis. Inclusion criteria required: (1) an experimental RCT design that measured clinical endpoints, including breast cancerrelated mortality; (2) a duration of follow-up lasting 5 years or more and a minimum of 10 breast cancer deaths; and (3) the reporting of relative risk (RR) or an odds ratio (OR) with 95% confidence intervals (CI), or data that would allow their calculation. We excluded studies that involved exclusively women younger than 50 years at the time of recruitment, were published in languages other than English or only in abstract form, or that enrolled primarily symptomatic patients with breast lumps, pain, nipple discharge, or enlarged lymph nodes.
Of 24 references that satisfied these criteria, we excluded 16 because of multiple publications from the same study. Thus, 8 articles from 7 studies were finally selected for quantitative pooling of results.512 The same 2 authors separately abstracted the relevant data. Disagreement was settled by consensus. For studies that reported more than 1 set of results due to updating of findings from longer periods of follow-up, we used the latest results published. A meta-analysis was performed using DerSimonian and Lairds13 random effects model, Greenlands14 fixed effects model, and Petos15 assumption-free method.
A test for heterogeneity was performed. Because the power of statistical tests of heterogeneity is low, a relatively high critical value for P of 0.2 was selected a priori to avoid underestimating the presence of heterogeneity.
| RESULTS |
|---|
|
|
|---|
|
|
Geo-ethnic differences in breast cancer epidemiology.
Figure 2
shows the estimated age-specific incidences of breast cancer for the age group 5069,19 comparing the rates of Asian regions with those of White populations from Canada, Scotland, Sweden, and the United States, where the primary studies were performed. There is a 1- to 2-fold difference in breast cancer risk between Asian women and North Americans and Europeans, on whom the original screening trials were carried out.19 These differences indicate that the positive predictive value (PPV) of any test, which depends on the underlying disease rates in the screened population, will be much lower in Asian regions than in the West.
|
In Hong Kong, there were 868 new cases of breast cancer for women aged 50 years or more in 1996, an incidence of 123.3 per 100 000.24 From the International Agency for Research on Cancers GLOBOCAN database, we obtained corresponding rates per 100 000 of 361.6 (White) and 283.5 (Black), 299.5, 277.4, and 255.1 for the United States, Canada, Sweden, and the UK, respectively19 (arithmetic mean = 295.4 per 100 000). Assuming a linear relationship between prevalence ratio (P) and incidence rate (I), as per the formula D/(ND)
I, where N is the size of the population at risk for the disease, D the subset of the population with the disease, and D/(ND) is the prevalence ratio,25 and a consistent relationship across different population groups (Western and Chinese women), we estimated Hong Kongs breast cancer prevalence at screening to be around 146.1 per 100 000, according to the formula PWest/IWest = PHK/IHK.
Using the optimal ranges of sensitivity (83%95%) and specificity (93.5%99.1%) reported in a recent meta-analysis,21 the PPV for Hong Kong women ranges from 1.8% to 13.4%, assuming regular annual screening in those aged 50 years or more. So we would expect at least 86%, and as many as 98%, of positive screens to be false positives. In practice, we anticipate that the accuracy would be lower than these quoted ranges from trials. Because the average Chinese breast has a smaller volume (224.5 cm3 vs 585.1 cm3 in British women) and is denser with less fat content compared with Caucasians breasts,26,27 we believe that the true sensitivity and specificity in Hong Kong would correspond to the minimum values of these ranges.
Number needed to screen. The results of most clinical trials are presented as relative risk reductions but do not take into account the role of event rate or incidence of disease on overall clinical benefit. Statistical significance is often regarded as an index of clinical relevance, ignoring the effect of sample size on significance. Number needed to screen (NNS) is defined as the number of people that need to be screened to prevent 1 death or 1 adverse event.28 We calculated the NNS for breast cancerrelated mortality for screening, using Hong Kong women aged 50 years or more in 1996, in whom there were 270 breast cancerrelated deathsa mortality rate of 38.4 per 100 000.24,29 Applying the pooled relative risk reduction from our meta-analysis, we estimated the absolute risk reduction of breast cancerrelated mortality to be 0.106% over a screening period of 13.8 years and the NNS to be 1302 (95% CI = 898, 2604) women for 10 years to prevent 1 case of breast cancer death. This estimate compares unfavorably with the corresponding 10-year NNS of 666 in US women (the 1996 mortality rate for US women aged 50 years or more was 83.4 per 100 00030).
This extrapolation is valid only if we assume that the relative risk reduction observed in Western trials is independent of the baseline risk for developing breast cancer.31 Adjusting the NNS or number needed to treat for the baseline risk of the patient population implies, as underlined by Cook and Sackett,32 that the relative risk reduction is constant for all levels of disease incidence. Such an assumption may be valid for some diseases, such as hypertension treatment to prevent stroke,33 but not for others. For instance, in the overview of the effects of coronary artery bypass graft in patients with stable coronary artery disease, the relative risk reductions associated with surgery, instead of being equal to 39% in each third of risk, were around 45% in both the middle and highest thirds of risk, whereas there was a 17% relative risk increase in the lowest third.34 Therefore, there is a possibility that we may have underestimated the NNS to save 1 breast cancerrelated death.
We used cancer statistics from 1996 for breast cancer incidence and mortality because they were the latest and the highest age-standardized numbers available. There is currently no population-based screening program for breast cancer in Hong Kong, only periodically advertised services in isolated hospitals and laboratory facilities since 1993. The intervening 3 years would not have allowed sufficient time for any effects of sporadic screening to influence breast cancer population statistics.35 So the PPV and NNS results derived from the 1996 statistics would not have been affected by widespread opportunistic screening, and may be biased toward higher PPV and lower NNS estimates.
Natural history of breast cancer. A dramatic increase in the incidence of ductal carcinoma in situ (DCIS) in asymptomatic women has been reported, coinciding with the widespread adoption of screening mammography. In the United States, the total recorded number of DCIS cases in 1992 was 200% higher than expected, based on rates in 1983 when mammographic screening became widespread36 according to trends between 1973 and 1983. In Hong Kong, similar findings were made in an audit of a hospitals screening program.37 Of the asymptomatic, nonpalpable malignancies detected through mammography, 53% were DCIS, with the remainder being invasive cancer.37 Whereas early detection of invasive cancer has been shown to be beneficial in terms of survival, the prognostic value of DCIS detection is currently unknown. Increasing evidence suggests that most DCIS cases never progress to the invasive stage during a womans lifetime.3843 Earlier autopsy studies first proposed this view,4043 later supported by the large population-based Surveillance, Epidemiology, and End Results (SEER) program, in which the absolute risk of dying from breast cancer among women with DCIS was found to be very low (1.9% within 10 years).39 Miller et al.6 argued that the excellent survival (87.5% alive at 10 years) of women with DCIS and early impalpable invasive cancer in CNBSS II was almost certainly due to a combination of lead time and length bias. In the light of this, it is prudent to wait for more definitive evidence regarding improvements in survival from screening before recommending mass mammography in Hong Kong. For women who proceed with opportunistic screening in the meantime, we should at least inform them of the likelihood of being diagnosed with DCIS before mammographic examination, and that only some DCIS cases may be clinically significant.
Balance of benefit, harm and cost. The discussion of whether to screen cannot be limited to statistical validation of tests and histological staging of cancer types. We must also consider the harm inflicted by screening. Even though it is not known whether detection of DCIS actually contributes to lower mortality, it is clear that almost all women with mammographically detected DCIS undergo disfiguring operations. In the US SEER program, between 1983 and 1992, 43.8% of DCIS cases were treated with mastectomy and 53.3% with lumpectomy.36
High rates of false positivity are an important concern in screening mammography. Elmore et al.44 reported a 49.1% cumulative false-positive risk, and an 18.6% rate of biopsy in healthy women after 10 mammograms. They estimated that for every $100 spent on screening, an additional $33 was spent to evaluate these false-positive results. A positive screen inevitably leads to further confirmatory studies, ranging from a repeat mammogram to a fine-needle aspiration biopsy, or an open biopsy. The anxiety and psychological trauma associated with these interventions can be considerable.45
Women in a screening program deserve to know the complication rates of between 8%10%4648 for fine needle aspiration and open biopsy, because many will be subject to these investigations, given their low PPV of 1.8% to 13.4%. Complications ranged from prolonged bleeding and hematoma formation, to abscess and wound dishesion.46,48
The net balance of benefit, harm, and cost can be summarized for every 100 000 Hong Kong Chinese women aged 50 years or more screened annually for 10 years (Figure 3
). With the best-case scenario of a sensitivity of 95% and specificity of 99.1% and a prevalence at screening of 146.1 per 100 000, we would expect 10 370 "positive" results, 8980 of which would be false-positive cases. Biopsies for 18.6%44 of women with false positives with a complication rate of 8%4648 would lead to an avoidable iatrogenic adverse event in 134; however, according to our 10-year NNS estimate, fewer than 77 breast cancerrelated deaths would have been avoided, even assuming optimal trial conditions with 100% uptake and follow-up. In practice, the number of lives saved as a result of screening is likely to be much smaller where uptake by and follow-up of women are often less than optimal.
|
| CONCLUSIONS |
|---|
|
|
|---|
There is an ongoing clinical trial in Singapore, mostly in Chinese women, to evaluate the efficacy of mammography screening in 50- to 64-year-old women. Although intermediate results are reported favorably, enrolled women have yet to be followed for a sufficiently long period to examine the relevant outcomes such as mortality reduction.22 Until then, health care professionals must maintain the ethical position of equipoise and counsel patients about both the potential benefits and hazards of screening.
Asian health systems should pay particular attention to the Wests experience with mammographic screening programs. Olsen and Gotzsche49 have recently raised serious doubts about the scientific validity of most of the original trials that led Western governments to promote organized screening. They, and the accompanying Lancet editorial,50 concluded that there was "no reliable evidence that screening for breast cancer reduces mortality." Now the Swedish, Canadian, and British national screening programs, among others, are facing the dilemma of whether to carry on an unproven clinical activity or to stop and dismantle the screening infrastructure they have worked so hard to put in place. As Wilson and Jungner18 cautioned 3 decades ago, screening should not be initiated until there is absolute, solid evidence supporting its effectiveness because organized programs are almost impossible to undo and stop. The East must not repeat the mistake of the West.
Our discussion has only dealt with mass, population-based mammographic screening of well women. For those at high risk for the disease, careful individual clinical assessment should guide the need for and frequency of mammographic screening. We suggest resources that may be allocated for a comprehensive population screening program in Asian populations would be better directed at raising public awareness of the issues exposed here, along with case-finding in targeted high-risk groups where mammography may be a truly beneficial, and necessary, intervention.
| Acknowledgments |
|---|
Human Participant Protection
No institutional review board approval was needed for this study.
| Footnotes |
|---|
Accepted for publication December 13, 2001.
| References |
|---|
|
|
|---|
2. Singapore Breast Cancer Foundation Web site. Available at: http://www.bcf.org.sg/fmammogram.htm. Accessed October 23, 2000.
3. Morimoto T, Sasa M, Yamaguchi T, Harada K, Sagara Y. High detection rate of breast cancer by mass screening using mammography in Japan. Jpn J Cancer Res. 1994;85:11931195.[Medline]
4. Hong Kong Annual Digest 1999. Hong Kong, China: Government Printers, 2000.
5. Shapiro S, Venet W, Strax P, Venet L. Periodic Screening for Breast Cancer. The Health Insurance Plan Project and Its Sequelae, 19631986. Baltimore, Md: Johns Hopkins University Press, 1988.
6. Miller AB, To T, Baines CJ, Wall C. Canadian National Breast Screening Study 2: 13-year results of a randomized trial in women aged 5059 years. J Natl Cancer Inst. 2000;92:14901499.
7. Alexander FE, Anderson TJ, Brown HK, et al. 14 years of follow-up from the Edinburgh randomized trial of breast-cancer screening. Lancet. 1999;353:19031908.[Medline]
8. Andersson I, Aspegren K, Janzon L, et al. Mammographic screening and mortality from breast cancer: the Malmo mammographic screening trial. BMJ. 1988;297:943948.
9. Nystrom L, Rutqvist LE, Wall S, et al. Breast cancer screening with mammography: overview of Swedish randomised trials. Lancet. 1993;341:973978.[Medline]
10. Tabar L, Fagerberg G, Chen HH, et al. Efficacy of breast cancer screening by age. New results from the Swedish Two-County Trial. Cancer. 1995;75:25072517.[Medline]
11. Frisell J, Lidbrink E, Hellstrom L, Rutqvist LE. Followup after 11 yearsupdate of mortality results in the Stockholm mammographic screening trial. Breast Cancer Res Treat. 1997;45:263270.[Medline]
12. Bjurstam N, Bjorneld L, Duffy SW, et al. The Gothenburg breast screening trial: first results on mortality, incidence, and mode of detection for women ages 3949 years at randomization. Cancer. 1997;80:20912099.[Medline]
13. DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986;7:177188.[Medline]
14. Greenland S. Quantitative methods in the review of epidemiologic literature. Epidemiol Rev. 1987;9:130.
15. Yusuf S, Peto R, Lewis J, Collins R, Sleight P. Beta blockade during and after myocardial infarction: an overview of the randomized trials. Prog Cardiovasc Dis. 1985;27:335371.[Medline]
16. Kerlikowske K, Grady D, Rubin SM, Sandrock C, Ernster VL. Efficacy of screening mammography. A meta-analysis. JAMA. 1995;273:149154.[Abstract]
17. National Screening Committee criteria. Available at: http://www.doh.gov.uk/nsc/pdfs/criteria.pdf. Accessed September 24, 2000.
18. Wilson JMG, Jungner G. Principles and Practice of Screening for Disease. Geneva: World Health Organization, 1968. Public Health Papers No. 34.
19. Parkin DM, Whelan SL, Ferlay J, Raymond L, Young J. Cancer Incidence in Five Continents. Vol. VII. Lyon, France: International Agency for Research on Cancer;1997. IARC Scientific Publications No. 143.
20. Morrison AS. Screening. In: Rothman KJ, Greenland S, eds. Modern Epidemiology. 2nd ed. Philadelphia, Pa: Lippincott-Raven Publishers; 1998:499518.
21. Mushlin AI, Kouides RW, Shapiro DE. Estimating the accuracy of screening mammography: a meta-analysis. Am J Prev Med. 1998;14:143153.[Medline]
22. Ng EH, Ng FC, Tan PH, et al. Results of intermediate measures from a population-based, randomized trial of mammographic screening prevalence and detection of breast carcinoma among Asian women: the Singapore Breast Screening Project. Cancer. 1998;82:15211528.[Medline]
23. Roberts MM, Alexander FE, Anderson TJ, et al. Edinburgh trial of screening for breast cancer: mortality at seven years. Lancet. 1990;335:241246.[Medline]
24. Hong Kong Cancer Registry. Cancer Incidence and Mortality in Hong Kong 19951996. Hong Kong, China: Hospital Authority, 1999.
25. Rothman KJ, Greenland S. Measures of disease frequency. In: Rothman KJ, Greenland S, eds. Modern Epidemiology. 2nd ed. Philadelphia: Lippincott-Raven Publishers; 1998:43.
26. Alagaratnam TT, Wong J. Limitations of mammography in Chinese females. Clin Radiol. 1985;36:175177.[Medline]
27. Fok CM, Reynolds VB, Tan KA, Ong CL, Chong PY. Clustered intramammary microcalcifications not associated with a mass. Singapore Med J. 1995;36:2931.[Medline]
28. Rembold CM. Number needed to screen: development of a statistic for disease screening. BMJ. 1998;317:30712.
29. Smeeth L, Haines A, Ebrahim S. Numbers needed to treat derived from meta-analysessometimes informative, usually misleading. BMJ. 1999;318:15481551.
30. National Cancer Institute. SEER Cancer Statistic Review, 19731996. Available at: http://seer.cancer.gov/Publications/CSR1973_1996/. Accessed March 15, 2001.
31. Chatellier G, Zapletal E, Lemaitre D, Menard J, Degoulet P. The number needed to treat: a clinically useful nomogram in its proper context. BMJ. 1996;312:426429.
32. Cook RJ, Sackett DL. The number needed to treat: a clinically useful measure of treatment effect. BMJ. 1995;310:452454.
33. Collins R, Peto R, MacMahon S, et al. Blood pressure, stroke, and coronary heart disease. Part 2, Short-term reductions in blood pressure: overview of randomized drug trials in their epidemiological context. Lancet. 1990;335:827838.[Medline]
34. Yusuf S, Zucker D, Peduzzi P, et al. Effect of coronary artery bypass graft surgery on survival: overview of 10-year results from randomized trials by the Coronary Artery Bypass Graft Surgery Trialists Collaboration. Lancet. 1994;34:563570.
35. Chu KC, Smart CR, Tarone RE. Analysis of breast cancer mortality and stage distribution by age for the Health Insurance Plan clinical trial. J Natl Cancer Inst. 1988;80:11251132.
36. Ernster VL, Barclay J, Kerlikowske K, Grady D, Henderson C. Incidence of and treatment for ductal carcinoma in situ of the breast. JAMA. 1996;275:913918.[Abstract]
37. Lau Y, Lau PY, Chan CM, Yip A. The potential impact of breast cancer screening in Hong Kong. Aust N Z J Surg. 1998;68:707711.[Medline]
38. Jatoi I, Baum M. Mammographically detected ductal carcinoma in situ: are we overdiagnosing breast cancer? Surgery. 1995;118:118120.[Medline]
39. Ernster VL, Barclay J, Kerlikowske K, Wilkie H, Ballard-Barbash R. Mortality among women with ductal carcinoma in situ of the breast in the population-based surveillance, epidemiology and end results program. Arch Intern Med. 2000;160:953958.
40. Rosen PP, Braum DW Jr, Kinne DE. The clinical significance of pre-invasive breast carcinoma. Cancer. 1980;46:919925.[Medline]
41. Page DL, Dupont WD, Rogers LW, Landenberger M. Intraductal carcinoma of the breast: follow-up after biopsy only. Cancer. 1982;49:751758.[Medline]
42. Alpers CE, Wellings SR. The prevalence of carcinoma in situ in normal and cancer-associated breasts. Hum Pathol. 1985;16:796807.[Medline]
43. Nielsen M, Thomsen JL, Primdahl S, Dyreborg U, Andersen JA. Breast cancer and atypia among young and middle-aged women: a study of 110 medicolegal autopsies. Br J Cancer. 1987;56:814819.[Medline]
44. Elmore JG, Barton MB, Moceri VM, Polk S, Arena PJ, Fletcher SW. Ten-year risk of false positive screening mammograms and clinical breast examinations. N Engl J Med. 1998;338:108996.
45. Wright CJ, Mueller CB. Screening mammography and public health policy: the need for perspective. Lancet. 1995;346:2932.[Medline]
46. Kaelin CM, Smith TJ, Homer MJ, et al. Safety, accuracy, and diagnostic yield of needle localization biopsy of the breast performed using local anesthesia. J Am Coll Surg. 1994;179:267272.[Medline]
47. Mueller X, Amery A, Lallemand RC. Biopsy of mammographically-detected breast lesions in a district hospital. Eur J Surg Oncol. 1993;19:415419.[Medline]
48. Helvie MA, Ikeda DM, Adler DD. Localization and needle aspiration of breast lesions: complications in 370 cases. AJR Am J Roentgenol. 1991;157:711714.
49. Olsen O, Gotzsche PC. Cochrane review on screening for breast cancer with mammography. Lancet. 2001;358:13401342.[Medline]
50. Horton R. Screening mammographyan overview revisited. Lancet. 2001;358:12841285.[Medline]
This article has been cited by other articles:
![]() |
G. M. Leung, P. P. S. Woo, B. J. Cowling, C. S. H. Tsang, A. N. Y. Cheung, H. Y. S. Ngan, K. Galbraith, and T.-H. Lam Who receives, benefits from and is harmed by cervical and breast cancer screening among Hong Kong Chinese? J. Public Health Med., September 1, 2008; 30(3): 282 - 292. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. P.S. Woo, J. J. Kim, and G. M. Leung What Is the Most Cost-Effective Population-Based Cancer Screening Program for Chinese Women? J. Clin. Oncol., February 20, 2007; 25(6): 617 - 624. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Gur, J. S. Stalder, L. A. Hardesty, B. Zheng, J. H. Sumkin, D. M. Chough, B. E. Shindel, and H. E. Rockette Computer-aided Detection Performance in Mammographic Examination of Masses: Assessment Radiology, November 1, 2004; 233(2): 418 - 423. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tseng and C. Fang SCREENING MAMMOGRAPHY IN ASIAN AMERICAN WOMEN Am J Public Health, September 1, 2003; 93(9): 1378 - 1378. [Full Text] [PDF] |
||||
![]() |
T. Lam and G. Leung Geoethnic-sensitive and cross-culture collaborative epidemiological studies Int. J. Epidemiol., April 1, 2003; 32(2): 178 - 180. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |