|
|
||||||||
RESEARCH AND PRACTICE |
The authors are with the Center for Health Quality, Outcomes and Economic Research (a Veterans Affairs Health Services Research and Development National Center of Excellence), Bedford VA Medical Center, Bedford, Mass, and the Health Services Department, Boston University School of Public Health, Boston, Mass.
Correspondence: Requests for reprints should be sent to Nancy R. Kressin, PhD, Center for Health Quality, Outcomes and Economic Research, VA Medical Center, 200 Springs Rd, Building 70 (152), Bedford, MA 01730 (e-mail: nkressin{at}bu.edu).
| ABSTRACT |
|---|
|
|
|---|
Objectives. We examined agreement of administrative data with self-reported race/ethnicity and identified correlates of agreement.
Methods. We used Veterans Affairs administrative data and VA 1999 Large Health survey race/ethnicity data.
Results. Relatively low rates of agreement (approximately 60%) between data sources were largely the result of administrative data from patients whose race/ethnicity was unknown, with least agreement for Native American, Asian, and Pacific Islander patients. After exclusion of patients with missing race/ethnicity, agreement improved except for Native Americans. Agreement did not increase substantially after inclusion of data from individuals indicating multiple race/ethnicities. Patients for whom there was better agreement between data sources tended to be less educated, nonsolitary living, younger, and White; to have sufficient food; and to use more inpatient Department of Veterans Affairs (VA) care.
Conclusions. Better reporting of race/ethnicity data will improve agreement between data sources. Previous studies using VA administrative data may have underestimated racial disparities.
| INTRODUCTION |
|---|
|
|
|---|
A few previous studies have examined the reliability of racial classifications in administrative data from specific states, the federal government, and national insurance programs. Blustein documented that racial classifications for patients with multiple admissions in hospital discharge data in New York state lacked reliability, especially for nonAfrican American and non-White racial categories.1 When California birth certificate race/ethnicity data were compared with race/ethnicity information obtained by interview, Baumeister et al. found that the sensitivity of the birth certificate data was significantly lower for Native Americans.2 In a review of vital statistics on race and ethnicity, Hahn and colleagues noted inconsistencies between birth and death records of infants, especially for Hispanic persons and for races/ethnicities other than White and African American.3 Pan and colleagues compared racial designations in Medicare and Medicaid data, finding significant amounts of contradictory information on race/ethnicity between the programs, with the greatest discrepancies for Hispanic, "other," and Asian classifications.4 Boehmer and colleagues documented that study outcomes differed markedly depending on whether the source of race/ethnicity information was Department of Veterans Affairs (VA) administrative data or self-report data. Specifically, additional race/ethnicity differences in the use of tooth extraction versus root canal therapy were found when self-report data were used.5 They also noted discrepancies in race/ethnicity classifications between the data sources.
Because a number of recent studies on racial variations in cardiac care have been based on VA databases,610 understanding the accuracy of these data is especially important. One study examined the concordance between medical record data on race/ethnicity in the VA and race/ethnicity as recorded in inpatient administrative data files, finding good agreement.11 However, this finding is not surprising, given that medical record data serve as the source for inpatient data on race/ethnicity. The agreement of the administrative files with patient self-report was unknown, as were the sociodemographic and health factors associated with such agreement.
The purpose of this study was to extend previous research by examining the agreement of VA administrative data on race/ethnicity with patient self-reported race/ethnicity, using information obtained from the largest federal survey ever conducted in the Veterans Health Administration.12 Thus, in addition to examining general rates of agreement, we assessed the effect of including or excluding patients with missing race/ethnicity information or with multiple race/ethnicity designations in the survey data. A secondary goal of our study was to identify any sociodemographic and health characteristics of patients associated with agreement between data sources.
| METHODS |
|---|
|
|
|---|
1999 Large Health Survey of Veteran Enrollees. In 1999, the largest and most detailed survey of veterans using VA health services was conducted to ascertain their health status and health practices.12 Patients were sampled from the March 1999 enrollment file, which contained 3 760 200 enrollees. After exclusion of 146 323 veterans who had died or who had "bad" addresses, the final mailable sampling frame was 3 613 877. A total of 1 406 049 enrollees were sent surveys by mail using a stratified random sample (those who died or who were ineligible because of bad addresses were excluded), and a total of 887 775 surveys were received, resulting in a response rate of 63%. These surveys included questions about the patients race/ethnicity, other basic sociodemographic characteristics, and health. We excluded patients whose race/ethnicity was missing, and for most analyses, we excluded patients who indicated more than 1 race/ethnicity, leaving an analysis sample of 730 149.
Measures
VA administrative data on race/ethnicity.
Race/ethnicity information is recorded in both inpatient and outpatient VA administrative files, which exist for each fiscal year. In the outpatient files, race/ethnicity designations are assigned by the registration clerk on the basis of visual inspection. For inpatient files, the race/ethnicity information is extracted from the medical record documentation provided by the clinician. In the summary file we used, each person-level record contains several variables related to patient birthdate or age, gender, and race/ethnicity. Because in other work with these files we had noted that information on race/ethnicity sometimes varied from file to file, we created a summary race/ethnicity variable based on all the indicators for race/ethnicity in the inpatient and outpatient utilization files for the 3 years represented. For patients with more than 1 value (e.g., both African American and White) across all utilization records, we assigned the value present most often in the patients records. This decision affected only a small number of cases; for each of the 3 race/ethnicity categories, the majority (99.8%) of patients had consistent codes. For patients without a majority value or for whom there was no race/ethnicity information, we assigned a value of "unknown." Thus, each patients record indicated whether the patients race/ethnicity was White, African American, other (Hispanic, American Indian, Asian and Pacific Islander were grouped together in the file during its preparation), or unknown. Patients in the file who had been seen on an outpatient basis only in 1996, when race/ethnicity information was not yet included in the administrative database, were excluded from our analysis sample.
Large Survey data on race/ethnicity. Our measurement of race/ethnicity was the patients self-reported race/ethnicity provided in the 1999 Large Health Survey of Veteran Enrollees in response to a question developed by the Office of Management and Budget for use in federal surveys.13 Patients were asked to indicate their race/ethnicity in response to the following single question, "What is your race/ethnicity?" Patients were instructed to mark all responses that applied, including "American Indian or Alaskan native," "Asian," "Black or African American," "Spanish, Hispanic, or Latino," "Native Hawaiian or other Pacific Islander," or "White." We excluded patients from our sample whose race/ethnicity was missing from this file (n = 115 349; 13.1% of the original sample). Patients who had indicated more than 1 race/ethnicity (n = 34 113; 3.8% of the original sample) were also excluded from some analyses, leaving a sample of 730 149 veterans.
Sociodemographic characteristics. Age (in 1998) was taken from the administrative data. Patients were asked to report their educational level by selecting 1 of the following responses: "never attended school or only kindergarten," "grades 1 through 8," "grades 9 through 11," "grades 12 or GED (general equivalency diploma)," "college 1 year to 3 years," or "college graduate or graduate school."
As in the Behavioral Risk Factor Surveillance System questionnaire, patients were asked, "In the past 30 days have you been concerned about having enough food for you or your family?" (yes or no) to determine sufficiency of food.14
For health status, we used a single item that assesses general health, drawn from the Veterans Short Form (SF)-36 ("In general, would you say your health is excellent, very good, good, fair or poor?").15
Patients were asked whether they lived alone (yes or no) and to indicate whether they were married, divorced, separated, widowed, or never married. For the purposes of our analyses, we dichotomized this variable, creating 2 groups: currently married and other.
For employment status, patients were asked whether they were currently employed for wages, self-employed, looking for work and unemployed for more than 1 year, looking for work and unemployed for less than a year, retired, homemaker, student, or unable to work. These responses were grouped as employed, retired, unemployed, or "other." For some analyses, these categories were further subdivided as employed or not employed.
Under health care utilization, we calculated total number of inpatient stays between 1996 and 1998 and total number of outpatient visits between 1996 and 1998 from the administrative data.
Analyses
We conducted cross-tabulations between self-reported and administrative data, examining percentage agreement between the 2 data sources (administrative vs self-report data on race/ethnicity). Next, we conducted multivariate logistic regression analyses to identify sociodemographic characteristics, health factors, and health care utilization patterns associated with the likelihood of agreement in race/ethnicity between the 2 databases.
| RESULTS |
|---|
|
|
|---|
|
|
Patients with more than 1 self-reported race/ethnicity presented more opportunities for concordant classification with the administrative data (e.g., a man who considers himself both White and Native American had 2 chances for being administratively classified in a category consistent with 1 of his selfdesignations). Thus, we conducted an additional analysis to examine concordant classifications between all patients with 1 or 2 self-designations of race/ethnicity (99.6% of the sample) and the administrative records of race/ethnicity. As shown in Table 3
, for patients who reported some combinations of race/ethnicities, the concordance between administrative and self-report classifications increased dramatically when the multiple racial classifications were taken into account. For example, patients self-reporting as both Hispanic and White had a total of 70.1% concordance with administrative data when both categories were taken into account, as opposed to 45% or 26% when only 1 category, Hispanic or White, was considered. Similarly, patients reporting combinations of African American and White or African American and Hispanic were approximately twice as likely to have self-designations concordant with the administrative data if 2 racial categories were included. For other combinations, the addition of a second racial category made much less difference, adding 0% to 7% more likelihood of concordance between the 2 data sources.
|
2 tests). Patients with agreement regarding race/ethnicity had worse self-perceived general health (3.68 vs 3.29; a higher score indicates worse health), more inpatient stays (0.80 vs 0.07), and more outpatient visits (37.2 vs 13.6; all P < .0001, using t tests).
Finally, we conducted multivariate logistic regression analyses to examine the unique association of sociodemographic and health care utilization variables with known race in the administrative data, and then to examine the association of these variables with agreement on race/ethnicity between administrative and self-report data (Table 4
). First, among all patients, we examined sociodemographic and health factors associated with known race in the administrative data. Compared with patients for whom race was unknown, patients with complete race information in the administrative data tended to be older, unmarried, less educated, and unemployed; to have insufficient food; to have more inpatient stays and outpatient visits; to be of "other" race/ethnicity, and to report worse health status. Second, when we restricted our analysis to patients for whom race was known in the administrative data, we found that those more likely to show agreement between the 2 data sources on race tended to be younger, White, nonsolitary living, and less educated; to have no problems with food sufficiency; and to have more inpatient stays.
|
| DISCUSSION |
|---|
|
|
|---|
One other possible strategy for improving the concordance between VA and selfreported race/ethnicity data is to consider patients multiple races/ethnicities when making administrative classifications (e.g., allowing patients to indicate more than 1 race/ethnicity, as is done on the US Census), thereby allowing more opportunities for administrative designations to match patients self-designations. Our analyses showed that this approach would improve agreement when patients considered themselves combinations of Hispanic, White, or African American, but not when other combinations of races/ethnicities are involved. However, given that these 3 combinations of racial classifications accounted for only 0.06% of our total sample, this strategy is unlikely to have a large effect on the overall quality of the administrative race/ethnicity data.
In the bivariate results, the agreement between VA race/ethnicity data and self-report was similar for White, African American, and Hispanic patients and notably lower for Asian, Pacific Islander, and Native American patients. However, the logistic regression results indicated that after control for a variety of sociodemographic factors, African Americans and "others" (including Hispanic, Asian, Pacific Islander, and Native American patients) were less likely than Whites to have agreement between the data sources. This implies that studies examining racial disparities in health care within the VA are particularly likely to have poor agreement with selfreported race/ethnicity data for non-White patients. However, because African Americans were described as White almost 5% of the time, whereas Whites were described as African American only 0.44% of the time, estimates of racial disparities between these 2 groups are likely to be diminished owing to the characteristics of the administrative database. Similarly, Hispanic patients were listed as White almost 11% of the time, whereas Whites were listed as "other" less than 1% of the time; estimates of disparities between these 2 groups are also likely attenuated by the classification as White of a significant proportion of Hispanic patients in the administrative data.
To illustrate this point, consider a prominent VA study on racial disparities in cardiac care. Whittle and colleagues6 observed significantly different crude rates of cardiac catheterization, 19.3% for Whites and 11.8% for African Americans; they observed rates of 1.8% and 0.8%, respectively, for percutaneous transluminal coronary angioplasty; and they observed rates of 5.0% and 1.6%, respectively, for coronary artery bypass grafting. If African Americans were misclassified as White 5% of the time and Whites were misclassified as African American 0.44% of the time, as in our findings, correct identification of race/ethnicity may actually have inflated the procedure rates for Whites and reduced them for African Americans. This assumes that the African Americans misclassified as Whites had rates of invasive procedures similar to those of other African Americans. Consequently, use of self-reports of veterans race/ethnicity could actually have magnified the racial differences in procedure use observed by Whittle et al. Our results highlight important implications of the quality of VA data on race/ethnicity for past and present research findings.
These results extend those in the previous literature because of their focus on administrative data from the VAs national databases. This focus is an important addition to the field, as so many studies of health and health disparities rely on the VAs data on race/ethnicity.68,10 The results also extend those of Boehmer et al.5 by detailing the effects on rates of agreement of excluding patients with unknown race, as well as by elucidating the sociodemographic and health factors associated with available race data and with agreement between self-reported and administrative data on race.
This study was limited by its reliance on a summary file of VA administrative data on race/ethnicity, which grouped individuals of Hispanic, Pacific Islander, Asian, and Native American race/ethnicity together. Because all these groups were included in the "other" category, the level of detail originally available in the VA databases for each racial category was eliminated in this file. Thus, we could not report levels of agreement with self-reported race/ethnicity within each of these 4 categories. However, the benefit of using this file is that the file summarizes 3 years of administrative data on race/ethnicity, as opposed to containing data from a single year.
Our exploration of factors associated with agreement between self-reported race/ethnicity and race/ethnicity in the administrative data files indicated that younger, lesseducated patients who possess some social and material resources and who use VA inpatient care more often are likely to have higher levels of agreement between race/ethnicity information in the 2 databases we studied. In contrast, patients whose administrative data is most likely to include known race have consistently fewer social and economic resources, report worse health, and use more inpatient and outpatient VA care. Other studies have noted that users of VA care are more likely to have a low family income and low labor force participation and are less likely to have a family physician or private health insurance.16 Our results suggest that even within this socioeconomically challenged population, those using the system more often are more likely to have agreement between self-report and administratively classified race/ethnicity. Thus, results indicate that the more opportunities the VA has to record race/ethnicity, the more likely its data are to agree with patient self-reports.
| Acknowledgments |
|---|
We gratefully acknowledge the programming assistance of Arkadiy Pitman, Megan Amuan, and Michelle Orner.
Human Participation Protection
Human studies approval for this research was granted by the Bedford VA Medical Center.
| Footnotes |
|---|
Contributors
N. R. Kressin led the analysis and drafted the article. B. H. Chang provided statistical expertise. A. Hendricks advised on the administrative data and participated in the analyses. L. E. Kazis advised on the survey data and participated in the analyses.
Accepted for publication May 30, 2003.
| References |
|---|
|
|
|---|
2. Baumeister L, Marchi K, Pearl M, Williams R, Braveman P. The validity of information of "race" and "Hispanic ethnicity" in California birth certificate data. Health Serv Res. 2000;35:869883.[ISI][Medline]
3. Hahn RA, Mulinare J, Teutsch SM. Inconsistencies in coding of race and ethnicity between birth and death in US infants: a new look at infant mortality, 1983 through 1985. JAMA. 1992;267:259263.[Abstract]
4. Pan CX, Glynn RJ, Mogun H, Choodnovskiy I, Avom J. Definition of race and ethnicity in older people in Medicare and Medicaid. J Am Geriatr Soc. 1999;47:730733.[ISI][Medline]
5. Boehmer U, Kressin N, Berlowitz D, Christiansen C, Kazis L, Jones J. Self-reported vs administrative race/ethnicity data and study results. Am J Public Health. 2002;92:14711473.
6. Whittle J, Conigliaro J, Good CB, Lofgren RP. Racial differences in the use of invasive cardiovascular procedures in the Department of Veterans Affairs medical system. N Engl J Med. 1993;329:621627.
7. Peterson ED, Wright SM, Daley J, Thibault GE. Racial variation in cardiac procedure use and survival following acute myocardial infarction in the Department of Veterans Affairs. J Am Med Assoc. 1994;271:11751180.[Abstract]
8. Mirvis DM, Burns R, Gaschen L, Cloar FT, Graney M. Variation in utilization of cardiac procedures in the Department of Veterans Affairs health care system: effect of race. J Am Coll Cardiol. 1994;24:12971304.[Abstract]
9. Mickelson JK, Blum CM, Geraci JM. Acute myocardial infarction: clinical characteristics, management and outcome in a metropolitan Veterans Affairs Medical Center teaching hospital. J Am Coll Cardiol. 1997;29:915925.[Abstract]
10. Mirvis DM, Graney MJ. Variations in the use of cardiac procedures in the Veterans Health Administration. Am Heart J. 1999;137:706713.[ISI][Medline]
11. Kashner TM. Agreement between administrative files and written medical records: a case of the Department of Veterans Affairs. Med Care. 1998;36:13241336.[ISI][Medline]
12. Kazis L. Health Status and Outcomes of Veterans: Physical and Mental Component Summary Scores Veterans SF-36. 1999 Large Health Survey of Veteran Enrollees. Executive Report. Washington, DC; Bedford, MA: Department of Veterans Affairs, Veterans Health Administration, Office of Quality and Performance and VHA Health Assessment Project, Center for Health Quality, Outcomes and Economic Research; May 2000.
13. Office of Management and Budget. Standards for maintaining, collecting and presenting federal data on race and ethnicity. Available at: http:/www.whitehouse.gov/omb/inforeg/r&e_app-a-update.pdf. Accessed March 18, 2002.
14. National Center for Chronic Disease Prevention and Health Promotion. Behavioral Risk Factor Surveillance System Survey Data. Hyattsville, Md: National Center for Chronic Disease Prevention, Centers for Disease Control and Prevention and Health Promotion, US Dept of Health and Human Services; 1999.
15. Ware J, Kosinski M, Bayliss MS. Comparison of methods for the scoring and the statistical analysis of the SF-36 health profile and summary measures: summary of results from the Medical Outcomes Study. Med Care. 1995;33:AS264AS279.[ISI][Medline]
16. Wolinsky FD, Coe RM, Mosely RM 2nd, Homan SM. Veterans and non-veterans use of health services: a comparative analysis. Med Care. 1985;23:13581371.[ISI][Medline]
This article has been cited by other articles:
![]() |
P. E. Rivard, S. L. Luther, C. L. Christiansen, Shibei Zhao, S. Loveland, A. Elixhauser, P. S. Romano, and A. K. Rosen Using Patient Safety Indicators to Estimate the Impact of Potential Adverse Events on Outcomes Med Care Res Rev, February 1, 2008; 65(1): 67 - 87. [Abstract] [PDF] |
||||
![]() |
R. Kimerling, K. Gima, M. W. Smith, A. Street, and S. Frayne The Veterans Health Administration and Military Sexual Trauma Am J Public Health, December 1, 2007; 97(12): 2160 - 2166. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. O'Hare, D. Bertenthal, K. E. Covinsky, C. S. Landefeld, S. Sen, K. Mehta, M. A. Steinman, A. Borzecki, and L. C. Walter Mortality Risk Stratification in Chronic Kidney Disease: One Size for All Ages? J. Am. Soc. Nephrol., March 1, 2006; 17(3): 846 - 853. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Kilbourne, M. S. Bauer, X. Han, G. L. Haas, P. Elder, C. B. Good, M. Shad, J. Conigliaro, and H. Pincus Racial Differences in the Treatment of Veterans With Bipolar Disorder Psychiatr Serv, December 1, 2005; 56(12): 1549 - 1555. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. N. West, A. M. Geiger, S. M. Greene, E. L. Harris, I.-L. A. Liu, M. B. Barton, J. G. Elmore, S. Rolnick, L. Nekhlyudov, A. Altschuler, et al. Race and Ethnicity: Comparing Medical Records to Self-Reports J Natl Cancer Inst Monographs, November 1, 2005; 2005(35): 72 - 74. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Maizlish and L. Herrera A Record Linkage Protocol for a Diabetes Registry at Ethnically Diverse Community Health Centers J. Am. Med. Inform. Assoc., May 1, 2005; 12(3): 331 - 337. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. James, F. T. Bourgeois, and M. W. Shannon Association of Race/Ethnicity with Emergency Department Wait Times Pediatrics, March 1, 2005; 115(3): e310 - e315. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. R. Kressin, B.-H. Chang, J. Whittle, E. D. Peterson, J. A. Clark, A. K. Rosen, M. Orner, T. C. Collins, L. G. Alley, and L. A. Petersen Racial Differences in Cardiac Catheterization as a Function of Patients' Beliefs Am J Public Health, December 1, 2004; 94(12): 2091 - 2097. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Bradley, J. Herrin, Y. Wang, R. L. McNamara, T. R. Webster, D. J. Magid, M. Blaney, E. D. Peterson, J. G. Canto, C. V. Pollack, Jr, et al. Racial and Ethnic Differences in Time to Acute Reperfusion Therapy for Patients Hospitalized With Myocardial Infarction JAMA, October 6, 2004; 292(13): 1563 - 1572. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Ibrahim, S. B. Thomas, and M. J. Fine Achieving Health Equity: An Incremental Journey Am J Public Health, October 1, 2003; 93(10): 1619 - 1621. [Full Text] [PDF] |
||||
![]() |
M. J. Fine and J. G. Demakis The Veterans Health Administration's Promotion of Health Equity for Racial and Ethnic Minorities Am J Public Health, October 1, 2003; 93(10): 1622 - 1624. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |