The Contribution of First-name Information to the Accuracy of Racial-and-Ethnic Imputations Varies by Sex and Race-and-Ethnicity Among Medicare Beneficiaries.


Journal

Medical care
ISSN: 1537-1948
Titre abrégé: Med Care
Pays: United States
ID NLM: 0230027

Informations de publication

Date de publication:
01 08 2022
Historique:
entrez: 7 7 2022
pubmed: 8 7 2022
medline: 12 7 2022
Statut: ppublish

Résumé

Data on race-and-ethnicity that are needed to measure health equity are often limited or missing. The importance of first name and sex in predicting race-and-ethnicity is not well understood. The objective of this study was to compare the contribution of first-name information to the accuracy of basic and more complex racial-and-ethnic imputations that incorporate surname information. We imputed race-and-ethnicity in a sample of Medicare beneficiaries under 2 scenarios: (1) with only sparse predictors (name, address, sex) and (2) with a rich set (adding limited administrative race-and-ethnicity, demographics, and insurance). A total of 284,627 Medicare beneficiaries who completed the 2014 Medicare Consumer Assessment of Healthcare Providers and Systems survey and reported race-and-ethnicity were included. Hispanic, non-Hispanic Asian/Pacific Islander, and non-Hispanic White racial-and-ethnic imputations are more accurate for males than females under both sparse-predictor and rich-predictor scenarios; adding first-name information increases accuracy more for females than males. In contrast, imputations of non-Hispanic Black race-and-ethnicity are similarly accurate for females and males, and first names increase accuracy equally for each sex in both sparse-predictor and rich-predictor scenarios. For all 4 racial-and-ethnic groups, incorporating first-name information improves prediction accuracy more under the sparse-predictor scenario than under the rich-predictor scenario. First-name information contributes more to the accuracy of racial-and-ethnic imputations in a sparse-predictor scenario than in a rich-predictor scenario and generally narrows sex gaps in accuracy of imputations.

Sections du résumé

BACKGROUND
Data on race-and-ethnicity that are needed to measure health equity are often limited or missing. The importance of first name and sex in predicting race-and-ethnicity is not well understood.
OBJECTIVE
The objective of this study was to compare the contribution of first-name information to the accuracy of basic and more complex racial-and-ethnic imputations that incorporate surname information.
RESEARCH DESIGN
We imputed race-and-ethnicity in a sample of Medicare beneficiaries under 2 scenarios: (1) with only sparse predictors (name, address, sex) and (2) with a rich set (adding limited administrative race-and-ethnicity, demographics, and insurance).
SUBJECTS
A total of 284,627 Medicare beneficiaries who completed the 2014 Medicare Consumer Assessment of Healthcare Providers and Systems survey and reported race-and-ethnicity were included.
RESULTS
Hispanic, non-Hispanic Asian/Pacific Islander, and non-Hispanic White racial-and-ethnic imputations are more accurate for males than females under both sparse-predictor and rich-predictor scenarios; adding first-name information increases accuracy more for females than males. In contrast, imputations of non-Hispanic Black race-and-ethnicity are similarly accurate for females and males, and first names increase accuracy equally for each sex in both sparse-predictor and rich-predictor scenarios. For all 4 racial-and-ethnic groups, incorporating first-name information improves prediction accuracy more under the sparse-predictor scenario than under the rich-predictor scenario.
CONCLUSION
First-name information contributes more to the accuracy of racial-and-ethnic imputations in a sparse-predictor scenario than in a rich-predictor scenario and generally narrows sex gaps in accuracy of imputations.

Identifiants

pubmed: 35797457
doi: 10.1097/MLR.0000000000001732
pii: 00005650-202208000-00002
doi:

Types de publication

Journal Article Research Support, U.S. Gov't, P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

556-562

Informations de copyright

Copyright © 2022 Wolters Kluwer Health, Inc. All rights reserved.

Déclaration de conflit d'intérêts

The authors declare no conflict of interest.

Références

US Department of Health and Human Services. Disparities; 2021. Available at: www.healthypeople.gov/2020/about/foundation-health-measures/Disparities . Accessed January 31, 2022.
Office of Disease Prevention and Health Promotion. Healthy People 2030 Framework. Available at: https://health.gov/healthypeople/about/healthy-people-2030-framework . Accessed January 31, 2022.
Haas A, Elliott MN, Dembosky JW, et al. Imputation of race/ethnicity to enable measurement of HEDIS performance by race/ethnicity. Health Serv Res. 2019;54:13–23.
Elliott MN, Morrison PA, Fremont A, et al. Using the Census Bureau’s surname list to improve estimates of race/ethnicity and associated disparities. Health Serv Outcomes Res Methodol. 2009;9:69.
Perkins RC. Evaluating the Passel-Word Spanish Surname List: 1990 Decennial Census Post Enumeration Survey Results Population Division Working Paper. Washington, DC: US Census Bureau; 1993.
Abrahamse AF, Morrison PA, Bolton NM. Surname analysis for estimating local concentration of Hispanics and Asians. Popul Res Policy Rev. 1994;13:383–398.
Fiscella K, Fremont AM. Use of geocoding and surname analysis to estimate race and ethnicity. Health Serv Res. 2006;41:1482–1500.
Morrison P, Word D, Coleman C. Using first names to estimate racial proportions in populations; 2001. Population Association of America Annual Meeting. Washington, DC.
Voicu I. Using first name information to improve race and ethnicity classification. Stat Public Policy. 2018;5:1–13.
Wong EC, Palaniappan LP, Lauderdale DS. Using name lists to infer Asian racial/ethnic subgroups in the healthcare setting. Med Care. 2010;48:540–546.
Adjaye-Gbewonyo D, Bednarczyk RA, Davis RL, et al. Using the Bayesian Improved Surname Geocoding Method (BISG) to create a working classification of race and ethnicity in a diverse managed care population: a validation study. Health Serv Res. 2014;49:268–283.
Smith CK, Bonauto DK. Improving occupational health disparity research: testing a method to estimate race and ethnicity in a working population. Am J Ind Med. 2018;61:640–648.
Gooding GE, Kreider RM. Women’s marital naming choices in a nationally representative sample. J Fam Issues. 2009;31:681–701.
Taylor P, Wang W, Parker K, et al. The Rise of Intermarriage: Rates, Characteristics Vary by Race and Gender Social & Demographic Trends. Washington, DC: Pew Research Center; 2012.
Scott CG. Identifying the race of ethnicity of SSI recipients. Soc Secur Bull. 1999;62:9–20.
Social Security Administration. Privacy Program: A Central Source For Information About SSA’s Privacy Compliance Program. 2009. Available at: www.ssa.gov/privacy/pia/RECS%20PIA%20Nov%202009.htm . Accessed May 27, 2021.
Filice CE, Joynt KE. Examining race and ethnicity information in Medicare administrative data. Med Care. 2017;55:e170–e176.
Lauderdale DS, Goldberg J. The expanded racial and ethnic codes in the Medicare data files: their completeness of coverage and accuracy. Am J Public Health. 1996;86:712–716.
Waldo DR. Accuracy and bias of race/ethnicity codes in the Medicare enrollment database. Health Care Financ Rev. 2004;26:61–72.
Arday SL, Arday DR, Monroe S, et al. HCFA’s racial and ethnic data: current accuracy and recent improvements. Health Care Financ Rev. 2000;21:107–116.
Zaslavsky AM, Ayanian JZ, Zaborski LB. The validity of race and ethnicity in enrollment data for Medicare beneficiaries. Health Serv Res. 2012;47:1300–1321.
Derose SF, Contreras R, Coleman KJ, et al. Race and ethnicity data quality and imputation using US Census data in an integrated health system: the Kaiser Permanente Southern California experience. Med Care Res Rev. 2013;70:330–345.
Elliott MN, Becker K, Beckett MK, et al. Using indirect estimates based on name and census tract to improve the efficiency of sampling matched ethnic couples from marriage license data. Public Opin Q. 2013;77:375–384.
Zhang Y. Assessing fair lending risks using race/ethnicity proxies. Manag Sci. 2016;64:1–45.
Lauderdale DS, Kestenbaum B. Mortality rates of elderly Asian American populations based on Medicare and Social Security data. Demography. 2002;39:529–540.
Tzioumis K. Demographic aspects of first names. Sci Data. 2018;5:180025.
Klein DJ, Elliott MN, Haviland AM, et al. Understanding nonresponse to the 2007 Medicare CAHPS Survey. Gerontologist. 2011;51:843–855.
Purcell NJ, Kish L. Postcensal estimates for local areas (or domains). Int Stat Rev. 1980;48:3–18.
US Census Bureau. Frequently occurring surnames from the 2010 Census; 2016. Available at: www.census.gov/topics/population/genealogy/data/2010_surnames.html . Accessed May 28, 2019.
Word DL, Coleman C, Nunziata R, et al. Demographic Aspects of Surnames From Census 2000. Washington, DC: US Census Bureau; 2000.
Hand DJ, Till RJ. A simple generalisation of the area under the ROC curve for multiple class classification problems. Mach Learn. 2001;45:171–186.
Hosmer DW, Lemeshow S. Applied Logistic Regression. New York, NY: John Wiley & Sons Inc.; 2000.
Efron B, Gong G. A leisurely look at the Bootstrap, the Jackknife, and Cross-Validation. Am Stat. 1983;37:36–48.
McCaffrey DF, Elliott MN. Power of tests for a dichotomous independent variable measured with error. Health Serv Res. 2008;43:1085–1101.
Centers for Medicare & Medicaid Services. Racial, Ethnic, & Gender Disparities in Health Care in Medicare Advantage. Baltimore, MD: Centers for Medicare & Medicaid Services; 2021:1–179.
Fryer RG, Levitt SD. The causes and consequences of distinctively Black names. Q J Econ. 2004;119:767–805.

Auteurs

Ann Haas (A)

RAND Corporation, Pittsburgh, PA.

John L Adams (JL)

Kaiser Permanente Center for Effectiveness & Safety Research.
Kaiser Permanente Bernard J. Tyson School of Medicine, Pasadena, CA.

Amelia M Haviland (AM)

RAND Corporation, Pittsburgh, PA.
Carnegie Mellon University, Pittsburgh, PA.

Jacob W Dembosky (JW)

RAND Corporation, Pittsburgh, PA.

Peter A Morrison (PA)

Morrison & Associates Inc, Nantucket, MA.

Sarah Gaillot (S)

Centers for Medicare & Medicaid, Services, Baltimore, MD.

Allen M Fremont (AM)

RAND Corporation, Santa Monica, CA.

Jennifer L Gildner (JL)

RAND Corporation, Santa Monica, CA.

Loida Tamayo (L)

Centers for Medicare & Medicaid, Services, Baltimore, MD.

Marc N Elliott (MN)

RAND Corporation, Santa Monica, CA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH