Journal Article

Scientific Reports

Using Machine Learning to Uncover Hidden Heterogeneities in Survey Data

Christina Ramirez and et al.

View

Journal Article

CEI fact sheet cover with older woman and infographic

Summary

Published Date: November 05, 2019

Survey responses in public health surveys are heterogeneous. The quality of a respondent's answers depends on many factors, including cognitive abilities, interview context, and whether the interview is in person or self-administered. A largely unexplored issue is how the language used for public health survey interviews is associated with the survey response. Authors introduce a machine learning approach, Fuzzy Forests, which they use for model selection. They use the 2013 California Health Interview Survey (CHIS) as the training sample and the 2014 CHIS as the test sample.

Authors find that non-English language survey responses differ substantially from English responses in reported health outcomes.

Heterogeneity among the Asian languages suggest that caution should be used when interpreting results that compare across these languages. The 2013 Fuzzy Forests model also correctly predicted 86% of good health outcomes using 2014 data as the test set.

Share this article

Related Publications

CEI fact sheet cover with older woman and infographic

Trajectories of Alcohol Screening and Brief Intervention (ASBI) Performance and Their Associations With Long-Term Performance and Alcohol Use Outcomes: An Observational Study in a Large US Integrated Healthcare Delivery System

CEI fact sheet cover with older woman and infographic

Factors Related to the Health of Ethnic Hispanic Children in the United States: Application of Multiple Disadvantage Model

CEI fact sheet cover with older woman and infographic

Impact of Adverse Childhood Experiences (ACEs) on Mental Health Help-Seeking Among Asian American Adults: Findings from the 2021 California Health Interview Survey

CEI fact sheet cover with older woman and infographic

COVID-19 Information Sources and Vaccination Status Among Californian Adults by Generation Using the 2022 California Health Interview Survey: Cross-Sectional Study

CEI fact sheet cover with older woman and infographic

Paying the Price: Californians Struggle with the High Cost of Health Care

CEI fact sheet cover with older woman and infographic

Process Evaluation of a Parish-Based Intervention to Reduce Mental Health-Related Stigma

CEI fact sheet cover with older woman and infographic

Health Literacy and Digital Health Service Use Among Community Residents in Taiwan: implications From a Pilot Assessment Using the HLS-SF12

CEI fact sheet cover with older woman and infographic

Under Pressure: Mental Wellness, Access, and Systems of Care for Women and Girls in Sonoma County

CEI fact sheet cover with older woman and infographic

SARS-CoV-2 Infection During Pregnancy and Neurodevelopmental Outcomes in Early Childhood

CEI fact sheet cover with older woman and infographic

Perceived Neighborhood-level Assets and Barriers to Weight-related Behaviors Among Ethnically Diverse Black Adults

CEI fact sheet cover with older woman and infographic

Childhood Adversity And Self-Rated Health Disparities by Citizenship in Middle-Aged-And-Older Latino Adults in California

CEI fact sheet cover with older woman and infographic

Protocol for a Cluster Randomized Trial to Evaluate a Faith-Based Breast Cancer Screening Navigation Model