Finding the patient's voice using big data: Analysis of users' health-related concerns in the ChaCha question-and-answer service (2009-2012)

Chad Priest, Amy Knopf, Doyle Groves, Janet Carpenter, Christopher Furrey, Anand Krishnan, Wendy Miller, Julie Otte, Mathew Palakal, Da Sarah Wiehe, Jeffrey Wilson

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

Background: The development of effective health care and public health interventions requires a comprehensive understanding of the perceptions, concerns, and stated needs of health care consumers and the public at large. Big datasets from social media and question-and-answer services provide insight into the public's health concerns and priorities without the financial, temporal, and spatial encumbrances of more traditional community-engagement methods and may prove a useful starting point for public-engagement health research (infodemiology). Objective: The objective of our study was to describe user characteristics and health-related queries of the ChaCha question-and-answer platform, and discuss how these data may be used to better understand the perceptions, concerns, and stated needs of health care consumers and the public at large. Methods: We conducted a retrospective automated textual analysis of anonymous user-generated queries submitted to ChaCha between January 2009 and November 2012. A total of 2.004 billion queries were read, of which 3.50% (70,083,796/2,004,243,249) were missing 1 or more data fields, leaving 1.934 billion complete lines of data for these analyses. Results: Males and females submitted roughly equal numbers of health queries, but content differed by sex. Questions from females predominantly focused on pregnancy, menstruation, and vaginal health. Questions from males predominantly focused on body image, drug use, and sexuality. Adolescents aged 12-19 years submitted more queries than any other age group. Their queries were largely centered on sexual and reproductive health, and pregnancy in particular. Conclusions: The private nature of the ChaCha service provided a perfect environment for maximum frankness among users, especially among adolescents posing sensitive health questions. Adolescents' sexual health queries reveal knowledge gaps with serious, lifelong consequences. The nature of questions to the service provides opportunities for rapid understanding of health concerns and may lead to development of more effective tailored interventions.

Original languageEnglish (US)
Article numbere44
JournalJournal of Medical Internet Research
Volume18
Issue number3
DOIs
StatePublished - Mar 1 2016

Fingerprint

Reproductive Health
Health
Public Health
Delivery of Health Care
Social Media
Health Priorities
Pregnancy
Menstruation
Body Image
Sexuality
Age Groups
Research
Pharmaceutical Preparations

Keywords

  • Adolescent
  • Big data
  • ChaCha
  • Health information seeking
  • Infodemiology
  • Infoveillance
  • Patient engagement
  • Question-and-answer service
  • Sexual health
  • Social meda

ASJC Scopus subject areas

  • Health Informatics

Cite this

Finding the patient's voice using big data : Analysis of users' health-related concerns in the ChaCha question-and-answer service (2009-2012). / Priest, Chad; Knopf, Amy; Groves, Doyle; Carpenter, Janet; Furrey, Christopher; Krishnan, Anand; Miller, Wendy; Otte, Julie; Palakal, Mathew; Wiehe, Da Sarah; Wilson, Jeffrey.

In: Journal of Medical Internet Research, Vol. 18, No. 3, e44, 01.03.2016.

Research output: Contribution to journalArticle

@article{3f49e029d6ca4d0683bcb2ae559b261e,
title = "Finding the patient's voice using big data: Analysis of users' health-related concerns in the ChaCha question-and-answer service (2009-2012)",
abstract = "Background: The development of effective health care and public health interventions requires a comprehensive understanding of the perceptions, concerns, and stated needs of health care consumers and the public at large. Big datasets from social media and question-and-answer services provide insight into the public's health concerns and priorities without the financial, temporal, and spatial encumbrances of more traditional community-engagement methods and may prove a useful starting point for public-engagement health research (infodemiology). Objective: The objective of our study was to describe user characteristics and health-related queries of the ChaCha question-and-answer platform, and discuss how these data may be used to better understand the perceptions, concerns, and stated needs of health care consumers and the public at large. Methods: We conducted a retrospective automated textual analysis of anonymous user-generated queries submitted to ChaCha between January 2009 and November 2012. A total of 2.004 billion queries were read, of which 3.50{\%} (70,083,796/2,004,243,249) were missing 1 or more data fields, leaving 1.934 billion complete lines of data for these analyses. Results: Males and females submitted roughly equal numbers of health queries, but content differed by sex. Questions from females predominantly focused on pregnancy, menstruation, and vaginal health. Questions from males predominantly focused on body image, drug use, and sexuality. Adolescents aged 12-19 years submitted more queries than any other age group. Their queries were largely centered on sexual and reproductive health, and pregnancy in particular. Conclusions: The private nature of the ChaCha service provided a perfect environment for maximum frankness among users, especially among adolescents posing sensitive health questions. Adolescents' sexual health queries reveal knowledge gaps with serious, lifelong consequences. The nature of questions to the service provides opportunities for rapid understanding of health concerns and may lead to development of more effective tailored interventions.",
keywords = "Adolescent, Big data, ChaCha, Health information seeking, Infodemiology, Infoveillance, Patient engagement, Question-and-answer service, Sexual health, Social meda",
author = "Chad Priest and Amy Knopf and Doyle Groves and Janet Carpenter and Christopher Furrey and Anand Krishnan and Wendy Miller and Julie Otte and Mathew Palakal and Wiehe, {Da Sarah} and Jeffrey Wilson",
year = "2016",
month = "3",
day = "1",
doi = "10.2196/jmir.5033",
language = "English (US)",
volume = "18",
journal = "Journal of Medical Internet Research",
issn = "1439-4456",
publisher = "Journal of medical Internet Research",
number = "3",

}

TY - JOUR

T1 - Finding the patient's voice using big data

T2 - Analysis of users' health-related concerns in the ChaCha question-and-answer service (2009-2012)

AU - Priest, Chad

AU - Knopf, Amy

AU - Groves, Doyle

AU - Carpenter, Janet

AU - Furrey, Christopher

AU - Krishnan, Anand

AU - Miller, Wendy

AU - Otte, Julie

AU - Palakal, Mathew

AU - Wiehe, Da Sarah

AU - Wilson, Jeffrey

PY - 2016/3/1

Y1 - 2016/3/1

N2 - Background: The development of effective health care and public health interventions requires a comprehensive understanding of the perceptions, concerns, and stated needs of health care consumers and the public at large. Big datasets from social media and question-and-answer services provide insight into the public's health concerns and priorities without the financial, temporal, and spatial encumbrances of more traditional community-engagement methods and may prove a useful starting point for public-engagement health research (infodemiology). Objective: The objective of our study was to describe user characteristics and health-related queries of the ChaCha question-and-answer platform, and discuss how these data may be used to better understand the perceptions, concerns, and stated needs of health care consumers and the public at large. Methods: We conducted a retrospective automated textual analysis of anonymous user-generated queries submitted to ChaCha between January 2009 and November 2012. A total of 2.004 billion queries were read, of which 3.50% (70,083,796/2,004,243,249) were missing 1 or more data fields, leaving 1.934 billion complete lines of data for these analyses. Results: Males and females submitted roughly equal numbers of health queries, but content differed by sex. Questions from females predominantly focused on pregnancy, menstruation, and vaginal health. Questions from males predominantly focused on body image, drug use, and sexuality. Adolescents aged 12-19 years submitted more queries than any other age group. Their queries were largely centered on sexual and reproductive health, and pregnancy in particular. Conclusions: The private nature of the ChaCha service provided a perfect environment for maximum frankness among users, especially among adolescents posing sensitive health questions. Adolescents' sexual health queries reveal knowledge gaps with serious, lifelong consequences. The nature of questions to the service provides opportunities for rapid understanding of health concerns and may lead to development of more effective tailored interventions.

AB - Background: The development of effective health care and public health interventions requires a comprehensive understanding of the perceptions, concerns, and stated needs of health care consumers and the public at large. Big datasets from social media and question-and-answer services provide insight into the public's health concerns and priorities without the financial, temporal, and spatial encumbrances of more traditional community-engagement methods and may prove a useful starting point for public-engagement health research (infodemiology). Objective: The objective of our study was to describe user characteristics and health-related queries of the ChaCha question-and-answer platform, and discuss how these data may be used to better understand the perceptions, concerns, and stated needs of health care consumers and the public at large. Methods: We conducted a retrospective automated textual analysis of anonymous user-generated queries submitted to ChaCha between January 2009 and November 2012. A total of 2.004 billion queries were read, of which 3.50% (70,083,796/2,004,243,249) were missing 1 or more data fields, leaving 1.934 billion complete lines of data for these analyses. Results: Males and females submitted roughly equal numbers of health queries, but content differed by sex. Questions from females predominantly focused on pregnancy, menstruation, and vaginal health. Questions from males predominantly focused on body image, drug use, and sexuality. Adolescents aged 12-19 years submitted more queries than any other age group. Their queries were largely centered on sexual and reproductive health, and pregnancy in particular. Conclusions: The private nature of the ChaCha service provided a perfect environment for maximum frankness among users, especially among adolescents posing sensitive health questions. Adolescents' sexual health queries reveal knowledge gaps with serious, lifelong consequences. The nature of questions to the service provides opportunities for rapid understanding of health concerns and may lead to development of more effective tailored interventions.

KW - Adolescent

KW - Big data

KW - ChaCha

KW - Health information seeking

KW - Infodemiology

KW - Infoveillance

KW - Patient engagement

KW - Question-and-answer service

KW - Sexual health

KW - Social meda

UR - http://www.scopus.com/inward/record.url?scp=84962082041&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84962082041&partnerID=8YFLogxK

U2 - 10.2196/jmir.5033

DO - 10.2196/jmir.5033

M3 - Article

C2 - 26960745

AN - SCOPUS:84962082041

VL - 18

JO - Journal of Medical Internet Research

JF - Journal of Medical Internet Research

SN - 1439-4456

IS - 3

M1 - e44

ER -