A natural language processing framework to analyse the opinions on HPV vaccination reflected in twitter over 10 years (2008 - 2017)

Xiao Luo, Gregory Zimet, Setu Shah

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

In this research, we developed a natural language processing (NLP) framework to investigate the opinions on HPV vaccination reflected on Twitter over a 10-year period–2008–2017. The NLP framework includes sentiment analysis, entity analysis, and artificial intelligence (AI)-based phrase association mining. The sentiment analysis demonstrates the sentiment fluctuation over the past 10 years. The results show that there are more negative tweets in 2008 to 2011 and 2015 to 2016. The entity extraction and analysis help to identify the organization, geographical location and events entities associated with the negative and positive tweets. The results show that the organization entities such as FDA, CDC and Merck occur in both negative and positive tweets of almost every year, whereas the geographical location entities mentioned in both negative and positive tweets change from year to year. The reason is because of the specific events that happened in those different locations. The objective of the AI-based phrase association mining is to identify the main topics reflected in both negative and positive tweets and detailed tweet content. Through the phrase association mining, we found that the main negative topics on Twitter include “injuries”, “deaths”, “scandal”, “safety concerns”, and “adverse/side effects”, whereas the main positive topics include “cervical cancers”, “cervical screens”, “prevents”, and “vaccination campaigns”. We believe the results of this research can help public health researchers better understand the nature of social media influence on HPV vaccination attitudes and to develop strategies to counter the proliferation of misinformation.

Original languageEnglish (US)
Pages (from-to)1496-1504
Number of pages9
JournalHuman Vaccines and Immunotherapeutics
Volume15
Issue number7-8
DOIs
StatePublished - Aug 3 2019

Fingerprint

Natural Language Processing
Artificial Intelligence
Vaccination
Organizations
Social Media
Immunization Programs
Centers for Disease Control and Prevention (U.S.)
Research
Uterine Cervical Neoplasms
Public Health
Communication
Research Personnel
Safety
Wounds and Injuries

Keywords

  • HPV vaccine
  • Natural language processing
  • public health
  • sextually transmitted infection
  • social media

ASJC Scopus subject areas

  • Immunology and Allergy
  • Immunology
  • Pharmacology

Cite this

A natural language processing framework to analyse the opinions on HPV vaccination reflected in twitter over 10 years (2008 - 2017). / Luo, Xiao; Zimet, Gregory; Shah, Setu.

In: Human Vaccines and Immunotherapeutics, Vol. 15, No. 7-8, 03.08.2019, p. 1496-1504.

Research output: Contribution to journalArticle

@article{944e31bb84194db5a886b00dc6760608,
title = "A natural language processing framework to analyse the opinions on HPV vaccination reflected in twitter over 10 years (2008 - 2017)",
abstract = "In this research, we developed a natural language processing (NLP) framework to investigate the opinions on HPV vaccination reflected on Twitter over a 10-year period–2008–2017. The NLP framework includes sentiment analysis, entity analysis, and artificial intelligence (AI)-based phrase association mining. The sentiment analysis demonstrates the sentiment fluctuation over the past 10 years. The results show that there are more negative tweets in 2008 to 2011 and 2015 to 2016. The entity extraction and analysis help to identify the organization, geographical location and events entities associated with the negative and positive tweets. The results show that the organization entities such as FDA, CDC and Merck occur in both negative and positive tweets of almost every year, whereas the geographical location entities mentioned in both negative and positive tweets change from year to year. The reason is because of the specific events that happened in those different locations. The objective of the AI-based phrase association mining is to identify the main topics reflected in both negative and positive tweets and detailed tweet content. Through the phrase association mining, we found that the main negative topics on Twitter include “injuries”, “deaths”, “scandal”, “safety concerns”, and “adverse/side effects”, whereas the main positive topics include “cervical cancers”, “cervical screens”, “prevents”, and “vaccination campaigns”. We believe the results of this research can help public health researchers better understand the nature of social media influence on HPV vaccination attitudes and to develop strategies to counter the proliferation of misinformation.",
keywords = "HPV vaccine, Natural language processing, public health, sextually transmitted infection, social media",
author = "Xiao Luo and Gregory Zimet and Setu Shah",
year = "2019",
month = "8",
day = "3",
doi = "10.1080/21645515.2019.1627821",
language = "English (US)",
volume = "15",
pages = "1496--1504",
journal = "Human Vaccines and Immunotherapeutics",
issn = "2164-5515",
publisher = "Landes Bioscience",
number = "7-8",

}

TY - JOUR

T1 - A natural language processing framework to analyse the opinions on HPV vaccination reflected in twitter over 10 years (2008 - 2017)

AU - Luo, Xiao

AU - Zimet, Gregory

AU - Shah, Setu

PY - 2019/8/3

Y1 - 2019/8/3

N2 - In this research, we developed a natural language processing (NLP) framework to investigate the opinions on HPV vaccination reflected on Twitter over a 10-year period–2008–2017. The NLP framework includes sentiment analysis, entity analysis, and artificial intelligence (AI)-based phrase association mining. The sentiment analysis demonstrates the sentiment fluctuation over the past 10 years. The results show that there are more negative tweets in 2008 to 2011 and 2015 to 2016. The entity extraction and analysis help to identify the organization, geographical location and events entities associated with the negative and positive tweets. The results show that the organization entities such as FDA, CDC and Merck occur in both negative and positive tweets of almost every year, whereas the geographical location entities mentioned in both negative and positive tweets change from year to year. The reason is because of the specific events that happened in those different locations. The objective of the AI-based phrase association mining is to identify the main topics reflected in both negative and positive tweets and detailed tweet content. Through the phrase association mining, we found that the main negative topics on Twitter include “injuries”, “deaths”, “scandal”, “safety concerns”, and “adverse/side effects”, whereas the main positive topics include “cervical cancers”, “cervical screens”, “prevents”, and “vaccination campaigns”. We believe the results of this research can help public health researchers better understand the nature of social media influence on HPV vaccination attitudes and to develop strategies to counter the proliferation of misinformation.

AB - In this research, we developed a natural language processing (NLP) framework to investigate the opinions on HPV vaccination reflected on Twitter over a 10-year period–2008–2017. The NLP framework includes sentiment analysis, entity analysis, and artificial intelligence (AI)-based phrase association mining. The sentiment analysis demonstrates the sentiment fluctuation over the past 10 years. The results show that there are more negative tweets in 2008 to 2011 and 2015 to 2016. The entity extraction and analysis help to identify the organization, geographical location and events entities associated with the negative and positive tweets. The results show that the organization entities such as FDA, CDC and Merck occur in both negative and positive tweets of almost every year, whereas the geographical location entities mentioned in both negative and positive tweets change from year to year. The reason is because of the specific events that happened in those different locations. The objective of the AI-based phrase association mining is to identify the main topics reflected in both negative and positive tweets and detailed tweet content. Through the phrase association mining, we found that the main negative topics on Twitter include “injuries”, “deaths”, “scandal”, “safety concerns”, and “adverse/side effects”, whereas the main positive topics include “cervical cancers”, “cervical screens”, “prevents”, and “vaccination campaigns”. We believe the results of this research can help public health researchers better understand the nature of social media influence on HPV vaccination attitudes and to develop strategies to counter the proliferation of misinformation.

KW - HPV vaccine

KW - Natural language processing

KW - public health

KW - sextually transmitted infection

KW - social media

UR - http://www.scopus.com/inward/record.url?scp=85070110637&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85070110637&partnerID=8YFLogxK

U2 - 10.1080/21645515.2019.1627821

DO - 10.1080/21645515.2019.1627821

M3 - Article

C2 - 31194609

AN - SCOPUS:85070110637

VL - 15

SP - 1496

EP - 1504

JO - Human Vaccines and Immunotherapeutics

JF - Human Vaccines and Immunotherapeutics

SN - 2164-5515

IS - 7-8

ER -