How Search Engine Data Enhance the Understanding of Determinants of Suicide in India and Inform Prevention: Observational Study.


Journal

Journal of medical Internet research
ISSN: 1438-8871
Titre abrégé: J Med Internet Res
Pays: Canada
ID NLM: 100959882

Informations de publication

Date de publication:
04 01 2019
Historique:
received: 20 02 2018
accepted: 24 09 2018
revised: 12 09 2018
entrez: 6 1 2019
pubmed: 6 1 2019
medline: 22 1 2020
Statut: epublish

Résumé

India is home to 20% of the world's suicide deaths. Although statistics regarding suicide in India are distressingly high, data and cultural issues likely contribute to a widespread underreporting of the problem. Social stigma and only recent decriminalization of suicide are among the factors hampering official agencies' collection and reporting of suicide rates. As the product of a data collaborative, this paper leverages private-sector search engine data toward gaining a fuller, more accurate picture of the suicide issue among young people in India. By combining official statistics on suicide with data generated through search queries, this paper seeks to: add an additional layer of information to more accurately represent the magnitude of the problem, determine whether search query data can serve as an effective proxy for factors contributing to suicide that are not represented in traditional datasets, and consider how data collaboratives built on search query data could inform future suicide prevention efforts in India and beyond. We combined official statistics on demographic information with data generated through search queries from Bing to gain insight into suicide rates per state in India as reported by the National Crimes Record Bureau of India. We extracted English language queries on "suicide," "depression," "hanging," "pesticide," and "poison". We also collected data on demographic information at the state level in India, including urbanization, growth rate, sex ratio, internet penetration, and population. We modeled the suicide rate per state as a function of the queries on each of the 5 topics considered as linear independent variables. A second model was built by integrating the demographic information as additional linear independent variables. Results of the first model fit (R In this work, we used search data and demographics to model suicide rates. In this way, search data serve as a proxy for unmeasured (hidden) factors corresponding to suicide rates. Moreover, our procedure for outlier rejection serves to single out states where the suicide rates have substantially different correlations with demographic factors and query rates.

Sections du résumé

BACKGROUND
India is home to 20% of the world's suicide deaths. Although statistics regarding suicide in India are distressingly high, data and cultural issues likely contribute to a widespread underreporting of the problem. Social stigma and only recent decriminalization of suicide are among the factors hampering official agencies' collection and reporting of suicide rates.
OBJECTIVE
As the product of a data collaborative, this paper leverages private-sector search engine data toward gaining a fuller, more accurate picture of the suicide issue among young people in India. By combining official statistics on suicide with data generated through search queries, this paper seeks to: add an additional layer of information to more accurately represent the magnitude of the problem, determine whether search query data can serve as an effective proxy for factors contributing to suicide that are not represented in traditional datasets, and consider how data collaboratives built on search query data could inform future suicide prevention efforts in India and beyond.
METHODS
We combined official statistics on demographic information with data generated through search queries from Bing to gain insight into suicide rates per state in India as reported by the National Crimes Record Bureau of India. We extracted English language queries on "suicide," "depression," "hanging," "pesticide," and "poison". We also collected data on demographic information at the state level in India, including urbanization, growth rate, sex ratio, internet penetration, and population. We modeled the suicide rate per state as a function of the queries on each of the 5 topics considered as linear independent variables. A second model was built by integrating the demographic information as additional linear independent variables.
RESULTS
Results of the first model fit (R
CONCLUSIONS
In this work, we used search data and demographics to model suicide rates. In this way, search data serve as a proxy for unmeasured (hidden) factors corresponding to suicide rates. Moreover, our procedure for outlier rejection serves to single out states where the suicide rates have substantially different correlations with demographic factors and query rates.

Identifiants

pubmed: 30609976
pii: v21i1e10179
doi: 10.2196/10179
pmc: PMC6682304
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e10179

Informations de copyright

©Natalia Adler, Ciro Cattuto, Kyriaki Kalimeri, Daniela Paolotti, Michele Tizzoni, Stefaan Verhulst, Elad Yom-Tov, Andrew Young. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 04.01.2019.

Références

Proc SIGCHI Conf Hum Factor Comput Syst. 2016 May;2016:2098-2110
pubmed: 29082385
Asian J Psychiatr. 2017 Jun;27:48-52
pubmed: 28558895
BMC Public Health. 2018 Jul 6;18(1):846
pubmed: 29980237
AMIA Annu Symp Proc. 2006;:244-8
pubmed: 17238340
JMIR Ment Health. 2018 May 23;5(2):e43
pubmed: 29792291
Indian J Psychol Med. 2016 Nov-Dec;38(6):560-566
pubmed: 28031593
J Med Internet Res. 2016 Feb 11;18(2):e35
pubmed: 26868917
J Epidemiol Glob Health. 2016 Dec;6(4):217-227
pubmed: 27080191
Indian J Psychiatry. 2012 Oct;54(4):304-19
pubmed: 23372232
PLoS One. 2016 Apr 13;11(4):e0152285
pubmed: 27073900
Aust N Z J Psychiatry. 2018 Sep;52(9):856-863
pubmed: 29726275
Psychiatry Res. 2015 Aug 30;228(3):913-7
pubmed: 26003510
Br Med Bull. 2015 Jun;114(1):127-34
pubmed: 25958380
Sci Rep. 2017 Oct 11;7(1):13006
pubmed: 29021528
Lancet. 2012 Jun 23;379(9834):2343-51
pubmed: 22726517
Crisis. 2014;35(1):51-9
pubmed: 24121153
EPJ Data Sci. 2016;5(1):32
pubmed: 32355600
Nature. 2009 Feb 19;457(7232):1012-4
pubmed: 19020500
Indian J Psychiatry. 2015 Oct-Dec;57(4):348-54
pubmed: 26816422
Br J Psychiatry. 2008 Nov;193(5):402-5
pubmed: 18978322
J Affect Disord. 2013 Jun;148(2-3):411-2
pubmed: 23182592
Eur Arch Psychiatry Clin Neurosci. 2012 Feb;262(1):39-46
pubmed: 21505949
Clin Infect Dis. 2008 Dec 1;47(11):1443-8
pubmed: 18954267

Auteurs

Natalia Adler (N)

United Nations International Children's Emergency Fund (UNICEF), New York, NY, United States.

Ciro Cattuto (C)

ISI Foundation, Torino, Italy.

Kyriaki Kalimeri (K)

ISI Foundation, Torino, Italy.

Daniela Paolotti (D)

ISI Foundation, Torino, Italy.

Michele Tizzoni (M)

ISI Foundation, Torino, Italy.

Stefaan Verhulst (S)

The Governance Lab, New York University, New York, NY, United States.

Elad Yom-Tov (E)

Microsoft Research, Herzeliya, Israel.

Andrew Young (A)

The Governance Lab, New York University, New York, NY, United States.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH