Predicting hypertension using machine learning: Findings from Qatar Biobank Study

AlKaabi, Latifa A; Ahmed, Lina S; Al Attiyah, Maryam F; Abdel-Rahman, Manar E

Author	AlKaabi, Latifa A
Author	Ahmed, Lina S
Author	Al Attiyah, Maryam F
Author	Abdel-Rahman, Manar E
Available date	2020-10-28T10:19:55Z
Publication Date	2020-10-16
Publication Name	PLoS ONE
Identifier	http://dx.doi.org/10.1371/journal.pone.0240370
Citation	AlKaabi LA, Ahmed LS, Al Attiyah MF, Abdel-Rahman ME (2020) Predicting hypertension using machine learning: Findingsfrom Qatar Biobank Study. PLoS ONE 15(10): e0240370. https://doi.org/10.1371/journal.pone.0240370
Identifier	e0240370
URI	http://hdl.handle.net/10576/16825
Abstract	Hypertension, a global burden, is associated with several risk factors and can be treated by lifestyle modifications and medications. Prediction and early diagnosis is important to prevent related health complications. The objective is to construct and compare predictive models to identify individuals at high risk of developing hypertension without the need of invasive clinical procedures. This is a cross-sectional study using 987 records of Qataris and long-term residents aged 18+ years from Qatar Biobank. Percentages were used to summarize data and chi-square tests to assess associations. Predictive models of hypertension were constructed and compared using three supervised machine learning algorithms: decision tree, random forest, and logistics regression using 5-fold cross-validation. The performance of algorithms was assessed using accuracy, positive predictive value (PPV), sensitivity, F-measure, and area under the receiver operating characteristic curve (AUC). Stata and Weka were used for analysis. Age, gender, education level, employment, tobacco use, physical activity, adequate consumption of fruits and vegetables, abdominal obesity, history of diabetes, history of high cholesterol, and mother's history high blood pressure were important predictors of hypertension. All algorithms showed more or less similar performances: Random forest (accuracy = 82.1%, PPV = 81.4%, sensitivity = 82.1%), logistic regression (accuracy = 81.1%, PPV = 80.1%, sensitivity = 81.1%) and decision tree (accuracy = 82.1%, PPV = 81.2%, sensitivity = 82.1%. In terms of AUC, compared to logistic regression, while random forest performed similarly, decision tree had a significantly lower discrimination ability (p-value<0.05) with AUC's equal to 85.0, 86.9, and 79.9, respectively. Machine learning provides the chance of having a rapid predictive model using non-invasive predictors to screen for hypertension. Future research should consider improving the predictive accuracy of models in larger general populations, including more important predictors and using a variety of algorithms.
Language	en
Publisher	PLoS ONE
Subject	Prediction model Predictors Logistic regression Decision tree Random forest Machine Learning Hypertension High blood pressure
Title	Predicting hypertension using machine learning: Findings from Qatar Biobank Study
Type	Article
Issue Number	10
Volume Number	15
dc.accessType	Open Access

Files in this item

Name:: Predicting hypertension using ...
Size:: 909.0Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Public Health [‎433‎ items ]

Show simple item record

Predicting hypertension using machine learning: Findings from Qatar Biobank Study

Files in this item

This item appears in the following Collection(s)

Video