• English
    • العربية
  • العربية
  • Login
  • QU
  • QU Library
  •  Home
  • Communities & Collections
View Item 
  •   Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Student Thesis & Dissertations
  • College of Health Sciences
  • Biomedical Sciences
  • View Item
  • Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Student Thesis & Dissertations
  • College of Health Sciences
  • Biomedical Sciences
  • View Item
  •      
  •  
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    MACHINE LEARNING PREDICTION OF DIABETES FROM A PUBLICLY AVAILABLE DATASET

    View/Open
    Khalid Alqahtani_OGS Approved Project.pdf (1.380Mb)
    Date
    2024-01
    Author
    AL-QAHTANI, KHALID DABSAN H H
    Metadata
    Show full item record
    Abstract
    A wide array of medical conditions necessitate invasive diagnostic techniques, with diabetes being one of the most well-known among them. To address this challenge, a predictive model was developed using a publicly available dataset comprising over a million participants from the USA, sourced from the Behavioral Risk Factor Surveillance System (BRFSS). The dataset spans the years 2019, 2020, and 2021. After conducting a thorough literature review to establish connections with features defining a participant's diabetic status, three primary class features and 30 additional features were chosen from this three-year dataset. These class and feature selections were adapted to be compatible with the application used to construct the predictive model. The model itself was constructed using the Weka application. Missing data were transformed and the Principle Component filter was applied to reduce the data from a multidimensional space to a 2D space. The model was trained on four different classifiers, with Random Forest emerging as the most effective classifier. To validate the model, an unseen dataset from 2021 was employed, employing the supply test set method in Weka. The model demonstrated an ability to predict more than 70% of the cases and 98% of the controls. This predictive model can be used to predict various health conditions using publicly available data, reducing the reliance on invasive diagnostic methods. Furthermore, it holds promise for the Hamad Medical Corporation in assessing the risk of diabetes development among individuals in Qatar by leveraging the features incorporated in this model.
    DOI/handle
    http://hdl.handle.net/10576/51491
    Collections
    • Biomedical Sciences [‎66‎ items ]

    entitlement


    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us | Send Feedback
    Contact Us | Send Feedback | QU

     

     

    Home

    Submit your QU affiliated work

    Browse

    All of Digital Hub
      Communities & Collections Publication Date Author Title Subject Type Language Publisher
    This Collection
      Publication Date Author Title Subject Type Language Publisher

    My Account

    Login

    Statistics

    View Usage Statistics

    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us | Send Feedback
    Contact Us | Send Feedback | QU

     

     

    Video