• English
    • العربية
  • العربية
  • Login
  • QU
  • QU Library
  •  Home
  • Communities & Collections
  • Help
    • Item Submission
    • Publisher policies
    • User guides
    • FAQs
  • About QSpace
    • Vision & Mission
View Item 
  •   Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Student Thesis & Dissertations
  • College of Arts & Sciences
  • Mathematics, Statistics & Physics
  • View Item
  • Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Student Thesis & Dissertations
  • College of Arts & Sciences
  • Mathematics, Statistics & Physics
  • View Item
  •      
  •  
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Assessment and Prediction of Body Fat Composition Using A Variety of Machine Learning Algorithms

    View/Open
    TAHSIN RAAHILA SHAJAHAN_OGSApproved_Thesis.pdf (430.0Kb)
    Date
    2023-06
    Author
    Shajahan, Tahsin Raahila
    Metadata
    Show full item record
    Abstract
    Body composition is critical for health outcomes and has been researched in various populations and conditions like obesity, diabetes, and many more. Qatar Biobank collected anthropometric and biomedical data from individuals across all age groups. Body fat and lean mass are important measures of body composition which help identify several health risks including cardiovascular health and nutrition. Machine learning (ML) algorithms in Python were used to predict Total Fat Percentage (TFP) and Total Lean Mass (TLM). All the variables in the dataset are used to test different ML algorithms on the TFP variable. Based on performance metrics like R2, Mean Absolute Error and Root Mean Square Error; linear regression, support vector regression (SVR) and extreme gradient boosting (XGBoost) performed well. Subsequently, further analysis on these models were performed using feature selection methods like forward, backward, stepwise and information gain for multiple cross-validation (CV) levels. We found that backward selection with a 10-CV on the SVR model predicted TFP the best with R2 of 86.7% (train), R2 of 80.2% (test) and MAE of 0.025 (train), MAE of 0.030 (test). Some of the best variables selected via this model are: testosterone, urea, gender, body mass index (BMI) and bone mineral density (BMD) Next, TLM is analyzed using the three models that were selected earlier for TFP. It was found that linear regression and SVR models predicted TLM well, while XGBoost performed poorly. Since backward selection with 10-CV produced good results for TFP, the same is applied to the models for feature selection. Based on the results obtained we conclude that linear regression model after feature section predicts TLM the best with R2 of 83.7% (train), R2 of 82.9% (test) and MAE of 0.313 (train), MAE of 0.313 (test). Some of the best variables explaining TLM are: gender, age, BMI, cholesterol and BMD.
    DOI/handle
    http://hdl.handle.net/10576/48144
    Collections
    • Mathematics, Statistics & Physics [‎35‎ items ]

    entitlement


    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us | Send Feedback
    Contact Us | Send Feedback | QU

     

     

    Home

    Submit your QU affiliated work

    Browse

    All of Digital Hub
      Communities & Collections Publication Date Author Title Subject Type Language Publisher
    This Collection
      Publication Date Author Title Subject Type Language Publisher

    My Account

    Login

    Statistics

    View Usage Statistics

    About QSpace

    Vision & Mission

    Help

    Item Submission Publisher policiesUser guides FAQs

    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us | Send Feedback
    Contact Us | Send Feedback | QU

     

     

    Video