Mathematics, Statistics & Physics

Mathematics, Statistics & Physics http://hdl.handle.net/10576/3082 2025-07-29T07:16:50Z IMAGE MONITORING USING MULTIVARIATE CONTROL CHARTS http://hdl.handle.net/10576/62735 IMAGE MONITORING USING MULTIVARIATE CONTROL CHARTS ABOUAMOUNA, MAHA TAYSEER This study focuses on developing a Multivariate Hotelling's T2 control chart to monitor image data. Dimensionality reduction of the image data is achieved through Principal Component Analysis (PCA) and Sparse Principal Component Analysis (SPCA). Control charts were then built using different number of Principal Components (PCs), and their performance was evaluated using different run length metrics, including ARL, SDRL, MDRL, and percentile values. The results indicated that both models were effective in detecting shifts in the data; however, performance improved as more variance was incorporated into the model. The study concludes that SPCA is preferable due to its sparsity, which allows some loadings to be zero, thereby enhancing the interpretability of variables that affect image features. Additionally, SPCA and Hotelling T2 control chart are applied in a real-life context to detect apple rot, demonstrating efficient detection outcomes. Recommendations for future research and further evaluation of the SPCA model are presented. 2025-01-01T00:00:00Z A COMPARISON STUDY: EVALUATING SOME STATISTICAL AND AI TECHNIQUES FOR MEDICAL APPLICATION http://hdl.handle.net/10576/62728 A COMPARISON STUDY: EVALUATING SOME STATISTICAL AND AI TECHNIQUES FOR MEDICAL APPLICATION QAID, AMIRA ALI Analyzing medical data using artificial intelligence and statistical techniques may contribute to improving healthcare by helping to accurately identify important and irrelevant features in data collection and disease diagnosis. Enhancing the accuracy of collected data through these technologies helps to develop healthcare quality and deliver effective treatment. Python and R-Studio were utilized for medical data analysis, employing machine learning algorithms and statistical techniques for data classification and prediction. The machine learning algorithms included Decision Tree, Random Forest, Logistic Regression, Support Vector Machine, Naïve Bayes, and K-Nearest Neighbors. Traditional statistical techniques, such as Logistic Regression and Discriminant Analysis, have also been used to evaluate data accuracy and performance of predictive models. The simulation results showed that working with data containing structural issues such as missing data and imbalance between patient and non-patient classes arise. Algorithms such as Random Forest and K-Nearest Neighbors were able to help address these data issues. As the sample size increased, the accuracy of Logistic Regression and Random Forest improved significantly, indicating their ability to handle large datasets. On the other hand, using SMOTE with algorithms reduced accuracy but improved the understanding of rare classes in terms of precision and recall. Discriminant analysis revealed a similarity in the average of variables across classes, which reduced prediction accuracy due to height similarity, thus not providing clear insights. In contrast, Decision Tree algorithms offer better clarity in interpreting variables through the decision tree diagram. Random Forest is the best algorithm for classifying data with missing values and imbalanced medical data. While machine learning is superior in terms of medical data accuracy, statistical techniques remain essential for understanding data and making informed decisions based on precise trends and patterns analysis. 2025-01-01T00:00:00Z COMPETING RISK MODELS IN PRESENCE OF PROGRESSIVELY TYPE-II CENSORED DATA FOR DAGUM DISTRIBUTION http://hdl.handle.net/10576/56270 COMPETING RISK MODELS IN PRESENCE OF PROGRESSIVELY TYPE-II CENSORED DATA FOR DAGUM DISTRIBUTION BADWAN, RAGHD Y. H. In the survival time analysis, there could be more than one cause of failure for an individual or an item. Usually, researchers are interested in survival times under a certain cause of failure, considering the rest of the causes as "other". Moreover, censoring is unavoidable in survival analysis, due to the time and money limitations where researchers are unable to get comprehensive information for the entire units in the experiment. "Progressive Type-II censoring" is considered in this thesis of the problem of competing risks under the Dagum or "Burr Type-III" distribution. "Maximum likelihood" estimation is applied to estimate the unknown shape parameters under the general case as well as the special case of a common shape parameter. Moreover, the observed "Fisher information matrix" is found to get the approximate CI for the unknown parameters. The bootstrap CI is also studied using the resampling method. Furthermore, adequacy of the proposed methods are assessed using "Monte Carlo simulation" followed by an analysis of a real dataset. 2024-06-01T00:00:00Z ON THE GAUSSIAN PROCESS FOR STATIONARY AND NON-STATIONARY TIME SERIES PREDICTION FOR THE QATAR STOCK MARKET http://hdl.handle.net/10576/56261 ON THE GAUSSIAN PROCESS FOR STATIONARY AND NON-STATIONARY TIME SERIES PREDICTION FOR THE QATAR STOCK MARKET AL FAKIH, BATOUL MOHAMAD KAZEM This research adopts a Gaussian prediction model for non-stationary time series. Then, we discuss four transformation techniques: Generalized Optimal Wavelet Decomposition Algorithm (GOWDA), Hilbert Huang transform (HHT), Detrending based on Echo State Networks (DESN), and Kolmogorov-Zurbenco (KZ) filter. GOWDA is an algorithm that runs the continuous wavelet transform (CWT) several times using different mother wavelet functions, maximal levels, and thresholding techniques. It chooses a combination with minimal error. Meanwhile, HHT combines echo state networks (ESNs), which decompose the time series into intrinsic mode functions (IMFs). Then, the Hilbert spectral analysis is applied to the IMFs before reconstructing the denoised signal. DESN is a neural network algorithm with minimal assumptions. KZ filter is a moving average algorithm that is easy to understand and implement. When comparing the performance of these methods with the Gaussian prediction model, it was found that the HHT reconstructed before prediction gave the best results. 2024-06-01T00:00:00Z