Update Data authored by Adham Beshr's avatar Adham Beshr
--- ---
title: Data title: Data
--- ---
## Data Chapter ## Data Chapter
### 1. Data Source ### 1. Data Source
...@@ -89,7 +90,12 @@ title: Data ...@@ -89,7 +90,12 @@ title: Data
#### 5.2. Impact of Fake Data #### 5.2. Impact of Fake Data
- The addition of fake data was evaluated by comparing the performance of machine learning models with and without fake data. The performance drop or improvement was analyzed based on the **Mean Squared Error (MSE)** and **R²** scores. - The inclusion of fake data was carefully evaluated as it significantly influenced the model's accuracy. Adding more than 25% fake data caused the accuracy to drop drastically, making the model unreliable.
- To maintain balance and robustness, the fake data was limited to 25%. This ensured enough variety without heavily compromising model performance.
- The evaluation showed a decline in performance metrics such as **Mean Squared Error (MSE)** and **R²** scores when fake data exceeded this threshold, emphasizing the importance of minimizing synthetic data.
### 6. Data Splitting ### 6. Data Splitting
- The dataset was split into **training** and **testing** sets using a **70:30** ratio for model evaluation. - The dataset was split into **training** and **testing** sets using a **70:30** ratio for model evaluation.
... ...
......