| ... | ... | @@ -133,7 +133,7 @@ The data generation process involves augmenting the original dataset with synthe |
|
|
|
Step 1: Determine Number of Synthetic Rows to Add
|
|
|
|
The synthetic data will be generated to augment the dataset by approximately 25%. This is done by calculating 25% of the total number of rows in the original dataset.
|
|
|
|
|
|
|
|
num_synthetic_rows = int(len(data) * 0.25)
|
|
|
|
{width=386 height=35}
|
|
|
|
Purpose: This step ensures that the synthetic data constitutes 25% of the total dataset, allowing the model to benefit from the additional data without overwhelming the original dataset.
|
|
|
|
####
|
|
|
|
Step 2: Generate Synthetic Dates
|
| ... | ... | |
| ... | ... | |