Script to create folds for 5-fold cross validation.
Since we have a small dataset, we will do 5-fold cross-validation rather than the normal hold-out set based validation. This will provide us a more accurate idea of our model's generalization ability since it will be less influenced by the variance of any single split. Additionally, since the dataset is small, it won't be too compute-heavy to do this form of validation.