Data and codes for the Paper "Leveraging domain knowledge in data augmentation to boost concrete strength prediction accuracy with automated machine learning and deep learning"
- The
data.csvis collected from https://archive.ics.uci.edu/dataset/165/concrete+compressive+strength - The
data_aug.pyprovides the hyperparameter optimization process of four data augmentation methods: GaussianCopula, CTGAN, CopulaGAN, and TVAE. The generation of initial and finalized synthetic data is presented in the script. - The
filter.pydefines three anomaly detection methods: RANSAC, IF, and LOF. - The
automl.pydefines the AutoML and AutoDL frameworks. - The
prediction.pydefines the whole experimental process.