Data and codes for the Paper "Enhancing the prediction accuracy of concrete properties with knowledge constrained data augmentation and tabular foundation model"
- The folder
datasetsstores the 15 datasets used in the study. - The
data_aug.pydefines four data augmentation models: GaussianCopula, CTGAN, CopulaGAN, and TVAE. - The
filter.pydefines three anomaly detection methods: RANSAC, IF, and LOF. - The
model.pydefines the TabPFN framework - The
benchmark.pydefines eight benchmark ML models, which are used to compare with TabPFN - The
prediction.pydefines the whole experimental process. - All synthetic datasets can be downloaded here. The password is
MT1G.