Robust Gaussian Process Regression Based on Iterative Trimming (ITGP)
- Nonparametric, flexible, and robust nonlinear regression based on Gaussian process.
- Outperforming the robust Gaussian process with Student’s-t likelihood significantly in many test cases.
The Gaussian process (GP) regression can be severely biased when the data are contaminated by outliers. ITGP is a new robust GP regression algorithm that iteratively trims the most extreme data points. While the new algorithm retains the attractive properties of the standard GP as a nonparametric and flexible regression method, it can greatly improve the model accuracy for contaminated data even in the presence of extreme or abundant outliers. It is also easier to implement compared with previous robust GP variants that rely on approximate inference. Applied to a wide range of experiments with different contamination levels, the proposed method significantly outperforms the standard GP and the popular robust GP variant with the Student-t likelihood in most test cases.
pip install robustgp
Dependency: GPy
One can start with examples in this notebook.
from robustgp import ITGP
# train ITGP
res = ITGP(X, Y, alpha1=0.5, alpha2=0.975, nsh=2, ncc=2, nrw=1)
gp, consistency = res.gp, res.consistency
# make prediction
y_avg, y_var = gp.predict(x_new)
y_var *= consistency
Here gp
is a GPy.core.GP
object, whose usage is further illustrated here.
-
See the docstring of ITGP for detailed usage and full API.
-
See this notebook for a complete example.
Comparison with standard Gaussian process and t-likelihood Gaussian process in terms of RMSE. See the algorithm paper for details.
-
Algorithm paper: Robust Gaussian Process Regression Based on Iterative Trimming, Zhao-Zhou Li, Lu Li, & Zhengyi Shao, 2020
-
First application: Modeling Unresolved Binaries of Open Clusters in the Color-Magnitude Diagram. I. Method and Application of NGC 3532, Lu Li, Zhengyi Shao, Zhao-Zhou Li, Jincheng Yu, Jing Zhong, & Li Chen, 2020
The MIT License