-
Notifications
You must be signed in to change notification settings - Fork 30
Description
In testing the new code for this pull request, I found an issue with processing the delayed mode SeaExplorer time series data for missions for which certain sensors (oxygen in this case) are severely oversampled. These missions end up with delayed mode data files that contain fewer actual (non-nan) data points than the realtime files. In other words, we are losing data during the processing.
Currently, the dropna
function is used to remove the oversampled oxygen data when converting the raw data. The dropna
function is working correctly, however note that the resulting data has many nan values in it, for both the CTD and optics. These nan values will often not co-occur.
I think the problem in the processing is caused by using the GPCTD_TEMPERATURE
as the default time base in seaexplorer.py
. This variable contains nan values that are not all co-located with the nan values in the oxygen and optical variables. It's desirable to use the CTD as the time base, but we may need to do some interpolation to avoid losing data when the other variables are mapped onto this base.