Interpolate irfs #141

jsitarek · 2021-03-22T18:13:41Z

this is PR #125 done this time from the branch of the main repository rather than from a fork

the main changes are:

functions to interpolate effective area and migration matrix
functions to read IRFs (symmetric to the create_... functions)

the code has all the requests from PR #125 implemented already so please let me know if you have any further comments or if we can merge them.

one note, the reading IRF functions are made on purpose such that they can read either a single file or a list of files. this does not complicate those functions to much (it is done by a common auxiliary function) and it is very useful, because in a real situations (either for interpolation, or for averaging of IRFs) we will be very often reading in a bunch of IRF files, and instead of writing a separate function for each type of IRF to deal with arrays it is good to have such universal tool

from the JSON config file the code gets the parameters according to which perform the interpolation, as well as the files with IRFs and their corresponding parameter values then it reads in a data file and checks the value of those parameters in the end it peforms the interpolation

…olation.py - added unit tests for two of the functions (one is disabled for the moment) - added writing of the output FITS file with interpolated Aeff

now the code automatically extracts the unit of each read quantity added a unit test (disabled for the moment) to test this new function

…t IRF files

…ific anymore, now the key where the interpolation parameters are stored has to be specified. Updated also the unit test and the example macro of generating the interpolated IRFs.

…the data from the bulk of pyirf, it is moved to the examples file

…into interpolate_irfs

….table.Table removed read_unit_from_HDUL function and its unit test since it is unnecessary now

…aster (from 140s to a fraction of a s)

…125

…ved also the corresponding unit test. included the (before missing) test data for this test. Since the test files are small they are added directly to the repository

the function that was reading the lo and hi bins and previosly also joining them into a single array now only reads the bins added functions (in binning.py) to split or join the bins between the lo/hi scheme (used in FITS files) and single arrays (used in IRF computing functions) added unit tests for all the above functions

…t dispersion matrixes to have the same sequence of variables as in the create/read dispersion matrix functions (before it was using the sequence as read from the fits file)

…functions changed read_fits_bins_lo_hi that it returns the 0-th row of the table read from the file changed read_fits_bins_lo_hi to be able to read from a list of files at the same time

…er of parameters used the binning functions to simplify the generation of energy and theta tables in create_aeff2d_hdu function

… code repetition

jsitarek · 2021-03-24T18:17:49Z

the status: I answered all the comments of @maxnoe most of them are implemented, for one I asked for clarification, and on the question of reading tables of IRFs I'm not sure if we have converged
there is also still the issue with failing tests because of seemingly missing file (that is committed to repository, yet it is not visible by a test), I would appreciate some hint about this problem

HealthyPear

The CI fails because test files are not supposed to be part of the repository files, in fact we are actively ignoring any data file with extensions like *.fits.gz or *.hdf5 in our .gitignore file.

I think we should keep test files all together (namely together with the ED/FACT comparisons)

codecov · 2021-04-23T14:38:31Z

Codecov Report

Merging #141 (d9568eb) into master (650c801) will increase coverage by 1.02%.
The diff coverage is 96.95%.

@@            Coverage Diff             @@
##           master     #141      +/-   ##
==========================================
+ Coverage   86.86%   87.89%   +1.02%     
==========================================
  Files          39       41       +2     
  Lines        1348     1503     +155     
==========================================
+ Hits         1171     1321     +150     
- Misses        177      182       +5

Impacted Files	Coverage Δ
pyirf/interpolation.py	`81.81% <81.81%> (ø)`
pyirf/binning.py	`96.20% <90.00%> (-0.90%)`	⬇️
pyirf/cuts.py	`94.33% <100.00%> (+0.58%)`	⬆️
pyirf/io/gadf.py	`100.00% <100.00%> (ø)`
pyirf/tests/test_binning.py	`100.00% <100.00%> (ø)`
pyirf/tests/test_cuts.py	`98.55% <100.00%> (+0.27%)`	⬆️
pyirf/tests/test_interpolation.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 650c801...d9568eb. Read the comment docs.

jsitarek · 2021-04-23T14:48:06Z

Hi @maxnoe @HealthyPear
Discussing with Max I finally divided the PR into interpolation, and the rest of the new code (the new io functions) I will put into a separate branch/PR so we can continue the discussion there.
So, can we merge now?

maxnoe

Mostly looks good, thanks.

I have one nitpick over the location / behaviour of the compare_irf_cuts function and a question concerning the effective area interpolation.

pyirf/io/gadf.py

pyirf/tests/test_interpolation.py

maxnoe · 2021-04-23T15:37:58Z

pyirf/interpolation.py

+
+    # remove zeros and log it
+    effective_area[effective_area < min_effective_area] = min_effective_area
+    effective_area = np.log(effective_area)


Why do we interpolate effective area in logspace? The only axis over which effective area is expected to vary over multiple orders of magnitude is energy, but thats not the interpolation axis.

Wouldn't normal linear interpolation make more sense for different values of e.g. azimuth / zenith?

the reason is mostly the threshold effects - when you interpolate the energy bins close to the threshold, the zenith angle shifts the whole curve in energy scale and this means that the difference in zenith can produce a large difference in collection area and thus it is safer to interpolate such points in log space.

Ok, let's merge it like this and maybe we need to make this optional and check on simulations which interpolation scheme performs better.

I think interpolation in the logA logE space should work well at all energies, not just at threshold, so for simplicity I would keep (at least as default) the log-log approach everywhere.

But energy is not used in the interpolation at all. The interpolation happens for all energy bins independently.

And then it boils down, what kind of behaviour you expect of effective area vs. the changing (interpolation) variable.
That would just needed to be checked for the specific interpolation one is doing.

Ah, right, the interpolation in energy is done by gammapy.
I think Julian's point that logA is also better for interpolation in zenith is probably true, but there is not other way of knowing that testing it.

… returns false rather than raising exception when cuts differ

moralejo · 2021-11-03T13:39:19Z

Hi, I think that besides interpolation we definitely need extrapolation, because one cannot build a grid that covers all possible cases inside the grid (and much less in an efficient way).

I see the scipy function used has no extrapolation option (even though it would be trivial to add it). Can you explore the possibility of using instead the functions in https://matplotlib.org/stable/api/tri_api.html? One can get the parameters of the plane defined by the 3 grid points closest to the given target direction, and then evaluate the plane in that point...

maxnoe · 2021-11-03T14:06:53Z

I think this comment is better in the open issue about changes to the interpolation than in this closed PR.

jsitarek and others added 29 commits February 24, 2021 19:08

- moved calculations of the Aeff averaging into 3 functions in interp…

c76993e

…olation.py - added unit tests for two of the functions (one is disabled for the moment) - added writing of the output FITS file with interpolated Aeff

reenabled the test on extraction interpolation parameters from the data

dfd37ec

changed how the IRFs are being read from the fits files,

14eb6ca

now the code automatically extracts the unit of each read quantity added a unit test (disabled for the moment) to test this new function

implemented simple interpolation of the energy dispersion matrix

9dccef4

fixed wrong variable name

35c342a

including a test for migration matrix interpolation

23ca869

added a function and unit test of comparing the cuts between differen…

b7a1fc1

…t IRF files

clean-up of the code

96bad79

fixing codacy's complains

a4923e4

change of variable name and small refactorings

66209e5

changed test_read_mean_parameters_data function so it is not LST-spec…

4d17061

…ific anymore, now the key where the interpolation parameters are stored has to be specified. Updated also the unit test and the example macro of generating the interpolated IRFs.

Delete dl2_LST-1.Run03642.0110.h5

7e3f197

removed the function to read the interpolation parameter values from …

2700925

…the data from the bulk of pyirf, it is moved to the examples file

Merge branch 'interpolate_irfs' of https://github.com/jsitarek/pyirf …

82d3135

…into interpolate_irfs

changed how the quantities are read from the files, now using astropy…

368a5fc

….table.Table removed read_unit_from_HDUL function and its unit test since it is unnecessary now

got rid of the for loops in the interpolation, now the code is much f…

e6f1ecf

…aster (from 140s to a fraction of a s)

removed the read_mean_parameters_data function

3a5b776

removed the example macro showing how to use interpolation

8a54cac

renamed function names and input variables following discussion in PR #…

856407f

…125

changed how the conversion to m2 is done

545b644

added a comment about protection against dividing by 0

71db8b2

moved the function for testing consistency of cuts to gadf module, mo…

990962b

…ved also the corresponding unit test. included the (before missing) test data for this test. Since the test files are small they are added directly to the repository

simplified read_grid_irf function and included a unit test

3117d61

swapped the axes of the offset angle and true energy bins in the inpu…

6eeba4d

…t dispersion matrixes to have the same sequence of variables as in the create/read dispersion matrix functions (before it was using the sequence as read from the fits file)

changed ext_name to extname to match the naming of variable in other …

ca70f73

…functions changed read_fits_bins_lo_hi that it returns the 0-th row of the table read from the file changed read_fits_bins_lo_hi to be able to read from a list of files at the same time

renamed .._hi_lo binning functions to ..._lo_hi so they match the ord…

97dfcdf

…er of parameters used the binning functions to simplify the generation of energy and theta tables in create_aeff2d_hdu function

used split_bin_lo_hi in a bunch of create irf functions to avoid some…

aa6867c

… code repetition

jsitarek requested a review from HealthyPear as a code owner March 22, 2021 18:13

cleaned up non needed variables and imports

95dccfd

HealthyPear added enhancement New feature or request input/output Format and file extensions of the input/output data. labels Apr 9, 2021

chaimain mentioned this pull request Apr 15, 2021

To compare Zen pointing of an event list and IRF before combining them to make a DL3 file cta-observatory/cta-lstchain#696

Closed

HealthyPear requested changes Apr 16, 2021

View reviewed changes

jsitarek added 6 commits April 23, 2021 15:38

Merge remote-tracking branch 'upstream/master' into interpolate_irfs

95538a3

moving the IO changes to a separate branch and PR

d3e16da

moved IO to a separate branch/PR

341c82f

moving IO to a separate branch/PR

2ed245d

-

479236d

-

413edff

maxnoe requested changes Apr 23, 2021

View reviewed changes

moved compare_irf_cuts function to cuts.py and change it such that it…

d9568eb

… returns false rather than raising exception when cuts differ

maxnoe approved these changes Apr 26, 2021

View reviewed changes

maxnoe requested a review from HealthyPear April 26, 2021 07:46

HealthyPear approved these changes Apr 26, 2021

View reviewed changes

maxnoe merged commit c5dd8c0 into master Apr 26, 2021

maxnoe deleted the interpolate_irfs branch April 26, 2021 07:46

jsitarek added a commit that referenced this pull request Apr 26, 2021

added the IO functions removed from PR #141

852ce6f

jsitarek mentioned this pull request Apr 26, 2021

added the IO functions removed from PR #141 #150

Open

chaimain mentioned this pull request May 7, 2021

Update DL3 Tool to to use pyirf IRF interpolation functions cta-observatory/cta-lstchain#711

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interpolate irfs #141

Interpolate irfs #141

jsitarek commented Mar 22, 2021

jsitarek commented Mar 24, 2021 •

edited

Loading

HealthyPear left a comment

codecov bot commented Apr 23, 2021 •

edited

Loading

jsitarek commented Apr 23, 2021

maxnoe left a comment

maxnoe Apr 23, 2021

jsitarek Apr 26, 2021

maxnoe Apr 26, 2021 •

edited

Loading

moralejo Apr 26, 2021

maxnoe Apr 26, 2021

moralejo Apr 26, 2021

moralejo commented Nov 3, 2021

maxnoe commented Nov 3, 2021

Interpolate irfs #141

Interpolate irfs #141

Conversation

jsitarek commented Mar 22, 2021

jsitarek commented Mar 24, 2021 • edited Loading

HealthyPear left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 23, 2021 • edited Loading

Codecov Report

jsitarek commented Apr 23, 2021

maxnoe left a comment

Choose a reason for hiding this comment

maxnoe Apr 23, 2021

Choose a reason for hiding this comment

jsitarek Apr 26, 2021

Choose a reason for hiding this comment

maxnoe Apr 26, 2021 • edited Loading

Choose a reason for hiding this comment

moralejo Apr 26, 2021

Choose a reason for hiding this comment

maxnoe Apr 26, 2021

Choose a reason for hiding this comment

moralejo Apr 26, 2021

Choose a reason for hiding this comment

moralejo commented Nov 3, 2021

maxnoe commented Nov 3, 2021

jsitarek commented Mar 24, 2021 •

edited

Loading

codecov bot commented Apr 23, 2021 •

edited

Loading

maxnoe Apr 26, 2021 •

edited

Loading