Good work on this project!
I noticed that both the code and the paper use a vFoV sampling range of 20° to 105° in the data generation process. Could you explain the rationale behind this choice? It seems limited for equivalent focal-length coverage and may not be very robust. Below are my test results using a 28–200 mm lens:
