Hi. First, thank you for your interesting work in the domain of long context!
I have some questions about the experimental part of the article. Specifically, is the experiments in Table 2 and Table 3.
Why in these two experiments, you only show the performance of LIFT and ICT method? I think a set of effects that use other previously proposed techniques for fine-tuning should be added, such as LoRA.