Missing details #1

vgaraujov · 2022-06-10T18:20:09Z

Hi! Thanks for the contribution. I want to use your probings tasks (EN and ES); however, I came across some problems:

Regarding EN data for tasks 4, 5, and 6. You don't include code for extracting EN data. Are you supposed to provide the final EN data? The data in the data_en folder seems to be incomplete. Could you please take a look at it?
Regarding EN data for tasks 4, 5, and 6. You provide code for extracting EN data. However, there is a missing module in the script: from discourse_tree_utils import *. Is it a missing file?

I would really appreciate you could support me on this.

The text was updated successfully, but these errors were encountered:

hotzjacobb · 2022-11-08T00:45:47Z

@vgaraujov Hi Vladimir, the dataset that the authors cite in the paper is the Penn Treebank. This is a proprietary dataset so they unfortunately are now allowed to share it. Hopefully you have access to it through an institution? https://catalog.ldc.upenn.edu/LDC2002T07 Feel free to message me.

As for how it's parsed, I'm not sure, but I might need to explore this so maybe I can update this.

Cheers. (:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing details #1

Missing details #1

vgaraujov commented Jun 10, 2022

hotzjacobb commented Nov 8, 2022

Missing details #1

Missing details #1

Comments

vgaraujov commented Jun 10, 2022

hotzjacobb commented Nov 8, 2022