Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TREC 2024 Tip-of-the-Tongue #271

Open
8 tasks
mam10eks opened this issue Sep 16, 2024 · 3 comments
Open
8 tasks

TREC 2024 Tip-of-the-Tongue #271

mam10eks opened this issue Sep 16, 2024 · 3 comments

Comments

@mam10eks
Copy link
Contributor

Dataset Information:

The training and dev data of the TREC 2023 Tip-of-the-Tongue track are now available: https://trec-tot.github.io/guidelines

Description from the website:

Tip of the tongue: The phenomenon of failing to recall something from memory, combined with partial recall and the feeling that recall is imminent.

Links to Resources:

Dataset ID(s) & supported entities:

  • tip-of-the-tongue/2024: corpus
  • tip-of-the-tongue/2024/test: test queries

Checklist

Mark each task once completed. All should be checked prior to merging a new dataset.

  • Dataset definition (in ir_datasets/datasets/[topid].py)
  • Tests (in tests/integration/[topid].py)
  • Metadata generated (using ir_datasets generate_metadata command, should appear in ir_datasets/etc/metadata.json)
  • Documentation (in ir_datasets/etc/[topid].yaml)
  • Downloadable content (in ir_datasets/etc/downloads.json)
    • Download verification action (in .github/workflows/verify_downloads.yml). Only one needed per topid.
    • Any small public files from NIST (or other potentially troublesome files) mirrored in https://github.com/seanmacavaney/irds-mirror/. Mirrored status properly reflected in downloads.json.

Additional comments/concerns/ideas/etc.

@mam10eks
Copy link
Contributor Author

I think this should be rather fast, I think it should be easy to integrate this into the code of the previous year: https://github.com/allenai/ir_datasets/blob/master/ir_datasets/datasets/trec_tot.py

@mam10eks
Copy link
Contributor Author

I will try to make a pull request :)

@mam10eks
Copy link
Contributor Author

I have created a pull request with some tests here: #272

As soon as this is merged, we could close the issue :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant