-
Notifications
You must be signed in to change notification settings - Fork 5
Updating information for citing the UraLex 2.0 dataset on Zenodo #20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Updated the publication information of the articles using the UraLex 2.0 data in README.
Updated the uralex_documentation.md file with information on published papers for the citations: De Heer, Mervi; Rogier Blokland; Michael Dunn & Outi Vesakoski. 2024. “Loanwords in basic vocabulary as an indicator of borrowing profiles”. Journal of Language Contact 16 (1). 54–103. https://doi.org/10.1163/19552629-bja10057. Syrjänen, Kaj, Luke Maurits, Unni-Päivä Leino, Terhi Honkola, Jadranka Rota & Outi Vesakoski. 2021. “Crouching TIGER, hidden structure: Exploring the nature of lin guistic data using TIGER values”. Journal of Language Evolution 6(2). 99–118. https://doi.org/10.1093/jole/lzab004.
|
I'll try to look into this. |
|
Is that the UraLex 1.0 plus https://github.com/bedlan/uralex-ns ? |
|
ah, I see, there are changes here https://github.com/bedlan/uralex as well. |
Hi, the Northern Samoyedic expansion is intended to become a separate sub-release in the UraLex project because it has been compiled on somewhat different principles than the UraLex 1.0 and 2.0 versions. Therefore we are not planning to directly merge the NS expansion with 2.0 for a new release. This is to highlight the new interesting innovations of the NS part. |
I made them (hopefully at the right place). We have several branches of UraLex and I'm trying to update the citation information for UraLex 2.0 that also appears on Zenodo "When you use this dataset, please also cite the following papers, introducing it: ..." before finalizing version 3.0. |
|
@MervideHeer one question: Are we just updating the citation information here, or is there actual new data supposed to be added? As far as I can see, the last changes to the data are from 2021. I suppose there is more data now, in particular regarding loanwords - correct? |
|
I've updated the info at https://zenodo.org/records/4777568 and at https://github.com/lexibank/uralex/releases/tag/v2.0 |
Thank you for the update and taking time to look at the issue! I see that the CLDFvalidation tag has changed to green again. However, in my pull request attempt, my goal was to update both papers using the 2.0 dataset. I can see that the first paper "De Heer et al." is updated but the second is not. It should be: Syrjänen, Kaj, Luke Maurits, Unni-Päivä Leino, Terhi Honkola, Jadranka Rota & Outi Vesakoski. 2021. “Crouching TIGER, hidden structure: Exploring the nature of linguistic data using TIGER values”. Journal of Language Evolution 6(2). 99–118. https://doi.org/10.1093/jole/lzab004. |
At this moment, just the citations are in need of an update. For the upcoming version 3.0, we have prepared big changes. Not only loanword information is updated but also reflexes, cognate assessments and new languages are coming. So no new data is added inside 2.0 right now. |
|
Ah, I see. Second paper should be updated now, too. |
I can see them both there now! |
Dear Lexibank contributors,
I have updated the UraLex 2.0 basic vocabulary dataset README and Documentation with citation information on published papers. It would be good if the Zenodo record of UraLex 2.0 were updated as well. Could these changes be merged?
I can also see that UraLex has a tag CLDF-validation failing. Can I fix it somehow?
On behalf of the BEDLAN team
Mervi de Heer