Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add CharacTER #633

Open
stancld opened this issue Nov 24, 2021 · 13 comments
Open

Add CharacTER #633

stancld opened this issue Nov 24, 2021 · 13 comments
Labels
Milestone

Comments

@stancld
Copy link
Contributor

stancld commented Nov 24, 2021

🚀 Feature

Add CharacTER, a text metric used for NMT evaluation.

Sources:
Paper - CharacTER: Translation Edit Rate on Character Level
Repo

Motivation / Abstract

Recently, the capability of character-level evaluation measures for machine translation output has been confirmed by several metrics. This work proposes translation edit rate on character level (CharacTER), which calculates the character level edit distance while performing the shift edit on word level. The novel metric shows high system-level correlation with human rankings, especially for morphologically rich languages. It outperforms the strong CHRF by up to 7% correlation on different metric tasks. In addition, we apply the hypothesis sentence length for normalizing the edit distance in CharacTER, which also provides significant improvements compared to using the reference sentence length. (Wang et al., 2016)

@stancld stancld added the enhancement New feature or request label Nov 24, 2021
@Rajathbharadwaj
Copy link

Hey @stancld @SkafteNicki, I want to contribute, how do I get started? Thanks.

@stancld
Copy link
Contributor Author

stancld commented Nov 29, 2021

Hi @Rajathbharadwaj, I hope you can have a look at #641 and try to prepare a PR for CharacTER accordingly. Once you have anything ready, you can open a draft PR and ping us :]

@SkafteNicki
Copy link
Member

Hi @Rajathbharadwaj, agree with @stancld that PR #641 I good place to look what how a complete implementation (+ docs ) should be structured. If you do not have time to fully implement it all, feel free to also open a PR with a partial implementation and then we can guide you to the rest :]

@Rajathbharadwaj
Copy link

Okay, I will check out and hopefully open a draft PR. Thanks!

@Rajathbharadwaj
Copy link

@SkafteNicki @stancld So I did check out the PR #641 and I can see there are some helper functions or some extra functions compared to the source repo. It'd be great if I could get some help in those regards. I will go ahead and create the module now. Thanks!

@Borda Borda added this to the v0.7 milestone Dec 8, 2021
@stancld
Copy link
Contributor Author

stancld commented Dec 9, 2021

Hello @Rajathbharadwaj, any update on this here? Let me know if you need any help :]

@Borda Borda modified the milestones: v0.7, v0.8 Jan 6, 2022
@stancld
Copy link
Contributor Author

stancld commented Jan 19, 2022

Hi @Rajathbharadwaj, any updates? :]

@Rajathbharadwaj
Copy link

Hey sorry about the delay. I will start contributing, been a bit busy.

Thanks.

@Borda
Copy link
Member

Borda commented Jan 26, 2022

Hey sorry about the delay. I will start contributing, been a bit busy.

no pressure, take your time 🐰

@Borda Borda changed the title [New metric] Add CharacTER Add CharacTER Jan 26, 2022
@ankitaS11
Copy link
Contributor

ankitaS11 commented Feb 25, 2022

Hey @Borda @stancld @SkafteNicki, I want to pick this issue, if its fine with everyone.

@stancld
Copy link
Contributor Author

stancld commented Feb 25, 2022

@ankitaS11 Yes, it's fine to take this one, but please be sure #855 to be merged before diving into this one :]

@ankitaS11
Copy link
Contributor

Sure. Will make that PR ready for review by this weekend.

@Borda Borda modified the milestones: v0.8, v0.9 Mar 22, 2022
@stancld
Copy link
Contributor Author

stancld commented Apr 9, 2022

Hi @ankitaS11, any updates here? :]

@SkafteNicki SkafteNicki removed this from the v0.9 milestone May 12, 2022
@SkafteNicki SkafteNicki added this to the future milestone Oct 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

7 participants
@Borda @SkafteNicki @Rajathbharadwaj @stancld @ankitaS11 and others