Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

i18n Infobox property translations #261

Open
tanvibhakta opened this issue Mar 25, 2019 · 3 comments
Open

i18n Infobox property translations #261

tanvibhakta opened this issue Mar 25, 2019 · 3 comments

Comments

@tanvibhakta
Copy link

tanvibhakta commented Mar 25, 2019

The field names of all Hindi pages that have infoboxes are being returned in English. The Hindi Wikipedia, as well as the official REST API, have these field names written in Hindi.

Screen Shot 2019-03-25 at 1 19 24 PM

This is a link to the official page:
https://hi.wikipedia.org/wiki/%E0%A4%AE%E0%A4%82%E0%A4%97%E0%A4%B2%E0%A4%AF%E0%A4%BE%E0%A4%A8

Screen Shot 2019-03-25 at 1 19 38 PM
This is a link to the official page:
https://hi.wikipedia.org/wiki/%E0%A4%AE%E0%A4%B9%E0%A4%BE%E0%A4%A4%E0%A5%8D%E0%A4%AE%E0%A4%BE_%E0%A4%97%E0%A4%BE%E0%A4%82%E0%A4%A7%E0%A5%80

@spencermountain spencermountain changed the title Infobox field names of all Hindi pages are in English i18n Infobox property translations Mar 26, 2019
@spencermountain
Copy link
Owner

ah, yeah the templates are keyed in english - which seems like a gross oversight by wikipedia.

They must have a mapping for each language somewhere in the parsoid project, or maybe elsewhere, but I haven't been able to find it.

here's the wikiscript, for reference - we're simply grabbing whatever it has right now.
image

Even in english, the presentation of infoboxes are rendered in ad-hoc ways. I think it probably should be handled in a secondary library - like wtf-infobox-render or something, as I imagine it's quite complicated.

related

@spencermountain
Copy link
Owner

i emailed one of the parsoid people about this
cheers

@spencermountain
Copy link
Owner

spencermountain commented Mar 26, 2019

yeah, it's not as simple as I had hoped, the templates get copy+pasted, and the translations are holed-up like this:

 |data7     = {{{mission_duration<includeonly>|</includeonly>}}}
 |label8    = मापी गयी दूरी

Every language will have a different solution for each template.
I don't see an easy multilingual fix. Open to ideas

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants