Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add automatic suggestion service for transliteration of Cyrillic script to Latin #11923

Closed
nijel opened this issue Jun 24, 2024 · 11 comments · Fixed by #12154
Closed

Add automatic suggestion service for transliteration of Cyrillic script to Latin #11923

nijel opened this issue Jun 24, 2024 · 11 comments · Fixed by #12154
Assignees
Labels
backlog This is not on the Weblate roadmap for now. Can be prioritized by sponsorship. enhancement Adding or requesting a new feature. good first issue Opportunity for newcoming contributors. hacktoberfest This is suitable for Hacktoberfest. Don’t try to spam. help wanted Extra attention is needed.
Milestone

Comments

@nijel
Copy link
Member

nijel commented Jun 24, 2024

Describe the problem

Several languages can be written in Cyrillic and Latin. As these are just different script for the same language, it should be possible to automatically convert these.

Describe the solution you would like

There is cyrtranslit module which can do transliteration of Cyrillic script to Latin. It can be used to automatically generate _Latn and _Cyrl translations.

Describe alternatives you have considered

No response

Screenshots

No response

Additional context

No response

@nijel nijel added hacktoberfest This is suitable for Hacktoberfest. Don’t try to spam. help wanted Extra attention is needed. good first issue Opportunity for newcoming contributors. backlog This is not on the Weblate roadmap for now. Can be prioritized by sponsorship. labels Jun 24, 2024
Copy link

This issue has been added to the backlog. It is not scheduled on the Weblate roadmap, but it eventually might be implemented.

In case you need this feature soon, please consider helping or push it by funding the development.

Copy link

This issue seems to be a good fit for newbie contributors. You are welcome to contribute to Weblate! Don't hesitate to ask any questions you would have while implementing this.

You can learn about how to get started in our contributors documentation.

@nijel nijel added the enhancement Adding or requesting a new feature. label Jun 24, 2024
@Wells245
Copy link

Hello Nijel. I'd like to take this issue as my first open source contribution but I'm not familiar with Weblate. Could someone give me a quick overview on where in the project it goes and how it's needed to be implemented?

@nijel
Copy link
Member Author

nijel commented Jun 26, 2024

The backends live in https://github.com/WeblateOrg/weblate/tree/main/weblate/machinery

An example commit to add a new backend: fa73d22

It will just not call the remote service, but use the library. List of supported languages probably needs to be hard-coded.

@gersona
Copy link
Collaborator

gersona commented Jul 26, 2024

@nijel for the sake of clarity, this issue only concerns Cyrillic to Latin transliteration right ? not in both ways

@nijel
Copy link
Member Author

nijel commented Jul 26, 2024

I think both ways make sense. The machinery should support whatever is possible with the library

@gersona
Copy link
Collaborator

gersona commented Jul 27, 2024

for languages that support both Cyrillic and Latin, is the type of script always informed in the code (like "sr@cyrillic" or "bs_Cyrl")? if so, how to determine current script of code such as "sr", or should I check the actual content of the text to translate ?

@nijel
Copy link
Member Author

nijel commented Jul 27, 2024

Check language definitions in https://github.com/WeblateOrg/language-data/. Codes like sr@Latn should be used. For languages without a modifier, there usually is an existing practice whether it is Latin or Cyrillic, but I don't think we have this documented somewhere. When I need info like this I usually review translations on Hosted Weblate to see if they agree (in this case, look at https://hosted.weblate.org/languages/sr/).

@gersona
Copy link
Collaborator

gersona commented Jul 29, 2024

about the MachineTranslation.max_score, since it's just transcribing from a script to another, is it safe to give it a 100 score ?

@nijel
Copy link
Member Author

nijel commented Jul 29, 2024

Yes, makes sense

@nijel nijel added this to the 5.7 milestone Aug 5, 2024
Copy link

github-actions bot commented Aug 5, 2024

Thank you for your report; the issue you have reported has just been fixed.

  • In case you see a problem with the fix, please comment on this issue.
  • In case you see a similar problem, please open a separate issue.
  • If you are happy with the outcome, don’t hesitate to support Weblate by making a donation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backlog This is not on the Weblate roadmap for now. Can be prioritized by sponsorship. enhancement Adding or requesting a new feature. good first issue Opportunity for newcoming contributors. hacktoberfest This is suitable for Hacktoberfest. Don’t try to spam. help wanted Extra attention is needed.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants