this post was submitted on 21 Jan 2024
53 points (100.0% liked)

askchapo

22763 readers
4 users here now

Ask Hexbear is the place to ask and answer ~~thought-provoking~~ questions.

Rules:

  1. Posts must ask a question.

  2. If the question asked is serious, answer seriously.

  3. Questions where you want to learn more about socialism are allowed, but questions in bad faith are not.

  4. Try [email protected] if you're having questions about regarding moderation, site policy, the site itself, development, volunteering or the mod team.

founded 4 years ago
MODERATORS
 

I found this here and have verified the accuracy by copy-pasting into google translate myself.

My question is, is this discrepancy due directly to an intentional decision to translate differently, or is it because google translate has been trained on news articles that have been manually translated for English-speaking audiences?

(To be clear, both paragraphs should involve one person kicking another in the nuts, unless I'm missing something.)

all 13 comments
sorted by: hot top controversial new old
[–] [email protected] 17 points 9 months ago (1 children)

Yeah that's weird. There's no change in the wording, aside from who is kicking who, so it's not like it should be switching up the translation.

[–] [email protected] 10 points 9 months ago (1 children)

Machine translators make heavy use of machine learning/LLMs on the back end. This is necessary to an extent since the same phrase can have different meanings depending on context, but it also means that the usual biases from machine learning can crop up easily. The most famous example is that if you translated something like "I waved to the doctor" and "I waved to the nurse" to Spanish, it used to give the masculine form/pronouns for the first sentence and the feminine form/pronouns for the second sentence, even though there is no indication of gender in the English version. So there's a good chance that the context of who is kicking who can cause Google Translate to interpret the same phrase differently due to this bias.

[–] [email protected] 2 points 9 months ago (1 children)

It might also be trying to translate on a phrase or sentence level and applying statistics. The two sentences might occur in more contexts (in their statistical model) where they one gets translated literally and the other idiomatically.

[–] [email protected] 4 points 9 months ago

that's what's meant by bias in this context. those statistical differences arise due to bias.

[–] [email protected] 16 points 9 months ago (1 children)
[–] [email protected] 13 points 9 months ago

https://web.archive.org/web/20240112225209/https://blog.duolingo.com/duolingo-max/

ai enshittification + rampant techbro fascism??! what is even going on? what is happening on twitter? who do i hunt?

[–] [email protected] 11 points 9 months ago

It's probably an LLM unintentionally absorbing bias from its source material.

[–] [email protected] 5 points 9 months ago

That's weird, when I do it one paragraph at a time or translates "kick in the balls" correctly

[–] [email protected] 4 points 9 months ago* (last edited 9 months ago) (1 children)

try writing patada, patá seems like an abbreviated colloquialism, also en el huevo, is in one nut, it should be en los huevos

[–] [email protected] 2 points 9 months ago

Me estoy convirtiendo en El Bromas jokerfication

[–] [email protected] 2 points 9 months ago* (last edited 9 months ago)

99% of the time when one of these black box algorithms produces a weird, inconsistent result, it's because it's badly designed, not because of any single conscious decision someone made to obscure a particular idea. Google Translate was an early application of LLMs, and to bust one's balls is a well known expression that you could translate as "give one a hard time."

[–] [email protected] 2 points 9 months ago* (last edited 9 months ago)

Also it should be “the curtain rises”. “Sube” is the simple present third person conjugation of subir, as well as the imperative second person command, so it could mean either, but the translator chose the wrong one based on context.