Several problematic messages

Edited by author.
Last edit: 13:53, 8 March 2022

I've also seen the recent imports by Fuzzybot made **today 8 March**, overwriting existing TRANSLATED messages, but that MIXED unrelated messages (in the same language), or BROKE them in several parts RANDOMLY distributed.

"FuzzyBot" should NOT break any existing translation, it can only (eventually) mark some of them to become "!!FUZZY". But it seems that the bot that is marking messages as "!!FUZZY" is BROKEN, probably using unitialized global variable (such as a variable containing the last parsed input line, which is not necessarily the full message, when in fact the source format was splitting a single translated message into several source lines.

Who did that broken import ???? Why FuzzyBot does not indicate that information in its edit comment which is invariably saying that it is importing new text.


Look at what it did (in any language) for:

Wikipedia Android Mobile: Wikipedia-android-strings-customize toolbar description text ("Customize the bottom toolbar with 5 links you find most helpful.")

Some words at end may have been dropped at end and appended to another message (completely unrelated, and belonging to another message group, inside the "Wikimedia" meta-group):

Wiki Ed Dashboard: Wikiedudashboard-campaign.data wikidata info ("Detailed breakdown of Wikidata activity, based on edit summaries")

Creating NON-SENSE in both cases ! (I have already fixed that for French, but all other translations marked FUZZY in French have actually NO change at all (only the unneeded "!!FUZZY" mark that forced a review).

You've detected this non-sense in "/li", I detect the same in "/fr". (Look at Fuzzybot's edit history: this affects a LOT of languages that it broke each time, all of them with the same "dummy" edit comment: many unjustified reverts, deleted valid translations, borken translations). You can find in fact changes in 380 languages.
All updates using FUZZYBOT, made today on Sunday 8 March since early morning (initiated by some unknown user, at around 08:00 UTC, and later around 11:00 UTC), are VERY SUSPECT.
Or the import tool inside TWN site itself has a new bug. And in that case, we need an "Emergency STOP" to this bot which is causing damages.
Verdy p (talk)12:21, 8 March 2022

It looks like the messages have changed, as a new message "MediaWiki:Skin-action-move/li" now exists, which I think is the one on the tab.

Ooswesthoesbes (talk)13:24, 8 March 2022

Yes; after looking at this for a while trying to analyze what happened with FuzzyBot here, I think I can safely say that the current situation is OK with regards to the action-* and skin-action-* messages. The issue Verdy p mentions is unrelated to what you initially brought up.

Jon Harald Søby (talk)14:28, 8 March 2022

Well, it can be the eventual meaning, but as it stands, "MediaWiki:Action-move" is still the one that displays on the tabs. In any case, it is rather idiotic to change the meaning/use of such priority messages. I have locally changed the messages on my home wiki, as the width of the tabs made action navigation practically impossible...

Ooswesthoesbes (talk)15:40, 8 March 2022
 
 

See also:

The two messages that were mixed (one word "ligne" in French deleted at end of message 1, and appended to message 2, via FuzzyBot import), both reverted:

  1. Wikiedudashboard-campaign.data editor usernames info ("List of editor usernames, one per line")
  2. Wikipedia-android-strings-customize toolbar description text ("Customize the bottom toolbar with 5 links you find most helpful.")
Verdy p (talk)13:53, 8 March 2022
 

Just for others to understand the problem: The issue was with these two messages:

And most importantly, only for French. The problem was caused by the change to fr.yml in this commit – two new messages were inserted one line too high up, resulting in the original "data_editor_usernames_info" message being split into two at a line break, removing the word "ligne" from one message and adding the same word to an unrelated message.

Jon Harald Søby (talk)14:33, 8 March 2022

That's a weird effect of the ".yml" format, which uses continuation lines, instead of explicit delimitation of end of messages.

The source edit was to insert some messages at "random" position, in the middle of another message (between its first line, and the following continuation line.

I wonder if this happens regularly for other "*.yml" localisation files (there are many of them, such edits are probably automated). This should not happen if it used the JSON format (at least the syntax error would be detected early), or if that message (supposed to be interpreted as one line, without any linebreak) was not using the YML continuation syntax to split them above some arbitrary line length (really not needed in localization source files).

The YML format is cool for human edits (file per file), but not for automated edits using basic regexps (e.g. with sed, awk or bash script) which are dangerous across a large set of files when the human will superficially look at the changes (and no syntax error will be detected). For now there were only two locale files "en" and "fr", but soon there will be hundreds generated from translations in TWN.

See https://github.com/WikiEducationFoundation/WikiEduDashboard/issues/4824

Verdy p (talk)14:43, 8 March 2022