Wikipedia talk:Administrator intervention against vandalism

WT:AIV

This is not the page for reporting vandalism.
The page to report persistent vandalism is at Administrator intervention against vandalism.

This is the talk page for discussing Administrator intervention against vandalism and anything related to its purposes and tasks.

Put new text under old text. Click here to start a new topic.
New to Wikipedia? Welcome! Learn to edit; get help.

Archives: Index, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17: 14 days

AIV helperbot information

HBC AIV helperbot14 assists with the management of vandalism reports. Edit the following parameters in the page header to control the bot's behavior:

RemoveBlocked: On to enable automatic removal of blocked users from the list. Any other value will disable this functionality (only in cases of bot malfunctions, please).
MergeDuplicates: On to enable automatic merging of multiple reports of the same person. Any other value will disable this functionality.
AutoMark: On to enable automatic marking of users with special IPs or membership in categories as defined at User:HBC AIV helperbot/Special IPs. Any other value will disable this functionality.
FixInstructions: On to enable automatic repair of the reporting instruction HTML comments in the User-reported section of the page, as defined at Wikipedia:Administrator intervention against vandalism/instructions. Any other value will disable this functionality.
AutoBacklog: On to enable the automatic switching on or off of the {{adminbacklog}} message. Any other value will disable this functionality. Associated parameters are:
- AddLimit: The number of vandalism reports at which the {{adminbacklog}} message will be made visible.
- RemoveLimit: The number of vandalism reports at which the backlog message will be disabled ({{noadminbacklog}}).

Gladiators 2024 vandal (potential ip hopper)

Latest comment: 12 days ago1 comment1 person in discussion

https://en.wikipedia.org/wiki/Special:Contributions/2A0E:CB01:4E:400:D400:330D:492E:87AD

https://en.wikipedia.org/wiki/Special:Contributions/2A06:5906:3E08:8F00:D9B1:C672:85A2:DA42

suspected sockpuppets of blocked ip https://en.wikipedia.org/wiki/Special:Contributions/78.86.131.106

continuing to add unsourced infomation to Gladiators 2024 article in spite of block (including offensive false material to the section on the Gladiators Ready! book) Visokor (talk) 19:14, 24 November 2024 (UTC)Reply

repeated vandalism in ahir clan page

Latest comment: 7 days ago3 comments2 people in discussion

User talk:HistorianAlferedo has indulged in repeated vandalism by reverting valid sourced contents from university of chicago, JN university, london school of economics.

Also the user is editing contents with intention of caste POV using raj sources from 1900

kindly edit the article ahir clan and provide inputs , Thanks Drisha herjee (talk) 02:42, 30 November 2024 (UTC)Reply

I have left you a note at your talkpage [1]. - Ratnahastin (talk) 02:45, 30 November 2024 (UTC)Reply

I have to disagree the user HistorianAlferedo is using British Raj sources

The user HistorianAlferedo has removed contents from university of chicago, JN university, london school of economics, Oxford , removing edits which follow Wikipedia:Reliable sources - Wikipedia is vandalism

overall user HistorianAlfered repeatedly removes academic scholarly contents, Drisha herjee (talk) 02:59, 30 November 2024 (UTC)Reply

Homoglyph vandalism

Latest comment: 2 days ago30 comments6 people in discussion

In response to a particular instance of vandalism raised at ANI (here; permalink), I added a subsection there to discuss a class of vandalism that it represents. Due to its subject, this discussion more properly belongs here, where it can get a more focused airing from those interested in vandalism as a topic, and also so it can end up archived somewhere where it can more easily be found, if need be. Content of the previous discussion at ANI follows: Mathglot (talk) 19:35, 3 December 2024 (UTC)Reply

Copy of discussion originally at ANI.

Although they are already indeffed, I wanted to call attention to the Mojibake edit linked by Gaismagorm. Τhis is a particularly pernicious form of vandalism that I call homoglyph vandalism (but I'd appreciate hearing the expression used at Wikipedia, if there is one). It involves replacing one character, say, a Latin capital T (Unicode U+0054) with another one, say a Greek capital letter Tau (U+03A4), or a Cyrillic Capital letter Te (U+0422) which has the identical, or almost identical appearance as the original latin T. You can see this in operation at Washeans's edit, where the first letter of the first word in the expression "The result is a systematic replacement of symbols..." in the original is Latin letter capital T (UTF-8: 54) but was replaced with homoglyph Greek capital letter Tau (UTF-8 CE A4) in the wikicode.

It is not by coincidence that they vandalized this article and not some other one, because the topic of the article is related to the type of vandalism they performed; they probably felt pretty clever about themselves doing it, right up to the point were they got indeffed. I am not aware of useful tools for detecting homoglyph vandalism at Wikipedia, but if there is anything at Toolforge, I'd like to know about it. We need a tool to help vandalism fighters detect and correct vandalism of this sort. Not sure if the AWB flavor of regex is powerful enough to write a pattern that would highlight script characters that appear to be embedded in characters belonging to a different unicode script block, but if it is, that might be one way. Mathglot (talk) 00:59, 2 December 2024 (UTC)Reply

As the editor who had to revert it, and as someone who is probably in the 99th percentile of editors for potential awareness of this issue, it took me a solid 20 seconds staring at the diff to realize what was actually changed. An ability to check for this seems technically difficult—surely it would end up being a "notice one diff by a user and the whole house of cards comes tumbling down" thing? Remsense ‥ 论 01:07, 2 December 2024 (UTC)Reply

presumably so. Sometimes I just search up common words in the search but replace l's with capital I's or the other way around, and use that to find vandalism. Gaismagorm (talk) 01:10, 2 December 2024 (UTC)Reply

Mathglot, please see User:Radarhump. Drmies (talk) 04:11, 2 December 2024 (UTC)Reply

(edit conflict) Diffs highlighting words that look identical, and unexpected differences in the byte length are two of the tells of homoglyph vandalism. I did a test edit to this section to demonstrate this. If you look at rev. 1260701025 of 04:02, 2 December 2024 by Mathglot, you will see that that edit replaced the 'T' in the first letter of the word 'This' in rev. 1260672475 of 00:59, 2 December 2024 with Greek letter capital Tau (U+0422). Note the diff (Special:Diff/1260699524/1260701025) highlighting the word 'This' with no visible change to the word 'This', and then look at the History, and note that the difference in byte length: rev. 1260701025 is one byte longer (363,186 bytes) than rev. 1260699524, because UTF-8 requires only one byte to render a Latin T, but two bytes to render a Tau.

These are two of the clues that help find this type of vandalism, the first being a word that is highlighted with no visible change; and the second is the byte count. The latter is easiest to use when only one word is changed, or multiple words but without additional text being added. But careful character counting may reveal it, if one of the encodings requires more UTF-8 bytes than the other, which is normally the case if one of the characters was Latin and the other was not. Mathglot (talk) 04:36, 2 December 2024 (UTC)Reply

I remember a case of this from a few years ago. The tell was a redlink which I knew should have gone to a DAB page, and the corrupting alphabet was Cyrillic. It was a real head-scratcher until I worked out what was going on. Fortunately, the editor had never been very active, and had given up. I cleaned them out by copying suspect characters in their edits into the searchbar; but that requires familiarity with the corrupting alphabet, and it might have been simpler to link every word and see what turned red on preview. Narky Blert (talk) 08:31, 2 December 2024 (UTC)Reply

Moved from WP:ANI § User:Washweans

My interest in raising this here at AIV is multipronged, including introducing the topic to those who might not be aware of it, and to stimulate discussion about it, especially regarding methods and tools to detect and repair it. I would hope that one thing that would come out of a discussion here would be a Help- or Info page-style write-up about the topic at an appropriate venue, directed at vandalism fighters who could go there to read up about it and get advice about how to deal with it. Mathglot (talk) 19:35, 3 December 2024 (UTC)Reply

If we could compile a list of the homoglyphs used for this type of vandalism, it would be, I think, pretty straightforward to put together some Javascript that vandal patrollers could use to more readily identify it (maybe causing the homoglyph characters to highlight in bright blue or the like?) Seraphimblade ^{Talk to me} 19:45, 3 December 2024 (UTC)Reply

Having a list of glyph collisions is another great idea. Unicode is a big place and that could be a long list, but we are a big project with a lot of motivated vandal fighters and other interested parties, and if we started a subpage or draft somewhere initiating such a list, it could grow organically over time and if properly formatted, perhaps could be used as the data page upon which the javascript could run (I presume JS can read an external data page?) Mathglot (talk) 19:54, 3 December 2024 (UTC)Reply

Now that I think about it, this may be an area where LLM might shine. I am going to give it a try with Chat GPT, and will report back. Maybe that can be the germ of a list that Seraphimblade is talking about. Mathglot (talk) 20:00, 3 December 2024 (UTC)Reply

Did it work? I also feel like that there is likely one online. I'll go and see if I can find one. Gaismagorm (talk) 01:23, 4 December 2024 (UTC)Reply

@Mathglot https://github.com/codebox/homoglyph/blob/master/raw_data/chars.txt found something that could be useful. Gaismagorm (talk) 01:23, 4 December 2024 (UTC)Reply

That's at least a great place to start, and it's MIT licensed, so entirely fine to use here. We'd probably want to take out just the ASCII ones so that it's not too heavy a load on the user, but that'll certainly get us going. Let me see if I can put together a quick prototype based upon that. Seraphimblade ^{Talk to me} 02:02, 4 December 2024 (UTC)Reply

Gaismagorm, I was on a couple of other things and then away, but it looks like you've found a great resource in the meantime, good work! Mathglot (talk) 07:38, 4 December 2024 (UTC)Reply

Thanks! Gaismagorm (talk) 11:24, 4 December 2024 (UTC)Reply

Another way your idea could be helpful, is to write a bot based on the homoglyph list, which would categorize the page in Category:Wikipedia articles with possible homoglyphs and tag the suspect word(s) inline with a new inline template, allowing vandal fighters to deal with the issue in an organized fashion, attempting to reduce the category to empty and the transclusion list of the template to none. Mathglot (talk) 20:21, 3 December 2024 (UTC)Reply

Perhaps an edit filter could be used (I'm not good with code so I have no clue if that would be feasible). It probably shouldn't disallow the edits, but tagging might be nice. Gaismagorm (talk) 01:26, 4 December 2024 (UTC)Reply

Yes, an edit filter is another possibility. It could alert you in the same way that the {{Alert}} template warns you to check the user's page history and logs when you hit Save for the first time, but then lets you save it, if you click Save a second time.

To me, the interesting (and non-obvious) part of any automated detector task, is in defining exactly what you want to flag, which requires a heuristic of some sort, which will inherently have the standard, precision and recall tension between wanting to catch as many genuine cases as possible, while minimizing false positives. Seraphimblade is probably wrestling with that issue right now, and opening up discussion about what the heuristic may help. For example, ideally we wouldn't want it to tag this page (a false positive here would not be a disaster however) although it would be good if it tagged the intentional homoglyph test-word highlighted by the Diff program in this diff. It's not a trivial task to define what exactly you want to tag. Mathglot (talk) 07:55, 4 December 2024 (UTC)Reply

I think at least to start, especially if we're not doing anything like disallowing or auto-reverting anything but just flagging it, a certain number of false positives are acceptable. So, yes, you might see some false positives in the midst of some words written in Greek, Cyrillic, whatever have you, but presumably most people will know "That's not malicious." There's also the question of vandals learning to game any heuristic there is, so if, for example, we say "Don't flag a character if it's surrounded by other non-ASCII characters", vandals could use several in a row to avoid tripping the detection. So, certainly not an easy question, and I doubt there's a perfect solution that will result in no false positives or negatives. The question is more whether false positives or false negatives are more tolerable. Seraphimblade ^{Talk to me} 08:03, 4 December 2024 (UTC)Reply

I would say false positives. I also put the question to an LLM; see the subsection below. Mathglot (talk) 09:40, 4 December 2024 (UTC)Reply

@Narky Blert:, your link-everything to see what turns red is a great idea, and suggests a technique that could be automated via template or other tools. In templating, there is the #ifexists parser conditional, which implies that Lua and Toolforge tools would have access to similar functionality, although I'm not familiar with exactly how they do it. Perhaps other tools might be able to be designed, based on finding "unexpected" byte count changes due to the UTF-8 issue regarding the number of bytes to represent Latin vs. non-Latin characters. (edit conflict) Mathglot (talk) 19:49, 3 December 2024 (UTC)Reply

With save disabled (to stymie MOS:OVERLINKers), such a tool might find general use as a rough-and-ready preview spellchecker. (I've has the embarrassing experience of adding a well-crafted well-cited sentence to an article, only for a pagewatcher to correct my glaring typo.) Narky Blert (talk) 16:53, 4 December 2024 (UTC)Reply

Detection heuristic options

Starting this subsection as a place to discuss how to define the detection heuristic; i.e., what do we want to flag, along with questions about tilt towards finding more cases at the risk of more false positives, or the other way. I put the question about defining a heuristic to an LLM, and recorded the response at /Homoglyph detection heuristic. (Feel free to retitle the page or move it to another location.) There's way more there than is desirable or doable for a first effort, but perhaps some of the ideas will be helpful. Some are not, but I hope those will be obvious; in particular, as written, it would find words enclosed in {{lang}} or {{ill}} templates which it should not, but those should be easy exclusions. (They are not immune to homoglyph vandalism, but the heuristic should be defined to handle those cases differently.) Mathglot (talk) 09:57, 4 December 2024 (UTC)Reply

Looking through it some more, I'm not all that impressed. I think we can do better. Mathglot (talk) 11:08, 4 December 2024 (UTC)Reply

I Think it might make sense to flag edits the insert homoglyphs into words starting and ending with a standard english character, or an edit adding in a large amount of homoglyphs but that has a byte change of 0 (in order to account for people who would replace entire words with homoglyphs. Gaismagorm (talk) 16:30, 4 December 2024 (UTC)Reply

also it should flag edits the create words ending/starting with a homoglyph, but with english characters within Gaismagorm (talk) 16:34, 4 December 2024 (UTC)Reply

That's one of the tells: it won't have a byte change of 0 if they replace an entire English word with homoglyphs, it will have a byte change of (at least) the number of letters in the word. E.g., changing This to homoglyphs will result in a byte change of +4 or larger, because of the way that UTF-8 works. Mathglot (talk) 06:14, 5 December 2024 (UTC)Reply

ah i see Gaismagorm (talk) 11:24, 5 December 2024 (UTC)Reply

If this discussion results in a usable tool of some sort, it might be useful to log occurrences of individual cases presented to users and the subsequent choice they made (not-change/change,and before-after). Those results could then be used to refine the tool further. Mathglot (talk) 11:15, 4 December 2024 (UTC)Reply

Danielle LoPresti - possible IP hopper, vandal

Latest comment: 2 days ago3 comments2 people in discussion

IP user 69.209.27.213, user "KaliIsComingForYou", 2600:1700:9584:c10:7ce9:5ae2:1f1b:e1bf, 2600:1700:9584:c10:1c35:46a4:fdae:4281 have repeatedly removed sourced materials on BLP. Review of IP addresses indicate possible connection to linked sources. They have removed benign and well-publicized facts, such as the fact the subject was married.

Subject is a public figure recently divorced. Sourced materials are related to a domestic violence restraining order.

Page may warrant at least a temporary protection with sourced materials preserved. ASunnyDisposition (talk) 16:36, 4 December 2024 (UTC)Reply

@ASunnyDisposition: As I've mentioned in my edit summary and your talk page, the other users are correct; scribd.com is not a reliable source, and that material should not be in the article without an actually reliable source. Per WP:BLP, please do not restore it without finding such a source. Writ Keeper ⚇♔ 16:47, 4 December 2024 (UTC)Reply

Thank you! These editors also removed sources earlier in the history of the page, including items from reputable sources. The flag about scribd (even if including legitimate legal docs) is helpful. Thanks! ASunnyDisposition (talk) 17:44, 4 December 2024 (UTC)Reply

Add topic