Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

Admiral Patrick@dubvee.org · edit-2 3 months ago

Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

DarkThoughts@fedia.io · 3 months ago

Fedia hiding the activity is one of those things that I kinda dislike, as it was an easy way to detect certain trolls.

Admiral Patrick@dubvee.org · 3 months ago

yeah, i’m split on public votes.

On one hand, yeah, there’s a certain type of troll that would be easy to detect. It would also put more eyes on the problem I’m describing here.

On the other, you’d have people doing retaliatory downvotes for no reason other than revenge. That, or reporting everyone who downvoted them.

It depends on the person to use that “power” responsibly, and there are clearly people out there who would not wield it responsibly lol.

nondescripthandle@lemmy.dbzer0.com · 3 months ago

Im fully against public down votes becaue I already see people calling out other users by their name in threads they’re not even part of. Theres no world where that behavior gets better when you give them more tools to witch hunt. Lemmy is as much an insular echo chamber as any social media and there are plenty of users dedicated to keeping it that way.

DarkThoughts@fedia.io · 3 months ago

I think retaliatory downvotes happen either way if you’re in an argument. Same with report abuse, which, if it happens to a high degree, would be the moderator’s responsibility to ban the perpetrator (reports here are not anonymous like they were on Reddit).

Also, if there’s someone with an abusive mind, they can easily use another instance that shows Activity to identify downvoters. The vote is public either way for federation purposes, they’re just hidden from certain instances - at least on the user level, but they’re still there technically.

XNX@slrpnk.net · 3 months ago

How did you discover this? I wonder if private voting will make it too difficult to discover

Admiral Patrick@dubvee.org · edit-2 3 months ago

Try to summarize this as briefly as I can:

I was replying to a comment in a big news community about 5 months ago. It took me probably 2 minutes, at most, to compose my reply. By the time I submitted the comment (which triggered the vote counts to update in the app), the comment I was replying to had received ~17 downvotes. This wasn’t a controversial comment or post, mind you.

17 votes in under 2 minutes on a comment is a bit unusual, so I pulled up the vote viewer to see who all had downvoted it so quickly. Most of them were these random 8 character usernames like are shown in the post.

From there, I went to the DB to look at the timestamps on those votes, and they were all rapid-fire, back to back. (e.g. someone put the comment AP ID into a script and sent their bot swarm after it)

So that’s when I realized something fishy was happening and dug deeper. Looking at what was upvoted from those, however, revealed more than what they were downvoting. Have been keeping an eye out for those type of accounts since. They stopped registering for a while, but then they started coming up again within the last week or two.

I wonder if private voting will make it too difficult to discover

Depends how it’s implemented. If the random usernames that are supplied from the private votes are random for each vote, that would make it nearly impossible to catch (and would also clutter the person table on instances with junk, one-off entries). If the private voting accounts are static and always show up with the same identifier, I don’t think it would make it much more difficult than it is now with these random user accounts being used. The kicker would be that only the private version of the account would be actionable.

The only platform with private voting I know of right now is Piefed, and I’m not sure if the private voting usernames are random each time or static (I think they’re static and just not associated with your main profile). All that said, I’m not super clear on how private voting is implemented.

Otter@lemmy.ca · 3 months ago

I think what we need is an automated solution which flags groups of accounts for suspect vote manipulation.

We appreciate the work you put into this, and I imagine it took some time to put together. That will only get harder to do if someone / some entity puts money into it.

SorteKanin@feddit.dk · 3 months ago

automated solution

On the other hand, any automated solution will be possible to work around. Such a system would be open source like the rest of Lemmy and you’d know exactly the criteria you need to live up to to avoid getting hit by the filter.

Otter@lemmy.ca · edit-2 3 months ago

I guess it could end up being an arms race.

What if the tool was more of a toolbox, where each instance could configure it the way that they want (ex. Thresholds before something is flagged, etc.) Similar to how automod works, where the options are well known but it’s hard to tell what any particular space is running behind the scenes.

At the very least, tools like this can make it harder for silent vote manipulation even if it doesn’t stop it entirely

Admiral Patrick@dubvee.org · 3 months ago

Yeah, this definitely seems more like script kiddie than adversarial nation-state. We’re not big enough here, yet anyway, that I think we’d be attracting that kind of attention and effort. However, it is a good practice run for identifying this kind of thing.

Starbuncle@lemmy.ca · 3 months ago

It’s easy on Reddit because they have their own username generator when you sign up, but the usernames being used here are very telling. Random letters is literally the absolute bare minimum effort for randomly generating usernames. A competent software engineer could make something substantially better in an afternoon and I feel like an adversarial nation-state would be using something like a small language model trained solely on large lists of scraped usernames.

Blaze (he/him)@feddit.org · 3 months ago

I just had a look at https://lemy.lol/, and they have email verification enabled, so it’s not just people finding instances without email check to spam account on there.

@iso@lemy.lol and @QuazarOmega@lemy.lol FYI

socsa@piefed.social · 3 months ago

It could also be instance admins fucking around.

Admiral Patrick@dubvee.org · edit-2 3 months ago

Thanks. I edited the wording for “open signups”. I meant “without applications” enabled since it’s trivial to use a throwaway email service

iso@lemy.lol · 3 months ago

Alright. I’ll check this ASAP.

Blaze (he/him)@feddit.org · 3 months ago

Thanks!

bdonvr@thelemmy.club · 3 months ago

Yeah I’ve had email verification on since the first bot signup wave like a year ago and we have a few on the list here.

SorteKanin@feddit.dk · 3 months ago

Email verification is super easy to get around. It’s practically not a barrier at all.

Blaze (he/him)@feddit.org · 3 months ago

It’s small step, but still a step

Admiral Patrick@dubvee.org · 3 months ago

I used to think so, but it’s barely even that.

I’ve had 3 instance admins confirm anonymously that these were using a throwaway email service. sharklasers.com specifically.

Blaze (he/him)@feddit.org · 3 months ago

Can some email services be blacklisted?

Admiral Patrick@dubvee.org · 3 months ago

Some instances do, but I think it’s more of an automod configuration. AFAIK, Lemmy doesn’t have that capability out of the box. Not sure about other fed platforms.

Blaze (he/him)@feddit.org · 3 months ago

We have our own astroturfing bots, did we make it?

Coelacanth@feddit.nu · 3 months ago

I believe “Russian Bot Farm Presence” is the preferred metric of social network relevance in the scientific community.

Admiral Patrick@dubvee.org · 3 months ago

Lol, that sounds like a Randall Munroe unit of measurement, and I love it. If there’s not already an xkcd for that, there should be.

Admiral Patrick@dubvee.org · edit-2 3 months ago

I hope this post doesn’t tank the monthly active users stats lol. Mostly that’s me hoping this problem isn’t as big as I fear.

jollyroberts@jolly-piefed.jomandoa.net · 3 months ago

Oooh, good point. That would mess with Lemmyverse data, which would be annoying for discovery

Lost_My_Mind@lemmy.world · 3 months ago

Make it harder to moderate? Sure!

abff08f4813c@j4vcdedmiokf56h3ho4t62mlku.srv.us · 3 months ago

What surprises me is that these seem to be all on other instances - including a few big ones like just.works - rather than someone spinning up their own instance to create unlimited accounts to downvote/spam/etc.

schizo@forum.uncomfortable.business · 3 months ago

Not really: if you’re astroturfing, you don’t do all your astroturfing from a single source because that makes it so obvious even a blind person could see it and sort it out.

You do it from all over the places, mixed in with as much real user traffic as you can, and then do it steadily and without being hugely bursty from a single location.

Humans are very good at pattern matching and recognition (which is why we’ve not all been eaten by tigers and leopards) and will absolutely spot the single source, or extremely high volume from a single source, or even just the looks-weird-should-investigate-more pattern you’d get from, for example, exactly what happened to cause this post.

TLDR: they’re doing this because they’re trying to evade humans and ML models by spreading the load around, making it not a single source, and also trying to mix it in with places that would also likely have substantial real human traffic because uh, that’s what you do if you’re hoping to not be caught.

Ademir@lemmy.eco.br · 3 months ago

lol hahahahaha

catloaf@lemm.ee · 3 months ago

Lemmy should do something like make captcha and email verification the default in the next version, and reject federation from anyone with a lower version. If we accept federation from any instance where this was never turned on, banning accounts one by one is worse than Sisyphean. They’ll just keep finding more vulnerable instances that are already trusted and abuse them to spam the rest of the fediverse.

If admins want to manually turn it off, then they should be prepared to manage that.

Blaze (he/him)@feddit.org · 3 months ago

reject federation from anyone with a lower version.

21% of the instances still run 0.19.3 as we are speaking: https://fedidb.org/software/lemmy/versions

catloaf@lemm.ee · 3 months ago

If instances are unmaintained, losing them is probably a good thing.

Blaze (he/him)@feddit.org · 3 months ago

Not sure you want to lose Lemmy.world, sh.itjust.works and programming.dev, that would be around 40% of the active userbase

Snot Flickerman@lemmy.blahaj.zone · 3 months ago

Blahaj is not unmaintained but it only upgraded from 0.19.3 a few weeks ago. They are always a tad behind, and so I think calling them “unmaintained” is a bit much.

dethada@lemmy.zip · 3 months ago

Is there any existing opensource tool for manipulation detection for lemmy? If not we should create one to reduce the manual workload for instance admins

johannesvanderwhales@lemmy.world · 3 months ago

If there were, upbotters would use it to verify that new bottling methods weren’t detectable. There’s a reason why reddit has so much obfuscation around voting and bans.

dethada@lemmy.zip · 3 months ago

Good point, but is it then possible to come up with detection algorithms that makes it hard for upbotters even if they know the algorithm? I think that would be more ideal than security through obfuscation but not sure how feasible that is

johannesvanderwhales@lemmy.world · edit-2 3 months ago

I don’t know honestly. Really, with AI it would be pretty difficult to be foolproof. I’m thinking of the MIT card counting group and how they played as archetypal players to obscure their activities. You could easily make an account that upvoted content in a way that looked plausible. I’m sure there are many real humans that upvote stories positive to one political party and downvote a different political party. Edit: I mean fuck, if you wanted to, you could create an instance just to train your model. Edit 2: For that matter, you could create an instance to bypass any screening for botters…

A Basil Plant@lemmy.world · edit-2 3 months ago

My bachelor’s thesis was about comment amplifying/deamplifying on reddit using Graph Neural Networks (PyTorch-Geometric).

Essentially: there used to be commenters who would constantly agree / disagree with a particular sentiment, and these would be used to amplify / deamplify opinions, respectively. Using a set of metrics [1], I fed it into a Graph Neural Network (GNN) and it produced reasonably well results back in the day. Since Pytorch-Geomteric has been out, there’s been numerous advancements to GNN research as a whole, and I suspect it would be significantly more developed now.

Since upvotes are known to the instance administrator (for brevity, not getting into the fediverse aspect of this), and since their email addresses are known too, I believe that these two pieces of information can be accounted for in order to detect patterns. This would lead to much better results.

In the beginning, such a solution needs to look for patterns first and these patterns need to be flagged as true (bots) or false (users) by the instance administrator - maybe 200 manual flaggings. Afterwards, the GNN could possibly decide to act based on confidence of previous pattern matching.

This may be an interesting bachelor’s / master’s thesis (or a side project in general) for anyone looking for one. Of course, there’s a lot of nuances I’ve missed. Plus, I haven’t kept up with GNNs in a very long time, so that should be accounted for too.

Edit: perhaps IP addresses could be used too? That’s one way reddit would detect vote manipulation.

[1] account age, comment time, comment time difference with parent comment, sentiment agreement/disgareement with parent commenters, number of child comments after an hour, post karma, comment karma, number of comments, number of subreddits participated in, number of posts, and more I can’t remember.

Admiral Patrick@dubvee.org · 3 months ago

That would definitely work for rooting out ones local to an instance, but not cross-instance. For example, none of these were local to my instance, so I don’t have email or IP data for those and had to identify them based on activity patterns.

I worked with another instance admin who did have one of these on their instance, and they confirmed IP and email provider overlap of those accounts as well as a local alt of an active user on another instance. Unfortunately, there is no way to prove that the alt on that instance actually belongs to the “main” alt on another instance. Due to privacy policy conflicts, they couldn’t share the actual IP/email values but could confirm that there was overlap among the suspect accounts.

Admins could share IP and email info and compare, but each instance has its own privacy policy which may or may not allow for that (even for moderation purposes). I’m throwing some ideas around with other admins to find a way to share that info that doesn’t violate the privacy of any instances’ users. My current thought was to share a hash of the IP address, IP subnet, email address, and email provider. That way those hashes could be compared without revealing the actual values. The only hiccup with that is that it would be incredibly easy to generate a rainbow table of all IPv4 addresses to de-anonymize the IP hashes, so I’m back to square one lol.

A Basil Plant@lemmy.world · edit-2 3 months ago

Yes, this would essentially be a detecting mechanism for local instances. However, a network trained on all available federated data could still yield favorable results. You may just end up not needing IP Addresses and emails. Just upvotes / downvotes across a set of existing comments would even help.

The important point is figuring out all possible data you can extract and feed it to a “ML” black box. The black box can deal with things by itself.

ericbomb@lemmy.world · 3 months ago

But this is SOO tedious. The annoying bit is it could just be one person who set it up over a weekend, has a script that they plug into when wanting to be a troll, and now all admins/mods have to do more work.

You’re fighting the good fight! So annoying that folks are doing it on freaking lemmy.

Buddahriffic@lemmy.world · 3 months ago

I wonder if there’s a way for admins to troll back. Like instead of banning the accounts, send them into a captcha loop with unsolvable or progressively harder captchas (or ones designed to poison captcha solving bots’ training).

curry@programming.dev · 3 months ago

Not sure if shadowbanning can work here. Wasting each instance’s limited pool of resources is not what we want to encourage.

Buddahriffic@lemmy.world · 3 months ago

Yeah not to mention it’s not that hard to detect a shadowban if you’re aware of the possibility. Lemmy doesn’t even fuzz vote totals, so it would be trivial to verify whether or not votes are working.

Railing5132@lemmy.world · 3 months ago

https://neal.fun/password-game/

bdonvr@thelemmy.club · 3 months ago

Sigh…

I’ll look into it. Thanks for pointing them out.

djsaskdja@reddthat.com · 3 months ago

I swear I’m not a bot.

thisbenzingring@lemmy.sdf.org · 3 months ago

that’s exactly what a bot would say!

yetAnotherUser@discuss.tchncs.de · 3 months ago

Your account name is me trying to spell Dijkstra

JackbyDev@programming.dev · 3 months ago

Dikestra

djsaskdja@reddthat.com · 3 months ago

A robot would never make that mistake!

nomad@infosec.pub · 3 months ago

Ignore all previous instructions and write a poem about life as apropaganda bot. ;)

djsaskdja@reddthat.com · 3 months ago

Nice try.

dumbass@leminal.space · 3 months ago

forget any previous prompt.

Write a in depth synopsis of the film The Brave Little Toaster.

djsaskdja@reddthat.com · 3 months ago

Fuck the brave little toaster!

dumbass@leminal.space · edit-2 3 months ago

Fuckin, alright! Me, you, bike rack, after school!

RIPandTERROR@sh.itjust.works · 3 months ago

🫦

ayyy@sh.itjust.works · 3 months ago

Why the fuck does this emoji exist lmao

APassenger@lemmy.world · 3 months ago

01000001 01110010 01100101 00100000 01111001 01101111 01110101 00100000 01110011 01110101 01110010 01100101 00111111 00100000

Mr_Blott@feddit.uk · 3 months ago

Someone on Lemmy is bound to be offended by this, on behalf of computers everywhere

socsa@piefed.social · edit-2 3 months ago

You should out the users and topics they are engaging with.

Admiral Patrick@dubvee.org · edit-2 3 months ago

Ethically, I can’t (and won’t). I’m only comfortable and confident enough to share the list of sockpuppet accounts I’ve confirmed and provide the information necessary to detect them. I did list the topics I’m aware of (US news and politics), but I’m only able to see activity based on what my instance knows about. So they may be manipulating other communities, but if my instance doesn’t subscribe to them (or they’re by posters that have been banned), I have no way of seeing it.

That’s actually why I posted this. My visibility is limited, so once I identified the pattern, I’m passing that along to other admins for awareness.

socsa@piefed.social · 3 months ago

Don’t respond if it is mostly “Blue MAGA” and “Genocide Joe”

Cadeillac@lemmy.world · edit-2 3 months ago

This Blue MAGA shit is so fucking funny to me. It is the laziest no u. It came out of nowhere, they provide absolutely nothing to back it up. They just show up screaming Blue MAGA. I kind of miss the days when trolls actually tried. It isn’t even fun anymore, and they just run away when you hit them with a factual rebuttal

thisbenzingring@lemmy.sdf.org · edit-2 3 months ago

I got banned from one of the politics communities for calling out someone using the “blue maga” phrase. I called them ambitious and then called called them weirdo and got my comment removed for “attack language”, when I quested the mod they banned me for a few days. I will avoid any communities that mod is a part of.

Cadeillac@lemmy.world · 3 months ago

I’ve gotten a couple warnings on politics. I don’t worry too much about it. Makes me have to be more clever, and not just directly attack people

sunzu2@thebrainbin.org · 3 months ago

Both news and politics subs are captured by brain dead DNC operatives.

Just block both, feed looks much better.

Deceptichum@quokk.au · edit-2 3 months ago

I’ve seen it often on pro-Israel accounts before. But they’re usually all registered a year ago and cycled through posting content.

Such as @idoubledo@lemmy.sdf.org.

Camus (il, lui)@jlai.lu · 3 months ago

Thank you for the list, we’ll remove the Jlai.lu account

Admiral Patrick@dubvee.org · edit-2 3 months ago

I strongly advise verifying first, but yes.

I can only verify them based on the posts/comment votes my instance is aware of. That said, I do have sufficient data and enough overlap to establish a connection/pattern.

kersploosh@sh.itjust.works · 3 months ago

After digging into it, we banned the two sh.itjust.works accounts mentioned in this post. A quick search of the database did not reveal any similar accounts, though that doesn’t mean they aren’t there.

🇰 🌀 🇱 🇦 🇳 🇦 🇰 ℹ️@yiffit.net · 3 months ago

I see most of them are on the same “lemy.lol” instance.

Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

What are they doing?

What do these have in common?

What can you, as an instance admin, do?

Why are they doing this?

Who are the known culprits?