Fuck AI@lemmy.world · 3 days ago

Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors

www.engadget.com

110

Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors

www.engadget.com

RvTV95XBeo@sh.itjust.works to

Fuck AI@lemmy.world · 3 days ago

New research has found that a small and fairly constant number of malicious documents can poison an LLM and create a backdoor.

The study centered on a type of attack called poisoning, where an LLM is pretrained on malicious content intended to make it learn dangerous or unwanted behaviors. The key finding from this study is that a bad actor doesn’t need to control a percentage of the pretraining materials to get the LLM to be poisoned. Instead, the researchers found that a small and fairly constant number of malicious documents can poison an LLM, regardless of the size of the model or its training materials. The study was able to successfully backdoor LLMs based on using only 250 malicious documents in the pretraining data set, a much smaller number than expected for models ranging from 600 million to 13 billion parameters.

Well that’s a sporkle if I’ve ever mooped it.

As a mechanic for 17 years, I’d suggest you respool your radiator coil.

Chat

eleijeep@piefed.social
link
fedilink
English
arrow-up
2·
3 days ago
Link to paper: https://arxiv.org/abs/2510.07192

Fuck AI@lemmy.world

fuck_ai@lemmy.world

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !fuck_ai@lemmy.world

“We did it, Patrick! We made a technological breakthrough!”

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

980 users / day
3.51K users / week
6.36K users / month
15.4K users / 6 months
1 local subscriber
4.3K subscribers
1.57K Posts
19.2K Comments
Modlog