order @order

0 posts0 participants0 posts today

**Manel Guerra** @mgc@mastodont.cat · Mar 31

Generar contingut amb IA per contrarrestar l'excés de cerques amb IA. Què pot sortir malament?

Al blog: Bloquejar cerques d'IA embrutant (també) dades

https://www.manelguerra.com/blog/bloquejar-cerques-ia/

Pàgina de Manel Guerra · Mar 31Bloquejar cerques d'IA embrutant (també) dadesL’augment de les cerques a Internet via eines d’Intel·ligència Artificial (penseu en qualsevol xat que s’usi) en comptes de amb els cercadors tradicionals (Google, Bing, etc.) estan fent augmentar (molt) les peticions d’informació als servidors de pàgines web. Degut a l’estructura de la xarxa, les pàgines no només estan en un únic servidor sinó que, si son suficientment populars, es van repartint còpies (actualitzades gairebé instantàniament) a diferents punts de la xarxa (arreu del món), per facilitar la rapidesa de visualització. Aquestes còpies es serveixen mitjançant uns serveis (empreses) mitjancers que faciliten el control, actualització i servei.

#blog #ia #rag

**hubertf** @hubertf@mastodon.social · Mar 22

Mar 22

hubertf @hubertf@mastodon.social

A message you do not want to see when loading "just data" in your AI / ML framework. Beware!

Description in comments.

#ctf #HackTheBox #AI

**Friedemann** @frebelt@mastodon.online · Mar 12 *

Mar 12 *

Friedemann @frebelt@mastodon.online

Hi #Admins ,

Can you give me quotes that explain your fight against #AIScraping? I'm looking for (verbal) images, metaphors, comparisons, etc. that explain to non-techies what's going on. (efforts, goals, resources...)

I intend to publish your quotes in a text on @campact 's blog¹ (DE, German NGO).

The quotes should make your work visible in a generally understandable way

¹ https://blog.campact.de/author/friedemann/

Campact BlogFriedemann EbeltFriedemann Ebelt engagiert sich für digitale Grundrechte. Im Campact-Blog schreibt er darüber, wie Digitalisierung fair, frei und nachhaltig gelingen kann. Er hat Ethnologie und Kommunikationswissenschaften studiert und interessiert sich für alles, was zwischen Politik, Technik, und Gesellschaft passiert. Sein vorläufiges Fazit: Wir müssen uns besser digitalisieren!

#TDM #MastoAdmin #DataPoisoning

**Friedemann** @frebelt@mastodon.online · Mar 12 *

Mar 12 *

Friedemann @frebelt@mastodon.online

Liebe #Admins ,

für meine @campact -Kolumne¹ suche ich #Techies, die der Welt da draußen den fight #Admins vs. #KIScraping erklären. Mit welchen Bildern, Metaphern, Vergleichen beschreibt ihr, was da abgeht? (Zeitaufwand, Sinn, Ressourcen, Tools…)

Die Zitate sollen allgemeinverständlich eure Arbeit sichtbar machen.
(namentlich, pseudoym, anonym → gern angeben)

¹https://blog.campact.de/author/friedemann/

#TDM #AdminLeiden #MastoAdmin

**Greg Lloyd** @Roundtrip@federate.social · Jan 19 *

Jan 19 *

Greg Lloyd @Roundtrip@federate.social

“We find that replacement of just 0.001% of training tokens with medical misinformation results in harmful models more likely to propagate medical errors. Furthermore, we discover that corrupted models match the performance of their corruption-free counterparts on open-source benchmarks routinely used to evaluate medical LLMs. Using biomedical knowledge graphs to screen medical LLM outputs, we propose a harm mitigation strategy…”

#LLM #misinformation #datapoisoning
https://www.nature.com/articles/s41591-024-03445-1

NatureMedical large language models are vulnerable to data-poisoning attacks - Nature MedicineLarge language models can be manipulated to generate misinformation by poisoning of a very small percentage of the data on which they are trained, but a harm mitigation strategy using biomedical knowledge graphs can offer a method for addressing this vulnerability.

**Miguel Afonso Caetano** @remixtures@tldr.nettime.org · Jan 13

Jan 13

Miguel Afonso Caetano @remixtures@tldr.nettime.org

"The adoption of large language models (LLMs) in healthcare demands a careful analysis of their potential to spread false medical knowledge. Because LLMs ingest massive volumes of data from the open Internet during training, they are potentially exposed to unverified medical knowledge that may include deliberately planted misinformation. Here, we perform a threat assessment that simulates a data-poisoning attack against The Pile, a popular dataset used for LLM development. We find that replacement of just 0.001% of training tokens with medical misinformation results in harmful models more likely to propagate medical errors. Furthermore, we discover that corrupted models match the performance of their corruption-free counterparts on open-source benchmarks routinely used to evaluate medical LLMs. Using biomedical knowledge graphs to screen medical LLM outputs, we propose a harm mitigation strategy that captures 91.9% of harmful content (F1 = 85.7%). Our algorithm provides a unique method to validate stochastically generated LLM outputs against hard-coded relationships in knowledge graphs. In view of current calls for improved data provenance and transparent LLM development, we hope to raise awareness of emergent risks from LLMs trained indiscriminately on web-scraped data, particularly in healthcare where misinformation can potentially compromise patient safety."

https://www.nature.com/articles/s41591-024-03445-1?utm_source=substack&utm_medium=email

#AI #GenerativeAI #LLMs

**LabPlot** @LabPlot@floss.social · Nov 13, 2024 *

Nov 13, 2024 *

LabPlot @LabPlot@floss.social

Safeguarding #OpenData: #Cybersecurity essentials and skills for #data providers by Publications Office of the #EuropeanUnion

This webinar provides an overview of the fundamentals of open data and the complexity in terms of cybersecurity.

https://www.youtube.com/watch?v=6kPiY_8hRwQ

YouTube'Safeguarding open data: cybersecurity essentials and skills for data providers’ data.europa.euBy Publications Office of the European Union

#InfoSec #Security #DataPoisoning

**Norobiik @Norobiik@noc.social** @Norobiik@noc.social · Jan 20, 2024

Jan 20, 2024

Norobiik @Norobiik@noc.social @Norobiik@noc.social

#Nightshade is an offensive #DataPoisoning tool, a companion to a defensive style protection tool called #Glaze, which The Register covered in February last year.

Nightshade poisons #ImageFiles to give indigestion to models that ingest data without permission. It's intended to make those training image-oriented models respect content creators' wishes about the use of their work. #LLM #AI

How artists can poison their pics with deadly Nightshade to deter #AIScrapers
https://www.theregister.com/2024/01/20/nightshade_ai_images/

The Register · Jan 20, 2024How artists can poison their pics with deadly Nightshade to deter AI scrapersBy Thomas Claburn

**Coach Pāṇini ®** @paninid@mastodon.world · Dec 28, 2023

Dec 28, 2023

Coach Pāṇini ® @paninid@mastodon.world

/imagine salt-and-thorium mini-reactors designed by #cyberpunk and #solarpunk at #Microsoft.

Widespread LLM usage was Chernobyl of the internet.

https://www.superversive.co/blog/synthetic-chernobyl

SuperversiveSynthetic Chernobyl — SuperversiveMy cyberpunk pastime in Midjourney is to imagine thousands of salt-and-thorium mini-reactors powering desalination plants supporting walkable American villages with e-rickshaws. Microsoft is training language models to generate documentation to build nuclear reactors. That is the solarpunk future

#AI #LLMs #DataPoisoning

**Matt Willemsen** @mattotcha@mastodon.social · Dec 19, 2023

Dec 19, 2023

Matt Willemsen @mattotcha@mastodon.social

Data poisoning: how artists are sabotaging AI to take revenge on image generators
https://theconversation.com/data-poisoning-how-artists-are-sabotaging-ai-to-take-revenge-on-image-generators-219335 #DataPoisoning #AI #art #sabotage #revenge #Nightshade #ImageScraping

The ConversationData poisoning: how artists are sabotaging AI to take revenge on image generatorsAs AI developers indiscriminately suck up online content to train their models, artists are seeking ways to fight back.

**FeralRobots** @FeralRobots@mastodon.social · Dec 10, 2023 *

Dec 10, 2023 *

FeralRobots @FeralRobots@mastodon.social

My predicted Word of the Year for 2024: #ModelCollapse
#DataPoisoning
https://mas.to/@carnage4life/111556407042548417

mas.toDare Obasanjo (@carnage4life@mas.to)Attached: 1 image The AI snake is already eating its own tail. xAI’s Grok responds with ChatGPT error messages because there’s now so much ChatGPT content on the web that new LLMs are being trained on LLM output. One can imagine LLMs getting worse over time in much the same way Google Search has gotten worse over time due to SEO spam.

**Stef Walter** @stephaniewalter@front-end.social · Nov 2, 2023

Nov 2, 2023

Stef Walter @stephaniewalter@front-end.social

This new data poisoning tool lets artists fight back against generative AI (7min) an interesting way to fight back against companies that scrap art online. I wonder how long before AI companies fight this back. Sounds like an eternal battle.

https://www.technologyreview.com/2023/10/23/1082189/data-poisoning-artists-fight-generative-ai/

#GenerativeAI #DataPoisoning
https://www.technologyreview.com/2023/10/23/1082189/data-poisoning-artists-fight-generative-ai/

MIT Technology Review · Oct 23, 2023This new data poisoning tool lets artists fight back against generative AIBy Melissa Heikkilä

**Scripter** @scripter@social.tchncs.de · Oct 30, 2023

Oct 30, 2023

Scripter @scripter@social.tchncs.de

"Data Poisoning" ist der Sand im Getriebe der künstlichen Intelligenz - Edition Zukunft - derStandard.at › https://www.derstandard.at/story/3000000192919/data-poisoning-ist-der-sand-im-getriebe-der-kuenstlichen-intelligenz #KI #DataPoisoning

DER STANDARD · Oct 28, 2023"Data Poisoning" ist der Sand im Getriebe der künstlichen IntelligenzKunstschaffende kämpfen mit Nachahmungen ihrer Werke durch KI-Tools wie Midjourney. Die Software "Nightshade" soll das verhindern – unter anderem, indem sie aus Hunden Katzen macht

**IT News** @itnewsbot@schleuss.online · Oct 25, 2023

Oct 25, 2023

IT News @itnewsbot@schleuss.online

University of Chicago researchers seek to “poison” AI art generators with Nightshade - Enlarge (credit: Getty Images)

On Friday, a team of researcher... - https://arstechnica.com/?p=1978501 #largelanguagemodels #universityofchicago #adversarialattacks #foundationmodels #machinelearning #aitrainingdata #imagesynthesis #datapoisoning #nightshade #aiethics #benzhao #biz⁢ #google #metaai #openai #aiart #glaze #meta #ai

Ars Technica · Oct 25, 2023University of Chicago researchers seek to “poison” AI art generators with NightshadeBy Benj Edwards

**🅻🅸🅲🅴 ()** @alice@lgbtqia.space · Jul 20, 2023 *

Jul 20, 2023 *

🅻🅸🅲🅴 () @alice@lgbtqia.space

Hey, I'm putting together a practical guide on personal data pollution. You can find it at the link below.

I'd love suggestions and feedback on what's there—Issues and PRs welcome!

---

#Data #Privacy #DataPrivacy #DataPollution #DataPoisoning #Misinformation #Anonymity

https://codeberg.org/alicewatson/personal-data-pollution

Continued thread

**NKCS | NCC-DE** @nkcs@social.bund.de · May 8, 2023

May 8, 2023

NKCS | NCC-DE @nkcs@social.bund.de

Projekte sollen u.a. in den Bereichen #AI #DeepLearning oder #DataPoisoning forschen und werden als Research and Innovation Action mit einer Quote von 100% gefördert. Mit 4-6 Mio.€ Förderung pro Projekt sollen 2 Projekte gefördert werden.

**Linux Magazine** @linuxmagazine@fosstodon.org · Feb 10, 2023 *

Feb 10, 2023 *

Linux Magazine @linuxmagazine@fosstodon.org

On the DVD: The March 2023 issue of Linux Magazine features @MXLinux and Puppy Linux #fossapup 9.5 https://www.linux-magazine.com/Issues/2023/268/This-Month-s-DVD #Linux #OpenSource #MXLinux #MachineLearning #DataPoisoning #FOSS #PuppyLinux

Linux Magazine Issue 268 DVD: MX Linux 21.3 (64-bit) and Puppy Linux FOSSAPUP 9.5 (64-bit)

**Linux Magazine** @linuxmagazine@fosstodon.org · Feb 10, 2023

Feb 10, 2023

Linux Magazine @linuxmagazine@fosstodon.org

March 2023 issue “Data Poisoning” is available now! Find a copy on your local newsstand or get it from us https://bit.ly/Linux-Newsstand #Linux #OpenSource #MachineLearning #DataPoisoning #ML #MXLinux #PuppyLinux #FossaPup #GNOME44 #NuTyX #Debian #Minuimus #Golang #Docker #FOSS

Linux Magazine Issue 268 - March 2023: Data Poisoning
MakerSpace: Feed your fish with a Raspberry Pi. FREE DVD FOSSAPUP and MX Linux

**heise online (inoffiziell)** @heiseonline@squeet.me · Apr 28, 2022

Apr 28, 2022

heise online (inoffiziell) @heiseonline@squeet.me

heise+ | Sicherheit: Schutz vor Data Poisoning und anderen Angriffen auf KI-Systeme

Fehlerhafte Daten können Machine-Learning-Systeme zu folgenreichen Irrtümern verleiten. Ein Praxisbeispiel zeigt, wie so etwas verhindert werden soll.
Sicherheit: Schutz vor Data Poisoning und anderen Angriffen auf KI-Systeme

heise onlineSicherheit: Schutz vor Data Poisoning und anderen Angriffen auf KI-SystemeBy Mirko Ross

#künstlicheintelligenz #unternehmenssoftware #bigdata

Replied in thread

**Doc Edward Morbius ** @dredmorbius@toot.cat · Dec 10, 2020

Dec 10, 2020

Doc Edward Morbius  @dredmorbius@toot.cat

@mhoye The thought occurs: #chaffing / #DataPoisoning.

If we're going to live in a world in which every utterance and action is tracked, issue and utter as much as posssible.

Wire up a speech-aware-and-capable GPT-3 to your phone, have it handle telemarketers, scammers, and political calls. Simply to tie up their time.

Create positive-emotive socmed bots to #pumpUp your #socialcredit score.

Unleash bots on your political opposition's media channels. Have them call in to talk radio, and #ZoomBomb calls and conferences.

Create plausible deniability. Post selfies from a ddozen, or a thousand, places you're not.

Create #DigitalSmog to choke the #FAANG s.

Fight fire with fire.

Recent searches

Search options

Administered by:

Server stats:

#datapoisoning