Friend-ly enby-box

Clive Thompson

6 Monate her • •

Clive Thompson
6 Monate her • •

@peterfr @ASRG

Dieser Beitrag wurde bearbeitet. (6 Monate her)

Als Antwort auf Clive Thompson

peterfr

Als Antwort auf Clive Thompson • 6 Monate her • •

That heading – Sabot in the Age of AI – is 👌🏽

Als Antwort auf peterfr

Clive Thompson

Als Antwort auf peterfr • 6 Monate her • •

heh heh yes

Als Antwort auf Clive Thompson

Marisa

Als Antwort auf Clive Thompson • 6 Monate her (Empfangen 5 Monate her) • •

not sure what this means but sounds helpful!

Als Antwort auf Marisa

Clive Thompson

Als Antwort auf Marisa • 5 Monate her • •

@marisa
basically, one key way that companies like OpenAI train their language AI is by using "web crawler" software that roams around online, copying the text off web sites ("web scraping", as it's called) so they can have a consistently refreshed pile o' text for training their AI

you need *lots* of freshly written human words to train an AI -- and people are constantly writing stuff on their sites!

So what these tools do is ...

@Marisa

Als Antwort auf Clive Thompson

Clive Thompson

Als Antwort auf Clive Thompson • 5 Monate her • •

@marisa
... attempt to detect if an OpenAI web-crawler is trying to copy the text off a web site ...

... and if so, it generates fake pages with crap text -- which the OpenAI web crawler *assumes are real*, and thus dutifully copies

So OpenAI winds up feeding junk/fake/mangled text as training material into its next version of ChatGPT

The attitude is: "So, you wanna copy our site, so you can train your AI -- without us getting a penny from you? Okay, here's some *junk data*"

@Marisa

Als Antwort auf Clive Thompson

Marisa

Als Antwort auf Clive Thompson • 5 Monate her • •

Oh... I love it(!)

Als Antwort auf Marisa

Clive Thompson

Als Antwort auf Marisa • 5 Monate her • •

@marisa

🤘 🤖

@Marisa

Als Antwort auf Clive Thompson

Marisa

Als Antwort auf Clive Thompson • 5 Monate her • •

thank you for taking the time to explain. ✨ am learning so much here

Als Antwort auf Marisa

Clive Thompson

Als Antwort auf Marisa • 5 Monate her • •

@marisa

that's what the internet is for!

@Marisa

Als Antwort auf Clive Thompson

Jonathan Lamothe

Als Antwort auf Clive Thompson • 6 Monate her (Empfangen 5 Monate her) • •

@Clive Thompson I love this!

@Clive Thompson

Als Antwort auf Jonathan Lamothe

Clive Thompson

Als Antwort auf Jonathan Lamothe • 5 Monate her • •

@me

🤘

@Jonathan Lamothe

Unbekannter Ursprungsbeitrag

Clive Thompson

Unbekannter Ursprungsbeitrag • 5 Monate her • •

@carraway
very cool!

@Devin

Als Antwort auf Clive Thompson

Wulfy

Als Antwort auf Clive Thompson • 5 Monate her • •

“The Net interprets censorship as damage and routes around it.”

Als Antwort auf Clive Thompson

elle mundy

Als Antwort auf Clive Thompson • 5 Monate her • •

iocaine has fork bomb vibes. i love it

Als Antwort auf Clive Thompson

Chris

Als Antwort auf Clive Thompson • 5 Monate her • •

adding a watermark over text, making and flattening a pdf is a solid way to really mess with anything involving OCR for machine learning

Als Antwort auf Chris

Clive Thompson

Als Antwort auf Chris • 5 Monate her • •

yep yep

Dieser Beitrag wurde bearbeitet. (5 Monate her)

Unbekannter Ursprungsbeitrag

Clive Thompson

Unbekannter Ursprungsbeitrag • 5 Monate her • •

@raph_v
good question!

I don't really know

@Raph V.

Unbekannter Ursprungsbeitrag

Clive Thompson

Unbekannter Ursprungsbeitrag • 5 Monate her • •

@raph_v
Oh I did, thank you for pointing it out!

@Raph V.

Als Antwort auf Clive Thompson

peterfr

Als Antwort auf Clive Thompson • 5 Monate her • •

@raph_v maybe you missed @asrg ‘s reply?

tldr.nettime.org/@asrg/1139072…

ASRG (@asrg@tldr.nettime.org)

@raph_v@mstdn.social @clive@saturation.social @peterfr@mastodon.art .. Here’s an interesting approach from @gedankenstuecke@scholar.social that you might find helpful! ⟶ https://scholar.social/@gedankenstuecke/113899799818100252

^tldr.nettime

@ASRG @Raph V.

⇧