blogs.social
Sign in
Home Top Authors Stats
🔥 Trending Latest
Techdirt [Unofficial] @techdirt.com.web.brid.gy
May 3
An Open Training Set For AI Goes Global

As many of the AI stories on Walled Culture attest, one of the most contentious areas in the latest stage of AI development concerns the sourcing of training data. To create high-quality large language models (LLMs) massive quantities of training data are required. In the current genAI stampede, many companies are simply scraping everything they can off […]

♡
techdirt.com common corpuspleiasai
Page 1
🔥 Popular
Incident Report: CVE-2026-LGTM
@andrewnez.bsky.social · ♥ 0 · ↗ 30
The AT-URI Syntax Mess
@bnewbold.net · ♥ 20 · ↗ 6
Reading Proposal 0016: What atproto’s “Permissioned Data” Actually Does
@ngerakines.me · ♥ 12 · ↗ 2
Atmosphere Field Reporter Corps
@leaflet.pub · ♥ 9 · ↗ 1
Unsubscribing tags.pub from Open Registration Relays
@evanprodromou.socialwebfoundation.org.ap.brid.gy · ♥ 0 · ↗ 9
Giving Labels More Context
@bnewbold.net · ♥ 5 · ↗ 3
📌 Trending tags
#chart 68 #weekly 60 #song 52 #Allgemein 39 #album 15 #atproto 12 #Links 8 #daily 7 #photography 6 #cv 6 #blogging 6 #governance 6 #politics 6 #ai 6 #11ty 6 #atprotocol 5 #News 5 #open-access 5 #digital-humanities 5 #Brott 5