82
Researchers Scrape 2 Billion Discord Messages and Publish Them Online
(www.404media.co)
Welcome! This is a community for all those who are interested in protecting their privacy.
PS: Don't be a smartass and try to game the system, we'll know if you're breaking the rules when we see it!
Some of these are only vaguely related, but great communities.
Ok, and in regular english?
Well the DOI is a digital identifier for papers and other data references for sciency stuff. But that DOI just points to the actual paper https://www.arxiv.org/pdf/2502.00627
Link to where the archive is https://zenodo.org/records/15170676 but its been restricted from downloading
EDIT: lol I love the Internet Archive Its 120GiB if anyone wants to try download it and see if it works.
https://web.archive.org/web/20250521011912/https://zenodo.org/records/15170676/files/dataset.zst?download=1
The only error I can see is "the data is" should be "the data are". Stylistically I would also change utilizing to using, which conveys exactly the same meaning and is more accessible.
Otherwise I believe this is regular English. It's ok to have difficulty comprehending language, but it's more productive to ask questions about the parts you don't understand.
Can I help with any specific questions?
The data is correct English in this context. I was talking more about DOI: 10.5281/zenodo.146585059.
But others have said what it is.
Anything that people write is correct really, like language is created by anyone making sounds or patterns to share meaning.
Data is a plural though, seppo dominance using it as singular is changing it outside that miserable empire but it's one of my hangups haha.