this post was submitted on 17 Aug 2024
43 points (95.7% liked)

Technology

34665 readers
445 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 15 points 2 months ago

The coolest and most frightening thing about all that is the number of books they train the models on are immense, but the model data is very tiny comparatively. And while the compression is amazingly lossy it still has an amazing amount of the data in there.

To nvidas credit, The training models do not contain the contents of the books, but they can still tell you intimate details about the books without it being able to provide a photographic reproduction of everything in the book.

We've literally created something that can analyze books in the same way that we read them and retain the same lossy levels of information. That's honestly pretty f****** amazing.

Obviously intellectual property laws aren't designed for this. Hell even our concept of intellectual property isn't designed for this. If this was a corporation that hired a thousand people to read a bunch of books and be on tap for queries about the information in those books nobody would complain. One copy of each book purchased would be enough to cover the intellectual property restrictions for this.

Also obviously this isn't what happened and people see money lying on the table.