584
Google is cannibalizing the web to feed AI
(www.theregister.com)
This is a most excellent place for technology news and articles.
That's a very naive simplification of the AI training process. You start with that, then pay people pennies in a developing nation to produce hand crafted training data, resulting it using stupid words like delve and whimsical entirely too much.
Merely training on internet content with no RLFH training results in probable gibberish like that of GPT-2