I call that 'malint' (malicious intent)
Not that fine, though. Quality of life came a long way.
Copyleft foi um termo pensado por Richard Stallman para brincar com a ideia de copyright. São tipos de licenças, mas que são muitas vezes desrespeitadas. O ideal principal do copyleft é acabar com a propriedade intelectual.
As pessoas que estavam por trás da computação pessoal nos anos 70, que viam o potencial para o computador servir pessoas individuais ao invés de empresas e governos, ficaram chateadas quando as empresas começaram a colocar licenças nos programas de hardware que antes elas podiam alterar e moldar de acordo com a sua vontade. Foi daí que surgiu o copyleft. As empresas respeitam a GPL porque ela organizacionalmente funciona, não porque é repleta de proteção jurídica ou poder no lobbying.
Soon enough.
We come from the sea.
Copyleft tem tudo a ver com descentralização da informação e conhecimento. Pirataria não é só pegar as coisas de graça, mas também uma filosofia de descentralização do conhecimento. Espero que fique claro agora.
Data Annotation? That's not good work. I had better luck with Freelancer.com than with sending resumes. Plus ridiculous logic tests that take the knowledge of a linguist. I say Freelancer.com rather than Upwork because Freelancer.com is cheaper and you can shoot your bids away at more jobs.
Onboard the train to dystopia.
Google destroyed the opposition when building a search engine tool, this is nothing like the case with Google. Many websites generate robots.txt and other Terms of Service that are impossible for common people to follow these days. It's very hard to scrape, serve and be compliant at the same time. And as small fish you have to. Search engine maintenance occupies too much space and serving the pages with quality requires quick database management tools.
This gap might be closed by AI, but not before it. Even though true alternatives like GigaBlast existed.
The current LLM status has a vibrant open-weights scenario, which is centered on HuggingFace but it's the code away from being served in other places. AI uses datasets/corpus of texts, which can be shared by Universities/Institutions around the world, as they are currently.
LLM/AI is at arms reach from the people, no matter how much money Big Tech puts on Datacenters. The scary part is what Google always used to do best, lobbying for monopolization. Aside from that, we're safe.
If people can build it, it can serve the people. Think of open-weights LLMs. If we got a couple of 32B models that score as high as GPT-4o and Claude-3.5, why not use them? It can be run on mid-high end hardware. There are developers out there doing a good job. It doesn't need to be a datacenter/big tech company centered scenario.
That's very true.
Don't make me believe this is the kind of talk that's going on Twitter.