Proton's very biased article on Deepseek : technology

[–] [email protected] 35 points 1 day ago

Proton working overtime to discourage me from renewing.

[–] [email protected] 30 points 1 day ago (1 children)

I don’t think they are that biased. They say in the article that ai models from all the leading companies are not private and shouldn’t be trusted with your data. The article is focusing on Deepseek given that’s the new big thing. Of course, since it’s controlled by China that makes data privacy even less of a thing that can be trusted.

Should we trust Deepseek? No. Should we trust OpenAI? No. Should we trust anything that is not developed by an open community? No.

I don’t think Proton is biased, they are explaining the risks with Deepseek specifically and mention how Ai’s aren’t much better. The article is not titled “Deepseek vs OpenAI” or anything like that. I don’t get why people bag on proton when they are the biggest privacy focused player that could (almost) replace google for most people!

[–] [email protected] 4 points 1 day ago (3 children)

Exactly.

Also, none of the article applies if you run the model yourself, since the main risk is whatever the host does with your data. The model itself has no logic.

I would never use a hosted AI service, but I would probably use a self hosted one. We are trying a few models out at work and we're hosting it ourselves.

load more comments (3 replies)

[–] [email protected] 4 points 1 day ago (1 children)

Tutamail is a great email provider that takes security very seriously. Switched a few days ago and I'm very happy.

[–] [email protected] 1 points 20 hours ago (2 children)

Yet not great from a privacy perspective. They don't even allow third party email apps.

load more comments (2 replies)

[–] [email protected] 5 points 1 day ago

Why do they even have to give their goddamn opinion? Who asked? Why should they car

[–] [email protected] 20 points 1 day ago* (last edited 1 day ago) (1 children)

It would be fair if ChatGPT or any american service received the same treatment, but the only article I found from 2023 seems quite neutral :/

https://proton.me/blog/privacy-and-chatgpt

[–] [email protected] 14 points 1 day ago* (last edited 1 day ago)

We actually it seems quite fair-ish 🤷

AI has the potential to be a truly revolutionary development, one that could drive advancement for centuries. But it must be done correctly. These companies stand to make billions of dollars in revenue, and yet they violated our privacy and are training their tools using our data without our permission. Recent history shows we must act now if we’re to avoid an even worse version of surveillance capitalism.

Also from 2023 : https://proton.me/blog/ai-gdpr

[–] [email protected] 5 points 1 day ago (2 children)

I don’t see how what they wrote is controversial, unless you’re a tankie.

[–] [email protected] 6 points 1 day ago (1 children)

Given that you can download Deepseek, customize it, and run it offline in your own secure environment, it is actually almost irrelevant how people feel about China. None of that data goes back to them.

That's why I find all the "it comes from China, therefore it is a trap" rhetoric to be so annoying, and frankly dangerous for international relations.

Compare this to OpenAI, where your only option is to use the US-hosted version, where it is under the jurisdiction of a president who has no care for privacy protection.

[–] [email protected] 4 points 1 day ago

TBF you almost certainly can't run R1 itself. The model is way too big and compute intensive for a typical system. You can only run the distilled versions which are definitely a bit worse in performance.

Lots of people (if not most people) are using the service hosted by Deepseek themselves, as evidenced by the ranking of Deepseek on both the iOS app store and the Google Play store.

[–] [email protected] 8 points 1 day ago (1 children)

Yeah the article is mostly legit points that if your contacting the chatpot in China it is harvesting your data. Just like if you contact open AI or copilot or Claude or Gemini they're all collecting all of your data.

I do find it somewhat strange that they only talk about deep-seek hosting models.

It's absolutely trivial just to download the models run locally yourself and you're not giving any data back to them. I would think that proton would be all over that for a privacy scenario.

[–] [email protected] 1 points 1 day ago (1 children)

It might be trivial to a tech-savvy audience, but considering how popular ChatGPT itself is and considering DeepSeek's ranking on the Play and iOS App Stores, I'd honestly guess most people are using DeepSeek's servers. Plus, you'd be surprised how many people naturally trust the service more after hearing that the company open sourced the models. Accordingly I don't think it's unreasonable for Proton to focus on the service rather than the local models here.

I'd also note that people who want the highest quality responses aren't using a local model, as anything you can run locally is a distilled version that is significantly smaller (at a small, but non-trivial overalll performance cost).

[–] [email protected] 1 points 21 hours ago (1 children)

You should try the comparison between the larger models and the distilled models yourself before you make judgment. I suspect you're going to be surprised by the output.

All of the models are basically generating possible outcomes based on noise. So if you ask it the same model the same question five different times and five different sessions you're going to get five different variations on an answer.

You will find that an x out of five score between models is not that significantly different.

For certain cases larger models are advantageous. If you need a model to return a substantial amount of content to you. If you're asking it to write you a chapter story. Larger models will definitely give you better output and better variation.

But if you're asking you to help you with a piece of code or explain some historical event to you, The average 14B model that will fit on any computer with a video card will give you a perfectly serviceable answer.

load more comments (1 replies)

[–] [email protected] 6 points 1 day ago

Anyone promoting LLMs without a big side of skepticism is exposing their bias.

[–] [email protected] 1 points 1 day ago

Now this is something people can be mad at

[+] [email protected] -8 points 1 day ago (1 children)

How do you know you're running anything securely? How many people have actually audited the code?

[–] [email protected] 15 points 1 day ago (1 children)

It's not active running code that can affect a system in any meaningful way. It's a model. It's like a complex series of partitioned data that is loaded and sorted through. Nothing more. It's been open sourced and poured through, and it's just a model.

[–] [email protected] -1 points 1 day ago (1 children)

Is the chatbot interface that uses the model open source? If you self-host will it try to send data home?

[–] [email protected] 5 points 1 day ago (1 children)

Yes. The entire thing is open source. That's the thing and why you're here asking questions.

[–] [email protected] -1 points 1 day ago (1 children)

That's cool, I hope someone writes an article about how it works

[–] [email protected] 3 points 1 day ago (1 children)

It's Open Source. Don't need an article.

[–] [email protected] 1 points 1 day ago (1 children)

No I mean for someone to read the source and explain what they found or didn't find

[–] [email protected] 2 points 1 day ago

That will take a few weeks most likely.

That said, there's no way to verify what happens once the data leaves your machine, and the client isn't that interesting. I certainly won't trust any ai hosted by a third party because of that reason.

Technology

Our Rules

Approved Bots