1
16
submitted 1 day ago by [email protected] to c/[email protected]
2
31
submitted 1 day ago* (last edited 1 day ago) by [email protected] to c/[email protected]
3
34
submitted 1 day ago by [email protected] to c/[email protected]
4
9
submitted 1 day ago by [email protected] to c/[email protected]

Apple published a paper criticizing the capabilities of Large Language Models (LLMs) in reasoning and formal logic. The paper builds on previous arguments made by Gary Marcus and Subbarao Kambhampati about LLMs' limitations in generalizing beyond their training distribution.

The authors of the paper demonstrated that even the latest "reasoning models" fail to reason reliably on classic problems like the Tower of Hanoi. LLMs cannot solve the Tower of Hanoi reliably, even with the solution algorithm given to them.

The paper argues that LLMs are not a substitute for well-specified conventional algorithms and have limitations that are becoming clearer. LLMs are not a direct route to AGI and while the field of neural networks is not dead, current approach has clear limitations.

The paper highlights the importance of combining human adaptiveness with computational brute force and reliability in AI development.

5
99
submitted 1 day ago by [email protected] to c/[email protected]
6
2
submitted 21 hours ago by [email protected] to c/[email protected]
7
-2
submitted 14 hours ago by [email protected] to c/[email protected]
8
7
submitted 1 day ago by [email protected] to c/[email protected]

I dont now if this is the correct community to post this. My computer shows signal to the screen when in the os (arch) but when I spam F2 or F12 to get into bios or boot selection it doesnt show anything to the screen. I have an optiplex 3020.

9
98
submitted 3 days ago by [email protected] to c/[email protected]
10
19
submitted 2 days ago by [email protected] to c/[email protected]
11
575
submitted 4 days ago by [email protected] to c/[email protected]

"this morning, as I was finishing up work on a video about a new mini Pi cluster, I got a cheerful email from YouTube saying my video on LibreELEC on the Pi 5 was removed because it promoted:

Dangerous or Harmful Content Content that describes how to get unauthorized or free access to audio or audiovisual content, software, subscription services, or games that usually require payment isn't allowed on YouTube.

I never described any of that stuff, only how to self-host your own media library.

This wasn't my first rodeo—in October last year, I got a strike for showing people how to install Jellyfin!

In that case, I was happy to see my appeal granted within an hour of the strike being placed on the channel. (Nevermind the fact the video had been live for over two years at that point, with nary a problem!)

So I thought, this case will be similar:

  • The video's been up for over a year, without issue
  • The video's had over half a million views
  • The video doesn't promote or highlight any tools used to circumvent copyright, get around paid subscriptions, or reproduce any content illegally

Slam-dunk, right? Well, not according to whomever reviewed my appeal. Apparently self-hosted open source media library management is harmful.

Who knew open source software could be so subversive?"

12
72
Teachers Are Not OK (www.404media.co)
submitted 4 days ago by [email protected] to c/[email protected]

cross-posted from: https://lemmy.ml/post/31251815

I asked teachers to tell me how AI has changed how they teach.

The response from teachers and university professors was overwhelming. In my entire career, I’ve rarely gotten so many email responses to a single article, and I have never gotten so many thoughtful and comprehensive responses.

One thing is clear: teachers are not OK.

They describe trying to grade “hybrid essays half written by students and half written by robots,” trying to teach Spanish to kids who don’t know the meaning of the words they’re trying to teach them in English, and students who use AI in the middle of conversation. They describe spending hours grading papers that took their students seconds to generate: “I've been thinking more and more about how much time I am almost certainly spending grading and writing feedback for papers that were not even written by the student,” one teacher told me. “That sure feels like bullshit.”

13
8
submitted 2 days ago by [email protected] to c/[email protected]
14
14
submitted 3 days ago by [email protected] to c/[email protected]
15
23
submitted 3 days ago by [email protected] to c/[email protected]

AI companies claim their tools couldn't exist without training on copyrighted material. It turns out, they could and it just takes more work. To prove it, AI researchers trained a model on a dataset that uses only public domain and openly licensed material.

What makes it difficult is curating the data, but once the data has been curated once, in principle everyone can use it without having to go through the painful part. So the whole "we have to violate copyright and steal intellectual property" is (as everybody already knew) total BS.

16
18
submitted 3 days ago by [email protected] to c/[email protected]
17
19
submitted 4 days ago by [email protected] to c/[email protected]
18
3
submitted 3 days ago by [email protected] to c/[email protected]
19
26
submitted 5 days ago by [email protected] to c/[email protected]

In April, Palantir co-founder Joe Lonsdale got into a brawl with former Coinbase chief technology officer and Network State advocate Balaji Srinivasan. It wasn’t on a prominent stage or even Twitter/X; it happened in a Signal group chat that’s become a virtual gathering place for influential tech figures. Srinivasan wasn’t going along with the tech right’s aggressive anti-China rhetoric, so Lonsdale accused him of “insane CCP thinking.” “Not sure what leaders hang out w you in Singapore but on this you have been taken over by a crazy China mind virus,” he wrote.

Before Semafor published its story on the Signal chats that led with the billionaire spat, both Lonsdale and Srinivasan dismissed any notion their exchange was anything but a friendly disagreement. Surely, such wealthy people have much more in common than they do separating them. But the exchange does expose an ideological rift that will likely only grow in the coming years as more of the tech industry openly aligns itself with the security state to pursue lucrative military contracts.

Lonsdale and Srinivasan are arguably on either side of that divide. Palantir is part of the vanguard of defense tech companies openly championing collaboration with the US government. It claims to want to defend American power in the twenty-first century, positioning China as a civilizational threat — in part to mask the commercial threat Shenzhen poses to Silicon Valley. Lonsdale was even helping staff the Trump administration. The Network State movement, on the other hand, wants to escape the authority of the United States — or any other government — entirely, and doesn’t feel it’s part of that fight.

20
10
submitted 4 days ago* (last edited 4 days ago) by [email protected] to c/[email protected]

The project implements sparse multiplication and fuses up/down projections in the MLP layers through low rank weight activations. Work is based on Deja Vu and Apple's LLM in a Flash.

This approach avoids loading and computing activations with feed forward layer weights whose outputs will eventually be zeroed out.

It's a lossless approach as these weights anyway do not contribute in the current token prediction. It does however, need the predictors to be accurate in clustering the weights.

The result? 5X faster MLP layer performance in transformers with 50% lesser memory consumption avoiding the sleeping nodes in every token prediction. For Llama 3.2, Feed forward layers accounted for 30% of total weights and forward pass computation resulting in 1.6-1.8x increase in throughput:

Sparse LLaMA 3.2 3B vs LLaMA 3.2 3B (on HuggingFace Implementation):

- Time to First Token (TTFT):  1.51× faster (1.209s → 0.803s)
- Output Generation Speed:     1.79× faster (0.7 → 1.2 tokens/sec)  
- Total Throughput:            1.78× faster (0.7 → 1.3 tokens/sec)
- Memory Usage:                26.4% reduction (6.125GB → 4.15GB)
21
37
submitted 5 days ago by [email protected] to c/[email protected]
22
64
submitted 6 days ago by [email protected] to c/[email protected]

There are more than a billion PCs in use and, according to StatCounter, only 71 percent of them run Windows. Among the rest, about 4 percent run Linux. That's tens of millions of people with Ubuntu, Mint, Debian, etc as their desktop operating system. I envy them.

Windows 11 has become more annoying lately as it shoves ads for XBox Game Pass in my face, pushes AI features no one asked for and demands that I reconsider the choices I made during installation on a regular basis. Plus, it just isn't that attractive.

I'm ready to try joining that industrious four percent and installing Linux on my computers to use as my main OS, at least for a week. I'll blog about the experience here.

It's hard to give up Windows forever because so many applications only run in Microsoft's OS. For example, the peripheral software that runs with many keyboards and mice isn't available for Linux. Lots of games will not run under Linux. So I think it's likely I'll be using Windows again, at least some of the time, after this week is through.

However, for now, I'm going to give Linux a very serious audition and document the experience.

23
11
submitted 4 days ago by [email protected] to c/[email protected]
24
8
submitted 4 days ago by [email protected] to c/[email protected]
25
14
submitted 5 days ago by [email protected] to c/[email protected]
view more: next ›

Technology

38339 readers
123 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 6 years ago
MODERATORS