cm0002@lemmings.world

cm0002@lemmings.world

AI - Artificial intelligence

Aii

PostsComments

cm0002@lemmings.worldEnglish · 13 hours ago

Cutting LLM Memory by 84%: A Deep Dive into Fused Kernels

towardsdatascience.com

Cutting LLM Memory by 84%: A Deep Dive into Fused Kernels

towardsdatascience.com

cm0002@lemmings.worldEnglish · 13 hours ago

cm0002@lemmings.world

cm0002@lemmings.worldEnglish · 23 hours ago

Shrinking AI memory boosts accuracy

www.ed.ac.uk

Shrinking AI memory boosts accuracy

www.ed.ac.uk

cm0002@lemmings.worldEnglish · 23 hours ago

cm0002@lemmy.cafe

cm0002@lemmy.cafeEnglish · 3 days ago

Mistral Small Creative beats Claude Opus 4.5 at explaining transformers — 50x cheaper, higher scores

substack.com

Mistral Small Creative beats Claude Opus 4.5 at explaining transformers — 50x cheaper, higher scores

substack.com

cm0002@lemmy.cafeEnglish · 3 days ago

atro_city@fedia.io

atro_city@fedia.io · 5 days ago

Turn Microsoft into "Microslop" everywhere with this new browser extension — CEO Satya Nadella discourages the term, but it's having the opposite effect

fedia.io

Turn Microsoft into "Microslop" everywhere with this new browser extension — CEO Satya Nadella discourages the term, but it's having the opposite effect

fedia.io

atro_city@fedia.io · 5 days ago

cm0002@lemmy.cafe

cm0002@lemmy.cafeEnglish · 3 days ago

DeepSeek just published a paper on conditional memory via scalable lookup

github.com

DeepSeek just published a paper on conditional memory via scalable lookup

github.com

cm0002@lemmy.cafeEnglish · 3 days ago

codeinabox

codeinaboxEnglish · 4 days ago

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs

arxiv.org

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs

arxiv.org

codeinaboxEnglish · 4 days ago

cm0002@suppo.fi

cm0002@suppo.fiEnglish · 8 days ago

Jensen Huang Is Begging You to Stop Being So Negative About AI

gizmodo.com

Jensen Huang Is Begging You to Stop Being So Negative About AI

gizmodo.com

cm0002@suppo.fiEnglish · 8 days ago

codeinabox

codeinaboxEnglish · 4 days ago

Will Your AI Teammate Bring Bagels to Standup?

redmonk.com

Will Your AI Teammate Bring Bagels to Standup?

redmonk.com

codeinaboxEnglish · 4 days ago

cm0002@infosec.pub

cm0002@infosec.pubEnglish · 7 days ago

DroPE: Extending the Context of Pretrained LLMs by Dropping their Positional Embeddings

pub.sakana.ai

DroPE: Extending the Context of Pretrained LLMs by Dropping their Positional Embeddings

pub.sakana.ai

cm0002@infosec.pubEnglish · 7 days ago

cm0002@toast.ooo

cm0002@toast.oooEnglish · 11 days ago

ChatGPT May Be Eroding Critical Thinking Skills, According to a New MIT Study

time.com

ChatGPT May Be Eroding Critical Thinking Skills, According to a New MIT Study

time.com

cm0002@toast.oooEnglish · 11 days ago

cm0002@lemdro.id

cm0002@lemdro.idEnglish · 17 days ago

I don't care how well your "AI" works - fiona fokus

fokus.cool

I don't care how well your "AI" works - fiona fokus

fokus.cool

cm0002@lemdro.idEnglish · 17 days ago

cm0002@no.lastname.nz

cm0002@no.lastname.nzEnglish · 21 days ago

A Convolutional Neural Network implemented entirely from scratch in x86-64 assembly using AVX-512, performing cat vs dog image classification without any ML frameworks or libraries.

github.com

A Convolutional Neural Network implemented entirely from scratch in x86-64 assembly using AVX-512, performing cat vs dog image classification without any ML frameworks or libraries.

github.com

cm0002@no.lastname.nzEnglish · 21 days ago

chasteinsect

chasteinsectEnglish · 21 days ago

AI is hallucinating its way into research, and that’s not even where the problem starts

thelibre.news

AI is hallucinating its way into research, and that’s not even where the problem starts

thelibre.news

chasteinsectEnglish · 21 days ago

codeinabox

codeinaboxEnglish · 22 days ago

The Gorman Paradox: Where Are All The AI-Generated Apps?

codemanship.wordpress.com

The Gorman Paradox: Where Are All The AI-Generated Apps?

codemanship.wordpress.com

codeinaboxEnglish · 22 days ago

cm0002@lemy.lol

cm0002@lemy.lolEnglish · 22 days ago

Tencent just released WeDLM 8B, it's a diffusion language model that runs 3-6× faster than vLLM-optimized Qwen3-8B on math reasoning tasks.

huggingface.co

Tencent just released WeDLM 8B, it's a diffusion language model that runs 3-6× faster than vLLM-optimized Qwen3-8B on math reasoning tasks.

huggingface.co

cm0002@lemy.lolEnglish · 22 days ago

onlinepersona

onlinepersonaEnglish · 23 days ago

The AI-collapse pre-mortem - Bert Hubert's writings

berthub.eu

The AI-collapse pre-mortem - Bert Hubert's writings

berthub.eu

onlinepersonaEnglish · 23 days ago

cm0002@lemmy.cafe

cm0002@lemmy.cafeEnglish · 25 days ago

The Computer Chronicles - Artificial Intelligence (1984)

www.youtube.com

The Computer Chronicles - Artificial Intelligence (1984)

www.youtube.com

cm0002@lemmy.cafeEnglish · 25 days ago

cm0002@toast.ooo

cm0002@toast.oooEnglish · 25 days ago

How to Deal With AI Restrictions When Upscaling Images

ivanca.github.io

How to Deal With AI Restrictions When Upscaling Images

ivanca.github.io

cm0002@toast.oooEnglish · 25 days ago

cm0002@mander.xyz

cm0002@mander.xyzEnglish · 28 days ago

I'm running a 120B local LLM on 24GB of VRAM, and now it powers my smart home

www.xda-developers.com

I'm running a 120B local LLM on 24GB of VRAM, and now it powers my smart home

www.xda-developers.com

cm0002@mander.xyzEnglish · 28 days ago

monica_b1998@lemmy.world

monica_b1998@lemmy.worldEnglish · 28 days ago

Guided learning lets “untrainable” neural networks realize their potential

news.mit.edu

Guided learning lets “untrainable” neural networks realize their potential

news.mit.edu

monica_b1998@lemmy.worldEnglish · 28 days ago