Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
gpu
Follow
Hide
Posts
Left menu
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Intel Xe3P Leaks 160GB LPDDR5X; FlashAttention-2 in CuTe & Custom CUDA GPT-2 Engine
soy
soy
soy
Follow
May 19
Intel Xe3P Leaks 160GB LPDDR5X; FlashAttention-2 in CuTe & Custom CUDA GPT-2 Engine
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Building llama.cpp from source on a Dell Precision T5820 with an RTX 3090 Ti (after seven power cycles)
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
Building llama.cpp from source on a Dell Precision T5820 with an RTX 3090 Ti (after seven power cycles)
#
ai
#
llm
#
gpu
#
linux
Comments
Add Comment
16 min read
Three Months of Speed-Up Experiments on a 3090 Ti: Autoregressive DFlash MTP for Qwen3.6-27B
Ian L. Paterson
Ian L. Paterson
Ian L. Paterson
Follow
May 18
Three Months of Speed-Up Experiments on a 3090 Ti: Autoregressive DFlash MTP for Qwen3.6-27B
#
ai
#
llm
#
gpu
#
performance
Comments
Add Comment
18 min read
Why MTP doesn't speed up your llama.cpp inference (and how to actually fix it)
Alan West
Alan West
Alan West
Follow
May 18
Why MTP doesn't speed up your llama.cpp inference (and how to actually fix it)
#
llm
#
performance
#
machinelearning
#
gpu
Comments
Add Comment
5 min read
267 tok/s local inference on RTX 5090 â llama.cpp MTP + Qwen3-35B-A3B MoE
gen
gen
gen
Follow
May 18
267 tok/s local inference on RTX 5090 â llama.cpp MTP + Qwen3-35B-A3B MoE
#
llm
#
machinelearning
#
llama
#
gpu
Comments
Add Comment
1 min read
GPU Bottleneck Analyzer, NVIDIA Rubin VRAM Demands, and Qwen VRAM Optimization
soy
soy
soy
Follow
May 18
GPU Bottleneck Analyzer, NVIDIA Rubin VRAM Demands, and Qwen VRAM Optimization
#
gpu
#
nvidia
#
hardware
1
 reaction
Comments
Add Comment
4 min read
Production-Ready GPU Inference Autoscaling on EKS with Karpenter, KEDA, and Dragonfly
Mark Johnson
Mark Johnson
Mark Johnson
Follow
May 17
Production-Ready GPU Inference Autoscaling on EKS with Karpenter, KEDA, and Dragonfly
#
kubernetes
#
aws
#
devops
#
gpu
Comments
Add Comment
26 min read
GPU Hardware & Driver Update: RTX 5090 Benchmarks, llama.cpp MTP, Windows 11 Fix
soy
soy
soy
Follow
May 17
GPU Hardware & Driver Update: RTX 5090 Benchmarks, llama.cpp MTP, Windows 11 Fix
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
CUDA Cutile-rs Beta, AMD FSR 4.1 Release, & Forza Horizon 6 GPU Benchmarks
soy
soy
soy
Follow
May 16
CUDA Cutile-rs Beta, AMD FSR 4.1 Release, & Forza Horizon 6 GPU Benchmarks
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Same eBPF, Different Vendor: Tracing libhip Calls on AMD ROCm
Ingero Team
Ingero Team
Ingero Team
Follow
May 15
Same eBPF, Different Vendor: Tracing libhip Calls on AMD ROCm
#
linux
#
programming
#
gpu
#
performance
Comments
Add Comment
3 min read
Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)
Thurmon Demich
Thurmon Demich
Thurmon Demich
Follow
May 15
Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)
#
gpu
#
llama
#
70b
#
vram
Comments
Add Comment
6 min read
From TCP Retransmits to MCP-Driven Cluster Investigations: An eBPF GPU Agent Retrospective
Ingero Team
Ingero Team
Ingero Team
Follow
May 14
From TCP Retransmits to MCP-Driven Cluster Investigations: An eBPF GPU Agent Retrospective
#
ebpf
#
gpu
#
observability
#
mcp
1
 reaction
Comments
Add Comment
8 min read
From Zero to Supercomputing: A Beginner-Friendly Guide to Using HPC Clusters Like CINECA
Ikegbo Ogochukwu
Ikegbo Ogochukwu
Ikegbo Ogochukwu
Follow
May 14
From Zero to Supercomputing: A Beginner-Friendly Guide to Using HPC Clusters Like CINECA
#
gpu
#
cluster
#
supercomputers
#
ai
Comments
Add Comment
5 min read
What Inference-Platform Benchmark Posts Leave Out
Ingero Team
Ingero Team
Ingero Team
Follow
May 13
What Inference-Platform Benchmark Posts Leave Out
#
machinelearning
#
ai
#
gpu
#
performance
Comments
Add Comment
8 min read
Why CUDA kernels silently corrupt memory and how to catch the bug
Alan West
Alan West
Alan West
Follow
May 12
Why CUDA kernels silently corrupt memory and how to catch the bug
#
cuda
#
rust
#
debugging
#
gpu
Comments
Add Comment
5 min read
đ
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account
âď¸
HTTPS ¡ dev.to
Otomatik (DoH)
YĂśncĂź DNS
TTNet DNS
Google DNS
Ănizlemeyi BaĹlat â
â Home
đ