Skip to content

DEV Community

# gpu

👋 Sign in for the ability to sort posts by relevant, latest, or top.

soy

May 19

Intel Xe3P Leaks 160GB LPDDR5X; FlashAttention-2 in CuTe & Custom CUDA GPT-2 Engine

#gpu #nvidia #hardware

3 min read

Ian L. Paterson

May 18

Building llama.cpp from source on a Dell Precision T5820 with an RTX 3090 Ti (after seven power cycles)

#ai #llm #gpu #linux

16 min read

Ian L. Paterson

May 18

Three Months of Speed-Up Experiments on a 3090 Ti: Autoregressive DFlash MTP for Qwen3.6-27B

#ai #llm #gpu #performance

18 min read

Alan West

May 18

Why MTP doesn't speed up your llama.cpp inference (and how to actually fix it)

#llm #performance #machinelearning #gpu

5 min read

gen

May 18

267 tok/s local inference on RTX 5090 – llama.cpp MTP + Qwen3-35B-A3B MoE

#llm #machinelearning #llama #gpu

1 min read

soy

May 18

GPU Bottleneck Analyzer, NVIDIA Rubin VRAM Demands, and Qwen VRAM Optimization

#gpu #nvidia #hardware

4 min read

May 17

Production-Ready GPU Inference Autoscaling on EKS with Karpenter, KEDA, and Dragonfly

#kubernetes #aws #devops #gpu

26 min read

soy

May 17

GPU Hardware & Driver Update: RTX 5090 Benchmarks, llama.cpp MTP, Windows 11 Fix

#gpu #nvidia #hardware

3 min read

soy

May 16

CUDA Cutile-rs Beta, AMD FSR 4.1 Release, & Forza Horizon 6 GPU Benchmarks

#gpu #nvidia #hardware

3 min read

May 15

Same eBPF, Different Vendor: Tracing libhip Calls on AMD ROCm

#linux #programming #gpu #performance

3 min read

May 15

Best GPU for Llama 70B in 2026 (48GB+ VRAM Required)

#gpu #llama #70b #vram

6 min read

May 14

From TCP Retransmits to MCP-Driven Cluster Investigations: An eBPF GPU Agent Retrospective

#ebpf #gpu #observability #mcp

8 min read

Ikegbo Ogochukwu

May 14

From Zero to Supercomputing: A Beginner-Friendly Guide to Using HPC Clusters Like CINECA

#gpu #cluster #supercomputers #ai

5 min read

May 13

What Inference-Platform Benchmark Posts Leave Out

#machinelearning #ai #gpu #performance

8 min read

Alan West

May 12

Why CUDA kernels silently corrupt memory and how to catch the bug

#cuda #rust #debugging #gpu

5 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.

HTTPS · dev.to

← Home