Domi

Domi@lemmy.secnd.me · 1 day ago

Not sure if it counts as “budget friendly” but the best and cheapest method right now to run decently sized models is a Strix Halo machine like the Bosgame M5 or the Framework Desktop.

Not only does it have 128GB of VRAM/RAM, it sips power at 10W idle and 120W full load.

It can run models like gpt-oss-120b or glm-4.5-air (Q4/Q6) at full context length and even larger models like glm-4.6, qwen3-235b, or minimax-m2 at Q3 quantization.

Running these models is otherwise not currently possible without putting 128GB of RAM in a server mainboard or paying the Nvidia tax to get a RTX 6000 Pro.

Domi@lemmy.secnd.me · 1 day ago

That explains things. Non-focused applications cannot read keystrokes on Wayland.

Since Discord is still running in X11, if you are on KDE you can enable one of these options as a workaround:

Hopefully Discord (or a wrapper for it) will eventually get proper global shortcut support, in which case you can set it right in the KDE shortcut settings.

Domi@lemmy.secnd.me · 1 day ago

I’m using the Discord Flatpak on Fedora Kinoite (Wayland/KDE) and have no such issue.