Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Apparently it is the same as the DeepseekV3 architecture and already supported by llama.cpp once the new name is added. Here's the PR: https://github.com/ggml-org/llama.cpp/pull/18936


has been merged




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: