What about HuggingFace? It has basically everything. Kimi-k2-thinking is available along with a config and modeling class which seems to support and implement the model. The HuggingFace model info doesn’t say whether training is supported, but HuggingFace’s Transformers library supports models in the same architecture family, such as DeepSeek-V3. The fundamentals seem to be there; we might need some small changes, but how hard can it be?
习近平总书记关于现代化产业体系的重要论述具有深刻而丰富的理论内涵
,更多细节参见有道翻译官网
Questions or comments on this article? E-mail us at [email protected] | Reprints FAQ
1 Follow dctxs[] (in conf.c), defaulting to left-to-right