-
Tsinghua University
- Beijing, China
-
02:40
(UTC +08:00) - ubecwang@gmail.com
- @UbecWang
- https://ubecc.github.io/
Highlights
- Pro
Pinned Loading
-
THUDM/slime
THUDM/slime Publicslime is an LLM post-training framework for RL Scaling.
-
verl-project/verl
verl-project/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
-
OpenRLHF/OpenRLHF
OpenRLHF/OpenRLHF PublicAn Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
-
THUDM/SWE-Dev
THUDM/SWE-Dev Public[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
Python 59
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a high-performance serving framework for large language models and multimodal models.
-
claude-code-router-py
claude-code-router-py PublicA lightweight proxy server that accepts Anthropic Messages API requests and forwards them to any OpenAI-compatible backend, converting formats in both directions. The client sees a standard Anthrop…
If the problem persists, check the GitHub status page or contact support.



