Change the repository type filter
All
Repositories list
15 repositories
MultipoleAttention
PublicCDLM
Publicplan-and-act
Publicsciml-agent
PublicSciMLAgents: Write the Solver, Not the Solutionreward-under-attack
PublicETS
PublicQuantSpec
PublicSqueezedAttention
PublicTool2Vec
PublicEfficient and Scalable Estimation of Tool Representations in Vector SpaceTinyAgent
PublicKVQuant
Public[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationSqueezeLLM
Public[ICML 2024] SqueezeLLM: Dense-and-Sparse QuantizationLLMCompiler
Public[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function CallingLLM2LLM
Public[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancementopen_source_projects
Public