Fluxon

As GPU throughput keeps climbing, CPU and I/O paths increasingly become the hidden bottlenecks that drag down AI training and inference efficiency. Fluxon is built to aggressively consolidate the complexity of low-level storage and transport so more of the system budget can be spent on model work instead of data-plane plumbing.

Built on a unified Rust-based storage-and-transport foundation, Fluxon exposes three standardized interfaces that target the core bottlenecks in AI systems:

KV/RPC (Unified key-value and RPC): Breaks data silos and enables efficient cross-process, cross-node reuse of inference-side KVCache and latent cache
MQ (Elastic message queue): Decouples system dependencies and supports elastic message transport across heterogeneous resource pools
FS (S3-compatible file, object, and cache acceleration system): Unifies multi-form storage so one system can cache key-value, file, and object data, while supporting remote access, S3 forwarding, and large-scale cross-cluster migration for AI data and model files

🧱 Foundation Capabilities

End-to-end Rust: moves connection handling, protocol encoding/decoding, state-machine progression, shared-memory management, and observability collection into Rust hot paths
Integrated storage and transport: prioritizes the cross-process shared-memory fast path and optimizes storage and transport within one unified data plane
High-performance inter-node transport: inside the cluster, RDMA is preferred, with automatic TCP fallback, and NICs can be enabled, disabled, and switched dynamically from the GUI
Automatic inter-node relay: supports automatic relay / forwarding across nodes and sub-clusters, reducing the integration cost of complex network topologies
Global memory allocation and governance: uniformly manages global memory allocation, object lifecycles, capacity boundaries, and reclamation policies to avoid fragmentation and uncontrolled growth
Unified role model: master, owner_client, and external_client cooperate in layers, organizing control-plane and data-plane responsibilities into a scalable tree topology while decoupling business service processes from data-plane resource governance and low-level communication paths
Unified object interface: lets the system organize multi-field objects uniformly, balancing API flexibility, ease of use, and room for low-level optimization
Tensor-native zero-copy handoff path: better suited for reusing high-frequency tensor objects across caching and transport paths
Unified observability: uses the Prometheus protocol and Greptime to consolidate metric / trace / log, and includes a built-in GUI for cluster member state, log information, key metrics, and topology
Shared capabilities across all three interfaces: KV/RPC, MQ, and FS reuse the same caching, transport, lease, capacity-governance, and observability substrate

🔌 Interface Capabilities

Fluxon KV/RPC

Designed for world-model inference caches, state sharing, service-to-service calls, and tensor object reuse. In scenarios such as multi-view latent-space prediction, state extrapolation, and prefix-cache reuse, Fluxon KV/RPC provides a more general AI data plane rather than a point solution for only a single KVCache use case.

Local cache replicas and eventually consistent read path: prioritizes local fast-path hits while synchronizing metadata asynchronously in the background
Batched reclamation and hot-object management: advances invalid-object cleanup asynchronously through batch_delete, and combines it with TinyLFU to reuse hot objects more efficiently
Simultaneous control over L2 and L3 in AI workloads: keeps global data objects indexed, discoverable, and reusable, reducing redundant memory waste from duplicate residency across cache tiers
KV and RPC synergy: the same parameter organization, caching, and communication foundation serves both state storage and service-to-service calls

Fluxon MQ

Designed for heterogeneous training, data-processing pipelines, and intermediate-state handoff across resource pools. When the producer side and consumer side are split across different machines, different resource pools, or even different sub-clusters, Fluxon MQ consolidates message retention, capacity governance, and cross-cluster placement into one unified messaging layer.

Lease-based retention semantics: binds message retention to the channel, ensuring data has bounded-time reliable retention before actual consumption
channel-level prefix statistics and capacity governance: continuously tracks message counts and capacity usage boundaries for scaling and traffic control
Cross-cluster load-aware placement: uses consumer-side location to decide payload placement, shortening prefetch paths and stabilizing throughput
Co-designed with KV: message shells and member metadata stay on the control plane, while large payloads stay on the FluxonKV data plane, avoiding a second duplicated large-object transport stack

Fluxon FS

Fluxon FS is an S3-compatible file and object cache for AI data and model files. It supports read/write acceleration, remote access, S3 forwarding, cache hits, and large-scale cross-cluster migration. In workloads with high-resolution video, trajectory samples, checkpoints, and other large file objects, these capabilities are unified in one file data plane.

Unified caching system: directly reuses FluxonKV/RPC caching and communication capabilities, splits files into KeyValue shards, and lets one system support accelerated reads and writes for key-value, file, and object caching
S3 forwarding access: supports object-storage access and forwarding for AI data and model files
Transparent Python file semantics: preserves the upper-layer open() / read() / write() experience as much as possible while reducing system-call and cross-process overhead
Specialized optimization for small-file / large-file reads and writes: optimizes concurrency and transport paths by file granularity and read / write path to improve bandwidth utilization and overall throughput
Large-scale cross-cluster migration: supports PB-scale data migration and keeps caching, transport, and failure recovery in one unified path

📊 Benchmark

The benchmark section mainly covers the RPC, KV, and FS data planes, and the related scripts and configurations are primarily under fluxon_test_stack/.

Fluxon RPC Benchmark

The RPC benchmark mainly shows call latency and throughput across different message sizes and concurrency levels, to observe the stability and tail-latency behavior of the service-to-service call path.

Fluxon KV Benchmark

The TCP benchmark shows that Fluxon is significantly ahead of MooncakeStore and Redis on the two read-heavy workloads Read-affinity and Read-Zipf. For put_only, the current main constraint remains the inflight metadata deduplication path rather than payload transport.

Fluxon FS Benchmark

The benchmark results show that small-file reads and large-file writes are already significantly ahead of Alluxio, large-file reads are roughly comparable, and small-file writes still have room for further optimization.

Fluxon MQ Benchmark

MQ currently focuses mainly on scenario problems and data-plane design. The automated runtime entrypoints are test_runner.py and fluxon_test_stack/.

🧰 Runtime Requirements

For Quick Start (Docker):

Docker installed
The Quick Start image bundles the middleware required by the demo flows

For production deployment or building from source:

OS: Linux only
Python: >= 3.10
Rust: Toolchain pinned to 1.93.0; see fluxon_rs/rust-toolchain.toml
External middleware:
- The minimum service plane requires etcd and Greptime
- FluxonFS features such as directory transfer and pre-scan that persist task state also require TiKV PD and TiKV
Docker: Required for Quick Start image workflows and runtime packaging workflows

🚀 Quick Start

Quick Start is the shortest path to try Fluxon. For formal installation, deployment, and operations, see User Docs.

KV Quick Start

docker run --rm -it --network host \
  hanbaoaaa/fluxon_quick_start:0.2.1 \
  --mode kv \
  --etcd-client-port 12379 \
  --master-p2p-port 31000 \
  --panel-port 18080 \
  --greptime-http-port 14000 \
  --kv-http-port 8083

Once inside, you can type:

put demo:hello world
get demo:hello
del demo:hello

Runtime view:

Open the printed link to view the KV Web UI:

Related interface docs:

KV and RPC Interface

MQ Quick Start

docker run --rm -it --network host \
  hanbaoaaa/fluxon_quick_start:0.2.1 \
  --mode mq \
  --etcd-client-port 37379 \
  --kv-master-port 34200 \
  --greptime-http-port 14000 \
  --panel-port 18080

Once inside, you can type:

put hello
put world
exit

The background consumer keeps printing received messages.
Startup also prints the MQ Web UI address.

Runtime view:

Related interface docs:

MQ Interface

FS Quick Start

docker run --rm -it --network host \
  hanbaoaaa/fluxon_quick_start:0.2.1 \
  --mode fs \
  --etcd-client-port 36379 \
  --kv-master-port 34100 \
  --greptime-http-port 14000 \
  --panel-port 34180

Once inside, you can type:

ls
echo "hello fs" > notes.txt
cat notes.txt
ui

FS Quick Start additionally prints:

fs_s3 endpoint
Basic Auth entry; the default username / password is admin / admin

Runtime view:

Open the printed link to view the FS Web UI:

Related interface docs:

FS Interface

🗂️ Repository Structure

fluxon_rs/: Rust core implementation and low-level capabilities
fluxon_py/: Python interfaces, runtime, and bindings
deployment/: deployment and operations toolchain
scripts/: utility scripts and helper entrypoints
setup_and_pack/: packaging and release resource preparation entrypoints
examples/fluxon_quick_start/: minimal runnable environment entrypoint
fluxon_test_stack/: test stack, benchmarks, and gitops entrypoint

🤝 Contributing

Contributions are welcome. Before you start, please read the developer docs on GitHub Pages:

👥 Contributors

Some earlier contribution records are no longer fully reflected in the current commit history. Historical highlights:

yxrxy: FluxonFS implementation and optimization
zTz01: KVCache optimization
pakkah: RDMA support, VLM exploration
unity1263: KV shared-memory design integration, benchmark toolchain
mumupika: Initial MQ implementation
maplestarplayl: IPC integration, SPDK integration
RuileLu: KV lease support
Summage: Initial KV architecture optimization

📄 License

Fluxon is open-sourced under Apache License 2.0, see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
deployment		deployment
examples		examples
fluxon_doc_cn		fluxon_doc_cn
fluxon_doc_en		fluxon_doc_en
fluxon_py		fluxon_py
fluxon_release		fluxon_release
fluxon_rs		fluxon_rs
fluxon_test_stack		fluxon_test_stack
pics		pics
scripts		scripts
setup_and_pack		setup_and_pack
.gitignore		.gitignore
AGENTS.md		AGENTS.md
AGENTS_CN.md		AGENTS_CN.md
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
build_config_ext_static.yml		build_config_ext_static.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fluxon

🧭 Contents

🧱 Foundation Capabilities

🔌 Interface Capabilities

Fluxon KV/RPC

Fluxon MQ

Fluxon FS

📊 Benchmark

Fluxon RPC Benchmark

Fluxon KV Benchmark

Fluxon FS Benchmark

Fluxon MQ Benchmark

🧰 Runtime Requirements

🚀 Quick Start

KV Quick Start

MQ Quick Start

FS Quick Start

🗂️ Repository Structure

🤝 Contributing

👥 Contributors

📄 License

⭐ Stargazers over time

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fluxon

🧭 Contents

🧱 Foundation Capabilities

🔌 Interface Capabilities

Fluxon KV/RPC

Fluxon MQ

Fluxon FS

📊 Benchmark

Fluxon RPC Benchmark

Fluxon KV Benchmark

Fluxon FS Benchmark

Fluxon MQ Benchmark

🧰 Runtime Requirements

🚀 Quick Start

KV Quick Start

MQ Quick Start

FS Quick Start

🗂️ Repository Structure

🤝 Contributing

👥 Contributors

📄 License

⭐ Stargazers over time

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages