Always believe that something wonderful is about to happen.
-
The University of New South Wales
- Sydney
-
21:01
(UTC +08:00) - https://github.com/MemoryWorld
- https://tieba.baidu.com/home/main?id=tb.1.2ee9a753.sbaT9G-NbW8839y4_MNw5w?t=1643138221&fr=index
Highlights
- Pro
Pinned Loading
-
-
cuda-kernels
cuda-kernels PublicFused LLM operator kernels from scratch: RMSNorm, RoPE, SwiGLU — Triton kernels benchmarked on RTX 5090
Python
-
llm-inference-bench
llm-inference-bench PublicBenchmarking LLM inference optimization: KV Cache, vLLM, Quantization on RTX 5090
Python
-
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

