🎯
Focusing
ML Engineer @ Mercedes-Benz R&D · M.Tech CS, IISc Bengaluru · LLM inference, fine-tuning & optimization
-
Mercedes-Benz R&D India
- Bengaluru, India
- in/achitya-singh-5512bb162
Pinned Loading
-
efficient-llm-finetuning
efficient-llm-finetuning PublicEfficient LLM fine-tuning & deployment: LoRA, QLoRA, PTQ and QAT — with benchmarking and config-driven pipelines.
Python 2
-
llm-inference-engine
llm-inference-engine PublicA from scratch LLM inference engine build in PyTorch with custom GPT2 transformers, kv cache, paged kv cache, continuous batching and A100 benchmarks
Python 1
-
tinystories-transformer-training
tinystories-transformer-training PublicDecoder-only Transformer trained from scratch with token-based stopping, optimizer & scheduler ablations
Python
-
nn-from-scratch-numpy
nn-from-scratch-numpy PublicThis repo contains MLP implementation from scratch using numpy
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
