Skip to content
View achi9629's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report achi9629

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. efficient-llm-finetuning efficient-llm-finetuning Public

    Efficient LLM fine-tuning & deployment: LoRA, QLoRA, PTQ and QAT — with benchmarking and config-driven pipelines.

    Python 2

  2. llm-inference-engine llm-inference-engine Public

    A from scratch LLM inference engine build in PyTorch with custom GPT2 transformers, kv cache, paged kv cache, continuous batching and A100 benchmarks

    Python 1

  3. tinystories-transformer-training tinystories-transformer-training Public

    Decoder-only Transformer trained from scratch with token-based stopping, optimizer & scheduler ablations

    Python

  4. nn-from-scratch-numpy nn-from-scratch-numpy Public

    This repo contains MLP implementation from scratch using numpy

    Python