Skip to content
View freddiev4's full-sized avatar
🚢
ship it
🚢
ship it

Organizations

@github-beta @jupyterlab @Cohere-Labs-Community @Hugging-Face-Supporter @Hugging-Face-Helping-Hand @quotient-ai

Block or report freddiev4

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
freddiev4/README.md

✨ Hello! ✨

I build systems and tools for Machine Learning -- previously I worked at GitHub on GitHub Copilot building evals & infra for tab completion and Chat, and did open research with Cohere Labs on Aya.

Follow me on X for updates.

Past Research

Venue Paper Contributions
Nature 2026 Humanity's Last Exam Contributed a difficult math & statistics question about theoretical max damage in Old School Runescape
ACL 2024 (Best Paper Award) Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Data Engineering to create the Aya Human Annotated set of the Aya eval suite: dataset
ACL 2024 (1st Author) Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Designed, built, and scaled out the backend for an open data annotation platform for over 3000 contributors and 300,000+ annotations for training multilingual instruction-tuned LLMs on mixed-resource languages: repo. Co-created the Aya Score to encourage participants to incorporate more edits during annotation, increasing average annotation length by >50%
CSCW 2017 Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms Helped design a reputation system for workers and requesters to gain experience, receive higher wages and higher quality work

Current Projects

  • rune — a coding agent for data engineering, search, and analytics over my personal data
  • aqueduct — agent-owned DAG-based data pipelines to NAS & S3
  • yts3 — rust library for encoding arbitrary files into lossless video, using YouTube as S3 storage

Pinned Loading

  1. agents agents Public

    plugins, skills, scripts, etc for interacting with coding agents

    Shell 1

  2. pokeshadowbench pokeshadowbench Public

    "Who's that Pokemon?" evals for LLMs

    Python 2

  3. aqueduct aqueduct Public

    agent-owned DAG-based data pipelines to NAS & S3

    Python 3

  4. rune rune Public

    general purpose data agent for swe data tasks

    Python 5

  5. yts3 yts3 Public

    rust library for using YouTube as S3 storage

    Rust 1

  6. fvfs fvfs Public

    virtual filesystem with tiered storage across local disk, NAS, and S3

    Rust 1