Skip to content
View Gustavo-HA's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Centro de Investigación en Matemáticas (CIMAT)
  • 12:24 (UTC -06:00)
  • LinkedIn in/gustavo-ha

Highlights

  • Pro

Block or report Gustavo-HA

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Gustavo-HA/README.md

Hi, I'm Gustavo 👋

I'm a Data Scientist & LLM Engineer based in Monterrey, México, currently finishing my M.Sc. in Data Science at CIMAT. My work lives at the intersection of Natural Language Processing, LLM systems, and MLOps. Right now I'm leading research on tourist review analysis for Mexican destinations, building RAG pipelines, and thinking a lot about how to make language models actually useful in the real world.


Featured Projects

RAG system that turns scattered tourist reviews into strategic intelligence reports for México's Pueblos Mágicos.

Built a full LangGraph Map-Reduce workflow that retrieves reviews from a ChromaDB vector store, extracts structured insights per business type in parallel, and consolidates them into a validated strategic briefing — with a self-correction audit loop and an interactive Streamlit + FastAPI interface on top.

LangGraph LiteLLM ChromaDB FastAPI Streamlit MLflow DVC


End-to-end MLOps pipeline for loan eligibility prediction, deployed on AWS.

Full ML lifecycle: experiment tracking with MLflow, pipeline orchestration with Prefect, cloud infrastructure provisioned with Terraform, and real-time serverless inference via AWS Lambda + Kinesis. Includes unit tests, LocalStack integration tests, and automated linting... the kind of project where the engineering matters as much as the model.

MLflow Prefect Terraform AWS Docker LocalStack


Hybrid Two-Tower architecture that tackles the cold-start problem in music recommendation.

Combined content-based multimodal encoders with sequential user modeling to recommend music even for new users or tracks with no history.

PyTorch HuggingFace


A highlight

At REST-MEX 2025, a national NLP competition focused on Mexican tourism reviews, our team placed 7th out of all participating teams, earning an honorable mention. Good times.


Let's connect

I'm always open to interesting conversations about NLP, LLM systems, or MLOps. Feel free to reach out!

LinkedIn Email

Pinned Loading

  1. loan_prediction loan_prediction Public

    End to End online ML prediction of loan approval.

    Python 3 1

  2. restmex-2025 restmex-2025 Public

    Participación del equipo Corpus Christi en el REST-MEX 2025

    Jupyter Notebook 3 1

  3. DiegoPaniagua23/music-recommendation-multimodal DiegoPaniagua23/music-recommendation-multimodal Public

    Implementation of a Hybrid Two-Tower Architecture for Music Recommendation. Solves cold-start problems by integrating content-based multimodal encoders with sequential user modeling.

    Jupyter Notebook 6

  4. Tourism-Reports-LLM Tourism-Reports-LLM Public

    RAG system to synthesize tourism reports on Pueblos Mágicos from México.

    Python