Skip to content
View StarlitVienna's full-sized avatar

Highlights

  • Pro

Block or report StarlitVienna

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
StarlitVienna/README.md

Hi, I am Vienna

Data Engineer & Machine Learning Architect

I specialize in building high-performance data pipelines and custom deep learning infrastructure. My core focus is architecting real-time systems that handle complex data pipelines.


Engineering

Real-Time Data Ingestion Service

A resilient backend service that maintains persistent WebSocket connections to Binance for live market data ingestion. It features a custom "self-healing" mechanism that automatically detects sequence gaps caused by network instability and asynchronously fetches historical fragments to ensure 100% data continuity. It applies vectorized stationarity transformations via NumPy in real-time.

High-Concurrency Data Delivery Layer

A high-performance REST API built with FastAPI and Async SQLAlchemy. It acts as the bridge between the raw data engine and downstream ML models, serving normalized, scale-invariant financial features with low latency. Designed for asynchronous load handling to support multiple concurrent trading agents.

Deep Learning Research (PyTorch)

A research-grade implementation of a Multi-Target Time-Series Transformer built entirely from scratch. Unlike standard models, this architecture utilizes dual-output decoders to simultaneously solve regression tasks (Log Returns) and classification tasks (Volatility Regimes). It is engineered to handle chaotic, non-stationary financial distributions without look-ahead bias.

Engineered ML Dataset

A curated, high-frequency dataset of 1-minute Bitcoin candles, pre-processed specifically for deep learning stability. Unlike raw OHLCV data, this dataset features rigorous stationarity engineering using Robust Scaling and Arcsinh transformations to eliminate distribution shifts, making it ready for immediate training of gradient-sensitive models.


Tech Stack

Languages

python c cplusplus bash

Backend & Systems

fastapi selenium sqlalchemy sqlite linux

Data Science & Machine Learning

pytorch tensorflow scikit_learn pandas numpy


Contacts

Pinned Loading

  1. quantcandle-engine quantcandle-engine Public

    Websocket for Binance's klines and feature engineering.

    Python

  2. quantcandle-api quantcandle-api Public

    Stationary feature engineering for financial data

    Python

  3. Machine-learning-kaggleDatasets Machine-learning-kaggleDatasets Public

    Solving kaggle datasets with pytorch

    Jupyter Notebook 1

  4. StarlitVienna StarlitVienna Public

    just my README ;)

    2

  5. PlurallScraper PlurallScraper Public

    Projeto com intuito de conseguir fazer backup da aba maestro na plataforma Plurall

    Python 1

  6. Background-Remover-backend Background-Remover-backend Public

    Background Remover backend

    Jupyter Notebook