You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
end-to-end Big Data pipeline that demonstrates advanced proficiency in distributed computing, data engineering, and automated validation. Rather than relying on a single script or a simplified analytics task, the project is designed to simulate a real-world enterprise data architecture where raw data is generated at scale
Este é um projeto acadêmico para a disciplina de "BigData" do curso de Ciências da Computação e ADS do Centro Universitário do Vale do Ipojuca. Consiste em um software de gerenciamento de atendimento de uma clínica veterinária regional.
Repositorio con proyectos y laboratorios de procesamiento de datos utilizando Databricks, Apache Spark y Python. Incluye conceptos clave de Big Data, almacenamiento, procesamiento, análisis y aprendizaje automático.
Explore the capabilities of Amazon EMR Serverless by processing semi-structured review data with Apache Spark, showcasing efficient big data analysis without managing clusters.
Explore and replicate Amazon EMR (Elastic MapReduce) setup and utilization for big data processing and analytics tasks, featuring comprehensive demonstrations from VPC creation to Spark job execution.