Senior Data Engineer focused on building scalable cloud data platforms, reliable analytics workflows, and practical GenAI-enabled solutions.
I build production-style data workflows across GCP, AWS, and Azure, with hands-on work in Python, SQL, Airflow, dbt, BigQuery, Databricks, Spark, and Terraform.
- Scalable batch and orchestration workflows for analytics and operational data use cases
- Cloud-native data platforms with infrastructure as code, testing, and CI/CD
- Analytics engineering layers for trustworthy reporting and decision support
- GenAI-enabled workflows for reporting, knowledge access, and data product experiences
Production-style GCP data platform for IoT energy monitoring using Cloud Run Jobs, Composer, BigQuery, dbt, Terraform, and CI workflows.
AWS data lake infrastructure provisioned with Terraform, modular environments, secure defaults, MWAA orchestration, and analytics-ready services.
ETL and reporting pipeline for B3 reference rates with Airflow orchestration, layered storage, and LLM-assisted executive reporting.
Bilingual technical documentation portfolio covering data architecture, pipelines, quality, observability, governance, and cloud patterns.
- Data platform design
- Analytics engineering
- Workflow orchestration
- Cloud architecture
- Infrastructure as code
- Data quality and observability
- GenAI applied to data workflows
- Strengthening production-style portfolio projects for international opportunities
- Expanding practical GenAI use cases for data teams and analytics workflows
- Deepening multi-cloud platform patterns across GCP, AWS, and Azure
Python SQL Spark PySpark Airflow dbt BigQuery Databricks Terraform GCP AWS Azure
- LinkedIn: linkedin.com/in/lucas-skuja
- GitHub: github.com/lucasskuja
- Email:
lucas_skuja@hotmail.com
I use this GitHub to share practical data platform projects, architecture decisions, and applied experiments at the intersection of data engineering and GenAI.

