I'm a ETL Developer & Data Engineer specializing in enterprise data pipelines and modern data lakehouse architectures.
I build production-grade data solutions that transform raw data into actionable insights, working with massive datasets (18+ billion records) in healthcare domain.
- Python (Pyspark), SQL
- Databricks, Apache Airflow, dbt, Delta Lake, Informatica
- SSMS, Postgres, MongoDB
- Python (NumPy, Pandas), PyTorch, Object Detection Models (YOLO)
- AWS, Ansible, Docker
⭐️ Don’t forget to check out my pinned repositories!
