Skip to content
View sadkanuos's full-sized avatar
✍️
Learning new things
✍️
Learning new things

Block or report sadkanuos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sadkanuos/README.md

Hi πŸ‘‹, I'm Sounak.

Data Engineer | AWS | Databricks | PySpark | SQL | ETL Automation



About Me:

  • Data Engineer designing and deploying scalable ETL pipelines, automating data workflows, and building reliable cloud-based data systems with a major focus on improvement of data accessibility, consistency, and performance accross platforms.
  • Key expertise in Python, SQL, Pyspark and Delta Lake.
  • Indepth knowhow of Databricks, AWS Services like Glue, Lambda, Redshift, Athena and S3 to use end-to-end data engineering solution with these skills transferrable to Azure ecosystem (ADF, Azure SQL, AppFunctions, ADLS etc.)
  • Proficient in:
    • Data Ingestion
    • Data Aggregation
    • Data Transformation
    • Data Pre-processing
    • Supervised and unsupervised machine learning techniques
    • Data Validation
  • Implemented end-to-end data monitoring and transformation ETL Pipeline in medallion architecture and metadata driven pipeline streamlining ingestion, transformations, and validation across multiple data sources.
  • Experienced in integrating data pipelines with BI Tools to enable faster decision-making and improve data visibility.
  • Involved in data governance, lineage tracking and monitoring to ensure the parity in data quality and operational reliability.
  • Experience in creating CI/CD workflows and pipelines using Azure DevOps and Github Actions.
  • I am currently exploring frameworks like Apache Airflow, dbt, Great Expectations and Apache Iceberg to diversify my personal toolkit.
  • I am also exploring the Infrastructure as a code paradigm and DevOps in using Jenkins and Terraform
  • I am an AWS certified Associate for Developer and Data Engineer
  • I am also Microsoft Certified in Azure Fundamentals.

πŸ’» Machine Configuration:

Connect with me:

linkedin outlook

Programming Languages:

Tools and Frameworks

Databases

plsql ddb Redshift mongo

Cloud Platforms

Pinned Loading

  1. SKU_Unsupervised SKU_Unsupervised Public

    Post Graduation Major Project

    Jupyter Notebook 1