Areas of Interest

Data Engineering

I enjoy building scalable data pipelines and integrating diverse data sources to support analytics and machine learning workflows. My experience includes ETL pipeline automation, data modeling, and warehousing using tools like Snowflake, Airflow, and Python.

Data Analysis

I am passionate about uncovering insights through data. From cleaning and transforming datasets to building dashboards and reports, I use tools like Excel, Tableau, and SQL to help stakeholders make informed decisions backed by data.

Machine Learning

I explore machine learning solutions for classification, regression, and prediction tasks. I’ve worked with Scikit-learn, TensorFlow, and PyTorch to train and evaluate models that drive business impact.

Natural Language Processing

I have a deep interest in language models and text analysis. I have used NLP techniques for tasks like summarization, classification, and building both basic and semantic search engines. I use tools like Hugging Face Transformers, NLTK, and vector similarity techniques to develop models that understand text beyond keywords.

Recent Projects

SQL Data Warehouse

A modern SQL data warehouse using Medallion Architecture (Bronze, Silver, Gold) to transform and integrate over 100K+ raw ERP and CRM records into a clean, analytics-ready star schema, achieving 98%+ data quality through robust T-SQL transformations, standardization, and integrity checks.

  • T-SQL
  • SQL Server
  • git Git
  • git GitHub
  • draw.io Draw.io

Beauty Product Purchase Pipeline

An automated end-to-end pipeline to extract, transform, and load beauty purchase data using Python, Snowflake, and Airflow, enabling real-time trend analysis and interactive visualizations in Tableau to uncover spending habits and product preferences.

  • Python Python
  • SQL
  • Snowflake Snowflake
  • Airflow
  • tableau Tableau
  • Docker

Coffee Sales Excel Dashboard

An interactive Excel dashboard designed to empower business decisions by turning raw coffee sales data into clear, actionable insights using advanced formulas, pivot tables, and clean visual storytelling.

  • excel Excel
  • data-viz Data Visualization

Sales & Customer Dashboard

Interactive Sales Performance and Customer Analysis Tableau dashboards designed to help business stakeholders uncover sales trends, understand customer behaviour, and make data-driven decisions.

  • tableau Tableau
  • data-viz Data Visualization

Weather and Sales Trend Pipeline

A Snowflake-powered data pipeline with Streamlit visualization to analyze the relationship between weather patterns and sales trends in Hamburg, Germany.

  • Python Python
  • Snowflake Snowflake
  • Streamlit Streamlit

Sentiment Classification on Product Reviews

Developed and compared five NLP models - FastText, BERT, DistilBERT, RoBERTa, and XLNet - to classify product reviews as positive or negative, using pre-trained architectures and fine-tuning techniques.

  • Python Python
  • huggingface Hugging Face
  • scikit-learn Scikit-learn
  • PyTorch

Bank Institution Term Deposit Predictive Model

Developed a predictive model using XGBoost and Stratified K-Fold to identify customers likely to subscribe to bank term deposits, addressing class imbalance and optimizing performance.

  • Python Python
  • scikit-learn Scikit-learn
  • eda Exploratory Data Analysis