Skills

🛠️ Skills & Expertise

💻 Programming

  • Python — ETL pipelines, data analysis, testing
  • SQL — analytics, warehousing, querying big datasets
  • R — statistical workflows, data validation, reproducible analysis
  • Bash — scripting & automation

⚙️ Data Engineering

  • Apache Airflow — orchestrating workflows
  • Apache Spark — distributed data processing
  • dbt — analytics engineering & transformations
  • Parquet & Arrow — high-performance data storage & in-memory analytics
  • DuckDB — fast local analytics

☁️ Cloud & Platforms

Google Cloud

  • BigQuery, Cloud Storage, Pub/Sub
  • Cloud Functions, Cloud Run, Composer

AWS

  • S3, Redshift, Athena, IAM

🗄️ Databases & Analytics

  • PostgreSQL, MongoDB, DuckDB, SQLite
  • Tableau — dashboards & visualization
  • Grafana — monitoring & metrics

🧰 Engineering Practices

  • CI/CD — GitHub Actions
  • Docker — containerized workflows
  • Unit testing & data validation
  • Data quality checks & reproducible pipelines