Skip to content
View davidvanegas2's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report davidvanegas2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
davidvanegas2/README.md

GitHub followers LinkedIn Badge Medium Badge

Hi there, I'm David 👋

I'm David, a passionate Senior Data Engineer with a deep expertise in designing and building scalable, cloud-based data solutions. I specialize in leveraging AWS technologies and Python to transform raw data into actionable insights, helping businesses unlock the full potential of their data.

Currently, I'm a Senior Data Engineer at Workstate, where I contribute to impactful data projects that drive meaningful outcomes for clients. With a solid background in consulting and product-focused roles, I excel at bridging technical excellence with strategic problem-solving.


🛠️ Skills & Expertise:

  • Cloud Technologies: Proficient in AWS services (Glue, EMR, Lambda, S3, Athena, DynamoDB, etc.), Terraform, and Infrastructure as Code (IaC).
  • Programming: Advanced in Python, with a focus on data engineering, automation, and backend development.
  • Big Data: Experience with distributed systems like Apache Kafka and Trino, designing and optimizing data architectures for high-volume pipelines.
  • Consulting: Skilled in scoping and delivering tailored solutions for diverse clients, ensuring efficiency and scalability.

🌍 About Me:

  • 🔭 Current Role: Senior Data Engineer at Workstate
  • 🚀 Specialization: AWS Services, Python Programming, Cloud Data Architectures, Big Data
  • 🌱 Currently Learning: Advanced CI/CD practices and expanding my knowledge in modern data lakehouse frameworks.
  • 💬 Ask Me About: Data Engineering, AWS Services, Python Programming, Big Data, and Cloud Infrastructure.
  • 🌍 Languages: Spanish (Native), English (Fluent).
  • Fun Fact: I love building tools that simplify workflows and improve efficiency—whether it’s for internal teams or clients. Sharing knowledge with the community is my way of growing and giving back.

📫 Let’s Connect!


I’m always looking to collaborate on challenging projects and explore opportunities where I can make a meaningful impact. Let’s connect and build something great together! 🚀

David's github stats

Pinned Loading

  1. iceberg-s3-terraform-glue iceberg-s3-terraform-glue Public

    Automated setup of Apache Iceberg on Amazon S3 using Terraform and AWS Glue Data Catalog. Explore the power of a Lakehouse architecture for data management and analysis, featuring schema discovery,…

    Python 6 1

  2. StreamSoft-Real-Time-Market-Analysis StreamSoft-Real-Time-Market-Analysis Public

    StreamSoft enables real-time analysis of any stock market

    Python 13 1

  3. door2door-de door2door-de Public

    This repository contains a solution to automate the build of a scalable data lake and data warehouse for Door2Door, a company that collects live position data from its fleet of vehicles via GPS sen…

    Python

  4. KafkaRedditFlow KafkaRedditFlow Public

    A practical Kafka data pipeline project showcasing real-time data streaming, AWS MSK, Terraform IaC, and Python. Includes data production, consumption, and storage in S3 via Kinesis Firehose.

    Python