Skip to content
View duc-dn's full-sized avatar
🎯
Focusing
🎯
Focusing
  • VNPT AI
  • Hà Nội
  • 08:02 (UTC -12:00)

Block or report duc-dn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
duc-dn/README.md

👋 Hi, I'm Duc

🚀 Data Engineer @ VNPT AI
🔹 Building scalable Lakehouse & Data Platforms
🔹 Experienced with Big Data, Streaming, and Cloud Infrastructure
🔹 Passionate about Data Infrastructure, APIs, and Workflow Orchestration


🔧 Tech Stack

💻 Programming Languages

  • Python (ETL, APIs, data pipelines, orchestration)
  • Java (Big Data, Kafka, Flink, Spark ecosystem)

📊 Data & Lakehouse

  • Apache Iceberg, Delta Lake
  • Apache Spark, Apache Flink
  • Kafka, Kafka Connect, Debezium (CDC from Postgres/MySQL/MongoDB)

☁️ Cloud & Storage

  • Google BigQuery, Cloud Scheduler
  • AWS S3, MinIO
  • GCS (Google Cloud Storage)

🗄️ Databases & Vector Search

  • PostgreSQL, MySQL, MongoDB
  • Qdrant (Vector Database)

📈 BI & Visualization

  • Apache Superset

🕒 Workflow Orchestration & Scheduling

  • Apache Airflow, Cronjob, Cloud Scheduler

⚙️ DevOps & Infra

  • Docker, Docker Compose
  • Kubernetes, Helm
  • Terraform, GitHub Actions

🌐 API & Software

  • FastAPI
  • Git, GitHub (version control & collaboration)

📌 Featured Projects


🌱 What I’m Learning

  • Data mesh & federated query engines (Trino/Presto, Dremio)
  • Advanced Iceberg optimizations (partitioning, compaction, metadata scaling)
  • Hybrid pipelines (batch + streaming with Flink + Spark)
  • AI/LLM integration with vector databases (Qdrant)

📫 Connect with Me


⭐️ From ducdn

📊GitHub Stats :




Popular repositories Loading

  1. BTL_Java_QLCB BTL_Java_QLCB Public

    Java 2

  2. hudi-cli-with-minio hudi-cli-with-minio Public

    setup Hudi CLI to connect to minio in local

    Shell 2

  3. realtime-analytic realtime-analytic Public

    Đồ án xây dựng mô hình phân tích và xử lý dữ liệu realtime - copyright ducdn

    TypeScript 1

  4. kafka-cluster kafka-cluster Public

    create kafka-cluster with docker-compose

    Python 1

  5. SQL-Server SQL-Server Public

    TSQL

  6. Python_Basics_Tutorial Python_Basics_Tutorial Public

    Forked from CodexploreRepo/python-youtube-tutorials

    Python