Skip to content
View CuteChuanChuan's full-sized avatar
🐈
Karlie and Lily
🐈
Karlie and Lily

Block or report CuteChuanChuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
CuteChuanChuan/README.md

Raymond Hung

Data Engineer | Apache Open Source Contributor

Portfolio LinkedIn GitHub


Tech Stack

Data Processing & Query Engines

Apache Spark Apache DataFusion Apache Airflow Apache Iceberg Apache Kafka Polars PostgreSQL

Languages

Python Rust Scala Java

Infrastructure

Docker Kubernetes AWS GCP


Open Source Contributions

Merged PRs across 5 Apache projects:

Project Focus
Apache DataFusion Spark-compatible functions (json_tuple, size), doc formatting, benchmarks
Apache DataFusion-Comet QueryPlanSerde modular refactoring (Spark accelerator plugin)
Apache DataFusion-Ballista Configurable gRPC timeouts for distributed query engine
Apache Iceberg ErrorProne fixes, test naming, docs
Apache Ozone Dead code removal

Blog

Technical articles on my open source work: cutechuanchuan.github.io/posts

Pinned Loading

  1. apache/datafusion apache/datafusion Public

    Apache DataFusion SQL Query Engine

    Rust 8.4k 2k

  2. apache/datafusion-ballista apache/datafusion-ballista Public

    Apache DataFusion Ballista Distributed Query Engine

    Rust 2k 264

  3. apache/datafusion-comet apache/datafusion-comet Public

    Apache DataFusion Comet Spark Accelerator

    Scala 1.1k 286

  4. apache/iceberg apache/iceberg Public

    Apache Iceberg

    Java 8.6k 3k

  5. DataPulse DataPulse Public

    Lightweight Data Quality Observability Platform

    Python

  6. Dive-Into-Iceberg Dive-Into-Iceberg Public

    In-depth exploration of Apache Iceberg features, performance optimizations, and best practices.

    Scala