Welcome! Let's dive into Apache Iceberg together in this interactive, hands-on workshop. Whether you're brand-new or already familiar with Iceberg, you'll walk away with practical skills and resources to confidently integrate Iceberg into your data workflows.
We'll explore Iceberg fundamentals, core concepts, and real-world table operations using Python-based tools like PyIceberg and DuckDB. Together, we'll follow data through the entire lifecycle—from ingestion and transformation to multi-engine querying.
This first session is a warm-up focused on clear, easy-to-follow examples of basic table operations with PyIceberg to introduce you to the essentials of Iceberg.
If you're having trouble installing Docker Desktop on your laptop, don't worry! We've prepared a Google Colab version of the workshop that doesn't require any local setup. Simply click the link below and follow the instructions provided to run the workshop:
Please make sure your laptop is ready:
- ✅ Working Laptop: Ensure you have network connectivity.
- ✅ Git Installed: Install Git here.
- ✅ Docker Desktop Installed: We'll use Docker for quick setup. Follow instructions for your system:
Want to save time on workshop day? You can build and prepare the environment ahead of time by running:
docker compose up -d --build
Once you see the following message in your terminal
it means all the containers are setup successfully. You can then shutdown them via
docker compose down
In your terminal, navigate to your desired folder and run:
git clone https://github.com/HonahX/iceberg-summit-workshop.gitInside the repo directory, start the environment by running:
docker compose up -dProceed to the next step once you see
in your terminal.
- Open your favorite web brower
- Type the following address into the address bar:
- Press Enter and you shall the see workshop notebook like below.
When you're finished, run the following code in the terminal that ran step 1:
docker compose downOnce you see
every component of the workshop has been shut down.
Ready to go deeper?
Check out the second part of the workshop focused on production use cases:
https://github.com/dlt-hub/workshop-iceberg-summit
We can't wait to get started—see you there! 🎉


