I'm Hamza, a Data Engineer / Data Scientist. I am passionate about everything related to AI, Cloud and Distributed Systems!
Here are the latest posts from my blog [paperboat.dev]:
-
Running Scheduled Tasks (Cron Jobs) on AWS Lambda with Terraform
AWS Lambda is one of the easiest to use and most popular AWS services. Lambda Functions can be used as API endpoints, Pipeline Tasks, or as we will see in this article: Scheduled jobs. -
Farewell Pandas.. Koalas is here to stay! - Koalas: Pandas API on Spark
Koalas is a new library introduced by Spark / Databricks, that mimics the Pandas API, on top of Spark! In this article we will take a look on how to install it, use it, and compare its performance to Pandas. -
Serverless on ARM: Running AWS Lambda on ARM / Graviton2
With the announcement of AWS's Graviton2 ARM availability for Lambda functions, we will be taking a look into it and testing its performance compared to the legacy x86_64 Intel architecture. -
MongoDB 5.0 Throughput Performance vs. Data Integrity Balance - A Comparison with 4.4
High throughput and efficiency is one of the biggest highlights of NoSQL databases. In this post we will review the Throughput performance of MongoDB 5.0 with and without data integrity checks (safe writes).