-
emsalcengiz.github.io Public
Personal blog & portfolio of Emsal Cengiz | Data Engineer | Writing about data engineering, cloud, and software.
HTML UpdatedAug 23, 2025 -
-
music-streaming-log-analysis Public
Script to analyze music streaming logs, calculating the distribution of distinct song plays per client on a specific date. Outputs a summary table of client counts by distinct play count
Python UpdatedAug 18, 2024 -
requests-scala Public
Forked from com-lihaoyi/requests-scalaA Scala port of the popular Python Requests HTTP client: flexible, intuitive, and straightforward to use.
Scala Other UpdatedJan 25, 2023 -
CheckoutMangmentExample Public
Forked from antonagre/CheckoutMangmentExamplerest calling
Java UpdatedSep 27, 2022 -
-
-
data-algorithms-book Public
Forked from mahmoudparsian/data-algorithms-bookMapReduce, Spark, Java, and Scala for Data Algorithms Book
Java Other UpdatedDec 28, 2021 -
DataEngineeringProject Public
Forked from damklis/DataEngineeringProjectExample end to end data engineering project.
Python MIT License UpdatedDec 13, 2021 -
Apache-Beam-examples Public
liked Apache Beam for streaming data transformations
Python UpdatedOct 12, 2021 -
get_users Public
After ETL done by reading static data, an API is designed with flask_sqlalchemy, the purpose is to show the top five users
Python UpdatedAug 25, 2021 -
filtering-process Public
You can do a lot of things with Apache Spark. What I've done here is to work with a static file and create a Batch ETL system.
Python UpdatedAug 2, 2021 -
I made various data normalization operations with python scripts. Target data in CSV format
-
-
Web-APIs Public
ways to make an HTTP request in JavaScript
-
Etl_processing Public
I find Apache Airflow very useful for ETL work. Here I transferred data from the source database(mysql) to the target database(postresql) and used the Airflow Bash Operator.
-
trendyol-bootcamp-spark Public
Forked from dogukannefis-py/trendyol-bootcamp-sparkScala UpdatedJan 22, 2021 -
-
-
bash-scripting Public
A collection of simple Bash scripts.
-
Data-Science--Cheat-Sheet Public
Forked from georgearun/Data-Science--Cheat-SheetCheat Sheets
2 UpdatedNov 19, 2020 -
Udacity-Data-Engineering-Projects Public
Forked from san089/Udacity-Data-Engineering-ProjectsFew projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Python Other UpdatedMar 5, 2020 -
Python Public
Forked from sohailRa/PythonAll Algorithms implemented in Python
-
clean-code-javascript Public
Forked from Zzz-Arvin/clean-code-javascript🛁 Clean Code concepts adapted for JavaScript
-
spark-stream Public
Forked from nologic/spark-streamA very neat mechanism for setting up data flow through a modularized system with Twitter as initial test case.
Scala GNU General Public License v2.0 UpdatedDec 7, 2014