Skip to content

BCSDLab/KOIN_AIRFLOW

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

27 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ›  KOIN_AIRFLOW

KOIN μ„œλΉ„μŠ€μ˜ 데이터 νŒŒμ΄ν”„λΌμΈμ„ κ΄€λ¦¬ν•˜κΈ° μœ„ν•œ Apache Airflow ν™˜κ²½μž…λ‹ˆλ‹€.
GA4 β†’ BigQuery β†’ Dataform으둜 κ΅¬μ„±λœ ETL νŒŒμ΄ν”„λΌμΈμ„ μŠ€μΌ€μ€„λ§, λͺ¨λ‹ˆν„°λ§, μž¬μ‹œλ„ν•˜κΈ° μœ„ν•΄ Airflowλ₯Ό μ‚¬μš©ν•©λ‹ˆλ‹€.


πŸ“Œ Purpose

  • 일 λ‹¨μœ„ 데이터 적재 및 λ³€ν™˜ μž‘μ—… μžλ™ν™”
  • 데이터 νŒŒμ΄ν”„λΌμΈ μ‹€ν–‰ μƒνƒœ κ°€μ‹œν™”
  • μ‹€νŒ¨ μ‹œ μž¬μ‹œλ„ 및 μ•ˆμ •μ μΈ 운영
  • DAG 성곡/μ‹€νŒ¨ μƒνƒœμ— λŒ€ν•œ Slack μ•Œλ¦Ό 제곡

βš™οΈ Architecture

  • Apache Airflow (Docker Compose 기반)
  • Executor: CeleryExecutor
  • Metadata DB: PostgreSQL
  • Message Broker: Redis
  • ETL: Dataform 기반 BigQuery νŒŒμ΄ν”„λΌμΈ
  • Notification: Slack Webhook을 ν†΅ν•œ μž‘μ—… μƒνƒœ μ•Œλ¦Ό

πŸ§ͺ Usage

  • 둜컬 개발 및 ν…ŒμŠ€νŠΈ ν™˜κ²½
  • μ„œλ²„ ν™˜κ²½μœΌλ‘œ ν™•μž₯ κ°€λŠ₯ν•œ ꡬ쑰
  • 운영 ν™˜κ²½μ—μ„œλŠ” 별도 μ„œλ²„(VM)μ—μ„œ μƒμ‹œ 싀행을 μ „μ œ

πŸ”” Notification

  • DAG μ‹€ν–‰ κ²°κ³Ό(성곡/μ‹€νŒ¨)λ₯Ό Slack으둜 전솑
  • 운영 쀑 νŒŒμ΄ν”„λΌμΈ 이상 μ§•ν›„λ₯Ό μ¦‰μ‹œ 인지할 수 μžˆλ„λ‘ ꡬ성

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published