Skip to content

hwRG/AST-Real-time-Emergency-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

7 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

AST-Real-time-Emergency-Classification

Base on

๋ณธ ํ”„๋กœ์ ํŠธ๋Š” YuanGongND๋‹˜์ด ๊ตฌํ˜„ํ•œ ast ์†Œ์Šค ์ฝ”๋“œ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ตฌํ˜„ํ–ˆ์Šต๋‹ˆ๋‹ค.

Introduction

  • ๊ณ ๋ นํ™” ์‚ฌํšŒ์— ์ ‘์–ด๋“ค๋ฉฐ ๋…ธ์ธ 1์ธ ๊ฐ€๊ตฌ ๋น„์ค‘์ด ์ฆ๊ฐ€ํ•˜๋ฉฐ, ๊ณ ๋…ํ•˜๊ฒŒ ์ƒ๋ช…๊ณผ ์ง๊ฒฐ๋œ ์œ„ํ—˜์— ๋†“์—ฌ์žˆ๋Š” ๋…ธ์ธ ๋น„์ค‘๋„ ์ฆ๊ฐ€ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ์ •๋ถ€์™€ ์ง€์ž์ฒด๋Š” ์ด ๋ฌธ์ œ๋ฅผ ํŒŒ์•…ํ•˜์—ฌ SKT ๋“ฑ ๊ธฐ์—…๊ณผ ํ˜‘์—…ํ•˜์—ฌ ๋„์›€์„ ์ค„ ์ˆ˜ ์žˆ๋Š” ์ธ๊ณต์ง€๋Šฅ์„ ๋ณด๊ธ‰ํ•˜๊ณ  ์žˆ๋‹ค. ์ด์— ๋”ฐ๋ผ AI ์Šคํ”ผ์ปค์— ํƒ‘์žฌ๋  ์ˆ˜ ์žˆ๋Š” ์‹ค์‹œ๊ฐ„์œผ๋กœ ์œ„๊ธ‰ ์ƒํ™ฉ์„ ๊ฐ์ง€ํ•  ์ˆ˜ ์žˆ๋Š” ๊ธฐ๋Šฅ์„ ๊ฐœ๋ฐœํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค.

To Do

  • Real-time์œผ๋กœ ์˜ค๋””์˜ค ์‹ ํ˜ธ๋ฅผ ์ฝ๊ณ , ์‘๊ธ‰ ์ƒํ™ฉ์„ ์˜ˆ์ธกํ•˜๊ณ  ์ฆ‰๊ฐ์ ์ธ ๋Œ€์‘
  • Transformer ๊ธฐ๋ฐ˜ ๊ณ ์„ฑ๋Šฅ Audio Classification ๋ชจ๋ธ AST ์ฑ„ํƒ

Model Architecture

AST(Audio Spectrogram Transformer)

  1. ์˜ค๋””์˜ค์— ๋Œ€ํ•œ STFT ๊ฒฐ๊ณผ๋ฌผ์ธ Spectrogram์„ n๊ฐœ๋งŒํผ ๋‚˜๋ˆ„๊ณ  linear projection ์ˆ˜ํ–‰ํ•ฉ๋‹ˆ๋‹ค.
  2. Linear projection ๊ฒฐ๊ณผ๋ฌผ์— positional embedding์„ ๊ฑฐ์ณ์„œ ํฌ์ง€์…˜ ๊ฐ’์„ ๊ฐ–๊ณ  Transformer์˜ Encoder ํ†ต๊ณผํ•ฉ๋‹ˆ๋‹ค.
  3. Encoder์˜ ๊ฒฐ๊ณผ๋ฌผ์„ ํ™œ์„ฑํ™” ํ•จ์ˆ˜๊ฐ€ sigmoid(softmax)์ธ Dense layer๋ฅผ ์ง€๋‚˜ ์ตœ์ข… ๊ฒฐ๊ณผ๋ฌผ์˜ ํ™•๋ฅ  ์˜ˆ์ธกํ•ฉ๋‹ˆ๋‹ค.

AST ํ•ต์‹ฌ ์•„์ด๋””์–ด

  • Vision Transformer์™€ ์œ ์‚ฌํ•œ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์ ธ Vision Transformer๋กœ ImageNet์„ ํ•™์Šตํ•œ Pre-trained ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค. ์ด๋•Œ, Transfer Learning์„ ์ˆ˜ํ–‰ํ–ˆ์„ ๋•Œ ์‘๊ธ‰ ์ƒํ™ฉ ๋ฐ์ดํ„ฐ ๊ธฐ์ค€ 40 epoch 99.1% ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ•˜๋Š” ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  • Mixup ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์‚ฌ์šฉํ•˜์—ฌ ์„ ํƒ๋œ ๋ฐ์ดํ„ฐ์™€ ๋žœ๋คํ•œ ๋ฐ์ดํ„ฐ๋ฅผ beta ๋ถ„ํฌ๋กœ ์„ž์–ด ํ•™์Šต์— ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค.

How to Train

ํ•™์Šต์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์ „, label indices.csv์™€ data.csv๋ฅผ ๋ฏธ๋ฆฌ ์ค€๋น„ํ•ฉ๋‹ˆ๋‹ค.
๊ทธ๋ฆฌ๊ณ  egs/emergency ๋””๋ ‰ํ† ๋ฆฌ์—์„œ run_emergency.sh ์Šคํฌ๋ฆฝํŠธ๋ฅผ ์‹คํ–‰ํ•ฉ๋‹ˆ๋‹ค. (./run_emergency.sh)

About

Real-time Audio Classification (with Emergency situation)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published