GitHub - CemRoot/Master-Uni-Project: Thesis deepfake detector: Real/Fake via TensorFlow/Keras (EfficientNet+Attention). Research use. National College Of Ireland

Deepfake Detection Web Application (Flask)

This repository contains a Flask web app for classifying images as Real or Fake using a TensorFlow/Keras model with an attention block. It offers a simple web UI and an HTTP endpoint.

TL;DR Quick Start

# 0) Prereqs: macOS 12+, Python 3.11.x

# 1) Create and activate a virtual environment
python3.11 -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate

# 2) Install dependencies
python -m pip install --upgrade pip
pip install -r requirements.txt

# 3) Add model assets (required to predict)
mkdir -p model
# Copy your trained files into ./model/
#  - best_model_effatt.h5
#  - label_transform.pkl

# 4) Run the app
python app.py  # opens http://127.0.0.1:5000/

What it does

Upload an image from the UI (/service) or send an HTTP POST (/deepfake).
Validates file type (PNG/JPG/JPEG) and resizes to 128×128.
Runs inference with a Keras model that includes an attention block and GAP rescaling.
Maps predicted index to label using model/label_transform.pkl (falls back to ["Fake", "Real"]).
Saves uploads under static/images/uploadedimage/ with unique filenames.

Repository Structure

FlaskWebApp/
  app.py                      # Flask server and inference logic
  requirements.txt            # Python dependencies
  model/                      # Place model files here (not tracked)
    best_model_effatt.h5      # Trained model weights (required)
    label_transform.pkl       # Label encoder (required)
  notebooks/                  # Jupyter notebooks used in the thesis (read-only)
  static/                     # Frontend assets (CSS/JS/images)
    images/uploadedimage/     # Upload destination (gitignored)
  templates/                  # Jinja2 templates (index.html, service.html)
  .gitignore                  # Excludes datasets, models, envs, upload outputs

Usage

Web UI: open http://127.0.0.1:5000/service, choose an image (PNG/JPG/JPEG), and submit.
Programmatic (HTTP):

curl -X POST \
  -F "file=@/path/to/image.jpg" \
  http://127.0.0.1:5000/deepfake

This returns the rendered service.html view. The app stores the uploaded image under static/images/uploadedimage/ with a unique filename (e.g., upload_<timestamp>_<uuid>.jpeg) and displays the predicted label with confidence.

How inference works

Preprocessing: OpenCV read → resize to 128×128 → float32 array
Model: H5 model loaded with custom_objects:
- attention_block(features, depth)
- RescaleGAP([gap_feat, gap_mask]) = gap_feat / gap_mask
Labels: from model/label_transform.pkl if present; otherwise ["Fake", "Real"].

Notes

Datasets, models, .venv, and upload outputs are ignored by git.
If model/labels are missing, the server runs but cannot predict.
On Apple Silicon, TensorFlow 2.18.0 requires macOS 12+ and Python 3.11.

Troubleshooting (quick)

Python must be 3.11.x. Create the venv inside FlaskWebApp/:

cd FlaskWebApp
python3.11 -m venv .venv && source .venv/bin/activate
python -m pip install --upgrade pip
pip install -r requirements.txt

VS Code import errors (Pylance):
- Cmd+Shift+P → Python: Select Interpreter → FlaskWebApp/.venv/bin/python
- Cmd+Shift+P → Python: Restart Language Server → (if needed) Developer: Reload Window
Model load error “bad marshal data (unknown type code)”:
- Likely wrong Python version/dependency mismatch. Recreate the venv with Python 3.11. The app first attempts to load the full H5 model; if that fails, it rebuilds the architecture in code and loads weights only.

Port 5000 in use:

lsof -ti :5000 | xargs kill -9
# or run on another port
python -c "from app import app; app.run(debug=True, use_reloader=False, port=5001)"

Conda conflicts:

conda deactivate || true
source FlaskWebApp/.venv/bin/activate

Same image appears repeatedly:
- The app saves with unique filenames and appends a cache-busting query param. If needed, hard-refresh the page (Cmd+Shift+R).

Notebooks

The notebooks/ folder contains the Jupyter notebooks used during experimentation and thesis work. They are not required to run the Flask app but are included for reference and reproducibility.

Academic Use

If you reference this application in your thesis or publications, consider citing your model and training methodology (e.g., EfficientNet variant + attention) and include a short description of preprocessing and evaluation protocol.

Introduction

Background

The rise of DeepFake technologies poses significant threats in media authenticity, misinformation, and digital identity misuse. Detecting and classifying DeepFake images is crucial for security, ethical AI use, and building public trust in digital content. This project explores deep learning techniques to accurately differentiate between real and manipulated (fake) images using various CNN architectures.

Aim of the Study

Design, implement, and evaluate deep learning-based models that classify images into Real or Fake with high accuracy, and interpret predictions using attention mechanisms and explainable AI.

Research Objectives

Preprocess and augment a balanced dataset of real and fake images.
Build and compare CNN-based models (CNN, VGG16, Xception, MobileNetV2, EfficientNetB7).
Introduce and evaluate a novel attention mechanism integrated with EfficientNetB7.
Assess with robust metrics and interpret with confusion matrices and visualizations.
Deploy the best model as a Flask web app for real-time predictions.

Research Question

Can attention-augmented deep learning models accurately and robustly classify DeepFake images compared to standard architectures, and how interpretable are these models in explaining the decision-making process?

Configuration of the System

Training used Kaggle Notebook with NVIDIA Tesla T4 or P100-PCIE-16GB GPU. Kaggle helps avoid RAM-related interruptions during long sessions.

Development Platform

Platform: Kaggle Notebook (GPU access, TensorFlow available, easy sharing/collaboration)

Hardware (GPU)

16 GB RAM
NVIDIA Tesla T4 or P100 GPU
~50 GB storage

Software and Libraries

Tool	Purpose
Python 3.11	Programming language
TensorFlow / Keras	Model building and training
OpenCV	Image loading and resizing
scikit-learn	Splitting and evaluation metrics
Matplotlib	Data visualization
Seaborn	Class distribution plots and heatmaps
Plotly	Interactive visualizations
LIME	Model explainability

Dataset Collection

The dataset comprises two classes: Real and Fake, stored in class-specific folders.

Dataset Source: https://www.kaggle.com/datasets/manjilkarki/deepfake-and-real-images
Original Source: https://zenodo.org/record/5528418

Data Summary

Image Size: 256×256 JPEG
Classes: Real (Authentic), Fake (Manipulated)
Structure: Two folders (Real/, Fake/)

Project Workflow

Library Installation & Import: install and import OpenCV, TensorFlow/Keras, scikit-learn, etc.

Dataset Loading: load and label images into NumPy arrays.

Preprocessing:
- Resize to 128×128
- Encode labels with LabelBinarizer
- Normalize pixel values to [0, 1]

Exploratory Data Analysis:
- Display sample images per class
- Plot class distributions
- Compute pixel statistics

Data Splitting:
- Train 80%, Validation 10%, Test 10%
- Stratified to keep class balance

Model Training (with EarlyStopping & ReduceLROnPlateau):
- CNN (custom baseline)
- VGG16 (ImageNet transfer learning)
- Xception (transfer learning)
- MobileNetV2 (lightweight)
- EfficientNetB7 (scalable)
- EfficientNetB7 + Attention (novelty)

Hyperparameter Tuning:
- Optimizers tried: Adam (best), SGD
- Epochs: up to 10 (early stopping)
- Batch sizes: 16, 32
- Data augmentation: tried; not beneficial in the final setting

Evaluation:
- Accuracy and loss curves
- Confusion matrix and classification report

Example classification report:

precision  recall  f1-score  support
0    0.96    0.97      0.97     1000
1    0.97    0.96      0.97     1000

accuracy 0.97 (N=2000)
macro avg 0.97 0.97 0.97 2000
weighted avg 0.97 0.97 0.97 2000

Conclusion

This repository implements a complete, reproducible pipeline for DeepFake image classification—covering dataset ingestion, preprocessing, stratified splitting (80/10/10), model training with EarlyStopping/ReduceLROnPlateau, and rigorous evaluation with interpretability. Across optimizers and backbones (custom CNN, VGG16, Xception, MobileNetV2, EfficientNetB7, and EfficientNetB7 + Attention), Adam consistently delivered the strongest performance (Accuracy ≈ 0.97; Precision/Recall ≈ 0.96–0.97; F1 ≈ 0.97). LIME and the custom attention module improved interpretability by highlighting discriminative facial regions. Future work: broader cross-domain validation (compression, occlusion, lighting), stronger augmentation/synthetic data, complementary explainability (Grad-CAM/Integrated Gradients), and deployment optimizations (pruning/quantization).

Screenshots

Place the following images under docs/images/ to render them here:

Türkçe Bölüm (Özet ve Kurulum)

Bu bölüm, uygulamanın Türkçe özeti ve hızlı kurulum talimatlarını içerir.

Hızlı Başlangıç

# 1) Sanal ortam oluşturun ve etkinleştirin
python3.11 -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate

# 2) Bağımlılıkları yükleyin
python -m pip install --upgrade pip
pip install -r requirements.txt

# 3) Model dosyalarını ekleyin (./model/ altına)
mkdir -p model
#  - best_model_effatt.h5
#  - label_transform.pkl

# 4) Uygulamayı çalıştırın
python app.py  # http://127.0.0.1:5000/

Ne Yapar?

Arayüzden (/service) ya da HTTP POST (/deepfake) ile görüntü yüklenir.
Dosya türü doğrulanır (PNG/JPG/JPEG) ve 128×128 boyutuna getirilir.
Dikkat (attention) bloğu içeren Keras modeli ile tahmin yapılır.
model/label_transform.pkl varsa etiket eşleme buradan yapılır; yoksa varsayılan sıra ["Fake", "Real"] kullanılır.
Yüklenen görseller static/images/uploadedimage/ altına benzersiz adlarla kaydedilir.

Depo Yapısı

FlaskWebApp/
  app.py
  requirements.txt
  model/
    best_model_effatt.h5
    label_transform.pkl
  notebooks/
  static/images/uploadedimage/
  templates/

Çıkarım Ayrıntıları

OpenCV okuma → 128×128 yeniden boyutlandırma → float32 dizi
Model H5 formatında yüklenir; custom_objects içinde:
- attention_block(features, depth)
- RescaleGAP([gap_feat, gap_mask]) = gap_feat / gap_mask

Sorun Giderme (Kısa)

Python sürümü 3.11 olmalı. Venv’i bu sürümle oluşturun ve requirements.txt kurun.
VS Code import hataları: Cmd+Shift+P → Python: Select Interpreter → FlaskWebApp/.venv/bin/python, ardından Python: Restart Language Server.
"bad marshal data" hatası: Yanlış Python/bağımlılık. Venv’i 3.11 ile yeniden kurun. Uygulama, tam modeli yükleyemezse mimariyi koddan kurup sadece ağırlıkları yükler.
5000 portu dolu: lsof -ti :5000 | xargs kill -9 ya da farklı portta çalıştırın.
Aynı görsel sorunu: Uygulama benzersiz adlarla kaydediyor; yine de gerekirse sert yenileme (Cmd+Shift+R).

Giriş

Arka Plan

DeepFake teknolojilerinin yükselişi; medya doğruluğu, yanlış bilgilendirme ve dijital kimlik kötüye kullanımı açısından ciddi riskler barındırıyor. Gerçek ve sahte (manipüle) görüntülerin ayrıştırılması; güvenlik, etik yapay zekâ ve dijital içerikte toplumsal güven için kritik önemdedir. Bu proje, farklı CNN mimarileri ile gerçek ve sahte görüntüleri yüksek doğrulukla ayırt etmeyi araştırır.

Çalışmanın Amacı

Görüntüleri Real/Fake olarak yüksek doğrulukla sınıflandıran derin öğrenme tabanlı modelleri tasarlamak, uygulamak ve değerlendirmek; tahminleri dikkat (attention) mekanizmaları ve açıklanabilir yapay zekâ ile yorumlamak.

Araştırma Hedefleri

Dengeli (real/fake) veri kümesinin ön işlenmesi ve artırılması.
CNN, VGG16, Xception, MobileNetV2, EfficientNetB7 modellerinin karşılaştırılması.
EfficientNetB7 ile entegre yeni bir attention mekanizmasının tanıtımı ve değerlendirilmesi.
Sağlam metriklerle değerlendirme; karışıklık matrisi ve görselleştirmelerle yorumlama.
En iyi modelin gerçek zamanlı tahmin için Flask web uygulaması olarak yaygınlaştırılması.

Araştırma Sorusu

Attention ile zenginleştirilmiş derin öğrenme modelleri, standart mimarilere kıyasla DeepFake görüntülerini ne kadar doğru ve sağlam sınıflandırabilir? Karar verme süreçleri ne kadar açıklanabilir?

Sistem Konfigürasyonu

Eğitim için Kaggle Notebook üzerinde NVIDIA Tesla T4 veya P100-PCIE-16GB GPU kullanıldı. Kaggle, RAM yetersizliğinden doğan oturum kesintilerini minimize eder.

Geliştirme Ortamı

Platform: Kaggle Notebook (GPU erişimi, TensorFlow kurulu, paylaşım kolaylığı)

Donanım (GPU)

16 GB RAM
NVIDIA Tesla T4 veya P100 GPU
~50 GB depolama

Yazılım ve Kütüphaneler

Araç	Amaç
Python 3.11	Programlama dili
TensorFlow / Keras	Model kurma ve eğitim
OpenCV	Görüntü okuma ve yeniden boyutlandırma
scikit-learn	Veri bölme ve değerlendirme metrikleri
Matplotlib	Veri görselleştirme
Seaborn	Sınıf dağılımı ve ısı haritaları
Plotly	Etkileşimli görselleştirmeler
LIME	Model açıklanabilirliği

Veri Kümesi

Veri kümesi, sınıf klasörleri halinde saklanan iki sınıftan oluşur: Real ve Fake.

Veri Kaynağı: https://www.kaggle.com/datasets/manjilkarki/deepfake-and-real-images
Orijinal Kaynak: https://zenodo.org/record/5528418

Özet

Görsel boyutu: 256×256 JPEG
Sınıflar: Real (Gerçek), Fake (Manipüle)
Yapı: İki klasör (Real/, Fake/)

Proje Akışı

Kütüphanelerin kurulumu ve içe aktarımı (OpenCV, TensorFlow/Keras, scikit-learn, vb.)
Veri yükleme: Görsellerin etiketlenip NumPy dizilerine alınması
Ön işleme:
- 128×128 yeniden boyutlandırma
- LabelBinarizer ile etiket kodlama
- Piksel değerlerinin [0, 1] aralığına ölçeklenmesi
Keşifsel Veri Analizi (EDA):
- Her sınıftan örnek görsellerin gösterimi
- Sınıf dağılımlarının grafikleri
- Piksel istatistikleri
Veri Bölme:
- Eğitim %80, Doğrulama %10, Test %10
- Sınıf dengesini korumak için katmanlı (stratified) bölme
Model Eğitimi (EarlyStopping & ReduceLROnPlateau ile):
- CNN (özel temel model)
- VGG16 (ImageNet transfer öğrenme)
- Xception (transfer öğrenme)
- MobileNetV2 (hafif model)
- EfficientNetB7 (ölçeklenebilir)
- EfficientNetB7 + Attention (yenilik)
Hiperparametre Ayarı:
- Optimizasyon: Adam (en iyi), SGD
- Epok: en fazla 10 (erken durdurma)
- Yığın boyutu: 16, 32
- Veri artırma: denendi; nihai ayarlarda fayda sağlamadı
Değerlendirme:
- Doğruluk ve kayıp eğrileri
- Karışıklık matrisi ve sınıflandırma raporu

Örnek sınıflandırma raporu:

precision  recall  f1-score  support
0    0.96    0.97      0.97     1000
1    0.97    0.96      0.97     1000

accuracy 0.97 (N=2000)
macro avg 0.97 0.97 0.97 2000
weighted avg 0.97 0.97 0.97 2000

Sonuç

Bu depo; veri alma, ön işleme, katmanlı bölme (80/10/10), EarlyStopping/ReduceLROnPlateau ile eğitim ve açıklanabilirlikle kapsamlı değerlendirmeyi içeren uçtan uca, çoğaltılabilir bir DeepFake sınıflandırma hattı sunar. Denenen omurgalar (özel CNN, VGG16, Xception, MobileNetV2, EfficientNetB7 ve EfficientNetB7 + Attention) ve optimizasyonlar arasında Adam en iyi performansı sağlamıştır (Doğruluk ≈ 0.97; Duyarlılık/Geri Çağırma ≈ 0.96–0.97; F1 ≈ 0.97). LIME ve özel attention modülü, ayrımcı yüz bölgelerini vurgulayarak yorumlanabilirliği artırmıştır. Gelecek işler: alanlar arası genelleme (sıkıştırma, örtülme, ışık değişimi), daha güçlü artırma/sentetik veri, ek açıklanabilirlik yöntemleri (Grad-CAM/Integrated Gradients) ve kaynak kısıtlı ortamlar için prunning/quantization.

Ekran Görüntüleri

docs/images/ altına aşağıdaki dosyaları eklediğinizde bu bölümde görüntülenecektir:

References

Agrawal, D.R., Haneef, F., 2025. Eye Blinking Feature Processing Using Convolutional Generative Adversarial Network for Deep Fake Video Detection. Trans. Emerg. Telecommun. Technol. 36, e70083. https://doi.org/10.1002/ett.70083
Al-Khazraji, S.H., Saleh, H.H., Khalid, A.I., Mishkhal, I.A., 2023. Impact of Deepfake Technology on Social Media: Detection, Misinformation and Societal Implications. Eurasia Proc. Sci. Technol. Eng. Math. 23, 429–441. https://doi.org/10.55549/epstem.1371792
Borji, A., 2023. Qualitative Failures of Image Generation Models and Their Application in Detecting Deepfakes. https://doi.org/10.48550/ARXIV.2304.06470
Goceri, E., 2023. Medical image data augmentation: techniques, comparisons and interpretations. Artif. Intell. Rev. 56, 12561–12605. https://doi.org/10.1007/s10462-023-10453-z
Heidari, A., Jafari Navimipour, N., Dag, H., Unal, M., 2024. Deepfake detection using deep learning methods: A systematic and comprehensive review. WIREs Data Min. Knowl. Discov. 14, e1520. https://doi.org/10.1002/widm.1520
Hernandez Aros, L., Bustamante Molano, L.X., Gutierrez Portela, F., Moreno Hernández, J.J., Rodríguez Barrero, M.S., 2023. Detection of financial fraud by applying ML techniques a RSL. https://doi.org/10.7910/DVN/CM8NVY
Jung, T., Kim, S., Kim, K., 2020. DeepVision: Deepfakes Detection Using Human Eye Blinking Pattern. IEEE Access 8, 83144–83154. https://doi.org/10.1109/ACCESS.2020.2988660
Lal, K., Saini, M.L., 2023. A study on deep fake identification techniques using deep learning. Presented at the RECENT ADVANCES IN SCIENCES, ENGINEERING, INFORMATION TECHNOLOGY & MANAGEMENT, Jaipur, India, p. 020155. https://doi.org/10.1063/5.0154828
Malik, A., Kuribayashi, M., Abdullahi, S.M., Khan, A.N., 2022. DeepFake Detection for Human Face Images and Videos: A Survey. IEEE Access 10, 18757–18775. https://doi.org/10.1109/ACCESS.2022.3151186
Masood, M., Nawaz, M., Malik, K.M., Javed, A., Irtaza, A., 2021. Deepfakes Generation and Detection: State-of-the-art, open challenges, countermeasures, and way forward. https://doi.org/10.48550/ARXIV.2103.00484
Mujawar, S.M., -, H.M.K., -, Y.S.P., -, K.S.K., -, A.P.T., 2025. Exploring AI/ML Techniques for Deepfake Detection: A Comprehensive Review. Int. J. Sci. Technol. 16, 3843. https://doi.org/10.71097/IJSAT.v16.i2.3843
Pishori, A., Rollins, B., Houten, N. van, Chatwani, N., Uraimov, O., 2020. Detecting Deepfake Videos: An Analysis of Three Techniques. https://doi.org/10.48550/arXiv.2007.08517
Rana, M.S., Nobi, M.N., Murali, B., Sung, A.H., 2022. Deepfake Detection: A Systematic Literature Review. IEEE Access 10, 25494–25513. https://doi.org/10.1109/ACCESS.2022.3154404
Stankov, I.S., Dulgerov, E.E., 2024. Detection of Deepfake Images and Videos Using SVM, CNN, and Hybrid Approaches, in: 2024 XXXIII International Scientific Conference Electronics (ET). Presented at the 2024 XXXIII International Scientific Conference Electronics (ET), IEEE, Sozopol, Bulgaria, pp. 1–5. https://doi.org/10.1109/ET63133.2024.10721497
Stroebel, L., Llewellyn, M., Hartley, T., Ip, T.S., Ahmed, M., 2023. A systematic literature review on the effectiveness of deepfake detection techniques. J. Cyber Secur. Technol. 7, 83–113. https://doi.org/10.1080/23742917.2023.2192888

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
docs/images		docs/images
notebooks		notebooks
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

CemRoot/Master-Uni-Project

Folders and files

Latest commit

History

Repository files navigation