Skip to content

This project is a multi-model image prediction tool that tags objects in a photo using datasets like COCO. It lets users upload or link an image and returns predicted labels in a simple, user-friendly interface.

License

Notifications You must be signed in to change notification settings

phyulwin/ML-Image-Prediction-Tagging

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Image Prediction & Tagging

Machine Learning Project

Author: Kelly Lwin
Course: CS 4200 Artificial Intelligence, CPP Spring 2025
Assignment: Course Project
Last Updated: April 2025


Project screenshot

Demo Video

Language

  • Python

Requirements

Install dependencies with:

pip install tensorflow numpy matplotlib nltk opencv-python fiftyone pillow

Dataset References & Citations

Common Objects in Context COCO 80-Category

A large-scale object detection, segmentation, and captioning dataset licensed under CC BY 4.0.
COCO Dataset

Food-101

Bossard, L., Guillaumin, M., & Van Gool, L. (2014). Food-101 – Mining Discriminative Components with Random Forests. European Conference on Computer Vision.

@inproceedings{bossard14,
  title     = {Food-101 -- Mining Discriminative Components with Random Forests},
  author    = {Bossard, Lukas and Guillaumin, Matthieu and Van Gool, Luc},
  booktitle = {European Conference on Computer Vision},
  year      = {2014}
}

ImageNet-R (200 Classes)

Hendrycks, D., Basart, S., Mu, N., Kadavath, S., Wang, F., Dorundo, E., Desai, R., Zhu, T., Parajuli, S., Guo, M., Song, D., Steinhardt, J., & Gilmer, J. (2021).
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization. ICCV.

@article{hendrycks2021many,
  title   = {The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization},
  author  = {Hendrycks, Dan and Basart, Steven and Mu, Norman and Kadavath, Saurav and Wang, Frank and Dorundo, Evan and Desai, Rahul and Zhu, Tyler and Parajuli, Samyak and Guo, Mike and Song, Dawn and Steinhardt, Jacob and Gilmer, Justin},
  journal = {ICCV},
  year    = {2021}
}

Hugging Face Furniture-Dataset (7 Classes)

Arkan0ID/furniture-dataset


Tools Used

  • Programming Language: Python 3.12.10
  • IDE/Notebook: Jupyter Notebook, VS Code
  • Framework: TensorFlow 2.19.0

References

About

This project is a multi-model image prediction tool that tags objects in a photo using datasets like COCO. It lets users upload or link an image and returns predicted labels in a simple, user-friendly interface.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published