The awesome tutorial of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching will be constantly updated for Preliminary Insight !
【2024.07.11】 Update 50+ papers; Due to limited time, LMMM section will be expanded later.
【2024.03.09】 A new section named [Large Multi-Modality Model] has been added.
【2023.05.25】 A new section named [Parameter-Efficient Finetuning] has been added.
【2021.07.10】 A new section named [Vision-Language Pretraining] has been added.
【2020.11.01】 A new section named [Conventional Image-Text Matching] has been added.
- Large Multi-Modality Model
- Parameter-Efficient Finetuning
- Vision-Language Pretraining
- Conventional Image-Text Matching
- Generic-Feature Extraction
- Cross-Modal Interaction
- Similarity Measurement
- Uncertainty Learning
- Noisy Correspondence
- Commonsense Learning
- Adversarial Learning
- Loss Function
- Un-/Semi-Supervised
- Zero-/Fewer-Shot
- Continual Learning
- Identification Learning
- Video-Text Learning
- Scene-Text Learning
- Related Works
- Posted in
- Peformance
- Other Resources
MIT license. If any questions, please contact me at r1228240468@gmail.com.