ytaek-oh

🤗

Youngtaek Oh ytaek-oh

🤗

PhD student at KAIST. Working on Vision and Language, Multimodality, Compositionality.

30 followers · 62 following

KAIST
Daejeon, South Korea
11:58 (UTC +09:00)
https://ytaek-oh.github.io
@ytaek_oh
in/young-taek-oh
https://huggingface.co/ytaek-oh

Achievements

Highlights

Stars

TencentARC / DiTCtrl

Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"

Python 123 1 Updated Dec 25, 2024

SpatialVision / Orient-Anything

Python 81 Updated Dec 25, 2024

fallenshock / FlowEdit

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 261 11 Updated Dec 18, 2024

dgist-cvlab / Flow4D

Forked from KTH-RPL/DeFlow

Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation

Python 12 Updated Jul 12, 2024

sudo-Boris / mr-Blip

Official Implementation of "The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval"

Python 69 1 Updated Dec 19, 2024

AtsuMiyai / Awesome-OOD-VLM

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey [Miyai+, arXiv2024]

75 3 Updated Aug 1, 2024

konanaif / MAF-DEMO

CSS 38 13 Updated Dec 27, 2024

konanaif / MAF

MSIT AI Fair(MAF)

Python 38 13 Updated Dec 12, 2024

jiwoohong93 / ai_dep

AI Development in Evolving Policy [AI DEP]

Python 46 21 Updated Dec 26, 2024

taewhankim / VIPCAP

3 Updated Dec 20, 2024

xfactlab / I0T

Python 2 Updated Dec 11, 2024

gritYCDA / jepa_action

Python 1 Updated Dec 19, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 19,947 1,503 Updated Dec 27, 2024

vision-x-nyu / thinking-in-space

Official repo and evaluation implementation of VSI-Bench

Python 234 11 Updated Dec 20, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 39,245 4,423 Updated Dec 28, 2024

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 28,428 1,113 Updated Dec 21, 2024

Mehrdad-Noori / WATT

[NeurIPS 2024] WATT: Weight Average Test-Time Adaption of CLIP

Python 35 2 Updated Sep 26, 2024

necla-ml / SNLI-VE

Dataset and starting code for visual entailment dataset

Python 109 7 Updated Apr 21, 2022

google-deepmind / geckonum_benchmark_t2i

GeckoNum Benchmark for T2I Model Eval.

11 1 Updated Dec 5, 2024

MismatchQuest / MismatchQuest

Python 3 1 Updated Mar 7, 2024

microsoft / VISOR

HTML 44 5 Updated Oct 27, 2023

j-min / DallEval

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Jupyter Notebook 138 6 Updated Nov 27, 2023

aszala / VPEval

VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)

Python 44 4 Updated Nov 29, 2023

google-deepmind / gecko_benchmark_t2i

7 1 Updated Jun 13, 2024

Lightning-AI / forked-pdb

Python pdb for multiple processes

Python 36 6 Updated Nov 5, 2022

dlsrbgg33 / video_kmax

Python 2 Updated Dec 18, 2024

Ziyang412 / VideoTree

Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Python 89 3 Updated Aug 6, 2024

illuin-tech / vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

Python 157 16 Updated Dec 17, 2024

huang-yh / Owl

32 Updated Dec 13, 2024

OpenGVLab / V2PE

[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Python 10 1 Updated Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Youngtaek Oh ytaek-oh

Achievements

Achievements

Highlights

Block or report ytaek-oh

Stars

TencentARC / DiTCtrl

SpatialVision / Orient-Anything

fallenshock / FlowEdit

dgist-cvlab / Flow4D

sudo-Boris / mr-Blip

AtsuMiyai / Awesome-OOD-VLM

konanaif / MAF-DEMO

konanaif / MAF

jiwoohong93 / ai_dep

taewhankim / VIPCAP

xfactlab / I0T

gritYCDA / jepa_action

Genesis-Embodied-AI / Genesis

vision-x-nyu / thinking-in-space

All-Hands-AI / OpenHands

microsoft / markitdown

Mehrdad-Noori / WATT

necla-ml / SNLI-VE

google-deepmind / geckonum_benchmark_t2i

MismatchQuest / MismatchQuest

microsoft / VISOR

j-min / DallEval

aszala / VPEval

google-deepmind / gecko_benchmark_t2i

Lightning-AI / forked-pdb

dlsrbgg33 / video_kmax

Ziyang412 / VideoTree

illuin-tech / vidore-benchmark

huang-yh / Owl

OpenGVLab / V2PE