Open
Description
Feature request
To support https://github.com/openadaptai/SoM we need to implement a client.py with https://www.gradio.app/docs/client. See:
- client.py: https://github.com/microsoft/SoM/pull/19/files#diff-1ebfaf6cb3592166b73835fa82333cb7109e7c624865c0039a7b22ff34aa27fa)
- deploy.py: https://github.com/microsoft/SoM/pull/19/files#diff-5c9ed18af9a5f902219d12c3044ccb193c2c304a3748d02702889c2ca5703978
Motivation
https://github.com/openadaptai/SoM is state-of-the-art for visual understanding, and only runs on Linux / CUDA
Refer to system diagram:

Inference (SoM/SAM) must be done remotely.
We wish to implement:
openadapt/adapters/som/client.py
: modified version of client.py in Add Dockerfile and client.py; deploy to EC2 on AWS via Github Actions microsoft/SoM#19 to support getting marked screenshots during analysis (visualization) and replayopenadapt/adapters/som/server
, which can be a git submodule containing https://github.com/OpenAdaptAI/SoM/