Merge branch 'main' of github.com:fkryan/gazelle

fkryan · Dec 12, 2024 · dca34e0 · dca34e0
2 parents eca8032 + 4cb1518
commit dca34e0
Showing 1 changed file with 4 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -57,6 +57,10 @@ model, transform = torch.hub.load('fkryan/gazelle', 'gazelle_dinov2_vitl14_inout
 
 
 ## Usage
+### Colab Demo Notebook
+Check out our [Demo Notebook](https://colab.research.google.com/drive/1TSoyFvNs1-au9kjOZN_fo5ebdzngSPDq?usp=sharing) on Google Colab for how to detect gaze for all people in an image.
+
+### Gaze Prediction
 Gaze-LLE is set up for multi-person inference (e.g. for a single image, GazeLLE encodes the scene only once and then uses the features to predict the gaze of multiple people in the image). The input is a batch of image tensors and a list of bounding boxes for each image representing the heads of the people to predict gaze for in each image. The bounding boxes are tuples of form ```(xmin, ymin, xmax, ymax)``` and are in ```[0,1]``` normalized image coordinates. Below we show how to perform inference for a single person in a single image.
 ```
 from PIL import Image