Skip to content

Commit

Permalink
Merge branch 'main' of github.com:fkryan/gazelle
Browse files Browse the repository at this point in the history
  • Loading branch information
fkryan committed Dec 12, 2024
2 parents eca8032 + 4cb1518 commit dca34e0
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,10 @@ model, transform = torch.hub.load('fkryan/gazelle', 'gazelle_dinov2_vitl14_inout


## Usage
### Colab Demo Notebook
Check out our [Demo Notebook](https://colab.research.google.com/drive/1TSoyFvNs1-au9kjOZN_fo5ebdzngSPDq?usp=sharing) on Google Colab for how to detect gaze for all people in an image.

### Gaze Prediction
Gaze-LLE is set up for multi-person inference (e.g. for a single image, GazeLLE encodes the scene only once and then uses the features to predict the gaze of multiple people in the image). The input is a batch of image tensors and a list of bounding boxes for each image representing the heads of the people to predict gaze for in each image. The bounding boxes are tuples of form ```(xmin, ymin, xmax, ymax)``` and are in ```[0,1]``` normalized image coordinates. Below we show how to perform inference for a single person in a single image.
```
from PIL import Image
Expand Down

0 comments on commit dca34e0

Please sign in to comment.