GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization

This is the code, checkpoint, and dataset repository for GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization

Environment

conda create -n georanker python=3.11
conda activate georanker

# Addtional modules
pip install git+https://github.com/huggingface/transformers accelerate
pip install qwen-vl-utils[decord]==0.0.8
pip install pandas geopy
pip install flash-attn --no-build-isolation
pip install scikit-learn deepspeed datasets peft torchvision wandb
conda install mpi4py

❗❗❗We also uploaded the YAML file for our environment, but the Transformers version used during our experiments was 4.52.0.dev0. You may want to set it to a newer official version when running this repository.

Quick Start

Run with your images (calculating rewards between a query and some candidates)

please first modify the image paths, candidate_gps_lis, gt_lat, and gt_lon in quick_start.py file, then run python quick_start.py to check the rewards and prediction.

Run with sampled im2gps3k data

You will need to first download the mp16-pro tar file and the tar_index.pkl file from Hugging Face. Additionally, please download the IM2GPS3K image dataset as described in the Dataset section. Then, modify the relevant paths in quick_start_im2gps3k.py and run python quick_start_im2gps3k.py.

Dataset

Evaluation Datasets

IM2GPS3K: images and metadata; YFCC4K: images and metadata; MP16-Pro: Huggingface

You can also find the meta data for IM2GPS3K, YFCC4K, retrieval checkpoints of G3, retrieval index in Huggingface

GeoRanking Dataset

We have uploaded the dataset to dataset/georanking

dataset = load_dataset("parquet", data_files="path_to_file", split="train")

>>> dataset
Dataset({
    features: ['img_id', 'gps', 'ref_gps', 'ref_img_id', 'ref_texts'],
    num_rows: 100000
})

img_id: ID of query image in MP16-Pro dataset
gps: gps of query image
ref_gps: gps list for candidates
ref_img_id: image id list for candidates
ref_texts: textual descriptions list for candidates

Checkpoints

The lora weights are put under checkpoints/.

File Structure

.
├── checkpoints/
│   ├── adapter_config.json
│   └── adapter_model.safetensors
├── dataset/
│   ├── im2gps3k/
│   │   ├── im2gps3k.csv
│   │   ├── im2gps3k_metadata_and_images_should_be_put_here
│   │   └── I.npy -> retrieval index results for im2gps3k
│   ├── mp16-pro/
│   │   └── mp16-pro_metadata_and_images_and_should_be_put_here
│   └── yfcc4k/
│       ├── yfcc4k.csv
│       ├── yfcc4k_metadata_and_images_should_be_put_here
│       └── I.npy -> retrieval index results for yfcc4k
├── deepspeed_config/
│   └── zero2.json
├── utils/
│   └── geo_ranker.py -> main file for georanker
├── compile_prediction_candidates.py -> compile retrieval and generated candidates to one file
├── evaluate.py
├── finetune_geo_ranker.py -> script for training georanker
├── environment.yml
└── lvlm_zs_predict.py -> script for generating candidates with lvlm

For MP16-Pro dataset, please refer to G3.

Running

Training GeoRanker

CUDA_VISIBLE_DEVICES=0,1,2,3 deepspeed --num_gpus 4 finetune_geo_ranker.py --model_path=Qwen/Qwen2-VL-7B-Instruct --model_save_path=xxx --group_size=7

Generating candidates with LVLM

python lvlm_zs_predict.py --api_key=sk-xxx --model_name=xxx --base_url=xxx --root_path=xxx/dataset/yfcc4k

Compiling generated and retrieval candidates to one file (we have uploaded the retrieval candidates and generated candidates for IM2GPS3K and YFCC4K under dataset folder). We have uploaded the index file I.npy for IM2GPS3K and YFCC4K.
```
python compile_prediction_candidates.py
```

Evaluation

# we recommend using larger batch size during inference
python evaluate.py --model_path=path_to_lora --dataset=im2gps3k --topn=12 --topn_zs=3 --batch_size=16

Citation

If you find our work interesting or helpful, we would really appreciate it if you could give us a star.

@article{jia2025georanker,
  title={GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization},
  author={Jia, Pengyue and Park, Seongheon and Gao, Song and Zhao, Xiangyu and Li, Yixuan},
  journal={arXiv preprint arXiv:2505.13731},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization

Environment

Quick Start

Run with your images (calculating rewards between a query and some candidates)

Run with sampled im2gps3k data

Dataset

Evaluation Datasets

GeoRanking Dataset

Checkpoints

File Structure

Running

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
checkpoints		checkpoints
dataset		dataset
deepspeed_config		deepspeed_config
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compile_prediction_candidates.py		compile_prediction_candidates.py
environment.yml		environment.yml
evaluate.py		evaluate.py
finetune_geo_ranker.py		finetune_geo_ranker.py
lvlm_zs_predict.py		lvlm_zs_predict.py
quick_start.py		quick_start.py
quick_start_im2gps3k.py		quick_start_im2gps3k.py

License

Applied-Machine-Learning-Lab/GeoRanker

Folders and files

Latest commit

History

Repository files navigation

GeoRanker: Distance-Aware Ranking for Worldwide Image Geolocalization

Environment

Quick Start

Run with your images (calculating rewards between a query and some candidates)

Run with sampled im2gps3k data

Dataset

Evaluation Datasets

GeoRanking Dataset

Checkpoints

File Structure

Running

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages