-
Georgia Institute of Technology
- Atlanta, GA
- akshay-krishnan.github.io
Highlights
- Pro
Stars
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
Official inference repo for FLUX.1 models
An open-source impl. of Large Reconstruction Models
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Minimal solvers for calibrated camera pose estimation
[CVPR 2023] "Revisiting Rotation Averaging: Uncertainties and Robust Losses" by Ganlin Zhang, Viktor Larsson and Daniel Barath
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
COCO 2018 Panoptic Segmentation Task API (Beta version)
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Official code release for VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel Grids
A Unified Framework for Surface Reconstruction
A collection of various Sky Model implementations in OpenGL suitable for real-time rendering.
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Curated list of publicly accessible machine learning engineering courses from CalTech, Columbia, Berkeley, MIT, and Stanford.
Open3D: A Modern Library for 3D Data Processing
Lightning fast C++/CUDA neural network framework
3d without friction (Torch, TF, Jax, Numpy)
A collaboration friendly studio for NeRFs
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization (Line Transformer)
End-to-end SFM framework based on GTSAM
GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse m…
code for solving global structure from motion problem in the ECCV'14 paper "Robust Global Translations with 1DSfM"
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"