-
@ServiceNow Research
- London, UK
-
04:31
(UTC +01:00)
-
PipelineRL Public
Forked from ServiceNow/PipelineRLA scalable asynchronous reinforcement learning implementation with in-flight weight updates.
Python Apache License 2.0 UpdatedMay 8, 2025 -
nano-aha-moment Public
Forked from McGill-NLP/nano-aha-momentSingle GPU, From Scratch (No RL Library), Efficient, Full Parameter Tuning Implementation of DeepSeek R1-Zero style training.
Jupyter Notebook UpdatedApr 3, 2025 -
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedFeb 6, 2025 -
AI-Researcher Public
Forked from TheBlewish/Automated-AI-Web-Researcher-OllamaAI researcher - with a single query determine focus areas to investigate, searching the web and scraping content from relevant websites to do research autonomously.
Python MIT License UpdatedNov 22, 2024 -
-
fastapi_text_sum Public
Text Summarisation using FastAPI
-
-
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Jupyter Notebook MIT License UpdatedJan 19, 2023 -
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webuiStable Diffusion web UI
Python UpdatedOct 26, 2022 -
-
-
thompson Public
Forked from andrecianflone/thompsonThompson Sampling Tutorial
Jupyter Notebook UpdatedMay 11, 2022 -
Kalman-and-Bayesian-Filters-in-Python Public
Forked from rlabbe/Kalman-and-Bayesian-Filters-in-PythonKalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
Jupyter Notebook Other UpdatedApr 15, 2022 -
google-research Public
Forked from google-research/google-researchGoogle Research
Jupyter Notebook Apache License 2.0 UpdatedApr 9, 2022 -
d4rl Public
Forked from Farama-Foundation/D4RLA benchmark for offline reinforcement learning.
Python Apache License 2.0 UpdatedMar 25, 2022 -
-
fastapi-prophet Public
Stock Market predictions with Prophet and FastAPI
-
-
finetuner Public
Forked from jina-ai/finetunerFinetuning any DNN for better embedding on neural search tasks
Python Apache License 2.0 UpdatedDec 22, 2021 -
-
custom-py-docker Public
Dockerfile with custom python indtall
Dockerfile MIT License UpdatedNov 28, 2021 -
mjrl Public
Forked from aravindr93/mjrlReinforcement learning algorithms for MuJoCo tasks
Python Apache License 2.0 UpdatedNov 8, 2021 -
backend-microservices Public
Microservices using RabbitMQ as message broker
-
-
summariser_client Public
A multi-platform client to consume the Summariser API
Dart Apache License 2.0 UpdatedOct 11, 2021 -
-
-
blog Public
Forked from phiresky/blogSource code of my personal blog
TypeScript Other UpdatedMay 2, 2021 -
pytorch-a2c-ppo-acktr-gail Public
Forked from ikostrikov/pytorch-a2c-ppo-acktr-gailPyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
Python MIT License UpdatedApr 15, 2021