Skip to content
View pkuyym's full-sized avatar

Block or report pkuyym

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,182 43 Updated Jan 17, 2025

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,043 236 Updated Apr 14, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,420 382 Updated Jul 16, 2023

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,486 522 Updated Oct 16, 2024

Session-based Recommendation

Python 60 12 Updated Jun 20, 2024
Python 14 2 Updated Mar 29, 2022

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

Python 223 32 Updated Feb 13, 2024

TorchKGE: Knowledge Graph embedding in Python and PyTorch.

Python 382 41 Updated Apr 11, 2024

a pytorch lib with state-of-the-art architectures, pretrained models and real-time updated results

Python 862 123 Updated Dec 8, 2020

Multi-Task Deep Neural Networks for Natural Language Understanding

Python 2,244 412 Updated Mar 7, 2024

ACL 2020: A Re-evaluation of Knowledge Graph Completion Methods

Python 146 24 Updated May 22, 2023

Must-read papers on graph neural networks (GNN)

16,190 3,009 Updated Dec 20, 2023

The new Windows Terminal and the original Windows console host, all in the same place!

C++ 96,552 8,416 Updated Jan 23, 2025

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,103 1,819 Updated Jul 3, 2024

Ongoing research training transformer models at scale

Python 11,170 2,494 Updated Jan 23, 2025

Resource scheduling and cluster management for AI

JavaScript 2,646 549 Updated Jun 6, 2024

The ultimate vim distribution

Vim Script 15,559 3,616 Updated Nov 4, 2023

NumPy & SciPy for GPU

Python 9,708 872 Updated Jan 23, 2025

pytorch implementation of Attention is all you need

Python 239 56 Updated Jun 16, 2021

Transformer of "Attention Is All You Need" (Vaswani et al. 2017) by Chainer.

Jupyter Notebook 315 70 Updated Oct 3, 2017

Embedded and mobile deep learning research resources

744 166 Updated Mar 14, 2023

Visual Studio Code

TypeScript 166,513 30,156 Updated Jan 23, 2025

Elastic Deep Learning for deep learning framework on Kubernetes

Python 171 51 Updated Jul 5, 2023

Macro Continuous Evaluation Platform for Paddle.

Python 19 14 Updated Mar 11, 2020

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,804 2,241 Updated Jan 8, 2025

Caffe models in TensorFlow

Python 2,796 1,030 Updated Jul 18, 2019

Deep Learning Visualization Toolkit(『飞桨』深度学习可视化工具 )

HTML 4,806 631 Updated Jan 22, 2025

A TensorFlow Implementation of the Transformer: Attention Is All You Need

Python 4,315 1,303 Updated May 21, 2023

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,987 1,993 Updated Apr 16, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,754 3,528 Updated Jun 2, 2023
Next