The official GitHub page for the survey paper "A Survey of Large Language Models".
-
Updated
Mar 11, 2025 - Python
The official GitHub page for the survey paper "A Survey of Large Language Models".
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).
An Open-sourced Knowledgable Large Language Model Framework.
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Oscar and VinVL
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
[NeurIPS 2020] "Graph Contrastive Learning with Augmentations" by Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, Yang Shen
The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.
Multi-modality pre-training
Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"
Code for our SIGKDD'22 paper Pre-training-Enhanced Spatial-Temporal Graph Neural Network For Multivariate Time Series Forecasting.
[NeurlPS D&B 2024] Generative AI for Math: MathPile
The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"
One-click training of your own GPT. Training a GPT has never been easier for beginners. / 一键预训练+SFT一个属于自己的LLM,0基础训练GPT原来可以这么简单?
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Add a description, image, and links to the pre-training topic page so that developers can more easily learn about it.
To associate your repository with the pre-training topic, visit your repo's landing page and select "manage topics."