Stars
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…
Awesome-LLM: a curated list of Large Language Model
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Source code for Twitter's Recommendation Algorithm
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Must-read papers on prompt-based tuning for pre-trained language models.
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Matplotlib styles for scientific plotting
TuckER: Tensor Factorization for Knowledge Graph Completion
A project written in Python to get old tweets, it bypass some limitations of Twitter Official API.
ACL 2020: Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings
Dataset containing Aggregated and anonymized queries from across the world with Coronavirus intent.
InferSent sentence embeddings
Implementation of AdaBoost algorithm in Python
Public facing notes page
Scripts to build your own IPsec VPN server, with IPsec/L2TP, Cisco IPsec and IKEv2
Fine tuning inception v3 on Kaggle dogs-vs-cats dataset
Must-read papers on network representation learning (NRL) / network embedding (NE)