Skip to content
View robinsongh381's full-sized avatar
👨‍👩‍👧
Family
👨‍👩‍👧
Family
  • KaKao Brain
  • Korea

Block or report robinsongh381

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A quick guide (especially) for trending instruction finetuning datasets

2,473 159 Updated Nov 28, 2023
Python 286 16 Updated Jun 9, 2024
Python 94 8 Updated May 9, 2024

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Python 204 19 Updated Jan 13, 2024

Experiments with generating opensource language model assistants

HTML 97 12 Updated May 14, 2023

Aligning pretrained language models with instruction data generated by themselves.

Python 4,089 482 Updated Mar 27, 2023
Python 692 40 Updated Mar 6, 2023

COYO-700M: Large-scale Image-Text Pair Dataset

Python 1,143 36 Updated Nov 30, 2022

A framework for few-shot evaluation of language models.

Python 6,556 1,738 Updated Sep 28, 2024

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 767 82 Updated Jan 3, 2024
Python 118 16 Updated Jan 3, 2024

Must-read papers on prompt-based tuning for pre-trained language models.

4,056 377 Updated Jul 17, 2023

Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets

Python 127 14 Updated Nov 12, 2022

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,701 299 Updated Apr 6, 2023

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 1,968 201 Updated Nov 16, 2023

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

Python 1,001 139 Updated Jan 30, 2024

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

27,189 3,674 Updated Jul 18, 2024

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Python 157 27 Updated Sep 21, 2022

This repository is a sub branch of AI Knowledge Tree, mainly focus on Natural Language Processing.

Shell 28 8 Updated Jun 14, 2021

A collection of papers of neural-symbolic AI (mainly focus on NLP applications)

225 25 Updated Aug 17, 2024

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Python 1,278 226 Updated Mar 23, 2022

Translating natural language questions to a structured query language

Jupyter Notebook 221 52 Updated Jun 12, 2023

Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.

Python 96 13 Updated Aug 14, 2023

Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。

Python 1,993 523 Updated Jun 12, 2023

Implementation of additional Masked Language Model (MLM) pretraining for BERT

Python 7 1 Updated Aug 10, 2020

Unofficial implementation of ConveRT model from PolyAI with no pre-trained encoder

Python 3 2 Updated Sep 2, 2020

A tensorflow implementation of GAN ( exactly InfoGAN or Info GAN ) to one dimensional ( 1D ) time series data.

Python 291 101 Updated Mar 24, 2024

Pretrained ELECTRA Model for Korean

Python 599 136 Updated Feb 19, 2024
Next