Skip to content
View github-eliviate's full-sized avatar

Block or report github-eliviate

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data

Train Data
13 repositories

Medical NLP Competition, dataset, large models, paper

2,238 418 Updated Dec 6, 2024

手工整理医疗行业词汇、术语等语料。可用于语音识别、对话系统等各类nlp模型训练。

114 39 Updated Apr 5, 2020

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

2,286 382 Updated Jan 17, 2024

Chinese medical dialogue data 中文医疗对话数据集

Python 1,368 259 Updated Aug 18, 2023

Chinese Medical Intent Dataset

5 28 Updated Nov 21, 2019

This is the dataset for Chinese community medical question answering.

98 20 Updated Oct 22, 2019

中医药古籍文本,近700项

852 364 Updated Sep 27, 2023

A Chinese medical question answering dataset

62 12 Updated Jan 14, 2020

PubMedQA: A Dataset for Biomedical Research Question Answering

Python 296 38 Updated Apr 18, 2023

Code and dataset for our Bioinformatics 2022 paper: "A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets"

Python 55 12 Updated Dec 24, 2022

Collection of awesome medical dataset resources.

673 57 Updated Jan 23, 2025

百度百科 500 万数据集

34 22 Updated Dec 1, 2023

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,761 263 Updated Mar 8, 2025