Skip to content

A medical transcription keywords extractor. Extract the possible keyworde from the doctor's medical transcript.

Notifications You must be signed in to change notification settings

nadhirfr/medical_transcript_keyword_extract

Repository files navigation

Medical Transcription Keywords Extraction

The aim of this repository is mainly to extract keywords from medical transcription. The dataset obtained from an open medical transcription dataset.

ko-fi

Preprocessing

Remove symbols, stopwords, empty spaces after comma, multiple spaces, etc. Basicly it will keep only the words with a single space separator. The clean dataset here

The model

  • Pipeline -- Vectorize the word -- TF-IDF Transformer -- OneVsRestClassifier with SGD Classifier
  • Input: ['a string sentences', 'another string sentences']
  • Output: ['keywords separated by a single space', 'another extracted keywords']
  • Serialized model here here

Credits

We used a number of open source projects to work properly:

License

MIT

Free Software, Hell Yeah!

About

A medical transcription keywords extractor. Extract the possible keyworde from the doctor's medical transcript.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

Packages

No packages published