Skip to content
@TurkuNLP

TurkuNLP Group - IT Department - University of Turku

Popular repositories Loading

  1. Turku-neural-parser-pipeline Turku-neural-parser-pipeline Public

    A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more than 50 languages. Top ranker in the CoNLL-18 Shared Task.

    Python 112 31

  2. FinBERT FinBERT Public

    BERT model trained from scratch on Finnish

    Shell 96 7

  3. Finnish-dep-parser Finnish-dep-parser Public

    The Finnish dependency parsing pipeline being developed by the Turku NLP group. Documentation:

    Python 49 10

  4. wikibert wikibert Public

    BERT models for many languages created from Wikipedia texts

    34 1

  5. Text_Mining_Course Text_Mining_Course Public

    Stuff for the Text Mining course

    Jupyter Notebook 28 9

  6. ocr-correction ocr-correction Public

    Post-processing OCR errors with seq2seq models

    Python 28 2

Repositories

Showing 10 of 124 repositories
  • vLLM-recipes Public

    Different vLLM setups on different machines

    TurkuNLP/vLLM-recipes’s past year of commit activity
    Python 0 0 0 0 Updated Nov 15, 2024
  • TurkuNLP/ATP_kurssi’s past year of commit activity
    Jupyter Notebook 4 4 0 0 Updated Nov 14, 2024
  • ocr-postcorrection-lm Public

    Code to try out ocr postcorrection with language models

    TurkuNLP/ocr-postcorrection-lm’s past year of commit activity
    Jupyter Notebook 0 0 1 0 Updated Nov 11, 2024
  • htr-annotations Public

    Handwritten text recognition annotations

    TurkuNLP/htr-annotations’s past year of commit activity
    0 0 0 0 Updated Nov 7, 2024
  • htr-table-pipeline Public

    Handwritten text recognition pipeline for table data

    TurkuNLP/htr-table-pipeline’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated Nov 4, 2024
  • TurkuNLP/RAG-web-app’s past year of commit activity
    HTML 3 0 6 0 Updated Oct 31, 2024
  • TurkuNLP/pytorch-registerlabeling’s past year of commit activity
    Python 1 1 0 0 Updated Oct 29, 2024
  • Deep_Learning_in_LangTech_course Public

    Materials for the University of Turku course TKO_8965 Deep Learning in Human Language Technology (previously named TKO_2101 Natural Language Processing)

    TurkuNLP/Deep_Learning_in_LangTech_course’s past year of commit activity
    Jupyter Notebook 18 11 0 0 Updated Oct 15, 2024
  • ocr_errors_simulator Public

    Functions and codes used to determine probabilities on OCR errors and simulate them

    TurkuNLP/ocr_errors_simulator’s past year of commit activity
    Python 2 Apache-2.0 0 0 0 Updated Oct 10, 2024
  • situational-analysis-llm Public

    Code and data for multilingual situational analysis of web registers using LLMs.

    TurkuNLP/situational-analysis-llm’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Oct 4, 2024