Learnice

Intelligent Icelandic Language Processing System 🌐🇮🇸

An NLP-powered application designed to assist with Icelandic language learning and processing. This project integrates key functionalities such as Part-of-Speech (PoS) tagging, spelling and grammar suggestions, and bilingual translation, enabling users to seamlessly analyze, correct, and translate Icelandic and English text.

Hugging Face Model

https://huggingface.co/valgardg/learnice-pos-tagger

Overview

This project is a web application consisting of:

Vue Frontend: A Vue.js-based frontend that serves the user interface. FastAPI Backend: A Python FastAPI backend for handling the application's API. LLM Scripts: A set of scripts for fine-tuning a language model. To run the project, both the frontend and backend must be running simultaneously.

Requirements

Node.js: Version v16.15.0 Python: Version 3.9.18 with pip installed Dependencies: Installed via npm for the frontend and pip for the backend

Backend setup

create a python virtual environment (optional but recommended)

python3 -m venv venv
source venv/bin/active

install project dependancies

cd learnice-backend
IMPORTANT: replace '<path_to_project>' in line 50 of requirements.txt with the path to your project
pip3 install -r requirements.txt

start the backend project

uvicorn main:app --reload

The backend includes a .env file pre-configured with necessary credentials for running an AWS service. This file has been included to simplify setup and avoid requiring you to create and configure an AWS account. No changes are needed to this file.

Frontend setup

Navigate to project directory cd learnice-frontend
install project dependancies npm install
start the development server npm run dev

LLM scripts

The llm/ folder contains scripts for fine-tuning a language model (mbert-base-multilingual-cased). These scripts are standalone and do not require the frontend or backend to be running. To use these scripts:

Ensure Python is installed with the necessary libraries (requirements.txt). Run the scripts directly from the llm/ folder.

Usage

Start the backend with uvicorn main:app --reload. Start the frontend with npm run dev. Open the application in your browser at http://localhost:5173.

llm files

llms/:

finetune_mbert.py: The script where the fine-tuned model is trained on preprocessed dataset.
test_tuned_model.py: A script used to test and observed the predicted tags of the fine-tuned model on different sentences
evaluate_fine_tuned.py: script used to evaluate performance of the fine tuned model
MIM_GOLD_DESCRIPTION_EN_tagset.pdf: reference for the meaning of each tag within the MIM-Gold dataset

llms/lang-classification:

evaluate_lang_classifier.py: script used to evaluate the performance of the language classifier model
lang_classifier.py: script to train the language classifier used to classify a sentence as Icelandic or English

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
data		data
learnice-backend		learnice-backend
learnice-frontend		learnice-frontend
llms		llms
.gitignore		.gitignore
Fine-Tuning_mBERT_for_Icelandic_PoS_Tagging_and_Integrating_Multilingual_NLP_Tools.pdf		Fine-Tuning_mBERT_for_Icelandic_PoS_Tagging_and_Integrating_Multilingual_NLP_Tools.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learnice

Intelligent Icelandic Language Processing System 🌐🇮🇸

Hugging Face Model

Overview

Requirements

Backend setup

Frontend setup

LLM scripts

Usage

llm files

About

Uh oh!

Releases

Packages

Uh oh!

Languages

valgardg/learnice

Folders and files

Latest commit

History

Repository files navigation

Learnice

Intelligent Icelandic Language Processing System 🌐🇮🇸

Hugging Face Model

Overview

Requirements

Backend setup

Frontend setup

LLM scripts

Usage

llm files

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages