Morse

Morse is a collection of morphological taggers presented in the paper that you can train on your data.

Furthermore, Morse provides pre-trained models which are trained in Universal Dependencies and in TrMor datasets, so you can tag your sentences immediately.

Dependencies

Julia 1.1
Network connection

Installation

   git clone https://github.com/ekinakyurek/Morse.jl
   cd Morse.jl

Setup

Open Julia in Morse.jl folder, then type ] to activate pkg mode. After that run the below commands.

   (v1.1) pkg> activate .
   (v1.1) Morse> instantiate # only in the first time

Data (Optional)

   julia> using Morse
   julia> download(TRDataSet)
   julia> download(UDDataSet)

Experiments

To verify the results presented in the paper, you may run the scripts to train models an ablations. During training logs will be created at logs/ folder.

Detailed information about experiments can be found in scripts/

Note: Nvidia GPU is required to train on a reasonable time.

Tagging

Note: Limited Support

   julia> using Knet, KnetLayers, Morse
   julia> model, vocabulary, parser = trained(MorseModel, TRDataSet, vers="2018");
   julia> predictions = model("annem sana yardım edemez .", v=vocabulary, p=parser)
   annem anne+Noun+A3sg+P1sg+Nom
   sana sen+Pron+Pers+A2sg+Pnon+Dat
   yardım yardım+Noun+A3sg+Pnon+Nom
   edemez et+Verb^DB+Verb+Able+Neg+Aor+A3sg
   . .+Punct

Customized Training

Note: Nvidia GPU is required to train on a reasonable time.

   julia> using Knet, KnetLayers, Morse
   julia> config = Morse.intro(split("--logFile nothing --lemma --dataSet TRDataSet")) # you can modify the program arguments
   julia> dataFiles = ["train.txt", "test.txt"] # make sure you have theese files exists in the given path
   julia> data, vocab, parser = prepareData(dataFiles,TRDataSet) # or UDDataSet
   julia> data = miniBatch(data,vocab) # sentence minibatching is required for processing a sentence correctly
   julia> model = MorseModel(config,vocab)
   julia> setoptim!(model, SGD(;lr=1.6,gclip=60.0))
   julia> trainmodel!(model,data,config,vocab,parser) # can take hours or more depends to your data
   julia> predictions = model("Annem sana yardım edemez .", v=vocab, p=parser)

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
checkpoints		checkpoints
data		data
docs		docs
generations		generations
logs		logs
scripts		scripts
src		src
.gitlab-ci.yml		.gitlab-ci.yml
.travis.yml		.travis.yml
LICENSE		LICENSE
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Morse

Dependencies

Installation

Setup

Data (Optional)

Experiments

Tagging

Customized Training

About

Releases

Packages

Languages

License

ekinakyurek/Morse_Personal

Folders and files

Latest commit

History

Repository files navigation

Morse

Dependencies

Installation

Setup

Data (Optional)

Experiments

Tagging

Customized Training

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages