Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
results		results
.gitignore		.gitignore
README.md		README.md
baseline_multi.py		baseline_multi.py
baseline_pair.py		baseline_pair.py
create_setting_keys.py		create_setting_keys.py
settings_baseline_multi.json		settings_baseline_multi.json
settings_baseline_pair.json		settings_baseline_pair.json
settings_template.json		settings_template.json
settings_transformer_multi.json		settings_transformer_multi.json
settings_transformer_pair.json		settings_transformer_pair.json
transformer_multi.py		transformer_multi.py
transformer_pair.py		transformer_pair.py
util.py		util.py

Repository files navigation

Cross-lingual Product Matching using Transformers

In our work, we seek to explore whether learned matching knowledge in the product matching domain can be transferred between languages. More specifically, we investigate whether high-resource languages, such as English, can be used to augment performance in scenarios where either no (zero-shot) or few (few-shot) examples are available in the target language. Towards that, we also study the effect of cross-lingual pre-training with models like mBERT or XLM-R and compare them to monolingual Transformer-architectures and simple baseline models.

Contributors

These people have contributed to this repository:

Data

Our datasets can be requested via mail at ralph@informatik.uni-mannheim.de, but are available for research purposes only.

Setup

How to use this Repository

Before installing, make sure to have Microsoft Build Tools for C++ installed for py_entitymatching

Settings files

Individual experiments can be configured using the .json settings files. The settings_template.json provides an overview over the possible settings for the experiments. Some settings are only available in the multi-class setup but not in the pair-wise case, and vice versa.

Run Experiments

To run a experiment, make sure to provide the path of the individual .json settings file as input agrument. For instance, to run the settings_baseline_multi.json, include the argument:

--i path_to_file\settings_baseline_multi.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-lingual Product Matching using Transformers

Contributors

Data

Setup

Settings files

Run Experiments

About

Releases

Packages

Contributors 2

Languages

wbsg-uni-mannheim-students/cross-lingual-product-matching

Folders and files

Latest commit

History

Repository files navigation

Cross-lingual Product Matching using Transformers

Contributors

Data

Setup

Settings files

Run Experiments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages