Name	Name	Last commit message	Last commit date
Latest commit History 16 Commits
.idea	.idea
__pycache__	__pycache__
LICENSE	LICENSE
README.md	README.md
geo_ESIM.py	geo_ESIM.py
geo_config.py	geo_config.py
geo_data_prepare.py	geo_data_prepare.py
geo_data_processor.py	geo_data_processor.py
geo_similarity.py	geo_similarity.py
geo_test.py	geo_test.py
geo_token.py	geo_token.py
geo_train.py	geo_train.py
geo_word2vec.py	geo_word2vec.py
other_CRF.py	other_CRF.py
other_crf_w2v.py	other_crf_w2v.py
other_string.py	other_string.py
other_w2v_cls.py	other_w2v_cls.py

Name

Last commit message

Last commit date

.idea

geo_data_processor.py

A Deep Learning Architecture for Semantic Address Matching

Codes in this repository is for the paper Yue Lin, Mengjun Kang, Yuyang Wu, Qingyun Du & Tao Liu (accepted) A deep learning architecture for semantic address matching, International Journal of Geographical Information Science..

Codes are cited as Lin, Yue & Kang, Mengjun. (2019, October 8). yuelinnnnnnn/semantic_address_matching: Semantic address matching (Version v1.0). Zenodo. http://doi.org/10.5281/zenodo.3476673

Data are available at:

Shenzhen address corpus (part): http://doi.org/10.5281/zenodo.3477632
Labelled address dataset for semantic address matching: http://doi.org/10.5281/zenodo.3477006

Below is an overview of each file in this repository.

geo_config.py Hyperparameter settings for the ESIM
geo_data_prepare.py Tokenize the corpus and convert each address element into index
geo_data_processor.py Process the labeled address dataset and divide it into training, development and test sets
geo_ESIM.py Implementation of the enhanced sequential inference model (ESIM)
geo_similarity.py Calculate statistical characteristics of the labeled address dataset
geo_test.py Output predictive results of the ESIM on the test set
geo_token.py Tokenize with the Jieba library
geo_train.py Train the ESIM and evaluate its accuracy on the development set
geo_word2vec.py Train word vectors of address elements
other_CRF.py Tokenize using CRF [Comber and Arribas-Bel (2019)]
other_crf_w2v.py Train word vectors of address elements (CRF tokenizer)
other_string.py String similarity-based address matching methods: measure the string relevance
other_w2v_cls.py Use word2vec embeddings directly for classification: calculat cosine similarity

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A Deep Learning Architecture for Semantic Address Matching

About

Uh oh!

Releases

Packages

Languages

License

MaxWenzel/semantic_address_matching

Folders and files

Latest commit

History

Repository files navigation

A Deep Learning Architecture for Semantic Address Matching

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages