A curated list of resources for Cross-lingual Information Retrieval (CLIR).
Please feel free to email Rui Zhang (r.zhang@yale.edu).
- Cross-Language Information Retrieval [book]
- Cross-Lingual Word Representations: Induction and Evaluation (EMNLP 2017 Tutorial) [slides]
- A Survey of Cross-lingual Word Embedding Models (JAIR) (paper)
- Neural Networks for Information Retrieval (SIGIR 2017, ECIR 2018, WSDM 2018 Tutorial) [slides]
- Neural Models for Information Retrieval [video][paper][slides]
- An Introduction to Neural Information Retrieval [paper]
- Neural Text Embeddings for Information Retrieval (WSDM 2017) [slides]
- Neural Information Retrieval: At the End of the Early Years (Information Retrieval Journal 2018) [slides]
- Neural Information Retrieval: A Literature Review [paper]
- Deep Learning for Information Retrieval (SIGIR 2016 Tutorial) [paper]
- Learning to Rank for Information Retrieval and Natural Language Processing [book]
- Unsupervised Cross-Lingual Information Retrieval using Monolingual Data Only (SIGIR 2018) [paper][code]
- Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings (SIGIR 2015) [paper]
- Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval (ACL 2018) [paper][code]
- Cross-lingual Learning-to-Rank with Shared Representations (NAACL 2018) [paper][data]
- Learning to Match using Local and Distributed Representations of Text for Web Search (WWW 2017) [paper][code]
- PACRR: A Position-Aware Neural IR Model for Relevance Matching (EMNLP 2017) [paper][code]
- Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval (WSDM 2018) [paper][code]
- Learning to Translate: A Query-Specific Combination Approach for Cross-Lingual Information Retrieval (EMNLP 2014) [paper]
- Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval (NAACL 2015)[paper]
- A Dual Embedding Space Model for Document Ranking (WWW 2016)[paper]
- Query Expansion with Locally-Trained Word Embeddings (ACL 2016)[paper]
- A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval (CIKM 2014)[paper]
- Toward Incorporation of Relevant Documents in word2vec (NeuIR-SIGIR 2017)[paper]
- Using Word Embeddings for Automatic Query Expansion (NeuIR-SIGIR 2016)[paper]
- Query Expansion Using Word Embeddings (CIKM 2016)[paper]
- Improved Cross-Language Retrieval using Backoff Translation (HLT 2001)[paper]
- (MUSE) Word Translation Without Parallel Data (ICLR 2018) [paper][github][blog]
- (Babylon - fastText_multilingual) Offline bilingual word vectors, orthogonal transformations and the inverted softmax (ICLR 2017) [paper][github]
- Learning Crosslingual Word Embeddings without Bilingual Corpora (EMNLP 2016) [paper][github]
- Multilingual Training of Crosslingual Word Embeddings (EACL 2017)[paper]
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)[paper][code]
- (multiCluster and multiCCA) Massively Multilingual Word Embeddings [paper]
- (bivec) Bilingual Word Representations with Monolingual Quality in Mind (NAACL 2015 workshop)[paper][github]
- Cross-lingual Models of Word Embeddings: An Empirical Comparison (ACL 2016) [paper]
- Bilingual Embeddings with Random Walks over Multilingual Wordnets (Knowledge-Based Systems, 2018) [paper]
- Cross-lingual Word Clusters for Direct Transfer of Linguistic Structure (NAACL 2012) [paper]
- Improving Cross-Lingual Word Embeddings by Meeting in the Middle (EMNLP 2018) [paper]
- CLUSE: Cross-Lingual Unsupervised Sense Embedding (EMNLP 2018) [paper][code]
- Unsupervised Cross-lingual Transfer of Word Embedding Spaces (EMNLP 2018) [paper]
- Cross-Lingual Morphological Tagging for Low-Resource Languages (ACL 2016) [paper]
- Phrase-Based & Neural Unsupervised Machine Translation (EMNLP 2018) [paper]
- Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision (EMNLP 2018) [paper]
- XL-NBT: A Cross-lingual Neural Belief Tracking Framework (EMNLP 2018) [paper]
- Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization (EMNLP 2018) [paper]
- Cross-lingual Lexical Sememe Prediction (EMNLP 2018) [paper]
- Joint Multilingual Supervision for Cross-lingual Entity Linking (EMNLP 2018) [paper]
- Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks (EMNLP 2018) [paper]
- Neural Cross-lingual Named Entity Recognition with Minimal Resources (EMNLP 2018) [paper]
- Multi-lingual Common Semantic Space Construction via Cluster-Consistent Word Embedding (EMNLP 2018) [paper]
- XNLI: Evaluating Cross-lingual Sentence Representations (EMNLP 2018) [paper]
- Morphological Word-Embeddings (NAACL 2015) [paper]
- Discriminative Reranking for Machine Translation (NAACL 2004) [paper]
- Discriminative Reranking for Natural Language Parsing (Computational Linguistics) [paper]
- Probabilistic Structured Query Methods (SIGIR 2003) [paper]
- Strong Baselines for Neural Semi-supervised Learning under Domain Shift (ACL 2018) [paper]
- Transfer Learning for Low-Resource Neural Machine Translation (EMNLP 2016) [paper]
- Meta-Learning for Semi-Supervised Few-Shot Classification (ICLR 2018) [paper]
- One-shot Learning with Memory-Augmented Neural Networks [paper]
- Low-Resource Semantic Role Labeling (ACL 2014) [paper]
- Universal Neural Machine Translation for Extremely Low Resource Languages (NAACL 2018) [paper]
- An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages (LREC 2018) [paper]