Stars
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
A developer-friendly Python library to interact with Apache HBase
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.
Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models
An Efficient Lexical Analyzer for Chinese
RNNSharp is a toolkit of deep recurrent neural network which is widely used for many different kinds of tasks, such as sequence labeling, sequence-to-sequence and so on. It's written by C# language…
💻 Data Structures and Algorithms in Python
Rakuten MA - morphological analyzer (word segmentor + PoS Tagger) for Chinese and Japanese written purely in JavaScript.
Python implementation of GloVe word embedding algorithm (Pennington et al., 2014) for educational purposes
Reinforcement Learning environments based on the 1993 game Doom
Demonstration of recurrent neural network implemented with Theano
A simple interface to the Project Gutenberg corpus.
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
Benchmarks for several RNN variations with different deep-learning frameworks
fitting mixture models using mixture of polynomials
solve polynomial optimization and generalized moment problem
C++ implementation of the Brown word clustering algorithm.
Random notes on papers, likely a short-term repo.
An Open Source Machine Learning Framework for Everyone
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Universal Dependencies online documentation