#自然语言处理#Topic Modelling for Humans
翻译 - 人类主题建模
#自然语言处理#Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text ...
#自然语言处理#A fast, efficient universal vector embedding utility package.
#自然语言处理#🦆 Contextually-keyed word vectors
翻译 - 🦆上下文相关词向量
#自然语言处理#Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre...
Data repository for pretrained NLP models and NLP corpora.
Compute Sentence Embeddings Fast!
#自然语言处理#AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models...
#自然语言处理#ADAM - A Question Answering System. Inspired from IBM Watson
Log Anomaly Detection - Machine learning to detect abnormal events logs
#自然语言处理#ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
#计算机科学#The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
#自然语言处理#This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
#自然语言处理#Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
#计算机科学#A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Web-ify your word2vec: framework to serve distributional semantic models online
#计算机科学#An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fu...