gensim · GitHub Topics

piskvorky / gensim

#自然语言处理#Topic Modelling for Humans

gensim topic-modeling information-retrieval 机器学习自然语言处理数据科学 Python data-mining word2vec word-embeddings 神经网络 fasttext

Python 16.19 k

2 个月前

dipanjanS / text-analytics-with-python

#自然语言处理#Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text ...

text-classification Python natural-language 自然语言处理 clustering sentiment semantic sentiment-analysis nltk stanford-nlp spaCy pattern scikit-learn gensim

Jupyter Notebook 1.68 k

5 年前

explosion / sense2vec

#自然语言处理#🦆 Contextually-keyed word vectors

spaCy 自然语言处理 word2vec Python sense2vec gensim gensim-word2vec 机器学习

Python 1.66 k

5 个月前

plasticityai / magnitude

#自然语言处理#A fast, efficient universal vector embedding utility package.

Python 自然语言处理机器学习 vectors embeddings word2vec fasttext glove gensim fast memory-efficient word-embeddings

Python 1.65 k

2 年前

kavgan / nlp-in-practice

#自然语言处理#Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre...

自然语言处理 word2vec text-classification gensim 机器学习 text-mining

Jupyter Notebook 1.18 k

5 年前

piskvorky / gensim-data

Data repository for pretrained NLP models and NLP corpora.

dataset gensim pretrained-models

Python 1.03 k

8 年前

oborchers / Fast_Sentence_Embeddings

Compute Sentence Embeddings Fast!

sentence-embeddings sentence-representation sentence-similarity gensim fasttext cython embeddings maxpooling fse

Jupyter Notebook 622

3 年前

zake7749 / word2vec-tutorial

中文詞向量訓練教學

gensim word2vec

Python 517

3 年前

ThoughtRiver / lmdb-embeddings

Fast word vectors with little memory usage in Python

word vectors embeddings lmdb gensim memory speed text word2vec fasttext glove

Python 416

4 年前

bakrianoo / aravec

#自然语言处理#AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models...

自然语言处理 gensim arabic text-mining word2vec

Jupyter Notebook 406

4 年前

5hirish / adam_qas

#自然语言处理#ADAM - A Question Answering System. Inspired from IBM Watson

Python spaCy 自然语言处理 question-answering adam scikit-learn gensim pandas wikipedia elasticsearch spacy-extension

Python 356

6 年前

AICoE / log-anomaly-detector

Log Anomaly Detection - Machine learning to detect abnormal events logs

人工智能 log anomaly-detection 机器学习 word2vec som gensim stream-processing Kubernetes aiops

Jupyter Notebook 335

2 年前

30lm32 / ml-projects

#自然语言处理#ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

Keras Tensorflow random-forest gensim word2vec Docker timeseries-analysis imbalanced-data svm 自然语言处理机器学习 geolocation 深度学习 text-classification tensorboard mlflow ab-testing

281

5 年前

benedekrozemberczki / GEMSEC

#计算机科学#The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).

clustering deepwalk node2vec word2vec Tensorflow Facebook deezer community-detection matrix-factorization embedding 神经网络 unsupervised-learning gensim 机器学习 network-embedding graph-embedding

Python 259

3 年前

davidberenstein1957 / concise-concepts

#自然语言处理#This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

ner spaCy gensim 自然语言处理机器学习 Hacktoberfest

Python 245

2 年前

devmount / GermanWordEmbeddings

#自然语言处理#Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

神经网络 word2vec word-embeddings model training evaluation 深度学习深度神经网络自然语言处理 gensim

Jupyter Notebook 239

1 年前

akoksal / Turkish-Word2Vec

#自然语言处理#Pre-trained Word2Vec Model for Turkish

word2vec 自然语言处理 gensim turkish

Python 216

7 年前

benedekrozemberczki / Splitter

#计算机科学#A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

deepwalk PyTorch node2vec gensim 机器学习 word2vec factorization 深度学习深度神经网络 graph-neural-network node-embedding community-detection clustering network-embedding graph-embedding graph-representation-learning

Python 212

2 年前

alisonmitchell / Stock-Prediction

#自然语言处理#Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.

Python 机器学习 keras-tensorflow NumPy scikit-learn pandas seaborn matplotlib plotly SciPy mplfinance beautifulsoup nltk spaCy gensim 自然语言处理 bert huggingface

Jupyter Notebook 205

1 个月前

akutuzov / webvectors

Web-ify your word2vec: framework to serve distributional semantic models online

gensim word2vec Web app Flask

Python 201

7 个月前