fasttext-embeddings · GitHub Topics

#自然语言处理#NLP 领域常见任务的实现，包括新词发现、以及基于pytorch的词向量、中文文本分类、实体识别、摘要文本生成、句子相似度判断、三元组抽取、预训练模型等。

textcnn bilstm-crf-model fasttext-embeddings transformer-pytorch seq2seq gpt2 text-classification glove 自然语言处理 PyTorch bert bert-ner electra

Python 527

2 年前

dccuchile / spanish-word-embeddings

#自然语言处理#Spanish word embeddings computed with different methods and from different corpora

自然语言处理 spanish word-embeddings fasttext-embeddings

358

6 年前

avidale / compress-fasttext

#自然语言处理#Tools for shrinking fastText models (in gensim format)

Python 自然语言处理 word-embeddings fasttext-embeddings fasttext

Jupyter Notebook 178

1 年前

thinkingmachines / christmAIs

#计算机科学#Text to abstract art generation for the holidays!

翻译 - 在假期给抽象艺术一代发短信！

机器学习 abstract-art perception fasttext-embeddings

Python 90

2 年前

ikergarcia1996 / MetaVec

A monolingual and cross-lingual meta-embedding generation and evaluation framework

embedding embedding-vectors embeddings word2vec fasttext fasttext-embeddings

Python 80

3 年前

ashalogic / Persian-Sentiment-Analyzer

#自然语言处理#Persian sentiment analysis ( آناکاوی سهش های فارسی | تحلیل احساسات فارسی )

lstm persian persian-nlp sentiment-analysis 机器学习 Python .NET JavaScript farsi fasttext fasttext-embeddings word2vec embeddings 教程 colab Tensorflow 自然语言处理

Jupyter Notebook 55

3 年前

hbahadirsahin / nlp-experiments-in-pytorch

#自然语言处理#PyTorch repository for text categorization and NER experiments in Turkish and English.

PyTorch 自然语言处理 text ner named-entity-recognition english-language fasttext-embeddings textcnn vdcnn transformer text-classification

Python 36

2 年前

cambridgeltl / ContrastiveBLI

Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

contrastive-learning self-learning PyTorch bilingual-lexicon-extraction word-embeddings fasttext-embeddings information-retrieval machine-translation

Python 34

3 个月前

ashokc / Word-Embeddings-and-Document-Vectors

An evaluation of word-embeddings for classification

fasttext-embeddings word2vec elasticsearch scikitlearn-machine-learning naive-bayes-classifier neural-networks

Python 32

6 年前

JoyeBright / DeepSentiPers

Repository for the experiments described in the paper named "DeepSentiPers: Novel Deep Learning Models Trained Over Proposed Augmented Persian Sentiment Corpus"

sentiment-analysis Keras 神经网络 lstm cnn fasttext-embeddings classification data-augmentation 深度神经网络 corpus dataset

Jupyter Notebook 32

2 年前

PlanTL-GOB-ES / lm-legal-es

Language Models for the legal domain in Spanish done @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

language-model roberta fasttext-embeddings spanish-language

2 年前

priyanshu2103 / Sanskrit-Hindi-Machine-Translation

Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning

machine-translation fasttext-embeddings

Jupyter Notebook 19

4 年前

tien02 / ensemble-roberta-fasttext-vietnamese

#自然语言处理#Ensemble PhoBERT with FastText Embedding to improve performance on Vietnamese Sentiment Analysis tasks.

bert fasttext fasttext-embeddings fine-tuning gensim-word2vec lstm 自然语言处理 PyTorch pytorch-lightning text-classification sentiment-analysis sentiment-classification

Python 16

2 年前

cambridgeltl / BLICEr

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

bilingual-lexicon-extraction fasttext-embeddings PyTorch reranking self-learning word-embeddings xlm-roberta information-retrieval machine-translation

Python 13

2 年前

BlackKakapo / Romanian-Word-Embeddings

#自然语言处理#Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .vec and .model files are available for downl...

自然语言处理 word2vec cbow fasttext fasttext-embeddings vectors corpus words vocabulary

4 天前