sbert · GitHub Topics

MTEB: Massive Text Embedding Benchmark

benchmark clustering information-retrieval sentence-transformers sts text-embedding retrieval neural-search semantic-search sbert text-classification reranking

Jupyter Notebook 2.39 k

9 小时前

beir-cellar / beir

#自然语言处理#A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

自然语言处理 information-retrieval bert benchmark sentence-transformers retrieval elasticsearch sbert dataset colbert 深度学习 PyTorch 大语言模型 rag

Python 1.77 k

2 个月前

ContextualAI / gritlm

#大语言模型#Generative Representational Instruction Tuning

embeddings grit information-retrieval instruction-tuning 大语言模型 llms retrieval sbert text-embedding embedding

Jupyter Notebook 617

1 个月前

sudharsan13296 / Getting-Started-with-Google-BERT

#自然语言处理#Build and train state-of-the-art natural language processing models using BERT

bert transformer huggingface-transformers albert roberta electra 自然语言处理 PyTorch bart sbert

Jupyter Notebook 220

4 年前

DmitryKey / bert-solr-search

#向量搜索引擎#Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU

elasticsearch solr vector-search bert-model sbert hnswlib semantic-search diversity

Jupyter Notebook 166

8 个月前

yuanzhoulvpi2017 / DocumentSearch

基于sentence transformers和chatglm实现的文档搜索工具

chatglm-6b sbert transformers

Python 154

2 年前

cpcdoy / rust-sbert

#自然语言处理#Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)

Rust bert sbert sentence-transformers sentence-embeddings 自然语言处理

Rust 113

7 个月前

helliun / targetedSummarization

#自然语言处理#TextReducer - A Tool for Summarization and Information Extraction

information-extraction 自然语言处理 question-answering sbert summarization

Python 87

1 年前

hellonlp / sentence-similarity

文本相似度，语义向量，文本向量，text-similarity，similarity, sentence-similarity，BERT，SimCSE，BERT-Whitening，Sentence-BERT, PromCSE, SBERT

Python 73

5 个月前

thiswillbeyourgithub / AnnA_Anki_neuronal_Appendix

#自然语言处理#Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity

umap kmeans clustering embedding bert 自然语言处理 Anki flashcards 机器学习 pca scheduler sentence-embeddings sbert 人工智能

Python 64

6 个月前

ukairia777 / KoBERTopic

KoBERTopic은 BERTopic을 한국어 데이터에 적용할 수 있도록 토크나이저와 BERT를 수정한 코드입니다.

bert lda topic-modeling sbert

Jupyter Notebook 61

3 年前

yuanzhoulvpi2017 / questionAnswerSystem

#自然语言处理#基于sentence-transformers实现文本转向量的机器人

数据库 encoding 自然语言处理 NumPy pandas Python robot sbert question-answering FastAPI

Jupyter Notebook 45

3 年前

wri-dssg-omdena / policy-data-analyzer

#网络爬虫#Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible fr...

自然语言处理 sbert sentence-transformers huggingface 机器学习 text-classification document-classification scraping policy 数据科学 bert transformers spyder scrapy topic lda active-learning

Jupyter Notebook 35

3 年前

UKPLab / useb

#自然语言处理#Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.

sentence-embeddings unsupervised-learning benchmark domain-adaptation information-retrieval reranking sbert transformer PyTorch 自然语言处理

Python 32

3 年前