#搜索#All-in-one 一站式 embedding 数据库,语义搜索、LLM 编排和语言模型workflows
#大语言模型#Retrieval and Retrieval-augmented LLMs
#自然语言处理#Leveraging BERT and c-TF-IDF to create easily interpretable topics.
翻译 - 利用BERT和基于类的TF-IDF创建易于理解的主题。
#自然语言处理#text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
#自然语言处理#[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
翻译 - SimCSE:句子嵌入的简单对比学习
#自然语言处理# A curated list of pretrained sentence and word embedding models
#自然语言处理#xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
SGPT: GPT Sentence Embeddings for Semantic Search
#自然语言处理#unified embedding model
#自然语言处理#Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
翻译 - 用于印度语言的Natural Language Toolkit旨在为应用程序开发人员可能需要的各种NLP任务提供开箱即用的支持
Compute Sentence Embeddings Fast!
#向量搜索引擎#A Python vector database you just need - no more, no less.
#自然语言处理#BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
#大语言模型#Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
#计算机科学#A Structured Self-attentive Sentence Embedding
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
#自然语言处理#The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
#自然语言处理#A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.