#算法刷题#Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)
#计算机科学#TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
翻译 - TensorFlow Similarity 是一个 Python 包,专注于使相似性学习变得快速而简单。
#计算机科学#Blazing fast framework for fine-tuning similarity learning models
#计算机科学#Music recommender using deep learning with Keras and TensorFlow
#自然语言处理#Generating multiple choice questions from text using Machine Learning.
#算法刷题#A .NET port of java-string-similarity
#大语言模型#Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine
📈This repo contains detailed notes and multiple projects implemented in Python related to AI and Finance. Follow the blog here: https://purvasingh.medium.com
#计算机科学#A python project for checking plagiarism of documents based on cosine similarity
Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
#向量搜索引擎#Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows s...
#自然语言处理#A blazing-fast semantic search React component. Match by meaning, not just by letters. Search as you type without waiting (no debounce needed). Rank by cosine similarity.
#向量搜索引擎#Coltt is a vector database that supports Multi-Vector Search, high-performance HNSW, FLAT and quantization, and enables fast searches through sophisticated internal data shard design.
#人脸识别#Real-Time Face Recognition use SCRFD, ArcFace, ByteTrack and Similarity Measure
#自然语言处理#Information Retrieval algorithms developed in python. To follow the blog posts, click on the link:
#自然语言处理#Machine learning based text classification in JavaScript using n-grams and cosine similarity
#搜索#Fast similarity search using DuckDB