MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)
#自然语言处理#Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
A scalable nearest neighbor search library in Apache Spark
A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It implements Locality-sensitive Hashing (LSH) and multi index ha...
#向量搜索引擎#[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
#搜索#Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon m...
Near-duplicate image detection using Locality Sensitive Hashing
A resistome profiler for Graphing Resistance Out Of meTagenomes
A Clojure library for querying large data-sets on similarity
Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.
LSH index for approximate set containment search
An implementation of locality sensitive hashing with Hadoop
Locality Sensitive Hashing, fuzzy-hash, min-hash, simhash, aHash, pHash, dHash。基于 Hash值的图片相似度、文本相似度
Quickly estimate the similarity between many sets