All-pair set similarity search on millions of sets in Python and on a laptop
翻译 - 在Python和笔记本电脑上的数百万个集合上进行全对集合相似性搜索
Finding all pairs of similar documents time- and memory-efficiently
Efficient set similarity search algorithms implemented in Go
Increases unit test coverage with fewer test cases using all-pairs and other covering arrays.
Solutions to some interesting questions on different data structure and algorithm concepts