本文首先将展示一个向量检索领域的精彩论文集,即近似最近邻搜索(ANN搜索,ANNS)。该集合旨在收集高质量的研究论文、文章和资源,提供有价值的见解和进展。这项技术是矢量数据库、检索增强生成(RAG)、大规模信息检索、推荐系统、药物发现、图像搜索等领域的重要组成部分。
首先本文将持续更新,其次在后续的文章中将深入浅出地分析向量检索领域的文章,把这个被变得看起来很大的领域变小一点,很多东西也没那么复杂,所以该领域的很多著名工作都是启发式的。
什么是向量检索及其应用
一些解释:
- what-is-vector-search
- a-gentle-introduction-to-vector-search
- Explanation in Quora
- k-nn-vs-approximate-nearest-neighbors
应用:
论文
| 标题 | 链接 | 大概分类 | 备注 |
|---|---|---|---|
| Approximate Nearest Neighbor Search on High Dimensional Data — Experiments, Analyses, and Improvement | Link | Survey | |
| Graph-based Nearest Neighbor Search: From Practice to Theory | Link | Theoretical | |
| FINGER: Fast Inference for Graph-based Approximate Nearest Neighbor Search | Link | Graph-based | |
| HVS: hierarchical graph structure based on Voronoi diagrams for solving approximate nearest neighbor search | Link | Graph-based | |
| DiskANN: Fast Accurate Billion-point Nearest Neighbor Search on a Single Node | Link | Graph-based | |
| Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs | Link | Graph-based | |
| SONG: Approximate Nearest Neighbor Search on GPU | Link | Graph-based | |
| Graph-based Nearest Neighbor Search: Promises and Failures | Link | Graph-based | |
| Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination | Link | Graph-based | |
| A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search | Link | Survey | |
| Fast approximate nearest neighbor search with the navigating spreading-out graph | Link | Graph-based | |
| Non-metric Similarity Graphs for Maximum Inner Product Search | Link | Graph-based | |
| Understanding and Improving Proximity Graph-based Maximum Inner Product Search | Link | Graph-based | |
| Learning to Route in Similarity Graphs | Link | Graph-based | |
| Optimization of Indexing Based on k-Nearest Neighbor Graph for Proximity Search in High-dimensional Data | Link | Graph-based | |
| Fast Approximate Nearest Neighbor Search with a Dynamic Exploration Graph using Continuous Refinement | Link | Graph-based | |
| Efficient Approximate Nearest Neighbor Search in Multi-dimensional Databases | Link | Graph-based | |
| Scaling Graph-Based ANNS Algorithms to Billion-Size Datasets: A Comparative Analysis | Link | Graph-based | |
| SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search | Link | Graph-based | |
| Hierarchical Clustering-Based Graphs for Large Scale Approximate Nearest Neighbor Search | Link | Graph-based | |
| Hierarchical Clustering-Based Graphs for Large Scale Approximate Nearest Neighbor Search | Link | Graph-based | |
| Fusion of graph-based indexing and product quantization for ANN search | Link | Graph-based | |
| Towards Efficient Index Construction and Approximate Nearest Neighbor Search in High-Dimensional Spaces | Link | Graph-based | |
| Optimization of Indexing Based on k-Nearest Neighbor Graph for Proximity Search in High-dimensional Data | Link | Graph-based | |
| Scaling Graph-Based ANNS Algorithms to Billion-Size Datasets: A Comparative Analysis | Link | Survey | |
| Automating Nearest Neighbor Search Configuration with Constrained Optimization | Link | Learning | |
| Approximate Nearest Neighbor Search under Neural Similarity Metric for Large-Scale Recommendation | Link | Graph-based | |
| Norm Adjusted Proximity Graph for Fast Inner Product Retrieval | Link | Graph-based | |
| On Efficient Retrieval of Top Similarity Vectors | Link | Graph-based | |
| SONG: Approximate Nearest Neighbor Search on GPU | Link | GPU | |
| RTNN: Accelerating Neighbor Search Using Hardware Ray Tracing | Link | GPU | |
| Billion-scale similarity search with GPUs | Link | GPU | |
| Fast neural ranking on bipartite graph indices | Link | Neural Rank | |
| Fast Item Ranking under Neural Network based Measures | Link | Neural Rank | |
| Non-metric Similarity Graphs for Maximum Inner Product Search | Link | MIPS | |
| Möbius Transformation for Fast Inner Product Search on Graph | Link | MIPS | |
| Understanding and Improving Proximity Graph-based Maximum Inner Product Search | Link | MIPS | |
| Reinforcement Routing on Proximity Graph for Efficient Recommendation | Link | Learning | |
| From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective | Link | Learning | |
| Constructing Tree-based Index for Efficient and Effective Dense Retrieval | Link | Learning | |
| Reverse Maximum Inner Product Search: Formulation, Algorithms, and Analysis | Link | MIPS | |
| FARGO: Fast Maximum Inner Product Search via Global Multi-Probing | Link | LSH | |
| SRS: solving c -approximate nearest neighbor queries in high dimensional Euclidean space with a tiny index | Link | LSH | |
| From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective | Link | LSH | |
| LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index | Link | LSH | |
| HD-index: pushing the scalability-accuracy boundary for approximate kNN search in high-dimensional spaces | Link | LSH | |
| Falconn++: A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search | Link | LSH | |
| Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search | Link | LSH | |
| Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval | Link | LSH | |
| A Revisit of Hashing Algorithms for Approximate Nearest Neighbor Search | Link | Survey |
请注意,某些条目可能需要访问权限或成员资格才能查看完整内容