Author’s articles

June 30, 2025
K-means for building vector indices
We discuss optimizing k-means to efficiently create high quality vector indices

May 31, 2025
Robust Optimized Scalar Quantization
We discuss a sparse preconditioner to apply to vectors which results in more stable quantization performance with respect to data distribution

April 7, 2025
Speeding up merging of HNSW graphs
Explore the work we’ve been doing to reduce the overhead of building multiple HNSW graphs, particularly reducing the cost of merging graphs.

April 7, 2025
Speeding up HNSW graph merge
We describe a new strategy we developed that reduces HNSW merge time by up to 70% whilst maintaining similar graph quality

December 23, 2024
Improve search results by calibrating model scoring in Elasticsearch
Learn how to leverage annotated data to calibrate semantic model scoring for better search results

December 19, 2024
Understanding optimized scalar quantization
In this post, we explain a new form of scalar quantization we've developed at Elastic that achieves state-of-the-art accuracy for binary quantization.

December 5, 2024
Exploring depth in a 'retrieve-and-rerank' pipeline
Select an optimal re-ranking depth for your model and dataset.

November 25, 2024
Introducing Elastic Rerank: Elastic's new semantic re-ranker model
Learn about how Elastic's new re-ranker model was trained and how it performs.

October 29, 2024
What is semantic reranking and how to use it?
Introducing the concept of semantic reranking. Learn about the trade-offs using semantic reranking in search and RAG pipelines.