Class Lucene90HnswGraphBuilder

java.lang.Object
org.apache.lucene.backward_codecs.lucene90.Lucene90HnswGraphBuilder

public final class Lucene90HnswGraphBuilder extends Object
Builder for HNSW graph. See Lucene90OnHeapHnswGraph for a gloss on the algorithm and the meaning of the hyperparameters.

This class is preserved here only for tests.

  • Field Details

  • Constructor Details

    • Lucene90HnswGraphBuilder

      public Lucene90HnswGraphBuilder(RandomAccessVectorValuesProducer vectors, VectorSimilarityFunction similarityFunction, int maxConn, int beamWidth, long seed) throws IOException
      Reads all the vectors from a VectorValues, builds a graph connecting them by their dense ordinals, using the given hyperparameter settings, and returns the resulting graph.
      Parameters:
      vectors - the vectors whose relations are represented by the graph - must provide a different view over those vectors than the one used to add via addGraphNode.
      maxConn - the number of connections to make when adding a new graph node; roughly speaking the graph fanout.
      beamWidth - the size of the beam search to use when finding nearest neighbors.
      seed - the seed for a random number generator used during graph construction. Provide this to ensure repeatable construction.
      Throws:
      IOException
  • Method Details

    • build

      Reads all the vectors from two copies of a random access VectorValues. Providing two copies enables efficient retrieval without extra data copying, while avoiding collision of the returned values.
      Parameters:
      vectors - the vectors for which to build a nearest neighbors graph. Must be an independet accessor for the vectors
      Throws:
      IOException
    • setInfoStream

      public void setInfoStream(InfoStream infoStream)
      Set info-stream to output debugging information *
    • addGraphNode

      void addGraphNode(float[] value) throws IOException
      Inserts a doc with vector value to the graph
      Throws:
      IOException
    • addDiverseNeighbors

      private void addDiverseNeighbors(int node, NeighborQueue candidates) throws IOException
      Throws:
      IOException
    • selectDiverse

      private void selectDiverse(Lucene90NeighborArray neighbors, Lucene90NeighborArray candidates) throws IOException
      Throws:
      IOException
    • popToScratch

      private void popToScratch(NeighborQueue candidates)
    • diversityCheck

      private boolean diversityCheck(float[] candidate, float score, Lucene90NeighborArray neighbors, RandomAccessVectorValues vectorValues) throws IOException
      Parameters:
      candidate - the vector of a new candidate neighbor of a node n
      score - the score of the new candidate and node n, to be compared with scores of the candidate and n's neighbors
      neighbors - the neighbors selected so far
      vectorValues - source of values used for making comparisons between candidate and existing neighbors
      Returns:
      whether the candidate is diverse given the existing neighbors
      Throws:
      IOException
    • diversityUpdate

      private void diversityUpdate(Lucene90NeighborArray neighbors) throws IOException
      Throws:
      IOException
    • findNonDiverse

      private int findNonDiverse(Lucene90NeighborArray neighbors) throws IOException
      Throws:
      IOException