Embeddings & Qdrant

Embedding generation

  • Uses configured embedding model/provider; chunk text via worker when possible.
  • Sanitizes text to avoid control chars; fallback to main-thread chunking if worker unavailable.

Sync to Qdrant

  • Provide host and optional apiKey; ensure collection vector size matches embedding dimensions.
  • App can hint/update vector size; monitor sync status in Embeddings view.
  • Sampling: load embeddings from Qdrant for visualization; paginate if large.
  • Recommendation: pair generation + judge workflows with Qdrant retrieval to improve synthetic quality via context and similarity checks.

Dimension mismatch tips

  • Mismatch errors mean the collection size differs from the embedding model output.
  • Fix by recreating/updating the collection or switching embedding model to match.