Short Explanation
Convert text into dense vectors for semantic search, matching, and clustering.
Dense semantic embeddings for retrieval, similarity, clustering, and reranking.
Core parameters, trigger timing, and visual before/after demo references.
Convert text into dense vectors for semantic search, matching, and clustering.
Prepare the scene, image, video, sensor stream, prompt, or configuration expected by the original project.
Read the produced visualization, prediction, map, trajectory, mask, grasp pose, or other documented artifact.
A quick-run style example for the documentation page.
Readable controls and the meaning of each returned artifact.
modeltextall-MiniLM-L6-v2Sentence-transformers model identifier.
inputpathPath to text lines or JSON records.
normalizetoggletrueWhether to L2-normalize vectors for cosine search.
embeddingsDense vectors for each input text item.
similarity_matrixOptional pairwise semantic similarity output.
Official resources, deployment steps, academic context, citation, and source-reported benchmark numbers.
python tools/sentence-transformers/run.py --model all-MiniLM-L6-v2 --input tools/sentence-transformers/examples/sentences.txt --out tools/sentence-transformers/runs/embeddings.npy
{
"tool": "sentence-transformers",
"status": "ok",
"results": [
{
"label": "Sentence embedding",
"score": 0.87,
"output": "Vector embeddings / similarity scores"
}
],
"timing": {
"runtime": "The official SBERT table reports 14,200 sentences/sec on V100, with an 80 MB model, 384-dimensional embeddings, and max sequence length 256.",
"device": "documented in source benchmark when available"
},
"artifacts": {
"visualization": "tools/sentence-transformers/runs/visualization.png",
"raw_predictions": "tools/sentence-transformers/runs/predictions.json"
}
}Paper identity and contribution summary.
@misc{sentencetransformers2019,
title={Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks},
author={Nils Reimers and Iryna Gurevych},
year={2019},
note={EMNLP-IJCNLP 2019 / arXiv:1908.10084},
url={https://arxiv.org/abs/1908.10084}
}Only compact, source-reported numbers are shown here.
| Dataset | Metric | Value | Runtime | Source |
|---|---|---|---|---|
| SBERT model table, 14 sentence-embedding datasets | all-MiniLM-L6-v2 sentence performance | 68.06 | 14,200 sentences/sec on V100 | Official SBERT pretrained model table |
| SBERT model table, 6 semantic-search datasets | all-MiniLM-L6-v2 semantic-search performance | 49.54 | 80 MB model, 384 dimensions, max sequence length 256 | Official SBERT pretrained model table |
| SBERT model table | Training scale | 1B+ training pairs | Mean pooling, normalized embeddings | Official SBERT pretrained model table |
Official SBERT pretrained model table, model card link, pretrained models, evaluation scripts, and embedding outputs.
Visual references from the original tool. Click any image to inspect the original size.