CohereLabs/Cohere-embed-multilingual-v3.0
Captured source
source ↗--- tags:
- mteb
model-index:
- name: embed-multilingual-v3.0
results:
- task:
type: Classification dataset: type: mteb/amazon_counterfactual name: MTEB AmazonCounterfactualClassification (en) config: en split: test revision: e8379541af4e31359cca9fbcf4b00f2671dba205 metrics:
- type: accuracy
value: 77.85074626865672
- type: ap
value: 41.53151744002314
- type: f1
value: 71.94656880817726
- task:
type: Classification dataset: type: mteb/amazon_polarity name: MTEB AmazonPolarityClassification config: default split: test revision: e2d317d38cd51312af73b3d32a06d1a08b442046 metrics:
- type: accuracy
value: 95.600375
- type: ap
value: 93.57882128753579
- type: f1
value: 95.59945484944305
- task:
type: Classification dataset: type: mteb/amazon_reviews_multi name: MTEB AmazonReviewsClassification (en) config: en split: test revision: 1399c76144fd37290681b995c656ef9b2e06e26d metrics:
- type: accuracy
value: 49.794
- type: f1
value: 48.740439663130985
- task:
type: Retrieval dataset: type: arguana name: MTEB ArguAna config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 55.105000000000004
- task:
type: Clustering dataset: type: mteb/arxiv-clustering-p2p name: MTEB ArxivClusteringP2P config: default split: test revision: a122ad7f3f0291bf49cc6f4d32aa80929df69d5d metrics:
- type: v_measure
value: 48.15653426568874
- task:
type: Clustering dataset: type: mteb/arxiv-clustering-s2s name: MTEB ArxivClusteringS2S config: default split: test revision: f910caf1a6075f7329cdf8c1a6135696f37dbd53 metrics:
- type: v_measure
value: 40.78876256237919
- task:
type: Reranking dataset: type: mteb/askubuntudupquestions-reranking name: MTEB AskUbuntuDupQuestions config: default split: test revision: 2000358ca161889fa9c082cb41daa8dcfb161a54 metrics:
- type: map
value: 62.12873500780318
- type: mrr
value: 75.87037769863255
- task:
type: STS dataset: type: mteb/biosses-sts name: MTEB BIOSSES config: default split: test revision: d3fb88f8f02e40887cd149695127462bbcf29b4a metrics:
- type: cos_sim_pearson
value: 86.01183720167818
- type: cos_sim_spearman
value: 85.00916590717613
- type: euclidean_pearson
value: 84.072733561361
- type: euclidean_spearman
value: 85.00916590717613
- type: manhattan_pearson
value: 83.89233507343208
- type: manhattan_spearman
value: 84.87482549674115
- task:
type: Classification dataset: type: mteb/banking77 name: MTEB Banking77Classification config: default split: test revision: 0fd18e25b25c072e09e0d92ab615fda904d66300 metrics:
- type: accuracy
value: 86.09415584415584
- type: f1
value: 86.05173549773973
- task:
type: Clustering dataset: type: mteb/biorxiv-clustering-p2p name: MTEB BiorxivClusteringP2P config: default split: test revision: 65b79d1d13f80053f67aca9498d9402c2d9f1f40 metrics:
- type: v_measure
value: 40.49773000165541
- task:
type: Clustering dataset: type: mteb/biorxiv-clustering-s2s name: MTEB BiorxivClusteringS2S config: default split: test revision: 258694dd0231531bc1fd9de6ceb52a0853c6d908 metrics:
- type: v_measure
value: 36.909633073998876
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackAndroidRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 49.481
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackEnglishRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 47.449999999999996
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackGamingRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 59.227
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackGisRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 37.729
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackMathematicaRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 29.673
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackPhysicsRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 44.278
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackProgrammersRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 43.218
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 40.63741666666667
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackStatsRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 33.341
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackTexRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 29.093999999999998
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackUnixRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 40.801
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackWebmastersRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 40.114
- task:
type: Retrieval dataset: type: BeIR/cqadupstack name: MTEB CQADupstackWordpressRetrieval config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 33.243
- task:
type: Retrieval dataset: type: climate-fever name: MTEB ClimateFEVER config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 29.958000000000002
- task:
type: Retrieval dataset: type: dbpedia-entity name: MTEB DBPedia config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 41.004000000000005
- task:
type: Classification dataset: type: mteb/emotion name: MTEB EmotionClassification config: default split: test revision: 4f58c6b202a23cf9a4da393831edf4f9183cad37 metrics:
- type: accuracy
value: 48.150000000000006
- type: f1
value: 43.69803436468346
- task:
type: Retrieval dataset: type: fever name: MTEB FEVER config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 88.532
- task:
type: Retrieval dataset: type: fiqa name: MTEB FiQA2018 config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 44.105
- task:
type: Retrieval dataset: type: hotpotqa name: MTEB HotpotQA config: default split: test revision: None metrics:
- type: ndcg_at_10
value: 70.612
- task:
type: Classification dataset: type: mteb/imdb…
Excerpt shown — open the source for the full document.