Alibaba unveils new open-source AI embedding models, a field it leads globally
The Chinese tech giant ranks top on Hugging Face’s benchmark for measuring the performance of text-embedding services

Alibaba, owner of the South China Morning Post, ranks third globally in the field of LLMs, according to the 2025 AI Index Report from Stanford University.
The new models, which come in various parameters, “support over 100 languages, including multiple programming languages, and provide robust multilingual, cross-lingual and code retrieval capabilities”, according to Alibaba.
In AI, an embedding model helps computers understand and process text by turning it into numerical representations. Since computers process data solely in numerical form, the embedding process enables them to grasp semantic data and questions more effectively, delivering more tailored results that do not rely solely on keywords.