Technical News

Mistral is launching a new model of code integration which surpasses Openai and Cohere in real world recovery tasks

Join our daily and weekly newsletters for the latest updates and the exclusive content on AI coverage. Learn more


With the demand for an increased generation of corporate recovery (RAG), the opportunity is ripe for model suppliers to offer their point of view on integration models.

The French AI Company Mistral threw its hat into the ring with codestral integration, its first model of incorporation, which, according to him, surpasses existing integration models on references like Swe-Bench.

The model specializes in code and “works particularly well for the recovery of use cases on real code data”. The model is available for developers for $ 0.15 per million tokens.

The company declared that the integration of codestral “considerably surpasses the fund code funds” as the travel code 3, Cohere Embed V4.0 and the Openai incorporation model, the text incorporated 3.

Codestral integration, which is part of the codestral family of Mistral of coding models, can make interests that transform code and data into digital representations for RAG.

“Codestral integration can produce integrations with different dimensions and details, and the figure below illustrates the compromises between the quality of recovery and storage costs,” said Mistral in a blog article. “The integration of codestral with dimension 256 and int8 precision still works better than any model of our competitors. The dimensions of our interests are controlled by relevance. For any whole target dimension n, you can choose to keep the N dimensions n for a fluid compromise between quality and cost. ”

Mistral tested the model on several landmarks, including Swe-Bench and Github Text2Code. In both cases, the company said that Codestrral Integration has outperformed the main integration models.

SWE BANC

Text2 code

Use case

Mistral said that codestral integration is optimized for “high performance code recovery” and semantic understanding. The company said that the code works better for at least four types of use cases: cloth, searches of semantic code, search for similarity and code analysis.

Integration models generally target cases of use of rags, as they can facilitate the recovery of information faster for tasks or agent processes. Therefore, it is not surprising that the integration of codestral focuses on this.

The model can also carry out a semantic code search, allowing developers to find code extracts using natural language. This use case works well for developer tool platforms, documentation systems and coding co-pilotes. Codestral Embed can also help developers identify duplicated code segments or similar code channels, which can be useful for companies with policies concerning the reused code.

The model supports semantic clustering, which involves the grouping of the code according to its functionality or structure. This use case would help to analyze the benchmarks, categorize and find models in code architecture.

Competition increases in the incorporation space

Mistral was on a momentum with the release of new models and agent tools. He published Mistral Medium 3, an average version of his flagship model of large language (LLM), which currently feeds his platform focused on the company Le Chat Enterprise.

He also announced the API of agents, which allows developers to access the tools to create agents who perform real tasks and orchestrate several agents.

Mistral movements to offer more model options to developers have not gone unnoticed in developer spaces. Some on X note that the moment of Mistral in the release of codestral integration “arrives in the heels of increased competition”.

However, Mistral must prove that the integration of codestral works well and not only in reference tests. Although it competes with more closed models, such as those of Openai and Cohere, the integration of codestral also faces open source of Qodo, including Qodo-Embed-1.5 B.

VentureBeat contacted Mistral on Codestral Embed license options.

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button