10 samples expanded to 242 languages, Adaption Labs aims to address AI multilingual shortcomings at the data level

robot
Abstract generation in progress

ME News Report, April 15 (UTC+8). According to Beating Monitoring, AI data platform Adaption Labs has released a new feature for Adaptive Data called “Expand Your World.” Starting from at least 10 samples in a single language, it can generate up to 2,420 high-quality training samples covering 242 languages and regional variants, without requiring any additional annotation process or data pipeline. This feature is now available to all Adaptive Data users.

Multilingual coverage is one of the main shortcomings of AI training data. Most datasets focus on a small number of high-resource languages, and models’ ability to handle low-resource languages and regional dialects is significantly weaker, making it difficult for later fine-tuning to fully make up for the gap.

Adaption Labs’ approach is to bring multilingual coverage forward to the data layer, solving distribution bias at the stage of generating training data.

Adaption Labs was co-founded by former Cohere Vice President of Research Sara Hooker and former Google AI infrastructure engineer Sudip Roy. This February, the company raised a $50 million seed round led by Emergence Capital, valuing the company at $1 billion.

The company’s core bet is to replace brute-force scaling with an efficient adaptive system, enabling models to continuously learn and evolve.

(Source: BlockBeats)

View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pin