Source link : https://tech365.info/qwenlong-l1-solves-long-context-reasoning-problem-that-stumps-present-llms/
Alibaba Group has launched QwenLong-L1, a brand new framework that permits massive language fashions (LLMs) to purpose over extraordinarily lengthy inputs. This improvement might unlock a brand new wave of enterprise purposes that require fashions to know and draw insights from in depth paperwork comparable to detailed company filings, prolonged monetary statements, or complicated authorized contracts.
The problem of long-form reasoning for AI
Latest advances in massive reasoning fashions (LRMs), significantly via reinforcement studying (RL), have considerably improved their problem-solving capabilities. Analysis reveals that when skilled with RL fine-tuning, LRMs purchase expertise just like human “slow thinking,” the place they develop refined methods to deal with complicated duties.
Nevertheless, these enhancements are primarily seen when fashions work with comparatively quick items of textual content, usually round 4,000 tokens. The power of those fashions to scale their reasoning to for much longer contexts (e.g., 120,000 tokens) stays a significant problem. Such long-form reasoning requires a sturdy understanding of all the context and the power to carry out multi-step evaluation. “This limitation poses a significant barrier to practical applications requiring interaction with external knowledge, such as deep research, where LRMs must collect and process information from knowledge-intensive environments,” the builders of QwenLong-L1 write of their paper.
The…
—-
Author : tech365
Publish date : 2025-05-31 02:30:00
Copyright for syndicated content belongs to the linked Source.
—-
1 – 2 – 3 – 4 – 5 – 6 – 7 – 8