Perplexity Unveils Post-Training Method for Enhanced Web Search Agent

News Flash 2026-04-23 12:52

Perplexity has disclosed its post-training process for a web search agent, utilizing open-source models Qwen3.5-122B-A10B and Qwen3.5-397B-A17B. The process involves a two-stage approach: supervised fine-tuning (SFT) for establishing deployment behaviors, followed by online policy reinforcement learning (RL) to enhance search accuracy and efficiency. The RL stage employs the GRPO algorithm, using a synthetic multi-hop QA dataset and general dialogue data to maintain instruction adherence and prevent behavioral degradation. The post-trained Qwen3.5-397B-SFT-RL model demonstrates superior performance on search benchmarks, achieving 57.3% accuracy on FRAMES with a single tool call, surpassing GPT-5.4 and Sonnet 4.6. With a moderate budget, its accuracy reaches 73.9% at $0.02 per query, outperforming competitors in both accuracy and cost-efficiency.

Share to:

This content is for informational purposes only and does not constitute investment advice.

Curated Series

SuperEx Popular Science Articles Column

SuperEx Popular Science Articles Column

This collection features informative articles about SuperEx, aiming to simplify complex cryptocurrency concepts for a wider audience. It covers the basics of trading, blockchain technology, and the features of the SuperEx platform. Through easy-to-understand content, it helps users navigate the world of digital assets with confidence and clarity.

Unstaked related news and market dynamics research

Unstaked related news and market dynamics research

Unstaked (UNSD) is a blockchain platform integrating AI agents for automated community engagement and social media interactions. Its native token supports governance, staking, and ecosystem features. This special feature explores Unstaked’s market updates, token dynamics, and platform development.

XRP News and Research

XRP News and Research

This series focuses on XRP, covering the latest news, market dynamics, and in-depth research. Featured analysis includes price trends, regulatory developments, and ecosystem growth, providing a clear overview of XRP's position and potential in the cryptocurrency market.

How do beginners trade options?How does option trading work?

How do beginners trade options?How does option trading work?

This special feature introduces the fundamentals of options trading for beginners, explaining how options work, their main types, and the mechanics behind trading them. It also explores key strategies, potential risks, and practical tips, helping readers build a clear foundation to approach the options market with confidence.

What are the risks of investing in cryptocurrency?

What are the risks of investing in cryptocurrency?

This special feature covers the risks of investing in cryptocurrency, explaining common challenges such as market volatility, security vulnerabilities, regulatory uncertainties, and potential scams. It also provides analysis of risk management strategies and mitigation techniques, helping readers gain a clear understanding of how to navigate the crypto market safely.