The post Microsoft Made GPT and Claude Work Together—And the Result Beats Every AI Research Tool Out There appeared on BitcoinEthereumNews.com. In brief MicrosoftThe post Microsoft Made GPT and Claude Work Together—And the Result Beats Every AI Research Tool Out There appeared on BitcoinEthereumNews.com. In brief Microsoft

Microsoft Made GPT and Claude Work Together—And the Result Beats Every AI Research Tool Out There

2026/03/31 05:31
Okuma süresi: 4 dk
Bu içerikle ilgili geri bildirim veya endişeleriniz için lütfen crypto.news@mexc.com üzerinden bizimle iletişime geçin.

In brief

  • Microsoft released two different modes that pair GPT and Claude to increase the quality of AI research.
  • Critique makes the models collaborate, whereas Council makes them work in parallel while a third judge finds the discrepancies.
  • This two-model workflow fixes hallucinations, weak citations, and other problems associated with mono-model AI research.

Deep research AI has been one of the hottest arms races in tech this year. Google announced its research agent for Gemini in December 2024, OpenAI released its own research agent in February 2025, xAI followed suit, Perplexity doubled down, and Anthropic’s Claude built a loyal following among professionals who need detailed, cited answers, introducing its agent in April of last year.

Every company has been trying to convince you that their single AI model is the smartest researcher in the room. Microsoft just said: Why pick one?

The company announced two new features on Monday for Copilot’s Researcher tool—called Critique and Council—that put OpenAI’s GPT and Anthropic’s Claude to work on the same research task in sequence. The result, according to Microsoft’s testing against an industry benchmark, scores higher than every system included in that test, including models from the top AI companies.

“Critique is a new multi model deep research system designed for complex research tasks. It separates generation from evaluation and utilizes a combination of models from Frontier labs, including Anthropic and OpenAI,” Microsoft explains. “One model leads the generation phase, planning the task, iterating through retrieval, and producing an initial draft, while a second model focuses on review and refinement, acting as an expert reviewer before the final report is produced.”

Here’s the basic problem Critique is designed to fix: Every AI research tool today works the same way. You ask a question, one model plans a search, scours sources, writes a report, and hands it back to you. That single model is doing everything with no one checking its work.

This can end up with some hallucinations slipping in, some errors in citations, fake or inaccurate claims, etc.

Critique breaks that workflow in two. GPT handles the first phase—it plans the research, pulls sources, and writes an initial draft. Then Claude steps in as a strict editor, reviewing the report for factual accuracy, citation quality, and whether the answer actually addressed what was asked. Only after that review does the final report reach the user. Microsoft says the roles can eventually run in the opposite direction too, with Claude drafting and GPT critiquing, though for now GPT goes first.

On the DRACO benchmark—a standardized test covering 100 complex research tasks across 10 domains including medicine, law, and technology—Copilot with Critique scored 57.4. points with Anthropic’s Claude Opus 4.6 by itself hitting 42.7. Microsoft’s combined system beats the next best result by nearly 14%.

Image: Microsoft

The biggest gains showed up in breadth of analysis and presentation quality, with factual accuracy also posting a significant improvement.

The second feature, Council, takes a different approach to the same problem. Instead of having one model review the other’s work, Council runs GPT and Claude simultaneously and puts their full reports side by side. A third “judge” model then reads both and writes a summary explaining where the two AIs agreed, where they diverged, and what unique angles each one caught that the other missed. Comparing AI research tools manually has been something users have had to do themselves until now.

In Critique, the models essentially collaborate with each other while in Council the models compete against each other.

Critique is the default experience in Researcher whereas Council requires you to select “Model Council” from the picker to activate the side-by-side mode. Both features are currently available to users enrolled in Microsoft’s Frontier program, the early-access channel for Copilot’s newest capabilities. A Microsoft 365 Copilot license ($30/user/month) is required, but users also need to be enrolled in Frontier to access them.

Image: Microsoft

OpenAI and Microsoft have a multibillion-dollar partnership, but Microsoft’s bet is that no single model stays on top for long, and that the real value is in the orchestration layer that routes tasks to whichever combination works best.

Daily Debrief Newsletter

Start every day with the top news stories right now, plus original features, a podcast, videos and more.

Source: https://decrypt.co/362805/microsoft-gpt-claude-work-together-ai-research

Piyasa Fırsatı
DeepBook Logosu
DeepBook Fiyatı(DEEP)
$0.027282
$0.027282$0.027282
-1.81%
USD
DeepBook (DEEP) Canlı Fiyat Grafiği
Sorumluluk Reddi: Bu sitede yeniden yayınlanan makaleler, halka açık platformlardan alınmıştır ve yalnızca bilgilendirme amaçlıdır. MEXC'nin görüşlerini yansıtmayabilir. Tüm hakları telif sahiplerine aittir. Herhangi bir içeriğin üçüncü taraf haklarını ihlal ettiğini düşünüyorsanız, kaldırılması için lütfen crypto.news@mexc.com ile iletişime geçin. MEXC, içeriğin doğruluğu, eksiksizliği veya güncelliği konusunda hiçbir garanti vermez ve sağlanan bilgilere dayalı olarak alınan herhangi bir eylemden sorumlu değildir. İçerik, finansal, yasal veya diğer profesyonel tavsiye niteliğinde değildir ve MEXC tarafından bir tavsiye veya onay olarak değerlendirilmemelidir.

Ayrıca Şunları da Beğenebilirsiniz

This U.S. politician’s suspicious stock trade just returned over 200% in weeks

This U.S. politician’s suspicious stock trade just returned over 200% in weeks

The post This U.S. politician’s suspicious stock trade just returned over 200% in weeks appeared on BitcoinEthereumNews.com. United States Representative Cloe Fields has seen his stake in Opendoor Technologies (NASDAQ: OPEN) stock return over 200% in just a matter of weeks. According to congressional trade filings, the lawmaker purchased a stake in the online real estate company on July 21, 2025, investing between $1,001 and $15,000. At the time, the stock was trading around $2 and had been largely stagnant for months. Receive Signals on US Congress Members’ Stock Trades Stocks Stay up-to-date on the trading activity of US Congress members. The signal triggers based on updates from the House disclosure reports, notifying you of their latest stock transactions. Enable signal The trade has since paid off, with Opendoor surging to $10, a gain of nearly 220% in under two months. By comparison, the broader S&P 500 index rose less than 5% during the same period. OPEN one-week stock price chart. Source: Finbold Assuming he invested a minimum of $1,001, the purchase would now be worth about $3,200, while a $15,000 stake would have grown to nearly $48,000, generating profits of roughly $2,200 and $33,000, respectively. OPEN’s stock rally Notably, Opendoor’s rally has been fueled by major corporate shifts and market speculation. For instance, in August, the company named former Shopify COO Kaz Nejatian as CEO, while co-founders Keith Rabois and Eric Wu rejoined the board, moves seen as a return to the company’s early innovative spirit.  Outgoing CEO Carrie Wheeler’s resignation and sale of millions in stock reinforced the sense of a new chapter. Beyond leadership changes, Opendoor’s surge has taken on meme-stock characteristics. In this case, retail investors piled in as shares climbed, while short sellers scrambled to cover, pushing prices higher.  However, the stock is still not without challenges, where its iBuying model is untested at scale, margins are thin, and debt tied to…
Paylaş
BitcoinEthereumNews2025/09/18 04:02
DigiByte Price Prediction 2026, 2027 and 2030: Is DGB Ready to See a Pump?

DigiByte Price Prediction 2026, 2027 and 2030: Is DGB Ready to See a Pump?

DigiByte DGB price prediction 2026–2030: $0.004, Arizona reserve bill, DigiDollar testnet, Taproot upgrade. Can DGB pump? Full honest analyst forecast 2026.
Paylaş
Blockchainreporter2026/04/02 05:00
Chris Burniske Forecasts Big Changes Coming to Cryptocurrency Market

Chris Burniske Forecasts Big Changes Coming to Cryptocurrency Market

TLDR Chris Burniske predicts that price flows will start driving crypto market narratives. Burniske foresees underperforming cryptocurrencies gaining more attention. Coinbase predicts growth in Q4 2025 driven by positive macroeconomic factors. Tom Lee suggests Bitcoin and Ethereum could benefit from potential Fed rate cuts. A major shift is looming in the cryptocurrency market, according to [...] The post Chris Burniske Forecasts Big Changes Coming to Cryptocurrency Market appeared first on CoinCentral.
Paylaş
Coincentral2025/09/18 00:17

Trade GOLD, Share 1,000,000 USDT

Trade GOLD, Share 1,000,000 USDTTrade GOLD, Share 1,000,000 USDT

0 fees, up to 1,000x leverage, deep liquidity