This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Infopro Learning Named to Training Industry’s 2026 Leadership Training Watch List

Infopro Learning Named to Training Industry’s 2026 Leadership Training Watch List

Infopro Learning has been named to the 2026 Leadership Training Watch List by Training Industry. NEW JERSEY, NJ, UNITED

March 9, 2026

Topaz Ventures Announces Successful Exit from Visitt and Expands PropTech Investment Platform

Topaz Ventures Announces Successful Exit from Visitt and Expands PropTech Investment Platform

Topaz Ventures Announces Successful Exit from Visitt Following Series B Financing and Expands Focus on Seed+ and

March 9, 2026

Rezervology and Titl Launch RezeLink AI Title Search and Pre-Exam Automation

Rezervology and Titl Launch RezeLink AI Title Search and Pre-Exam Automation

New AI solution automates title search, abstracting, and pre-examination, delivering structured data directly into TPS

March 9, 2026

EVERGRACE HOME’S RIDGELINE FLUFFY BUBBLE FAUX FUR THROW SELECTED AS A GOOD HOUSEKEEPING 2026 BEDDING AWARD WINNER

EVERGRACE HOME’S RIDGELINE FLUFFY BUBBLE FAUX FUR THROW SELECTED AS A GOOD HOUSEKEEPING 2026 BEDDING AWARD WINNER

NEW YORK, NY, UNITED STATES, March 9, 2026 /EINPresswire.com/ — Evergrace Home announced today that its Ridgeline

March 9, 2026

Disability Advocates Group Launches DAG Cares Initiative to Deepen Community Commitment Across Florida

Disability Advocates Group Launches DAG Cares Initiative to Deepen Community Commitment Across Florida

New community impact program expands firm’s Florida mission beyond its legal advocacy services through sponsorships,

March 9, 2026

Jamaica Encourages Local Investors to Collaborate with Government Through PPP Framework

Jamaica Encourages Local Investors to Collaborate with Government Through PPP Framework

Local entrepreneurs, developers, and financial stakeholders are urged to collaborate with the Jamaican Government

March 9, 2026

Magickitchen.com Expands Complete Meals Menu And Also Pureed Meal Items

Magickitchen.com Expands Complete Meals Menu And Also Pureed Meal Items

New Line of Delicious, Diet-Friendly Frozen Meals Delivered Right to Your Door At MagicKitchen.com, we believe great

March 9, 2026

TESCO Metering Introduces Model 749 Utility Meter Storage Rack for Electric Utility Meter Shops

TESCO Metering Introduces Model 749 Utility Meter Storage Rack for Electric Utility Meter Shops

High-capacity rack helps utilities organize, stage, and manage electric meter inventory while improving safety and

March 9, 2026

Remote Power Conditioner Control Enabled by New IoT AI Hub ‘NI Station V2’ Launching April 1

Remote Power Conditioner Control Enabled by New IoT AI Hub ‘NI Station V2’ Launching April 1

Japan-based Nobest introduces next-generation multi-connectivity IoT device with remote ON/OFF control, multi-carrier

March 9, 2026

Sunstone Digital Tech Drives Business Growth With Advanced Search Engine Optimization Services

Sunstone Digital Tech Drives Business Growth With Advanced Search Engine Optimization Services

Sunstone Digital Tech strengthens its digital marketing leadership by delivering search engine optimization strategies.

March 9, 2026

ESC of the Western Reserve Selects Footsteps2Brilliance® for State Grant to Strengthen Literacy Development

ESC of the Western Reserve Selects Footsteps2Brilliance® for State Grant to Strengthen Literacy Development

The Educational Service Center of the Western Reserve (ESCWR) has selected Footsteps2Brilliance® as a strategic partner

March 9, 2026

Infopro Learning Recognized as a 2026 Training Industry Sales Training and Enablement Watch List Company

Infopro Learning Recognized as a 2026 Training Industry Sales Training and Enablement Watch List Company

Infopro Learning has been named to the 2026 Training Industry Sales Training and Enablement Watch List. NEW JERSEY, NJ,

March 9, 2026

International Association of Top Professionals (IAOTP) Continues to Attract Global Leaders, Influencers, and World Icons

International Association of Top Professionals (IAOTP) Continues to Attract Global Leaders, Influencers, and World Icons

International Association of Top Professionals (IAOTP) Continues to Attract Global Leaders, Influencers, and Cultural

March 9, 2026

Intero Digital Releases Guide to Help Brands Measure Visibility in AI-Powered Search and Audit GEO Footprint

Intero Digital Releases Guide to Help Brands Measure Visibility in AI-Powered Search and Audit GEO Footprint

COLORADO SPRINGS, CO, UNITED STATES, March 9, 2026 /EINPresswire.com/ — Intero Digital, a full-service digital

March 9, 2026

New Research Reveals Loneliness Is the Strongest Predictor of Mental Health Distress in Myasthenia Gravis

New Research Reveals Loneliness Is the Strongest Predictor of Mental Health Distress in Myasthenia Gravis

A Bionews survey of 311 people living with the disease finds that three emotions explain 63% of all variation in mental

March 9, 2026

Infopro Learning Life Sciences Revenue Soars, Expands Footprint with LTEN Partnership

Infopro Learning Life Sciences Revenue Soars, Expands Footprint with LTEN Partnership

Infopro Learning accelerates Life Sciences growth with strong revenue gains, major enterprise client wins, & the

March 9, 2026

Expo 2031 Unveils USA Pavilion Vision, Names BRC Imagination Arts Partner, Appoints Shanna Woodbury Executive Director

Expo 2031 Unveils USA Pavilion Vision, Names BRC Imagination Arts Partner, Appoints Shanna Woodbury Executive Director

SYDNEY, Mar 9 (AP) Expo 2031 organizers unveiled the USA Pavilion vision and leadership for the first A1 International

March 9, 2026

San Francisco Unicorns Extend Major Partner Status with Qualys through 2027

San Francisco Unicorns Extend Major Partner Status with Qualys through 2027

Cloud-based IT, security, and compliance solution provider Qualys signs contract for front-of-shirt sponsorship, renews

March 9, 2026

MSI² Recognizes the Distinguished Career of its Senior Fellow, CDR José Adán Gutiérrez, USN (Ret.)

MSI² Recognizes the Distinguished Career of its Senior Fellow, CDR José Adán Gutiérrez, USN (Ret.)

His career spans more than three decades in U.S. naval intelligence, defense cooperation, and security strategy His

March 9, 2026

Yuno Appoints Former Mastercard Executive Mauricio Schwartzmann as Chief Banking and Financial Institutions Officer

Yuno Appoints Former Mastercard Executive Mauricio Schwartzmann as Chief Banking and Financial Institutions Officer

Payments industry executive to lead global banking partnerships as Yuno expands infrastructure for agentic commerce NEW

March 9, 2026

BCYW Foundation Champions Targeted Awareness, Research, and Early Detection on International Women’s Day 2026

BCYW Foundation Champions Targeted Awareness, Research, and Early Detection on International Women’s Day 2026

Empowering Tomorrow Through the Strength of Awareness DENVER, CO, UNITED STATES, March 9, 2026 /EINPresswire.com/ — On

March 9, 2026

Nijigen no Mori’s NARUTO & BORUTO Shinobi-Zato Announces ‘Shinobi-Zato 7th Anniversary Event’ Vol. 3

Nijigen no Mori’s NARUTO & BORUTO Shinobi-Zato Announces ‘Shinobi-Zato 7th Anniversary Event’ Vol. 3

Vol. 3: "Shinobi-Zato Official X 70,000 Followers Challenge" to Be Held AWAJI, JAPAN, March 9, 2026 /EINPresswire.com/

March 9, 2026

Michelle MaliZaki Releases ‘Nap Time!’ for National Napping Day 2026

Michelle MaliZaki Releases ‘Nap Time!’ for National Napping Day 2026

Japanese American comedian and musical artist celebrates National Napping Day with the release of “Nap Time!” I

March 9, 2026

Kent State School of Fashion to Induct Fern Mallis Into Fashion Hall of Fame

Kent State School of Fashion to Induct Fern Mallis Into Fashion Hall of Fame

‘Godmother of Fashion Week’ to be honored for transforming New York Fashion Week into a global institution Fern Mallis

March 9, 2026

PrepScholar Launches AI Learning Assistant for SAT Prep

PrepScholar Launches AI Learning Assistant for SAT Prep

CAMBRIDGE, MA, UNITED STATES, March 9, 2026 /EINPresswire.com/ — PrepScholar, a leading test prep and college prep

March 9, 2026

Renew Financial Sponsors Broward County Water Matters Day for the Third Consecutive Year

Renew Financial Sponsors Broward County Water Matters Day for the Third Consecutive Year

FORT LAUDERDALE, FL, UNITED STATES, March 9, 2026 /EINPresswire.com/ — Renew Financial, a leading provider and pioneer

March 9, 2026

Vine to Bar Grows National Footprint with Publix Super Markets to bring gut-friendly chocolate to thousands of shoppers

Vine to Bar Grows National Footprint with Publix Super Markets to bring gut-friendly chocolate to thousands of shoppers

World-Renowned Celebrity Chef Cat Cora Celebrates Distribution and Partners at Natural Products Expo West I admire

March 9, 2026

National Supply Chain Day® Returns April 29, 2026 | Celebrating the People and Stories Powering the Global Supply Chain

National Supply Chain Day® Returns April 29, 2026 | Celebrating the People and Stories Powering the Global Supply Chain

National Supply Chain Day returns April 29 with livestream registration open, official events nationwide, keynote Billy

March 9, 2026

AAIS Announces The Berwyn Group as Newest Partner, Expanding Data Intelligence Solutions for Member Carriers

AAIS Announces The Berwyn Group as Newest Partner, Expanding Data Intelligence Solutions for Member Carriers

Collaboration delivers advanced death audit and location intelligence tools that strengthen risk management and improve

March 9, 2026

Pervaziv AI Announces Cortex 2.5 to Advance Enterprise AI, Developer Tools and Cybersecurity

Pervaziv AI Announces Cortex 2.5 to Advance Enterprise AI, Developer Tools and Cybersecurity

Reimagine how developers approach tasks in an AI native workplace. Cortex 2.5 immensely expands its capabilities to

March 9, 2026

EnergySmart Institute releases RESNET Sampling Standard Course for ENERGY STAR Multifamily and HERS Index Generation

EnergySmart Institute releases RESNET Sampling Standard Course for ENERGY STAR Multifamily and HERS Index Generation

EnergySmart Institute releases RESNET Sampling Standard Course for ENERGY STAR Multifamily and HERS Index Generation

March 9, 2026

The Ridges Sanctuary Announces Festival of Nature on the Door Peninsula Featuring 65+ Field Trips

The Ridges Sanctuary Announces Festival of Nature on the Door Peninsula Featuring 65+ Field Trips

24th Annual Door County Festival of Nature Returns Memorial Day Weekend, May 21–24, 2026 We're excited to explore

March 9, 2026

Baltimore Annuity Contracting 2026 Expansion Through FMO BenaVest

Baltimore Annuity Contracting 2026 Expansion Through FMO BenaVest

BenaVest expands Baltimore annuity contracting for 2026, helping agents offer retirement income and asset protection

March 9, 2026

CrossPlans Celebrates 20 Years of Serving Retirement Plan Sponsors

CrossPlans Celebrates 20 Years of Serving Retirement Plan Sponsors

By helping employers, financial advisors & CPA’s deliver high-quality retirement plans, we support meaningful

March 9, 2026

Award-Winning Cicero Personal Injury Lawyer Announces Free Case Evaluations, Spanish-Language Services

Award-Winning Cicero Personal Injury Lawyer Announces Free Case Evaluations, Spanish-Language Services

Trial Lawyers College Graduate Serves Working Families in Predominantly Hispanic Suburb with No-Cost Legal

March 9, 2026

Elite Sports Slashes Wholesale BJJ Gi Prices Starting at $25

Elite Sports Slashes Wholesale BJJ Gi Prices Starting at $25

The Elite Sports BJJ Wholesale Program now offers plain BJJ gis, starting at just $25, with no minimum order and

March 9, 2026

Allen TX Commercial Irrigation Mandate Highlights System Efficiency Gaps Across DFW

Allen TX Commercial Irrigation Mandate Highlights System Efficiency Gaps Across DFW

Allen TX auditor finds most commercial systems run at 40-55% efficiency. Dedicated audit service now available starting

March 9, 2026

CYPHER Learning celebrates Customer of the Year winners for 2025

CYPHER Learning celebrates Customer of the Year winners for 2025

CYPHER Learning announces this year’s award-winning customers transforming learning and producing measurable business

March 9, 2026

Craters & Freighters of Portland Announces New Ownership

Craters & Freighters of Portland Announces New Ownership

New ownership reinforces custom crating, engineered packaging, and specialty shipping services for high-value and

March 9, 2026

Craters & Freighters Announces New Ownership in Jacksonville, Florida

Craters & Freighters Announces New Ownership in Jacksonville, Florida

New leadership strengthens custom crating, engineered packaging, and specialty shipping services for high-value assets

March 9, 2026