The "To Keep Up" Wiki

A collection of information we find useful

User Tools

Site Tools


washpost20250604

This page last changed 2025.06.07 11:30 [4 times today, 0 time yesterday, and 4 total times]

Summary of Washington Post 6/4/2025 Article

Information summarized from The Washington Post article

Washington Post “challenged AI helpers to decode legal contracts, simplify medical research, speed-read a novel, and make sense of Trump speeches.”  They asked ChatGPT, Claude, Copilot, Meta AI, and Gemini.  Responses varied from good to bad.  Scores are out of 10.

Literature

  • [7.8] ChatGPT. Best summary.  Failed to address slavery and the Civil War (as did most AI bots).
  • [7.3] Claude. Got facts right
  • [4.3] Meta AI.
  • [3.5] Copilot.
  • [2.3] Gemini. Inaccurate, misleading, and sloppy. Like when Costanza watches the “Breakfast at Tiffany's” movie instead of reading the book.

Law

Understanding two common legal contracts.

  • [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement.
  • [6.1] Gemini
  • [5.4] Copilot
  • [5.3] ChatGPT. tried to reduce complex parts of the contracts to one-line summaries and missed important points (key clauses).
  • [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points.

Health Science

Analyzing scientific research.

  • [7.7] Claude. Good summary on paper on Long Covid; scored low on another paper when accounting for racial differences
  • [7.2] ChatGPT
  • [7.0] Copilot
  • [6.5] Gemini. Left out key descriptions of the research on Parkinson's disease and why it mattered.
  • [6.0] Meta AI

Politics

Analyzing Trump's speeches.

  • [7.2] ChatGPT. Impressive responses to half of questions posed to it; accurate fact-checking Trump's claims about winning 2020 election.
  • [6.2] Claude
  • [5.2] Meta AI. Said Trump never said # jobs returning to MI and highlighted what Trump said about auto jobs.
  • [5.0] Gemini
  • [3.7] Copilot. Incorrect on # jobs returning to MI. Didn't capture charged nature of Trump's speech.

Overall Winner

This according to The Washington Post

  • [69.9] Claude - which was the only model that never hallucinated.
  • [68.4] ChatGPT
  • [49.7] Gemini
  • [49.0] Copilot
  • [45.0] Meta AI
washpost20250604.txt · Last modified: by Steve Isenberg