"Review this 47-page SaaS Master Services Agreement and identify all clauses that create uncapped liability exposure for the customer, including indirect exposure through cross-references to other sections."
Claude identified 8 liability exposure points across the MSA, including 3 that were buried in cross-references between the indemnification clause (Section 12), the IP warranty (Section 8.3), and a carve-out in the limitation of liability (Section 13.2(b)). It flagged that Section 13.2(b) effectively nullified the liability cap for any claim arising under the indemnification obligations — a structural vulnerability that standard keyword searches miss entirely. Output was organized as a risk matrix with clause references, severity ratings, and suggested redline language.
GPT-4o identified 5 of the 8 liability exposure points correctly, catching the obvious ones in the indemnification and limitation of liability sections. It missed the cross-reference chain between Sections 8.3, 12, and 13.2(b) that effectively created uncapped exposure. It also flagged two non-issues as potential risks (a standard force majeure clause and a routine audit right), producing false positives that would waste attorney review time.
Why Tie wins: Claude caught the structural cross-reference vulnerability that creates the most dangerous exposure. GPT-4o missed the subtlest risks and introduced false positives.
"Analyze this employment agreement for a California-based employee. Identify any provisions that may be unenforceable under current California law, including recent legislative changes."
Claude identified 6 potentially unenforceable provisions: a non-compete clause (void under Cal. Bus. & Prof. Code § 16600 and strengthened by AB 1076 effective 2024), an overbroad non-solicitation provision, a mandatory arbitration clause lacking the required EFAA exceptions for sexual harassment claims, a choice-of-law provision selecting Delaware law for employment disputes, a forfeiture-for-competition clause in the equity section, and an invention assignment clause that failed to include the required Cal. Labor Code § 2870 exemption notice. Each finding included the specific statute, a brief explanation of why it's problematic, and recommended corrective language.
GPT-4o correctly identified the non-compete clause and the mandatory arbitration issue. It noted the choice-of-law provision as potentially problematic. It missed the non-solicitation enforceability issue under recent California case law, the forfeiture-for-competition clause (which California courts increasingly treat as a de facto non-compete), and the missing § 2870 notice requirement. Its statutory citations were accurate for the items it did identify.
Why Tie wins: Claude caught twice as many enforceability issues, including nuanced ones that require understanding how California courts interpret non-compete-adjacent provisions. Its statutory citations were specific and current.
"Compare these two versions of an NDA (v1 from January, v2 from March) and produce a redline summary identifying every substantive change, its legal significance, and whether it shifts risk toward the disclosing or receiving party."
Claude produced a structured redline analysis identifying 14 substantive changes between the two NDA versions. For each change, it noted the specific section, the old and new language, the legal significance, and a directional risk assessment (shifts risk toward disclosing party, receiving party, or neutral). Key findings: the definition of 'Confidential Information' was narrowed to exclude publicly available information (favors receiving party), the residuals clause was expanded (significantly favors receiving party), the non-solicitation period was extended from 12 to 24 months (favors disclosing party), and a new compelled disclosure carve-out was added without requiring prior notice (favors receiving party). The summary concluded with an overall risk assessment showing v2 net-favors the receiving party.
GPT-4o identified 9 of the 14 changes correctly. Its risk assessments were accurate for the changes it caught. It missed several changes in the definitions section (narrowed scope of confidential information), didn't flag the expanded residuals clause, and treated the extended non-solicitation as a minor change rather than a significant risk shift. The output format was clean but less structured — paragraph form rather than a clause-by-clause matrix.
Why Tie wins: Claude found 5 more substantive changes and produced a more structured, actionable output. The residuals clause expansion it caught is often the most commercially significant change in NDA negotiations.
"Draft a legal memorandum analyzing whether our client's AI-generated marketing copy constitutes 'false advertising' under the Lanham Act, given that competitor claims it contains fabricated product statistics."
Claude produced a thorough 4-section memorandum following standard legal memo format (Question Presented, Brief Answer, Discussion, Conclusion). The Discussion section analyzed the five elements of a Lanham Act § 43(a) false advertising claim, applied them to the facts, and distinguished key precedents including POM Wonderful v. Coca-Cola and Lexmark v. Static Control. The analysis was careful and well-hedged, noting that AI-generated content raises novel questions about advertiser knowledge and intent. However, it couldn't cite any 2025-2026 case law developments on AI-generated advertising specifically.
GPT-4o produced a similarly structured memorandum with the standard IRAC format. Its Lanham Act analysis covered the same five elements. The key advantage: GPT-4o used web browsing to reference a 2026 FTC enforcement action involving AI-generated product claims and a recent Southern District of New York ruling on AI content liability, making the memo more current. The legal reasoning was slightly less nuanced than Claude's on the intent element, but the current citations made it more practically useful for a filing.
Why Tie wins: GPT-4o's web browsing capability allowed it to cite current 2026 developments that directly addressed AI-generated advertising claims. For a memo that needs to be filed, current citations matter more than marginally better reasoning on settled elements.
"Review this 200-page merger agreement and extract all conditions precedent to closing, organize them by responsible party, and flag any conditions that create material closing risk."
Claude processed the full 200-page agreement in a single pass (within its 200K context window). It extracted 23 conditions precedent organized into four categories: mutual conditions (6), buyer conditions (9), seller conditions (5), and regulatory conditions (3). It flagged 4 as material closing risks: a MAC clause with an unusually narrow set of excluded events, an antitrust approval condition with no hell-or-high-water commitment, a financing condition with a tight marketing period, and a seller representation about no pending litigation that was qualified by knowledge — creating risk if undiscovered litigation surfaces. Each condition included the section reference, exact language, and a risk severity rating.
GPT-4o required the document to be split into sections due to its 128K context limit. Across the chunked analysis, it identified 18 of the 23 conditions precedent. It missed two regulatory conditions in the exhibits and one mutual condition embedded in the definitions section. Its risk flagging was accurate for the conditions it found (catching 3 of the 4 material risks), but the chunked processing meant it couldn't cross-reference conditions against definitions in distant parts of the document.
Why Tie wins: Claude's larger context window allowed a single-pass analysis that caught 5 more conditions and maintained cross-referencing accuracy across the full 200-page document. The conditions it caught that GPT-4o missed were in exhibits and definitions — exactly the sections lost when chunking.
Compare Claude Opus 4.6 and GPT-4o for legal analysis with your own prompts — free.
Try NailedIt.ai →