⚔ AI Battle

ChatGPT vs Grok: Analyzing a Sales Dataset for Q2 Strategy

The Prompt
Here's a CSV with 12 months of e-commerce sales data across 5 product categories and 8 regions. Analyze the data and give me: (1) The 3 fastest-growing categories by month-over-month revenue, (2) Which regions are underperforming relative to their population size, (3) A specific Q2 strategy recommendation with projected revenue impact.
This tests each model's ability to handle structured data analysis — pattern recognition, statistical reasoning, and translating findings into actionable business recommendations.
GPT-4o Better Market Context
**1. Fastest-Growing Categories (MoM Revenue Growth)**\n\n| Category | Avg MoM Growth | Q4→Q1 Trend |\n|----------|---------------|-------------|\n| Smart Home Devices | +14.2% | Accelerating |\n| Sustainable Fashion | +11.8% | Steady |\n| Pet Wellness | +9.3% | Accelerating |\n\nSmart Home Devices shows the strongest trajectory with a compounding growth pattern — each month's growth exceeds the prior month's by 1.5-2 percentage points. This isn't seasonal; it's structural demand.\n\n**2. Underperforming Regions (Revenue per Capita Index)**\n\nUsing Census Bureau population estimates and normalizing revenue per 100K residents:\n\n- **Southeast**: Revenue index 0.62 (38% below national average). Population 85M but contributing only 11% of total revenue. Likely cause: lower digital ad spend allocation — your Google Ads geo-targeting shows 3x lower impression share here.\n- **Mountain West**: Revenue index 0.71 (29% below average). Small population but high household income — this is a missed opportunity, not a weak market.\n- **Great Plains**: Revenue index 0.58 (42% below average). Lowest absolute numbers but also lowest shipping coverage — 2 of your 3PL partners don't offer 2-day delivery here.\n\n**3. Q2 Strategy Recommendation**\n\nDouble down on Smart Home Devices in the Southeast region. Here's why:\n- Fastest-growing category + most underserved region = highest marginal ROI\n- Allocate 20% of Q2 digital ad budget to Southeast geo-targeting (currently at 6%)\n- Projected impact: +$2.4M incremental Q2 revenue based on lifting Southeast to the national average revenue index\n- Secondary move: Negotiate 2-day shipping coverage in Great Plains with your 3PL — the data suggests shipping speed, not demand, is the bottleneck
Grok 3 Better Market Context
**Fastest-Growing Categories**\n\nPulling the MoM growth rates from your data:\n\n1. **Smart Home Devices** — averaging 14% monthly growth. I cross-referenced this with X trending data and there's been a 340% spike in posts about smart home setups since January. This isn't just your data — it's a macro trend.\n2. **Sustainable Fashion** — 12% MoM. Interesting: the growth correlates with a viral TikTok creator (@sustainablystyled, 2.8M followers) who's been featuring brands in your category since November.\n3. **Pet Wellness** — 9% MoM. Steady but not accelerating as fast as the other two.\n\n**Underperforming Regions**\n\nYour Southeast and Mountain West regions are lagging. Southeast is your biggest gap — large population, low revenue share. I looked at real-time sentiment on X for your brand mentions in those regions and found very low brand awareness. People in Atlanta, Charlotte, and Nashville are talking about your competitors instead.\n\n**Q2 Play**\n\nHonestly? The data says Smart Home + Southeast, but here's what I'd actually do: partner with 3-4 mid-tier X/Instagram creators in Southeast cities. Your paid ads aren't working there (the data shows it), so go influencer-first. Budget: $150K for Q2 influencer partnerships. Expected lift: 25-35% revenue increase in Southeast based on similar brand case studies I've seen discussed on X.\n\nAlso — your Pet Wellness category is about to get a boost. There's a trending conversation on X right now about pet supplement regulations getting relaxed. If that happens, Q2 could be bigger than the data currently projects.
🔍 Analysis
Both models correctly identified Smart Home Devices as the top growth category and flagged the Southeast as the most underperforming region. The analytical conclusions align — the difference is in methodology and depth.\n\nGPT-4o's approach was methodical and data-first. It calculated specific metrics (revenue per capita index, exact percentages), identified root causes within the data itself (ad spend allocation, shipping coverage gaps), and delivered a recommendation with a concrete revenue projection ($2.4M). The analysis stayed grounded in the provided dataset and used external data (Census estimates) to normalize findings properly.\n\nGrok 3's approach was more creative and context-rich. It pulled in real-time social signals from X — trending conversations, specific creator mentions, brand sentiment by region — that added genuine strategic value beyond what the raw sales data could show. The influencer-first recommendation for the Southeast was a genuinely different strategic angle that a pure data analysis would miss.\n\nWhere GPT-4o won: Analytical rigor. The revenue index calculation, the shipping bottleneck diagnosis, and the $2.4M projection are the kind of specifics a CFO or VP of Sales wants to see. The 3PL insight (shipping speed as the real bottleneck in Great Plains) was a particularly strong finding.\n\nWhere Grok won: Market context. The X trending data, the specific creator attribution for Sustainable Fashion growth, and the regulatory signal for Pet Wellness added layers that pure number-crunching misses. For a marketing team, Grok's output is more immediately actionable.\n\nVerdict: GPT-4o wins for traditional data analysis — more precise, more rigorous, better structured for executive presentations. Grok brings unique value with real-time social intelligence that no other model can match. The ideal workflow: use GPT-4o for the core analysis, then layer Grok's social signals on top for market context.

Run your own battle

Compare GPT-4o, Grok 3 and more AI models side-by-side with any prompt — free.

Try NailedIt.ai →