0:00
/
Transcript

Watch Hilary try Codex live

A recording from Hilary Gridley's live video

I finally tried Codex today and figured it might be fun to do my first Substack Live so you could see my initial reactions. Overall, it did feel like a step up from Claude Code.

I wanted to test a few specific things:

  1. Does it have good judgment? Does it build on my work to make it better than what I would get if I did everything myself?

  2. Is it good at being delegated to? If I give it an unspecified task, can it figure out how to do it well? Does it “manage up” to me in a helpful way?

  3. Does the UX for creating and managing agents/skills/automations/whatever feel different/better than Claude Code?

To do this, I attempted a few tasks:

  1. Update Couch to 5K for AI to have a Codex path in addition to a Claude Code path

  2. Set up an invoice tracker + agent that creates invoices and manages client status for me

  3. Create a skill that helps me think more ambitiously

Overall I was very impressed and I understand why everyone at the bleeding edge is using Codex over Claude Code now.

Thanks to everyone who joined live! I had a lot of fun and hope to do more of these. What would you be interested in seeing?

HG

Get more from Hilary Gridley in the Substack app
Available for iOS and Android

Discussion about this video

User's avatar

Ready for more?