Ras Mic's best point in this long Calum Johnson Show walkthrough is not that everyone should copy his exact sponsor-report agent. It is that most people make agents too dramatic before they make them useful.
The productive path is much smaller: pick one repeated task, connect only the tools it needs, walk the model through one successful run, turn that run into a skill, and only then put it on a schedule.
Source Note
Credit for the episode goes to The Calum Johnson Show and guest Ras Mic. The public podcast listing describes Ras Mic as a full-stack engineer who teaches AI agents to beginners and frames the episode around using Codex in business, content, and life.
Product facts about Codex, skills, plugins, and automations are checked against OpenAI's Codex developer documentation. Claude Cowork and Claude Skills facts are checked against Anthropic's official product and platform pages. Ras Mic's preference for Codex over Claude Cowork in this episode is treated as commentary from the recording, not a timeless product verdict.
Link Map
| Resource | Use it for | Builder note |
|---|---|---|
| YouTube episode | Full Calum Johnson and Ras Mic conversation. | Main commentary source for the workflow examples. |
| Spotify episode listing | Public episode metadata and description. | Useful independent listing for title, guest, and timing. |
| Ras Mic on X | Guest credit and follow-up posts. | Follow the operator behind the workflow demo. |
| Calum Johnson on X | Host credit. | Follow the show source. |
| Codex app | Official Codex app surface and feature list. | Source for app capabilities, not the video transcript. |
| Codex Agent Skills | Reusable workflow instructions. | Best place to turn a successful run into repeatable context. |
| Codex plugins | Skills, app integrations, and MCP servers. | Use plugins when a workflow needs tool access. |
| Codex automations | Scheduled or recurring Codex work. | Schedule only after the manual run works. |
| Composio MCP with Codex | External app/tool access through MCP. | Useful when the native plugin directory does not cover a tool. |
| Claude Cowork | Anthropic's agent for knowledge work. | Relevant comparison point for non-technical workflows. |
| Claude Agent Skills | Anthropic's official skills docs. | Confirms the broader pattern: workflows become reusable skills. |
| Wispr Flow | Voice-to-text for long prompts. | Helpful when a workflow needs more context than you want to type. |
The Core Idea
The episode starts with a useful correction: bad agent outputs usually come from treating the model like a person who shares your assumptions. It does not. It can sound conversational, but it still needs explicit inputs, examples, constraints, and feedback.
Ras Mic's framing is simple: agents are smart enough to do useful work, but not wise enough to infer your whole operating process from a one-line prompt. The workflow has to carry the taste, rules, sequence, and success criteria.
That matters for business and content because the tasks worth automating are usually not glamorous. They are sponsor reports, inbox checks, weekly analytics, receipt cleanup, meeting summaries, social-performance briefs, support-question reports, or recurring research packets.
Why Your First Agent Probably Failed
The most common failure pattern is over-delegation. Someone says "make me a sponsor report" or "help with content" and expects the agent to know what a good report looks like, which metrics matter, which tools contain the data, what tone to use, and where the final output should go.
A better first version looks much more mechanical:
- Name the repeated task.
- Name the inputs: email, YouTube URL, analytics dashboard, receipts, calendar, meeting transcript, spreadsheet, or folder.
- Name the output: HTML report, spreadsheet, summary, email draft, checklist, file folder, or dashboard.
- Name the steps in order.
- Name what the agent must not do without approval.
- Run once manually, review the output, and correct it until it is genuinely useful.
This is why the episode's sponsor-report example works. It starts with a real task Ras Mic already does: pull YouTube views, find sponsor link clicks in Dub, calculate CTR, and render a clean report. That task has inputs, a formula, a deliverable, and a clear buyer: the sponsor.
Codex vs Claude Cowork
Ras Mic says he initially thought Claude Cowork was the easiest starting point for beginners, then shifted toward Codex after testing recent updates and rate-limit value. I would not turn that into a universal ranking. These tools are moving fast, and cost limits can change.
The more useful distinction is this:
| Tool | Best use | Watch out for |
|---|---|---|
| Codex | Agent work that mixes files, code, browser, plugins, skills, reports, app building, and scheduled routines. | It can feel technical until you learn the app, plugins, and permission model. |
| Claude Cowork | Non-technical knowledge work across desktop files, local documents, research, extraction, and deliverables. | Outputs still need coaching, examples, and review before you trust repeated runs. |
| Composio or MCP layer | Connecting agents to tools that the native plugin directory does not cover. | Every extra integration increases permission and audit risk. |
OpenAI's Codex docs support the architecture Ras Mic demonstrates: plugins can bundle skills, app integrations, and MCP servers; automations can use the same skills and plugins; and skills are the right place to store repeatable instructions. Anthropic's Claude Cowork page points in the same direction for knowledge work: outcome-based delegation still needs human oversight.
The Workflow Stack
The practical stack from the episode is not complicated:
- Codex as the work surface. Use it as the place where the task is described, run, reviewed, corrected, and eventually stored.
- Plugins for native app access. Start with the official plugin directory where possible.
- Composio for broader tool access. Use it as a tool router when the app you need is outside native coverage.
- Voice dictation for context. Ras Mic uses Wispr Flow-style dictation because longer prompts are easier to speak than type.
- Skills for repeatability. Only create the skill after a successful run.
- Automations for cadence. Schedule the repeat only after the output is stable.
This stack matters because it separates three jobs people often blend together. Connectors give access. Skills define how work should be done. Automations decide when it should run.
What To Automate First
The best first workflow is boring, repeated, and reviewable. Ras Mic gives examples from creator work: sponsor triage, sponsor reports, and missed email opportunities. The same pattern works for many small businesses.
| Workflow | Inputs | Output | Risk level |
|---|---|---|---|
| Sponsor or partnership triage | Email inbox, company website, funding/news search, notes | Ranked list of worth-replying opportunities | Medium: keep reply drafts human-approved |
| Content performance report | YouTube analytics, link tracker, publish calendar | HTML or PDF report with views, clicks, CTR, and caveats | Low: report-only if it does not send automatically |
| Weekly inbox rescue | Unread emails, starred threads, calendar, CRM | Missed-opportunity brief and draft replies | Medium: no sending without approval |
| Receipt and expense cleanup | Statements, receipts, rules, categories | Spreadsheet or review queue | Medium: finance needs review |
| Meeting brief | Transcript, notes, project docs | Decisions, action items, owner list, follow-up drafts | Low to medium depending on private data |
| Saved-post research sweep | X bookmarks, Instagram saves, newsletters, notes | Reusable ideas, hooks, and next actions | Low if it only summarizes |
If you are stuck, use this prompt:
I want to find one repeated business or content workflow worth turning into an AI agent skill.
Interview me first. Ask about:
- tasks I repeat every week
- tools I already use
- where the input data lives
- what output would actually save time
- what mistakes would be expensive
- what should require human approval
Then propose three candidate workflows ranked by time saved, clarity of inputs, reviewability, and risk.
Turn Wins Into Skills
One of the best distinctions in the episode is this: do not write a skill before the workflow works. Get one good run first. Then tell the agent to turn that successful run into a skill.
That matters because the thread now contains the examples, failed attempts, corrections, preferred output, and hidden assumptions that made the run work. A good skill is not a fantasy process. It is a working process compressed into reusable instructions.
The practical rule:
- If the first output is bad, correct it.
- If the connector fails, show the error and ask for a fix.
- If the style is wrong, give a concrete before/after.
- If the structure is wrong, provide the exact structure.
- When the output is good, say: "This run worked. Turn this into a skill I can reuse."
That is also aligned with OpenAI's own Codex best-practice direction: repeatable work should be packaged into skills instead of long prompts once the workflow is stable.
Permissions And Safety
The agent gets more useful as it gains access to tools, but every connector is also a permission decision. Email, calendar, drive, social accounts, analytics, payment tools, and CRMs are not harmless surfaces.
A safe starter path looks like this:
- Read-only first. Let the agent inspect and summarize before it edits, deletes, sends, or posts.
- Separate identity where possible. Agent-specific inboxes or accounts are easier to audit than your personal inbox.
- Approval before external action. Draft emails, reports, and posts first. Send later.
- Logs and artifacts. Keep the generated report, JSON payload, source list, and caveats.
- Limited scope. Do not connect every tool because you can. Connect what the workflow requires.
- Review the first three runs. If it cannot pass a human review three times, it is not ready for autonomy.
This is where small teams should be boring on purpose. The agent can help with high-leverage work, but the business still owns the permissions, the data, and the consequences.
Weekend Starter Plan
Here is the simple version I would actually run this weekend:
- Pick one workflow. Choose one repeated task you already do weekly.
- Write the current manual process. Inputs, steps, output, review criteria, and risks.
- Connect only the required tools. Start with native plugins; use Composio or another MCP route only when needed.
- Run it once in chat. Do not schedule anything yet.
- Correct the output. Tell the agent exactly what failed and what a good version looks like.
- Save the successful run as a skill. Name the inputs and expected output clearly.
- Run the skill again on a fresh example. If it works, then schedule it.
- Keep approvals on. No sending, deleting, posting, or money movement until the workflow earns trust.
The win is not "I have an agent." The win is "every Thursday, I get a clean report I used to dread making, and I can review it in two minutes."
Sources
- The Calum Johnson Show: AI Insider with Ras Mic
- Spotify episode listing
- Ras Mic on X
- Calum Johnson on X
- OpenAI Codex app documentation
- OpenAI Codex Agent Skills
- OpenAI Codex plugins
- OpenAI Codex automations
- OpenAI: save workflows as Codex skills
- OpenAI: manage your inbox with Codex
- Composio MCP integration with Codex
- Anthropic Claude Cowork
- Anthropic Agent Skills documentation
- Anthropic knowledge-work plugins
- Wispr Flow
- AgentMail
- Dub
- Cal.com
- Linear