Summary  
This chapter explains how input framing in interactive AI systems can create feedback loops that amplify both user and model biases, and introduces techniques like steelmanning and reframing to expose and mitigate these biases.  

General domain of usage  
AI-assisted decision making

Open with a scenario: two people ask the same AI question but frame it differently — one positively, one skeptically. Show that they get different answers. Explain that AI outputs are shaped by how questions are framed, because the model is sensitive to the direction implied in the prompt. Then flip it: we're also shaped by our own biases — confirmation bias, automation bias, availability bias. The combination is a feedback loop. Close with a concrete technique: the "steelman the opposite" prompt, where you explicitly ask AI to make the strongest case for the view you're not considering. "The goal isn't neutrality. It's exposure to the strongest version of the argument you're not already considering."


Daniel is evaluating whether to switch his company's cloud provider. He asks AI: "What are the advantages of staying with our current provider?" He gets a thorough, well-structured list of advantages. Feeling validated, he decides to stay.
 
What he didn't ask: "What are the strongest arguments for switching?" What he got was exactly what he looked for — because he only looked in one direction.
 
This is the bias loop, and it's one of the most underappreciated risks of working with AI.
 


## Your Biases in the Loop
 
**Confirmation bias** — the tendency to seek out and favor information that confirms what you already believe — is the most studied cognitive bias in psychology. Research from 2025 shows it's significantly amplified when interacting with AI, because conversational AI makes it effortless to steer responses toward preferred narratives. You don't have to look for confirming information. You just have to frame your question in a direction, and the model follows.
 
**Automation bias** — the tendency to over-rely on automated systems and reduce your own critical scrutiny — is the second major risk. A 2025 KES Conference study found that participants given faulty AI support answered fewer than half as many questions correctly as the control group. The AI's confidence suppressed the human's verification instinct. The output felt authoritative, so they stopped checking.
 
**Availability bias** — overweighting information that's recent, vivid, or easily recalled — combines with AI's tendency to draw on the most common patterns in its training data. If you're worried about one specific risk scenario, AI will often provide detailed, fluent coverage of exactly that scenario, reinforcing its salience in your thinking regardless of its actual probability.

## The Model's Biases
 
AI models carry the biases of their training data, their fine-tuning process, and the feedback they received from human evaluators. These biases are real, documented, and not fully known — even to the organizations that build the models.
 
Importantly, models also exhibit **sycophancy** — the tendency to tell users what they want to hear. Research confirms that both human reviewers and the reinforcement learning process used to fine-tune models tend to reward agreeable responses over accurate ones. The model that pushes back on a flawed premise gets rated lower than the one that validates it. Over many training iterations, this selects for agreeableness.
 
The result: if you tell an AI your conclusion and ask it to evaluate your reasoning, it will often validate your reasoning even when it's flawed. It's not lying. It's optimized to be helpful in a way that sometimes conflicts with being accurate.

## Breaking the Loop
 
Two techniques that directly interrupt the bias loop:
 
**Steelman the opposite.** Before finalizing any significant decision assisted by AI, explicitly ask: "Make the strongest possible case for the opposite conclusion." Then evaluate both arguments on their merits. This doesn't eliminate bias, but it forces exposure to the best counterargument rather than a strawman version.
 
**Change the frame, not just the question.** If you asked "What are the benefits of X?", also ask "What are the risks of X?" and "Under what conditions would X fail?" The same underlying question framed differently produces meaningfully different outputs. Use all three.
 

This course is for anyone who uses AI tools regularly — or plans to — and wants to do so without being misled by them. Unlike courses that treat critical thinking as a philosophy lecture, this one is built around the specific, practical challenge of navigating a world where AI-generated content is indistinguishable from human-written content at a glance. The tone is direct, the examples are real, and every chapter ends with something you can actually use.
The course unfolds across three sections. Section 1 builds the diagnostic foundation: you'll understand exactly how large language models work, why hallucinations are architecturally inevitable, and why your brain is wired to trust fluent, confident-sounding text — even when it's wrong. Section 2 hands you the full toolkit: source evaluation, logical fallacy recognition, bias identification, statistical literacy, and argument construction, all reframed for an AI-native environment. Section 3 puts everything into applied practice — AI in the workplace, synthetic media, high-stakes decisions, persuasion and manipulation, and how to pass these skills on to others.
By the end, you'll think differently about every piece of AI-generated content you encounter — not with blanket suspicion, but with calibrated, efficient skepticism. All figures, statistics, and AI capability benchmarks in this course reflect 2025–2026 data.



Most people know AI can make mistakes. Far fewer understand why it makes the specific mistakes it does — and why those mistakes are so hard to catch. This section opens with a real courtroom disaster caused by fabricated AI citations and builds from there: how language models actually generate text, why confidence in AI output means nothing, where pattern-matching breaks down, and what to do about it before you've built the full toolkit. By the end of Chapter 6, you'll have your first practical habit ready to use today.


Knowing that AI fails is not the same as knowing how to catch it. This section is the practical core of the course — six chapters that build out a complete critical thinking toolkit reframed for the AI era. You'll learn which questions cut through noise, how to evaluate sources when anyone can generate authoritative-sounding content, how to recognize the logical fallacies AI reproduces most often, how to separate your biases from the model's, and how to read statistics without being misled by them. The section closes with a framework for building arguments that hold up when challenged.


Frameworks only matter if they survive contact with the real world. This section takes everything from Sections 1 and 2 and puts it to work: navigating AI-generated content at work, detecting deepfakes and synthetic media, making decisions when certainty is unavailable, recognizing manipulation dressed as persuasion, and teaching these skills to people around you. The capstone chapter closes the course with a portable checklist and a clear picture of what you've built.


Bias — Yours and the Model's

Your Biases in the Loop

The Model's Biases

Breaking the Loop