Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
学ぶ Prompt Evaluation and Troubleshooting | Advanced Prompt Design and Refinement
Prompt Engineering Basics

bookPrompt Evaluation and Troubleshooting

メニューを表示するにはスワイプしてください

Prompt evaluation involves reviewing AI outputs to determine if the prompt is clear, specific, and produces the intended results. Troubleshooting means identifying and fixing issues such as ambiguity, bias, or inconsistency. When you examine the answers generated by an AI, your goal is to decide if the prompt you gave produces the kind of response you want. If the output is not what you expected, it is important to figure out why and make improvements.

Note
Definition

Prompt Evaluation is the process of assessing whether a prompt leads to accurate, relevant, and consistent AI responses.

Note
Definition

Prompt Bias: when a prompt unintentionally leads the AI to produce biased or unfair responses.

Example

If the prompt Explain climate change results in a generic answer, you might evaluate and revise it to Explain the main causes of climate change in simple terms for a high school student. This new prompt is more specific about the content and the intended audience, which helps the AI provide a more targeted and useful answer.

Note
Note

Always check if your prompt could lead to harmful, misleading, or biased outputs. Responsible prompt evaluation helps prevent these issues.

Tips for troubleshooting

  • Look for vague language;
  • Check for unclear instructions;
  • Identify any unintended assumptions;
  • Revise prompts to be more explicit and test again.

By carefully reviewing both the prompt and the AI's output, you can identify where things may have gone wrong and take steps to improve the results.

question mark

Which of the following is a sign that a prompt needs troubleshooting?

正しい答えを選んでください

すべて明確でしたか?

どのように改善できますか?

フィードバックありがとうございます!

セクション 3.  2

AIに質問する

expand

AIに質問する

ChatGPT

何でも質問するか、提案された質問の1つを試してチャットを始めてください

セクション 3.  2
some-alt