Home / Glossary

AI Coding Glossary

Key terms in AI-assisted development, explained clearly with practical context.

エージェンティック・コーディング

エージェンティック・コーディングとは、AIエージェントがコードベースを自律的に読み込み、コードを書き、コマンドを実行し、手動のコピー＆ペーストなしに結果を反復改善するソフトウェア開発手法です。チャット型AIとは異なり、エージェントが開発環境で直接アクションを起こし、複数ステップのタスクを完了させます。

→

Claude Code

Claude CodeはAnthropicが開発したターミナルベースのAIコーディングエージェントで、開発環境に直接組み込まれて動作します。プロジェクト全体を読み込み、複数ファイルにまたがるコードを書き、シェルコマンドを実行し、gitワークフローを管理し、エラーを自律的に修正します。すべてコマンドラインから操作できます。

→

Model Context Protocol（MCP）

Model Context Protocol（MCP）はAnthropicが作成したオープン標準で、AIモデルを外部ツール・データソース・APIに接続するための統一的な方法を提供します。AIのためのUSBのような標準化されたインターフェースとして機能し、MCP対応のツールはMCP対応のAIエージェントであればどれとでも連携できます。

→

CLAUDE.md

CLAUDE.mdはプロジェクトのルートに置くMarkdown設定ファイルで、Claude Codeに永続的なプロジェクト固有の指示を提供します。コーディング規約・アーキテクチャ・よく使うコマンド・ルールをエージェントに伝え、そのプロジェクトのすべてのセッションに適用される長期的なメモリとして機能します。

→

AIペアプログラミング

AIペアプログラミングとは、人間の開発者がAIツールと並走してリアルタイムで協力してコードを書く開発ワークフローです。開発者が方向性・コンテキスト・判断を提供し、AIがコードの提案・バグの検出・繰り返しの実装タスクを担います。

→

コンテキストウィンドウ

コンテキストウィンドウとは、AIモデルが1回のやり取りで処理できるトークン（単語・コードの文字・記号）の最大数です。プロンプト・コード・モデルの応答を含む、AIが一度にメモリに保持できる情報量の上限を定義します。

→

コーディングエージェント

コーディングエージェントとは、ファイルを自律的に読み込み、コードを書き、ターミナルコマンドを実行し、プログラミングタスクを完了するために結果を反復改善できるAIツールです。受動的なコード提案ツールとは異なり、コーディングエージェントは設定された目標を達成するために開発環境で独立してアクションを起こします。

→

バイブコーディング

バイブコーディングとは、開発者が自然言語で欲しいものを説明してAIツールに実装の詳細を任せる、非公式なソフトウェア開発アプローチです。正確な仕様を書く代わりに、開発者がカジュアルな会話で意図を伝え、結果に基づいて反復します。

→

AIコードレビュー

AIコードレビューとは、人工知能を使ってソースコードのバグ・セキュリティ脆弱性・スタイルの不一致・品質問題を自動的に分析するプロセスです。AIレビュアーはプルリクエストを検査し、改善を提案し、疲労や時間的プレッシャーで人間のレビュアーが見落とすかもしれない問題を発見できます。

→

コード向けプロンプトエンジニアリング

コード向けプロンプトエンジニアリングとは、AIコーディングツールが正確で関連性の高い出力を生成できるよう、明確で具体的な指示を作成する技術です。適切なレベルのコンテキスト・制約・例を含めてリクエストを構造化し、AIが何を望んでいるか、どのように実現してほしいかを理解できるようにします。

→

ヘッドレスAIエージェント

ヘッドレスAIエージェントとは、人間向けのインターフェースやリアルタイムのやり取りなしに動作するコーディングエージェントです。バックグラウンドプロセス・CI/CDパイプライン・スケジュールされたジョブで自律的にタスクを実行します。コードを読み込み、変更を加え、テストを実行し、どのステップでも人間の入力を待たずに結果を報告します。

→

サブエージェント

サブエージェントとは、複雑なタスクの独立した部分を同時に処理するために、メインのAIコーディングエージェントによって生成される並列の子プロセスです。すべてを順次処理する代わりに、メインエージェントがサブタスクを専門の子エージェントに委任し、並列に実行して結果を親に報告します。

→

AI Code Completion

AI code completion is a feature in development tools that uses machine learning models to predict and suggest code as you type. It ranges from single-line autocomplete to multi-line function generation, analyzing the surrounding code context to offer relevant suggestions that match your intent and coding style.

→

Large Language Model (LLM)

A large language model (LLM) is a deep learning system with billions of parameters, trained on vast datasets of text and code to understand, generate, and reason about natural language and programming languages. LLMs like Claude, GPT-4, and Gemini are the foundation of modern AI coding tools.

→

Retrieval-Augmented Generation (RAG)

Retrieval-augmented generation (RAG) is an AI architecture that improves the accuracy of language model responses by retrieving relevant information from external knowledge sources before generating an answer. Instead of relying solely on what the model memorized during training, RAG fetches up-to-date, domain-specific data and includes it in the model's context.

→

Tool Use

Tool use (also called tool calling) is the capability of a large language model to invoke external functions, APIs, or system commands as part of generating a response. Instead of being limited to producing text, a model with tool use can read files, run code, query databases, and interact with services—making it the foundation of agentic AI systems.

→

Function Calling

Function calling is an AI model capability where the model generates structured JSON arguments to invoke external functions instead of producing plain text. This enables LLMs to interact with APIs, databases, file systems, and other tools in a reliable, programmatic way—turning a conversational model into one that can take real-world actions.

→

System Prompt

A system prompt is a set of instructions provided to an AI model before the user's message that defines the model's behavior, persona, constraints, and capabilities. It acts as a configuration layer that shapes every response the model produces, without the user needing to repeat these instructions in each message.

→

Temperature

Temperature is a parameter in large language models that controls the randomness of the output. A temperature of 0 makes the model deterministic, always choosing the most probable next token. Higher temperatures (up to 1.0 or 2.0) increase randomness, making less probable tokens more likely to be selected. For coding tasks, lower temperatures generally produce more reliable, consistent code.

→

Token

A token is the fundamental unit of text that a large language model processes. Tokenization splits text into chunks—sometimes whole words, sometimes subwords, sometimes individual characters—that the model can work with. In English text, one token is roughly 3-4 characters or 0.75 words. In code, tokens map to keywords, operators, variable names, and whitespace.

→

Fine-Tuning

Fine-tuning is the process of further training a pre-trained large language model on a smaller, task-specific dataset to adapt its behavior for a particular use case. The model's weights are updated to specialize in a domain—such as a specific programming language, codebase, or output format—while retaining its general capabilities from pre-training.

→

Code Generation

AI code generation is the process of using artificial intelligence to produce source code from natural language descriptions, specifications, or existing code context. Modern code generation powered by LLMs can write entire functions, classes, tests, and even full applications from high-level instructions, across virtually any programming language.

→

AI Refactoring

AI refactoring is the use of artificial intelligence to automatically restructure, simplify, and improve existing source code without changing its external behavior. AI refactoring tools analyze code for complexity, duplication, poor naming, and anti-patterns, then apply transformations that make the code cleaner, more maintainable, and easier to understand.

→

AI Testing

AI testing is the application of artificial intelligence to software testing workflows—including generating unit tests, integration tests, and end-to-end tests from source code; identifying untested edge cases; analyzing test failures; and suggesting fixes. AI testing tools understand code semantics to write meaningful tests that go beyond basic coverage.

→

Multi-Modal AI

Multi-modal AI refers to artificial intelligence systems that can process, understand, and generate multiple types of data—text, images, audio, video, and code—within a single model. Unlike single-modal models that only handle text, multi-modal models can analyze a screenshot of a UI, read the associated code, and generate modifications based on both visual and textual understanding.

→

Chain-of-Thought

Chain-of-thought (CoT) prompting is a technique that encourages a large language model to break down complex problems into intermediate reasoning steps before producing a final answer. Instead of jumping to a conclusion, the model "thinks out loud," explaining each step of its logic. This significantly improves accuracy on tasks that require multi-step reasoning, including debugging, algorithm design, and code architecture decisions.

→

Few-Shot Prompting

Few-shot prompting is a technique where you include a small number of example input-output pairs in your prompt to demonstrate the pattern you want the AI to follow. By showing the model 2-5 examples of the desired behavior, it learns the format, style, and logic you expect—without any model training or fine-tuning. This is one of the most effective techniques for getting consistent, formatted output from LLMs.

→

Zero-Shot Prompting

Zero-shot prompting is a technique where you instruct an AI model to perform a task without providing any examples of the desired input-output format. You describe what you want in natural language, and the model relies entirely on its pre-trained knowledge to produce the output. It is the most natural way to interact with AI—just tell it what to do.

→

Embeddings

Embeddings are dense numerical vectors (arrays of floating-point numbers) that represent text, code, or other data in a high-dimensional space where semantically similar items are positioned close together. They enable AI systems to measure similarity between pieces of code, search codebases by meaning rather than keywords, and power retrieval-augmented generation (RAG) systems.

→

Vector Database

A vector database is a specialized database designed to store, index, and search high-dimensional embedding vectors efficiently. Unlike traditional databases that match exact values or keywords, vector databases find the most similar vectors to a query vector—enabling semantic search, recommendation systems, and the retrieval component of RAG (retrieval-augmented generation) architectures.

→

Technical Debt

Technical debt is the implied cost of future rework caused by choosing a quick, expedient solution now instead of a better approach that would take longer. Like financial debt, it accumulates interest: the longer it remains unaddressed, the more time and effort future changes require. Common sources include rushed features, skipped tests, outdated dependencies, and inconsistent architecture.

→