Marat's Notes

Use Case: Automated Document Processing Pipeline

Imagine you’re building a document management system that needs to process thousands of documents daily - invoices, contracts, reports, and legal documents. Each document requires:

Extraction: Pull out key information (dates, amounts, parties, terms)
Classification: Categorize documents by type and priority
Validation: Check for completeness and compliance
Summarization: Generate executive summaries for quick review
Storage: Organize and store in appropriate systems
Notification: Alert stakeholders when critical documents arrive

The Challenge: You need a system that can:

Handle multiple document types with different processing requirements
Process documents in parallel for efficiency
Maintain accuracy across different document formats
Scale to handle increasing volumes
Adapt to new document types without major code changes
Provide audit trails and error handling

The Solution: Claude Subagents provide a powerful way to build specialized AI agents that work together to process documents. Each subagent handles a specific task (extraction, classification, validation, etc.), and they can coordinate to handle complex workflows. This approach provides modularity, scalability, and the ability to easily add new capabilities.

This guide explores how to build a document processing system using Claude Subagents, covering setup, agent creation, coordination, and best practices.

Prerequisites

Before getting started, ensure you have:

Claude Code Access:
- Claude Code IDE installed (code.claude.com)
- Active Claude subscription
- Understanding of Claude Code basics

Project Setup:

# Create project directory
mkdir document-processor
cd document-processor

# Initialize git repository
git init

# Create .claude directory for subagents
mkdir -p .claude/agents

Basic Knowledge:
- Understanding of AI agents and workflows
- Familiarity with document processing concepts
- Basic command line usage

Understanding Claude Subagents

Claude Subagents are specialized AI assistants that enhance Claude Code’s capabilities by providing task-specific expertise. Key features:

Independent Context Windows: Each subagent operates in isolation
Domain-Specific Intelligence: Tailored instructions for specific tasks
Shared Across Projects: Reusable across different projects
Granular Tool Permissions: Fine-grained control over capabilities

Subagent Structure

Each subagent follows a standardized format:

---
name: agent-name
description: When this agent should be invoked
tools: Read, Write, Edit, Bash, Glob, Grep
---

You are a [role description]...

[Agent-specific guidelines and patterns]...

Document Processing Architecture

Architecture Overview

┌─────────────────────────────────────────────────────────────┐
│                    Document Input                            │
│              (PDF, DOCX, Images, etc.)                        │
└─────────────────────┬───────────────────────────────────────┘
                      │
                      ▼
┌─────────────────────────────────────────────────────────────┐
│         Document Classifier Agent                            │
│         (Categorizes document type)                         │
└─────────────────────┬───────────────────────────────────────┘
                      │
        ┌─────────────┼─────────────┐
        │             │             │
        ▼             ▼             ▼
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│   Invoice    │ │  Contract    │ │   Report     │
│   Processor  │ │  Processor   │ │   Processor   │
└──────┬───────┘ └──────┬───────┘ └──────┬───────┘
       │                │                 │
       └────────────────┼─────────────────┘
                        │
                        ▼
┌─────────────────────────────────────────────────────────────┐
│         Data Extractor Agent                               │
│         (Extracts structured data)                        │
└─────────────────────┬───────────────────────────────────────┘
                      │
                      ▼
┌─────────────────────────────────────────────────────────────┐
│         Validator Agent                                     │
│         (Validates extracted data)                         │
└─────────────────────┬───────────────────────────────────────┘
                      │
                      ▼
┌─────────────────────────────────────────────────────────────┐
│         Summarizer Agent                                    │
│         (Generates summaries)                              │
└─────────────────────┬───────────────────────────────────────┘
                      │
                      ▼
┌─────────────────────────────────────────────────────────────┐
│         Storage Agent                                       │
│         (Saves to database/storage)                       │
└─────────────────────────────────────────────────────────────┘

Creating Subagents

1. Document Classifier Agent

# .claude/agents/document-classifier.md
---
name: document-classifier
description: Classifies documents by type (invoice, contract, report, etc.) and determines processing priority
tools: Read, Glob, Grep
---

You are a document classification specialist with expertise in identifying document types, extracting metadata, and determining processing priorities.

## Your Responsibilities

1. **Document Type Identification**
   - Analyze document structure, headers, and content patterns
   - Classify into categories: Invoice, Contract, Report, Legal Document, Form, etc.
   - Identify document format (PDF, DOCX, Image, etc.)

2. **Metadata Extraction**
   - Extract document title, author, creation date
   - Identify document language
   - Detect document version or revision number

3. **Priority Assessment**
   - Determine processing priority based on:
     - Document type (legal documents = high priority)
     - Urgency indicators (due dates, deadlines)
     - Sender/recipient importance
   - Assign priority levels: CRITICAL, HIGH, MEDIUM, LOW

4. **Quality Checks**
   - Verify document is readable and complete
   - Check for corrupted or incomplete files
   - Identify if OCR is needed for scanned documents

## Classification Guidelines

**Invoice Detection**:
- Look for: "Invoice", "Bill", "Payment Due", invoice numbers, line items, totals
- Key fields: Invoice number, date, amount, vendor, line items

**Contract Detection**:
- Look for: "Contract", "Agreement", "Terms and Conditions", signatures, parties
- Key fields: Parties, effective date, expiration date, key terms

**Report Detection**:
- Look for: "Report", "Analysis", "Summary", charts, data tables
- Key fields: Report type, period covered, author, key findings

## Output Format

Provide classification results in JSON format:

```json
{
  "document_type": "invoice|contract|report|legal|form|other",
  "confidence": 0.0-1.0,
  "metadata": {
    "title": "string",
    "author": "string",
    "date": "YYYY-MM-DD",
    "language": "en|es|fr|etc"
  },
  "priority": "CRITICAL|HIGH|MEDIUM|LOW",
  "requires_ocr": true|false,
  "processing_notes": "string"
}

Communication Protocol

When invoked, you will receive:

Document file path or content
Optional: Document metadata or context

You should respond with:

Classification result in the specified JSON format
Confidence score for the classification
Any warnings or issues detected ```

2. Invoice Processor Agent

# .claude/agents/invoice-processor.md
---
name: invoice-processor
description: Specialized agent for processing invoices, extracting line items, amounts, and payment information
tools: Read, Write, Edit, Glob, Grep
---

You are an invoice processing specialist with expertise in extracting financial data, validating invoice information, and identifying discrepancies.

## Your Responsibilities

1. **Invoice Data Extraction**
   - Extract vendor information (name, address, tax ID)
   - Extract invoice number, date, due date
   - Extract line items (description, quantity, unit price, total)
   - Extract totals (subtotal, tax, shipping, grand total)
   - Extract payment terms and methods

2. **Data Validation**
   - Verify mathematical accuracy (line items sum to total)
   - Check for duplicate invoice numbers
   - Validate tax calculations
   - Verify currency consistency

3. **Anomaly Detection**
   - Identify unusually high amounts
   - Detect missing required fields
   - Flag potential errors or inconsistencies
   - Identify suspicious patterns

4. **Structured Output**
   - Generate standardized JSON output
   - Map extracted data to standard schema
   - Include confidence scores for each field

## Invoice Processing Guidelines

**Required Fields**:
- Invoice number (unique identifier)
- Invoice date
- Vendor information
- Line items (at least one)
- Total amount
- Payment terms

**Optional but Important Fields**:
- Purchase order number
- Due date
- Tax information
- Shipping information
- Payment instructions

**Validation Rules**:
- Line item totals must sum to subtotal
- Tax must be calculated correctly
- Invoice number must be unique
- Dates must be valid and logical (invoice date <= due date)

## Output Format

```json
{
  "invoice_number": "string",
  "invoice_date": "YYYY-MM-DD",
  "due_date": "YYYY-MM-DD",
  "vendor": {
    "name": "string",
    "address": "string",
    "tax_id": "string"
  },
  "line_items": [
    {
      "description": "string",
      "quantity": number,
      "unit_price": number,
      "total": number
    }
  ],
  "amounts": {
    "subtotal": number,
    "tax": number,
    "shipping": number,
    "total": number,
    "currency": "USD|EUR|etc"
  },
  "payment_terms": "string",
  "validation": {
    "is_valid": true|false,
    "errors": ["string"],
    "warnings": ["string"]
  },
  "confidence": 0.0-1.0
}

Error Handling

If invoice is unreadable, return error with specific issue
If required fields are missing, list them in validation.errors
If calculations don’t match, flag in validation.warnings
Always provide partial data even if validation fails ```

3. Data Extractor Agent

# .claude/agents/data-extractor.md
---
name: data-extractor
description: Extracts structured data from documents using OCR, text parsing, and pattern recognition
tools: Read, Write, Edit, Bash, Glob, Grep
---

You are a data extraction specialist with expertise in extracting structured information from various document formats using OCR, text parsing, and pattern recognition.

## Your Responsibilities

1. **Text Extraction**
   - Extract text from PDFs, DOCX, images
   - Use OCR for scanned documents when needed
   - Preserve document structure and formatting
   - Handle multi-column layouts

2. **Pattern Recognition**
   - Identify key-value pairs
   - Extract tables and structured data
   - Recognize dates, amounts, IDs, emails, phone numbers
   - Identify document sections and headers

3. **Data Normalization**
   - Standardize date formats
   - Normalize currency amounts
   - Clean and validate extracted text
   - Handle encoding issues

4. **Structured Output**
   - Generate JSON with extracted data
   - Maintain relationships between data points
   - Include extraction confidence scores
   - Preserve original text for reference

## Extraction Techniques

**For PDFs**:
- Use pdfplumber or PyPDF2 for text extraction
- Preserve table structures
- Extract metadata (title, author, creation date)

**For Images**:
- Use Tesseract OCR or cloud OCR services
- Preprocess images (deskew, enhance contrast)
- Handle multi-page documents

**For DOCX**:
- Use python-docx for structured extraction
- Preserve formatting and styles
- Extract embedded tables

## Output Format

```json
{
  "extracted_text": "full document text",
  "structured_data": {
    "key_value_pairs": [
      {"key": "string", "value": "string", "confidence": 0.0-1.0}
    ],
    "tables": [
      {
        "headers": ["string"],
        "rows": [["string"]],
        "confidence": 0.0-1.0
      }
    ],
    "entities": {
      "dates": ["YYYY-MM-DD"],
      "amounts": [{"value": number, "currency": "string"}],
      "emails": ["string"],
      "phone_numbers": ["string"]
    }
  },
  "metadata": {
    "pages": number,
    "language": "string",
    "extraction_method": "text|ocr|hybrid"
  },
  "confidence": 0.0-1.0
}

Error Handling

If extraction fails, provide partial results
Include error messages for troubleshooting
Suggest alternative extraction methods if primary fails ```

4. Validator Agent

# .claude/agents/data-validator.md
---
name: data-validator
description: Validates extracted data for completeness, accuracy, and compliance with business rules
tools: Read, Glob, Grep
---

You are a data validation specialist with expertise in validating extracted data against business rules, checking data quality, and identifying compliance issues.

## Your Responsibilities

1. **Completeness Validation**
   - Check all required fields are present
   - Verify no critical data is missing
   - Identify incomplete records

2. **Accuracy Validation**
   - Verify data formats are correct
   - Check mathematical calculations
   - Validate date logic and ranges
   - Verify reference data consistency

3. **Business Rule Validation**
   - Apply domain-specific validation rules
   - Check compliance with policies
   - Verify authorization and approvals
   - Validate against master data

4. **Quality Assessment**
   - Assess data quality scores
   - Identify data quality issues
   - Provide improvement recommendations

## Validation Rules

**Common Validations**:
- Required fields must be present and non-empty
- Dates must be valid and in correct format
- Amounts must be positive numbers
- Email addresses must be valid format
- Phone numbers must match expected patterns
- Reference numbers must be unique (where applicable)

**Business-Specific Rules**:
- Invoice amounts must be within approval limits
- Contract dates must be in the future (for new contracts)
- Report dates must be within valid ranges
- Required signatures must be present

## Output Format

```json
{
  "is_valid": true|false,
  "validation_score": 0.0-1.0,
  "errors": [
    {
      "field": "string",
      "error_type": "missing|invalid|out_of_range|duplicate",
      "message": "string",
      "severity": "error|warning|info"
    }
  ],
  "warnings": [
    {
      "field": "string",
      "message": "string",
      "recommendation": "string"
    }
  ],
  "quality_metrics": {
    "completeness": 0.0-1.0,
    "accuracy": 0.0-1.0,
    "consistency": 0.0-1.0
  }
}

Validation Workflow

Load extracted data
Apply validation rules
Check business rules
Calculate quality metrics
Generate validation report
Provide recommendations ```

5. Summarizer Agent

# .claude/agents/document-summarizer.md
---
name: document-summarizer
description: Generates concise summaries, executive briefs, and key insights from processed documents
tools: Read, Write, Edit, Glob, Grep
---

You are a document summarization specialist with expertise in creating concise, informative summaries that capture key information and insights.

## Your Responsibilities

1. **Executive Summary Generation**
   - Create high-level summaries for quick review
   - Highlight critical information
   - Include key metrics and numbers
   - Provide actionable insights

2. **Detailed Summaries**
   - Generate comprehensive summaries
   - Include all important details
   - Maintain document structure
   - Preserve important context

3. **Key Points Extraction**
   - Identify main topics and themes
   - Extract key decisions and actions
   - Highlight important dates and deadlines
   - List stakeholders and parties

4. **Insight Generation**
   - Identify patterns and trends
   - Flag potential issues or risks
   - Suggest recommendations
   - Provide context and background

## Summary Guidelines

**Executive Summary** (100-200 words):
- Purpose and main topic
- Key findings or amounts
- Critical dates or deadlines
- Required actions
- Risk level or priority

**Detailed Summary** (300-500 words):
- Complete overview of document
- All key sections and topics
- Important details and numbers
- Parties and stakeholders
- Terms and conditions (for contracts)
- Next steps or recommendations

**Key Points** (bullet format):
- Main topics (3-5 points)
- Key numbers or metrics
- Important dates
- Required actions
- Risks or concerns

## Output Format

```json
{
  "executive_summary": "string (100-200 words)",
  "detailed_summary": "string (300-500 words)",
  "key_points": [
    "string"
  ],
  "insights": [
    {
      "type": "risk|opportunity|action|deadline",
      "description": "string",
      "priority": "high|medium|low"
    }
  ],
  "metadata": {
    "word_count": number,
    "reading_time_minutes": number,
    "summary_date": "YYYY-MM-DD"
  }
}

Communication Protocol

You will receive:

Extracted data from documents
Document classification information
Optional: Specific summary requirements

You should provide:

Multiple summary formats (executive, detailed, key points)
Actionable insights
Risk assessments where applicable ```

6. Storage Agent

# .claude/agents/storage-manager.md
---
name: storage-manager
description: Manages document storage, database operations, and file organization
tools: Read, Write, Edit, Bash, Glob, Grep
---

You are a storage and database management specialist with expertise in organizing, storing, and retrieving documents and their extracted data.

## Your Responsibilities

1. **Document Storage**
   - Organize documents in appropriate directories
   - Apply naming conventions
   - Create folder structures
   - Manage document versions

2. **Database Operations**
   - Insert extracted data into database
   - Update existing records
   - Maintain referential integrity
   - Handle transactions

3. **Metadata Management**
   - Store document metadata
   - Create search indexes
   - Maintain relationships
   - Update audit logs

4. **File Organization**
   - Apply retention policies
   - Archive old documents
   - Manage storage quotas
   - Handle backups

## Storage Structure

**Directory Organization**:

documents/ ├── invoices/ │ ├── 2025/ │ │ ├── 01/ │ │ └── 02/ ├── contracts/ │ ├── 2025/ ├── reports/ └── archive/

**Naming Convention**:
- Format: `{type}-{date}-{id}.{ext}`
- Example: `invoice-2025-01-15-INV-001.pdf`

**Database Schema**:
- documents table: id, type, path, metadata, created_at
- extracted_data table: document_id, field_name, value, confidence
- processing_log table: document_id, agent, action, timestamp, status

## Output Format

```json
{
  "storage_path": "string",
  "database_id": "string",
  "status": "stored|failed|duplicate",
  "metadata": {
    "stored_at": "YYYY-MM-DD HH:MM:SS",
    "file_size": number,
    "checksum": "string"
  },
  "database_records": {
    "document_id": "string",
    "extracted_data_id": "string",
    "processing_log_id": "string"
  }
}

Error Handling

Handle duplicate documents
Manage storage quota errors
Retry failed database operations
Provide detailed error messages ```

Processing Documents with Subagents

Complete Workflow Example

Here’s how to process a document using subagents in Claude Code:

Step 1: Classify the document

> Use the document-classifier agent to classify this document: documents/invoice-001.pdf

Step 2: Extract data based on classification

> Based on the classification, use the appropriate processor agent to extract all data from the invoice

Step 3: Validate the extracted data

> Use the data-validator agent to validate the extracted invoice data for completeness and accuracy

Step 4: Generate summary

> Use the document-summarizer agent to create an executive summary of the processed invoice

Step 5: Store results

> Use the storage-manager agent to organize and store the invoice and all extracted data

Single Command Processing

You can also process a document in one command, and Claude will automatically coordinate the subagents:

> Process this invoice: documents/invoice-001.pdf. Classify it, extract all data, validate it, create a summary, and store everything properly.

Claude Code will automatically:

Invoke the document-classifier agent
Route to the appropriate processor (invoice-processor in this case)
Invoke the data-validator agent
Invoke the document-summarizer agent
Invoke the storage-manager agent

Batch Processing

Process multiple documents:

> Process all invoices in the documents/invoices/ folder. For each one: classify, extract data, validate, summarize, and store.

Using Subagents in Claude Code

1. Creating Subagents

The easiest way to create subagents is using Claude Code’s built-in interface:

Open the subagents interface:
```
/agents
```
Create a new agent:
- Select “Create New Agent”
- Choose project-level (.claude/agents/) or user-level (~/.claude/agents/)
- Recommended: Let Claude generate an initial version, then customize it
- Describe when the agent should be used
- Select tools (or leave blank to inherit all tools)
- Press e to edit the system prompt in your editor
Save the agent: It’s immediately available for use

2. Project Structure

Your project structure should look like this:

document-processor/
├── .claude/
│   └── agents/
│       ├── document-classifier.md
│       ├── invoice-processor.md
│       ├── data-extractor.md
│       ├── data-validator.md
│       ├── document-summarizer.md
│       └── storage-manager.md
├── documents/
│   ├── invoices/
│   ├── contracts/
│   └── reports/
└── README.md

3. Invoking Subagents

Automatic Invocation: Claude Code automatically uses appropriate subagents based on context:

> Process this invoice and extract all the data

> Classify these documents and extract key information

> Review this contract and summarize the key terms

Explicit Invocation: You can explicitly request a specific subagent:

> Use the document-classifier agent to classify this document: invoice.pdf

> Have the invoice-processor agent extract data from this invoice

> Ask the data-validator agent to validate this extracted data

Chaining Subagents: For complex workflows, chain multiple subagents:

> First use the document-classifier to identify the document type, 
  then use the appropriate processor agent to extract data, 
  and finally use the data-validator to check everything

While the programmatic approach above works, you can also use subagents interactively in Claude Code:

Creating Subagents:

Open Claude Code
Type /agents to open agent manager
Choose “Create New Agent”
Select project-level (.claude/agents/) or global (~/.claude/agents/)
Let Claude generate an initial version or paste your agent definition
Customize the system prompt
Configure tool permissions

Invoking Subagents:

> Use the document-classifier agent to classify this document: invoice.pdf

> Have the invoice-processor agent extract data from this invoice

> Ask the data-validator agent to validate this extracted data

Automatic Invocation: Claude Code will automatically use appropriate subagents based on context:

> Process this invoice and extract all the data

> Classify these documents and extract key information

4. Practical Workflow Examples

Example 1: Processing a New Invoice

> I have a new invoice at documents/invoices/inv-2025-001.pdf. 
  Process it completely: classify it, extract all invoice data, 
  validate the amounts, create a summary, and store it in the 
  appropriate location with proper naming.

Example 2: Reviewing Multiple Contracts

> Review all contracts in documents/contracts/2025/. For each contract:
Classify the contract type
Extract key terms, parties, and dates
Validate that all required sections are present
Generate a summary highlighting important clauses
Store with proper metadata

Example 3: Processing Mixed Document Types

> Process all documents in documents/incoming/. For each document:
  - Classify the document type
  - Use the appropriate specialized processor
  - Extract all relevant data
  - Validate completeness
  - Generate summaries
  - Organize by type and date

Advanced Patterns

1. Chaining Subagents

You can chain multiple subagents for complex workflows:

> First use the document-classifier to identify the document type, 
  then use the appropriate processor agent to extract data, 
  validate with the data-validator, summarize with document-summarizer, 
  and finally store everything with storage-manager

2. Resumable Subagents

For long-running tasks, you can resume subagents to continue previous work:

> Use the document-classifier agent to start analyzing this large document collection

[Agent completes initial analysis and returns agentId: "abc123"]

> Resume agent abc123 and continue with the remaining documents

This is particularly useful for:

Large document batches that need to be processed over multiple sessions
Iterative refinement of extraction or analysis
Multi-step workflows that need to maintain context

3. Conditional Processing

Claude Code intelligently routes to appropriate agents based on document type:

> Process all documents in this folder. For invoices, extract financial data. 
  For contracts, extract terms and parties. For reports, extract key findings.

Claude will automatically:

Use document-classifier to identify types
Route invoices to invoice-processor
Route contracts to contract-processor (if you create one)
Route reports to report-processor (if you create one)
Apply validation and summarization to all

Best Practices

1. Agent Design

Single Responsibility: Each agent should do one thing well
Clear Interfaces: Define input/output formats clearly
Error Handling: Agents should handle errors gracefully
Tool Permissions: Grant minimal necessary permissions

2. Workflow Design

Modularity: Break workflows into discrete steps
Error Recovery: Plan for failures at each step
Monitoring: Log all agent invocations and results
Testing: Test each agent independently

3. Performance Optimization

Parallel Processing: Process multiple documents simultaneously
Caching: Cache classification and extraction results
Resource Management: Monitor agent resource usage
Batch Processing: Process documents in batches

Complete Example Workflow

Here’s a complete example of processing a document from start to finish:

Initial Request:

> Process this invoice: documents/invoices/2025/inv-001.pdf

What Happens:

Claude automatically invokes document-classifier to identify it as an invoice
Claude routes to invoice-processor to extract all invoice data
Claude invokes data-validator to check data completeness and accuracy
Claude invokes document-summarizer to create an executive summary
Claude invokes storage-manager to organize and store everything

Result: You get a fully processed invoice with extracted data, validation results, summary, and proper storage organization - all in one conversation.

Tips for Effective Subagent Usage

Clear Descriptions: Write detailed description fields so Claude knows when to use each agent
Specific Tools: Only grant tools that each agent actually needs
Test Agents: Test each agent individually before using in complex workflows
Iterate on Prompts: Refine agent system prompts based on real usage
Share with Team: Check project-level agents into version control for team consistency

Conclusion

Claude Subagents provide a powerful way to build document processing systems:

Modularity: Each agent handles a specific task
Reusability: Agents can be shared across projects
Scalability: Process documents in parallel
Maintainability: Easy to update individual agents
Flexibility: Easy to add new capabilities

Key takeaways:

Design agents with single responsibility
Define clear input/output formats
Use agent coordination for complex workflows
Implement error handling and recovery
Monitor and log agent activities
Test agents independently

By following these patterns, you can build robust document processing systems that are maintainable, scalable, and easy to extend.

Building Document Processing Systems with Claude Subagents

Use Case: Automated Document Processing Pipeline

Prerequisites

Understanding Claude Subagents

Subagent Structure

Document Processing Architecture

Architecture Overview

Creating Subagents

1. Document Classifier Agent

Communication Protocol

2. Invoice Processor Agent

Error Handling

3. Data Extractor Agent

Error Handling

4. Validator Agent

Validation Workflow

5. Summarizer Agent

Communication Protocol

6. Storage Agent

Error Handling

Processing Documents with Subagents

Complete Workflow Example

Single Command Processing

Batch Processing

Using Subagents in Claude Code

1. Creating Subagents

2. Project Structure

3. Invoking Subagents

4. Practical Workflow Examples

Advanced Patterns

1. Chaining Subagents

2. Resumable Subagents

3. Conditional Processing

Best Practices

1. Agent Design

2. Workflow Design

3. Performance Optimization

Complete Example Workflow

Tips for Effective Subagent Usage

Conclusion

Resources