Phase 3: Agentic AI & MCP Server
Status: Future phase — planned to build on Phases 1 & 2
Overview
Phase 3 leverages the rich asset data, observations, and alerts from Phases 1 and 2 to enable intelligent, AI-powered insights and actions. Instead of traditional work order planning, Phase 3 uses agentic AI and MCP (Model Context Protocol) servers to allow natural language interaction with your maintenance data, intelligent recommendations, and context-aware automation.
What is MCP?
The Model Context Protocol (MCP) is an open standard that enables AI models (LLMs) to securely connect to data sources and tools. In the context of Cyzag Blueprint, an MCP server exposes:
- Asset data (readings, attributes, types, locations)
- Historical trends and last known values
- Operator rounds and observations
- Phase 2 alerts and triage status
LLMs can then use these MCP servers to answer questions, analyze trends, and recommend actions — all with full context of your plant's operational data.
Core Capabilities
Natural Language Queries
Ask questions about your assets and operations in plain English:
- "What's the current discharge pressure on Pump P-101?"
- "Show me all pumps with bearing temperatures above 65°C in the last week"
- "Which assets in Building 3 have the most observations flagged as critical?"
- "What's the trend for vibration readings on Motor M-204 over the past month?"
The AI agent understands your domain model (assets, attributes, functional locations) and can query the data through the MCP server.
AI Agents with Full Context
Unlike simple chatbots, agentic AI in Phase 3 has access to:
- Asset structure: Types, attributes, relationships, and location hierarchy
- Historical readings: Trends, patterns, anomalies over time
- Observations: Operator notes, photos, and flagged issues from rounds
- Alerts: Threshold violations and abnormal conditions from Phase 2
- Maintenance history: Past observations, resolutions, and actions taken
This context allows agents to provide intelligent recommendations, identify root causes, and suggest priorities.
Intelligent Recommendations
The AI can analyze patterns and recommend actions:
- Prioritize maintenance: "Based on recent trends, these 5 assets should be inspected next"
- Root cause analysis: "The pressure drop correlates with increased vibration readings on the upstream pump — likely a bearing issue"
- Predictive insights: "Bearing temperature on Pump P-101 has been trending up for 3 weeks; recommend inspection before next scheduled round"
- Comparative analysis: "Assets in Zone A are showing similar degradation patterns; consider reviewing maintenance procedures"
MCP Server Architecture
The Cyzag Blueprint MCP server exposes tools (functions) that LLMs can invoke:
get_asset_by_id(asset_id)- Retrieve asset details and current attribute valuesquery_readings(asset_id, attribute_code, start_date, end_date)- Get historical readings for trendinglist_alerts(severity, location, status)- Query Phase 2 alerts by criteriaget_observations(round_id)- Retrieve operator observations from a roundsearch_assets(query, filters)- Find assets matching criteria
The MCP server handles authentication, data access control, and formatting results for LLM consumption.
Example: AI-Assisted Maintenance Workflow
- Operator completes round (Phase 1): Captures readings and notes unusual noise on Pump P-101
- Alert triggered (Phase 2): Vibration reading exceeds threshold, observation flagged as "medium" severity
- Supervisor queries AI agent (Phase 3): "What's going on with Pump P-101?"
- Agent responds with context:
- "Pump P-101 has a medium-severity observation from today's round: unusual noise reported by Jane Smith"
- "Vibration reading was 8.2 mm/s, which exceeds the threshold of 7.5 mm/s"
- "Bearing temperature has been trending up over the past 2 weeks (from 55°C to 68°C)"
- "Historical data shows similar patterns preceded a bearing failure in Pump P-103 last year"
- "Recommendation: Inspect bearings and lubrication on Pump P-101 within 48 hours"
The supervisor can then decide whether to schedule immediate inspection or continue monitoring.
Example: AI-Assisted Troubleshooting at Blueprint Paper Mill
To illustrate how Phase 3 provides intelligent insights, consider this scenario at Blueprint Paper Mill:
Scenario: Steam Boiler B-01 Efficiency Investigation
Date: July 10, 2025 Engineer: Carlos Martinez (Reliability Engineer) Context: Steam production costs have increased over the past month
Carlos suspects an issue with Steam Boiler B-01 but isn't sure where to start investigating.
Phase 3 AI Agent Query
Carlos opens the AI assistant and asks:
Carlos: "Why is Steam Boiler B-01 performance degrading?"
AI Agent Analysis (Using MCP Server)
The agent queries the MCP server for context:
MCP Server Tool Invocations:
1. get_asset_by_id("steam-boiler-b-01")
→ Returns: Asset details, current attribute values, location
2. query_readings("steam-boiler-b-01", "steam_pressure", "2025-06-10", "2025-07-10")
→ Returns: 30 days of steam pressure readings
3. query_readings("steam-boiler-b-01", "flue_gas_temperature", "2025-06-10", "2025-07-10")
→ Returns: 30 days of flue gas temperature readings
4. query_readings("steam-boiler-b-01", "feedwater_conductivity", "2025-06-10", "2025-07-10")
→ Returns: 30 days of conductivity readings
5. list_alerts(asset_id="steam-boiler-b-01", start_date="2025-06-10")
→ Returns: 3 observations flagged in past 30 days
6. get_observations(asset_id="steam-boiler-b-01")
→ Returns: Operator notes and photos from recent rounds
AI Agent Response
🤖 AI Analysis: Steam Boiler B-01 Performance Degradation
**Summary**:
Based on analysis of Steam Boiler B-01 data over the past 30 days,
I've identified two correlated issues that explain the performance
degradation:
**Issue 1: Increasing Flue Gas Temperature**
- Flue gas temperature has risen from 395°C (Jun 10) to 418°C (Jul 10)
- This is within the normal range (380–420°C) but trending toward
the upper limit
- Higher flue gas temperature indicates heat is escaping up the stack
rather than being transferred to water/steam
- **Impact**: Reduced thermal efficiency, higher fuel consumption
**Issue 2: Degrading Feedwater Conductivity**
- Feedwater conductivity has increased from 2.1 µS/cm (Jun 10) to
4.8 µS/cm (Jul 10)
- Still below the 5 µS/cm limit but trending upward
- Higher conductivity indicates dissolved solids in feedwater
- **Impact**: Scale buildup on boiler tubes reduces heat transfer
**Correlation Analysis**:
The flue gas temperature increase correlates strongly with the
conductivity trend (R² = 0.87), suggesting **tube fouling** is the
root cause. As scale builds on tube surfaces, heat transfer
decreases, causing more heat to exit via flue gas.
**Recent Operator Observations**:
- Jul 8: "Steam pressure slightly lower than usual despite same firing rate"
- Jul 5: "Noticed white deposits on sight glass during inspection"
**Recommendation**:
1. Schedule boiler tube cleaning to remove scale deposits
2. Inspect feedwater treatment system for malfunction
3. Test boiler water chemistry after cleaning
4. Monitor flue gas temperature—expect 10–15°C drop after cleaning
**Historical Context**:
Similar pattern occurred in March 2024 (6 months ago). Tube cleaning
at that time restored efficiency and dropped flue gas temperature
from 415°C to 392°C within one day.
**Priority**: Medium
**Estimated Cost Impact**: ~$5,000/month in excess fuel costs if not addressed
**Recommended Action Timeline**: Schedule cleaning within next 2 weeks
[View Trend Charts] [Create Work Order] [Export Report]
Carlos's Response
Carlos reviews the AI analysis and takes action:
- Validates findings: Checks trend charts and confirms correlation
- Schedules maintenance: Creates work order for tube cleaning during next planned outage (July 20)
- Investigates feedwater treatment: Assigns water treatment technician to inspect demineralizer system
- Monitors progress: Sets reminder to check flue gas temperature after cleaning to verify improvement
Outcome
- Root cause identified: Tube fouling due to feedwater quality degradation
- Time saved: 30 minutes of AI analysis vs. hours of manual data review
- Context provided: Historical comparison (March 2024) and cost impact estimate
- Actionable recommendations: Clear next steps with priority and timeline
This example demonstrates how Phase 3 transforms raw data into intelligent insights that engineers can act on immediately.
Comparison: Without vs. With Phase 3 AI
Without Phase 3 (Manual Investigation):
- Carlos exports 30 days of boiler data to spreadsheet (20 minutes)
- Creates trend charts in Excel (15 minutes)
- Reviews operator round notes manually (25 minutes)
- Searches maintenance history for similar issues (30 minutes)
- Consults with senior engineer (30 minutes)
- Writes analysis and recommendation (20 minutes)
Total time: ~2 hours 20 minutes
With Phase 3 AI:
- Carlos asks natural language question (30 seconds)
- AI agent queries MCP server and analyzes data (5 seconds)
- AI provides comprehensive analysis with recommendations (30 seconds)
- Carlos validates findings and takes action (5 minutes)
Total time: ~6 minutes
Time saved: ~2 hours 14 minutes (94% reduction)
Why Agentic AI?
Agentic AI with MCP provides:
- Flexibility: Natural language interaction without predefined workflows
- Intelligence: Context-aware recommendations based on trends and patterns
- Speed: Instant insights without navigating complex UIs
- Accessibility: Anyone can query data and get answers, not just trained planners
- Integration: MCP servers can connect to external tools for action execution if needed
Phase 3 augments operator rounds with intelligent, accessible insights that help teams make better decisions faster.
Technical Architecture
MCP Server Components
- Data Access Layer: Secure queries to Phase 1 asset data and Phase 2 observations/alerts
- Tool Registry: Defined functions (tools) available to LLMs
- Authentication: Scoped access based on user roles and environments
- Response Formatting: Convert database queries into LLM-friendly formats
AI Agent Integration
- Claude Code CLI: Agents can use the MCP server as a tool during conversations
- Custom Agents: Build specialized agents for maintenance workflows using the Claude Agent SDK
- Web Interface: Chat-based UI for supervisors and engineers to query data
- Mobile Integration: Voice-enabled queries on mobile devices for field personnel
Security & Privacy
- MCP server enforces same authentication and scoping as Phase 1/2 APIs
- No raw data exposed to external LLMs without user consent
- Audit logs track all AI agent queries and actions
- Sensitive data (photos, operator notes) can be excluded from MCP tools
Example Use Cases
Use Case 1: Shift Handover
Outgoing Supervisor: "Summarize critical issues from today's rounds in Building 2"
AI Agent:
- "3 critical observations flagged today in Building 2:"
- "1. Pump P-205: Discharge pressure low (35 bar, expected 40-50). Operator noted cavitation sounds."
- "2. Motor M-312: Running hours counter stuck at 8,472 — not incrementing. Possible sensor failure."
- "3. Heat Exchanger HX-401: Outlet temperature trending down over past 3 days (now 78°C, expected 85-90)."
Use Case 2: Trend Analysis
Reliability Engineer: "Show me all centrifugal pumps with increasing vibration trends over the past month"
AI Agent:
- "Found 7 pumps with upward vibration trends:"
- "P-101: +15% (6.8 → 7.8 mm/s)"
- "P-205: +22% (7.1 → 8.7 mm/s) — already flagged as medium severity"
- "P-307: +8% (5.2 → 5.6 mm/s)"
- "...would you like to see detailed charts or schedule inspections?"
Use Case 3: Root Cause Investigation
Engineer: "Why did we get an alert on Heat Exchanger HX-201 yesterday?"
AI Agent:
- "Alert triggered at 14:23 yesterday: Outlet temperature 72°C (below threshold of 80°C)"
- "Inlet temperature was normal (90°C), suggesting heat transfer issue"
- "Operator noted 'reduced flow' in observation"
- "Differential pressure on upstream Pump P-150 has been low for 3 days (25 bar, expected 30-35)"
- "Likely root cause: Pump P-150 performance degradation reducing flow through HX-201"
- "Recommendation: Inspect Pump P-150 impeller and check for blockages"
What Phase 3 Builds On
- Phase 1 Data: Asset structure, attributes, readings, and offline sync provide the foundation
- Phase 2 Intelligence: Observations and alerts give context for what needs attention
- Phase 3 AI: Connects the dots, identifies patterns, and makes the data accessible through natural language
Each phase adds value independently, but together they create an intelligent maintenance system that learns from your operations.
See Also:
- Product Roadmap - Full three-phase vision
- Phase 2 Features - Observations and alerts that feed AI insights