Platform Operations
Platform operations cover the activities required to ensure that the First Watch® platform itself remains reliable, observable, and fit for purpose during continuous industrial operation. These activities are designed to be non-intrusive and aligned with operational priorities.
Agent Lifecycle Management
Agents are core components of the platform's visibility and protection capabilities. Their lifecycle is managed in a controlled and transparent manner.
Operationally, this includes:
- Monitoring agent connectivity and operational state — ensuring all deployed agents are active and reporting
- Tracking agent versions and compatibility — maintaining awareness of deployed versions across the environment
- Detecting failed, degraded, or incompatible agents — identifying gaps in coverage before they impact protection
- Managing configuration updates and rollouts — coordinating changes across distributed agents in a controlled manner
Agent lifecycle events are surfaced through events and alarms, allowing operational teams to quickly identify gaps in coverage or degraded visibility.
Health and Connectivity Monitoring
The platform continuously monitors the health of its own components and communication paths.
This includes visibility into:
- Agent health and module status — operational state of each deployed agent and its functional modules
- Network connectivity between agents and platform services — ensuring reliable communication paths
- Data ingestion and processing continuity — confirming that collected data is being received and processed
Health monitoring ensures that operators can distinguish between:
- A quiet environment — no events, normal operations
- A loss of visibility — due to connectivity or component issues
This distinction is critical for maintaining confidence in monitoring and protection outcomes.