Information Technology Business Analyst
Company: Astreya
Location: Location not specified (Remote)
Type: Full-time
Remote: Yes
Posted: 2026-05-24
About this role
What this Job Entails:
The Business Analyst IV will provide solutions that help attain business outcomes. The Alert Management & Observability Standards Lead is responsible for rationalizing and governing all system alerts to ensure they align with department priorities, operational coverage models, and service reliability goals. This role defines alerting standards, reviews and approves alerts before they are routed to the 24x7 Eyes-on-Glass Operations team, and establishes a scalable approach to cataloging alert response instructions (runbooks/playbooks) so responders can take consistent, high-quality actions.
This position operates at the intersection of the IT Operations Command Center (OCC), engineering/application teams, platform/monitoring tool owners, and service owners, ensuring alerts are actionable, prioritized, and paired with clear response guidance.
Your Roles and Responsibilities:
1) Alert Rationalization & Prioritization (Core)
Establish and maintain a department-wide alert rationalization framework that evaluates alerts for:
- Business/service criticality and operational priority
- Actionability (clear operator action available)
- Signal-to-noise (duplicate/low-value alerts removed or suppressed)
- Ownership and escalation paths
Perform regular alert reviews (new + existing) to ensure alert quality, correct routing, and alignment with operational coverage.
Lead continuous improvement efforts to reduce alert fatigue while preserving detection of true incidents and high-impact degradation.
2) Standards, Policies, and Guardrails
Define and enforce alerting standards including:
- Severity definitions and thresholds
- Required metadata (service, CI, owner, runbook link, escalation)
- Naming conventions and tagging taxonomy
- Routing rules and “when to page vs. when to ticket”
Create a standardized Alert Design Checklist and approval workflow (e.g., “Definition of Done” for alert onboarding).
Partner with tool/platform owners to ensur...