Datacenter Operations Engineer – 24×7
Employment Type: Full-time | Onsite DC | Shift-based
Experience: 2–4 years in DC monitoring & operations
About the Role:
Responsible for monitoring and managing all IT infrastructure components within the datacenter including servers, storage, network devices, firewalls, SSL certificates, and critical applications. Ensures that alerts are captured, validated, logged, and escalated as per SOPs.
Key Responsibilities (Detailed):
• Monitor server CPU, memory, partitions, and availability.
• Monitor OS-level and application services (DB, middleware, AD, etc.).
• Monitor SAN/NAS & storage health parameters.
• Monitor network devices (routers, switches, firewalls) for availability.
• Track link status, bandwidth usage, packet drops, and interface issues.
• Monitor SSL certificate expiries and coordinate renewals.
• Identify critical alerts, validate, and raise tickets.
• Coordinate with L2/L3 towers for patching and maintenance activities.
• Ensure correct escalation for Sev1/Sev2 alerts.
• Maintain daily DC health reports and monthly MIS.
Required Skills & Experience:
• Good understanding of server OS (Windows/Linux).
• Knowledge of network fundamentals (routing, switching, firewalls).
• Familiarity with SAN/NAS storage.
• Experience with monitoring tools (SolarWinds, Nagios, OpManager).
• Strong analytical and alert-handling capability.
• Ability to work in 24×7 DC environment.