SRE Engineer

Posted on July 25, 2025

Apply Now

Job Description

  • Position: SRE Engineer
  • Experience : 6+
  • Location : onsite (Near Shore)
  • Budget : 1.1 LPM
  • Required Techniacal Skills:
  • .NET � Application support and performance debugging
  • ServiceNow � Incident, change, and problem management
  • AppDynamics � Application performance monitoring and alerting
  • Project Context :
  • Hands-on experience in incident and change management via ServiceNow.
  • Hands-on experience in monitoring tools, observability dashboards � AppDynamics.
  • Strong development, scripting experience for automation and self-healing.
  • Capability to implement alert filtering, dynamic thresholds and suppression rules � AppDynamics.
  • Adherence to RCA and incident tracking frameworks.
  • Experience with proactive incident trend analysis and log correlation.
  • Proven experience in application support and issue resolution in .NET-based production environments.
  • Experience with proactive monitoring, alert tuning, and noise suppression strategies.
  • Capability to build self-healing automation or recovery scripts for recurring issues.
  • Competence in analyzing telemetry/logs for behavior, event log management and pattern detection.
  • Familiarity with Splunk.
  • Experience with Postman and Swagger for testing APIs.
  • Basic understanding of test automation tools (e.g., Selenium, JMeter).
  • Light scripting experience (PowerShell, Bash, or Python).
  • Microsoft Azure certification (AZ-104 or similar).
  • Experience in the travel or cruise domain.
  • Roles and responsibilities:
  • Ensure system health through continuous monitoring and automated alert response & incident creation.
  • Integrate observability tools i.e. AppDynamics and create/maintain observability dashboards.
  • Automate ticket classification, triage, and escalation workflows.
  • Implement with proactive monitoring, alert tuning, dynamic thresholds and noise suppression strategies.
  • Analyze logs for patterns, event log management and contribute to proactive issue detection.
  • Maintain and version RCA and knowledge base entries.
  • Enable automated proactive restarts.
  • Build self-healing automation or recovery scripts for recurring issues.

Required Skills

.net servicenow appdynamics