Transforming Digital World
What are you looking for?
Scroll to top

SRE Production Support

Job Description:

We’re passionate about building software that solves problems. We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availability, and stellar performance level to pursue their missions. As we expand customer deployments, we’re seeking an experienced SRE to deliver insights from massive-scale data in real time. Specifically, we’re searching for someone who has fresh ideas and a unique viewpoint, and who enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences for every interaction.

Responsibilities:

• Monitoring and reporting on application behavior analytics, conducts smart triage by identifying, diagnosing, and coordinating resolution of performance problems before they impact end users, and participates in rapid root cause diagnosis of problems occurring within the application and infrastructure.
• Identifying the functional domain in which problems reside (Server Utilization, network Saturation, Application Tuning).
• Participating in all Major Incident Management and Root Cause Analysis calls and provides expert troubleshooting support as needed.
• Understanding of troubleshooting, incidents and problems, work to resolve issues timely and determine fault or underlying issue. Work with both customer and vendor personnel.
• Monitoring high value Business-centric transactions and manages response actions.
• Maintaining accurate documentation for assigned workspace and procedures, updating procedures including, but not limited to software, hardware layers.
• Understand and utilize de-escalation techniques when working with difficult customers.
• Monitoring high value Business-centric transactions and manages response actions.
• Monitoring Application infrastructure and network through monitoring tools like Splunk, AppDynamics, Dynatrace.
• Proactively detects, reports, logs, and responds to all network performance and availability problems in each part of the Application.
• Follows incident, problem and change management processes related to technology infrastructure being supported. Reviews system requirements and application dependencies to determine monitoring configuration.
• Must Provide 24×7 support on the production servers on a rotation basis and involving in creating documentation.

KNOWLEDGE/SKILLS/ABILITIES:

  • Master’s degree in Computer Science or related discipline
  • Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript
  • 5 to 6 years of experience in Production Support .

Work Experience Requirements:

  • Minimum 6+ years of professional experience in SRE Production Support

Select Minds LLC. seeks Master’s + 6 yr. Exp/Equiv.: Network Engineer (SMSREPS22)  NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn). Mail resume with job ID # to HR: 39111 Six Mile Road, Suite 113 & 115, Livonia, MI 48152. Unanticipated worksite locations throughout U.S. Foreign Equiv. accepted.

Job Category: Programming & Design Software Developer
Job Type: Full Time
Job Location: Michigan USA

Apply for this position

Allowed Type(s): .pdf, .doc, .docx
This website uses cookies to improve your experience. Cookie Policy