About the Job The Sr. Site Reliability Engineer is responsible for driving the reliability, performance, and scalability of services with minimal instruction. This role involves tackling non-routine assignments and resolving moderately complex issues that directly impact the service’s stability and effectiveness. The Sr. Site Reliability Engineer applies a deep understanding of software and systems engineering principles to design and implement solutions that enhance service reliability. This position requires good judgment and the ability to prioritize work effectively while contributing to the overall goals of the SRE team and organization. This role may come into contact with confidential or sensitive customer information requiring special treatment in accordance with Red Hat policies and applicable privacy laws. What You Will Do Lead the development and implementation of robust code and automation scripts to improve service reliability and scalability Conduct thorough code reviews and testing processes to ensure the highest quality standards in the codebase Work to solve moderately complex issues, making decisions that impact the service's reliability and performance Mentor and guide junior engineers, fostering a collaborative environment focused on continuous improvement Engage in a regular on-call rotation, taking responsibility for critical incidents and ensuring timely resolution Lead incident response and postmortem processes, implementing solutions to prevent recurrence of issues Collaborate with cross-functional teams to design, develop, and refine SRE tools and systems that support service objectives Take ownership of tasks and projects, prioritizing them according to their impact on service health and team goals What You Will Bring Linux Systems Management: Extensive experience managing Linux servers, particularly Red Hat Enterprise Linux (RHEL), CentOS, or Fedora, within cloud environments such as AWS, GCP, or Azure; Includes advanced system administration, networking, and troubleshooting Automation and Scripting: Proficient in writing and maintaining scripts for automation and orchestration tasks using tools like Ansible, Terraform, or custom scripts, to enhance efficiency and reduce manual workload Monitoring and Observability: Expertise in setting up and managing enterprise monitoring and observability solutions (e.g., Prometheus, Grafana), enabling proactive detection and resolution of issues Configuration Management: In-depth experience with configuration management tools such as Puppet, Chef, or similar, ensuring consistent and reproducible system states across environments Incident Management: Proven ability to lead incident response efforts, from initial troubleshooting to root cause analysis and implementing preventative measures Service Delivery and Optimization: Understanding of service delivery processes, with a focus on optimizing performance, reliability, and availability of hosted services
Salesforce and Data Engineering Intern Country: United States of America Your Journey Starts Here: Santander is a global leader and innovator in the financial services industry. We believe that our employees are our greatest asset. Our focus is on fostering...
...Job Description School Professionals is recruiting for after school help, preferably with childcare experience, available to work in private and charter schools in Miami-Dade county in the current school year. The School Professionals advantage includes total control...
...cutting-edge technology, is seeking a Silicon Validation Engineer (Mid to Senior Level) to join... ...\t Run, triage, and maintain silicon system regressions and report results.\n\t... ...Masters degree in Electrical Engineering, Computer Engineering, Electronics and...
...over 660,000 square feet of retail, office, entertainment, hospitality space, and other related commercial uses. ARCHITECTURAL DESIGN INTERN POSITION; We are seeking a creative, passionate and energetic intern who is interested in gaining experience within the...
Job Description PCS Group is an integrated design firm located in Denver, CO that specializes in planning, landscape architecture, and 3D visualizations. At PCS Group, we actively collaborate with clients to uncover the purpose of place and develop a collective vision...