Ver oferta completa

SENIOR SITE RELIABILITY ENGINEER

Miguel Hidalgo - Ciudad de México

Descripción de la oferta de empleo

Job Description Ideally you are an ex-application programmer who moved to SRE/DevOps out of a love for automation and to satisfy your curiosity about computer systems.
You will join and lead a team of passionate technologists dedicated to core SRE principles and building an exemplary technology organization.
The role of the Senior – Site Reliability Engineer is to be hands-on and provide mentorship to other team members on core SRE principles and tools.
The Senior SRE will participate in end to end operational aspects of Production environment.
The individual concerned will be able to work on cloud systems, networks, databases and help drive incident lifecycle management.
As a member of the SRE team, you will also be working closely with the Architects, DevOps, Product and development teams to ensure we get the most out of the software on AWS platform.
This role requires a highly skilled technology professional with excellent communication skills, strategic mindset, strong analytical and troubleshooting skills on AWS Cloud Platform.
MEX-Distrito Federal-Blvd Manu Apply Job Description Ideally you are an ex-application programmer who moved to SRE/DevOps out of a love for automation and to satisfy your curiosity about computer systems.
You will join and lead a team of passionate technologists dedicated to core SRE principles and building an exemplary technology organization.
The role of the Senior – Site Reliability Engineer is to be hands-on and provide mentorship to other team members on core SRE principles and tools.
The Senior SRE will participate in end to end operational aspects of Production environment.
The individual concerned will be able to work on cloud systems, networks, databases and help drive incident lifecycle management.
As a member of the SRE team, you will also be working closely with the Architects, DevOps, Product and development teams to ensure we get the most out of the software on AWS platform.
This role requires a highly skilled technology professional with excellent communication skills, strategic mindset, strong analytical and troubleshooting skills on AWS Cloud Platform.
Other responsibilities include working with internal business partners to gather requirements, prototyping, architecting, implementing/updating solutions, building and executing test plans, performing quality reviews, managing operations, and triaging and fixing operational issues.
Site Reliability Engineers must be able to adjust to constant business change; common types of changes include new requirements, evolving goals and strategies, and emerging technologies.
Key Responsibilities Be hands-on and provide mentorship to a growing SRE team on core SRE principles and tools.
Foster a sense of automation in issue resolution; everything possible should be automated, and only when automation can’t resolve an issue should people get involved in the resolution Lead efforts for updating production with new versions/infrastructures as they are available Lead capacity planning efforts in collaboration with Architects and DevOps engineers to determine changes to infrastructure that are needed to support new load and performance characteristics Leads engagement with software developers, DevOps and other infrastructure engineers to integrate software development and delivery from inception to full operation, ensuring robust released software and systems.
Ensure highest level of uptime to meet the customer SLA by implementing system wide corrections to prevent reoccurrence of issues.
Mentor other SRE team members to further develop their soft and hard skills Triage, troubleshoot and resolve issues using golden signals and go past golden signals Go past golden signals with additional principles such as chaos engineering to detect failure points and lead Game days for testing resiliency of team when it comes to incident response and remediations and synthetic monitoring.
Lead SRE team members to create and maintain Recovery Procedures, RCA’s in collaboration with other engineering teams.
Ensure Incidents assigned to the team are being managed within agreed SLAs Ensure alarms are documented in up to date Knowledge Base Articles.
Ensures Production infrastructure is up to date with server/security patches and certificates.
Continuous improvement of system and application monitoring and automation Identify and automate manual workarounds and process improvements Proactive monitoring of Monitor the availability, latency, scalability and efficiency of all services Perform periodic on-call duty as part of the SRE team Qualifications Skilled with cloud operations/administration in Amazon AWS.
Tax/Accounting domain experience Bachelors or Master’s in Computer Science discipline.
5+ years’ experience focussed on Site Reliability Engineering or related position in AWS Cloud Platform.
AWS Certification is a Plus.
Experience working with SQL, Windows Servers, Load balancers, Linux Deep experience with AWS Services and Windows support.
Program at a high level in at least one language such as.
Java, C#, Javascript, Python or Ruby.
Integration experience with PagerDuty, ServiceNow, Datadog, CloudWatch.
Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation; Ability to explain technical concepts in clear, non-technical language Working knowledge of infrastructure components (e.
.
routers, load balancers, cloud products, container systems, compute, storage, and networks) Knowledge of security and compliance standards such as SOC/PCI is a plus
Ver oferta completa

Detalles de la oferta

Empresa
  • Sin especificar
Municipio
Dirección
  • Sin especificar - Sin especificar
Fecha de publicación
  • 09/09/2024
Fecha de expiración
  • 08/12/2024
Senior System Engineer
Omni payments

Candidateshould have good communication skills, be a team player and be able to lead a team... requisitos del puestogood knowledge of the architecture of the tandem platform good knowledge in handling tacl macros, obey files, shell script good knowledge in middleware configuration like pathway, mq ......

Microsoft Dynamics Product Support Engineer _ Remote
Cliecon solution inc

Required/minimum skills/qualifications: minimum 2+ years relevant experience as technical/functional consultant or engineer engineering or master’s degree in computer science/information technology (it) or equivalent relevant product certifications from microsoft excellent communication skills - verbal......

Full Stack .NET Engineer Remote
Sonatafy Technology

Work with cross-engineering staff, collaborating on hardware and system monitoring requirements to ensure expected performance and reliability of the application / system developed... as a full stack engineer assigned to the product/project ensure performance, maintainability, and functional requirements......

Project Engineer
Eficacia en consultoria

Importante empresa multinacional de origen suizo esta en búsqueda de tu talento como: project engineer escolaridad: ingeniero mecánico (titulado)... inglés avanzado (la entrevista es en inglés) edad: 26 a 30 años experiencia laboral de 3 a 4 años como mechanical engineer o en ingeniería de proyectos......

Diseñador Estructural Senior
Diseños en Corrugado y Publicidad

Te estamos buscando a ti como diseñador estructural senior... nosotros (torre) estamos ayudando a diseños en corrugado y publicidad a encontrar al mejor candidato para unirse a su equipo tiempo completo para el rol de diseñador estructural senior... mz 001, mexico nuevo, ciudad lópez mateos, méx......

Senior tandem developer
Omni payments

Requisitos del puesto  good knowledge of the tandem platform architecture good knowledge of cobol85, c and tal programming languages good knowledge in programming with pathway middleware... we are looking for an enthusiastic and motivated “senior tandem developer” in our development team......

Senior unisys developer
Omni payments

Candidateshould have good communication skills, be a team player and be able to lead a team... requisitos del puesto good knowledge of the unisys platform architect good knowledge of cobol and algol programming languages good knowledge of dmsii and socket management programming......

Azure Machine Learning – Technical Support Engineer
Cliecon Solutions INC

Job title : azure machine learning – technical support engineer location : guadalajara city, mexico – initially remote job type : fulltime job description:: knowledge with azure machine learning and how it works with associated azure services... responsible for providing technical support and expertise......

Sales and Project Engineer Jr
S-MEX, S.A. DE C.V.

Essential: graduated industrial engineer or related... customer service quotations and follow-ups search for new customers apqp administration constant communication with international customers prepare, schedule, coordinate and monitor the assigned engineering projects interact daily with the clients......

Senior Consultant TM
Acute Talent

En acute talent buscamos >>> senior consultant tm (transportation management)<<< para madrid, españa (presencial)... *muy importante: residir en españa (presencial en madrid y alrededores)... ➡ experiencia en toma de requerimientos, diseño y parametrización... ➡ capacidad para trabajar en equipo......