Lead Site Reliability Engineer
hace 1 semana
Become a key member of our team as a Lead Site Reliability Engineer, focusing on advancing enterprise application infrastructure through expert DevOps practices and innovative cloud solutions.
You will lead efforts in designing robust, scalable systems utilizing Azure, AWS, Kubernetes, and Terraform. If you are prepared to leverage your leadership and technical skills to drive operational excellence and security, we encourage you to apply.
We accept CVs in English only.
Responsibilities
- Maintain and improve enterprise application infrastructure with DevOps methodologies
- Design and oversee CI/CD pipelines to optimize software deployment
- Administer Kubernetes environments to ensure system availability and efficiency
- Develop infrastructure as code leveraging Terraform for automation
- Monitor and enhance system performance to guarantee reliability
- Collaborate with teams to architect scalable cloud infrastructure
- Manage cloud security settings and IAM configurations
- Automate operational workflows to increase productivity and reduce manual tasks
- Diagnose and resolve infrastructure and application issues swiftly
- Handle operational requests and maintenance activities minimizing service interruptions
- Analyze system data to identify and address potential problems
- Ensure adherence to security policies and industry best practices
- Document infrastructure design and operational workflows
- Lead capacity planning and infrastructure scaling efforts
Requirements
- Minimum 5 years of experience in Site Reliability Engineering or DevOps
- Advanced proficiency in Python programming
- Strong expertise with AWS and Microsoft Azure including API, authentication, and serverless services
- In-depth knowledge of cloud networking, Kubernetes administration, security, IAM, and configuration automation
- Extensive experience with CI/CD pipelines, source control, containerization, and Terraform-based infrastructure management
- Proven skills in IaaS enablement and enhancement
- Background in enterprise-level software development and release management
- Thorough understanding of automation principles related to CI/CD and IaaS
- Excellent analytical and complex problem-solving capabilities
- Ability to manage operational requests and maintenance tasks effectively
- Strong communication abilities with English language proficiency at B2+ level
We offer
- Learning Culture - We want you to be the best version of yourself, that is why we offer unlimited access to learning platforms, a wide range of internal courses, and all the knowledge you need to grow professionally
- Health Coverage - Health and wellness are important, that is why we have you and up to four family members in a premiere health plan. We have a couple of options, so you can choose what is best for you and your family
- Visual Benefit - Seeing your work for us would be a sight for sore eyes. We want your vision to always be at 100% which is why we offer up to $ COP for any visual health expenses
- Life Insurance Plan - We have partnered with MetLife to offer a full-coverage Ife insurance plan. So, your family is covered, even if you are gone.
- Medical Leave Coverage - We are one of the few companies that cover 100% of your medical leave, for up to 90 days. Your health is the most important thing to us
- Professional Growth Opportunities - We have designed a highly competitive and complete development process, where you will have all the tools to get where you have always wanted to be, personally and professionally
- Stock Option Purchase Plan - As an EPAMer you can be more than just an employee, you will also have the opportunity to purchase stock at a reduced price and become a part owner of our organization
- Additional Income - Besides your regular salary, you will also have the chance to earn extra income by referring talent, being a technical interviewer, and many more ways
- Community Benefit - You will be part of a worldwide community of over 50,000 employees, where you can learn, challenge yourself, stand out, and share your knowledge and experience with multicultural teams
Please note that even though you are applying for this position, you may be offered other projects to join within EPAM.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
-
Site Reliability Engineer
hace 1 semana
Desde casa, Colombia Definity First A tiempo completoWe are seeking a skilled and motivated **Site Reliability Engineer** (SRE) to join our dynamic team. As an SRE at Definity First, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems. You will collaborate with cross-functional teams to design, build, and maintain our infrastructure, and you'll have the...
-
Site Reliability Engineer
hace 7 días
Desde casa, Colombia Gorilla Logic A tiempo completo**Mid-Level Site Reliability Engineer (SRE)** Gorilla Logic is looking for a Mid-Level Site Reliability Engineer (SRE) responsible for automation, instrumentation, and stability of our client's platforms to achieve operational health and performance. Our environment will require you to work effectively with your teammates, of course. But your real success...
-
Senior Site Reliability Engineer
hace 7 días
Desde casa, Colombia Gorilla Logic A tiempo completo**Senior Site Reliability Engineer (SRE)** Gorilla Logic is looking for a Senior Site Reliability Engineer (SRE) responsible for automation, instrumentation, and stability of our client's platforms to achieve operational health and performance. Our environment will require you to work effectively with your teammates, of course. But your real success will be...
-
Lead Site Reliability Engineer
hace 4 días
Desde casa, Colombia EPAM Systems A tiempo completoOur remote team is on the lookout for a Lead Site Reliability Engineer, with a specialization in cloud infrastructure provisioning and data migration. RESPONSIBILITIES - Collecting user requirements and devising solutions that meet their needs - Synchronizing with cross-functional teams, among which are the storage and networking groups, as you work towards...
-
Middle Site Reliability Engineer
hace 5 horas
Desde casa, Colombia EPAM Systems A tiempo completoEPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most...
-
Senior Site Reliability Engineer
hace 5 horas
Desde casa, Colombia EPAM Systems A tiempo completoEPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most...
-
Senior Site Reliability Engineer
hace 1 semana
Desde casa, Colombia EPAM Systems A tiempo completoJoin a dynamic team as a Senior Site Reliability Engineer, tasked with maintaining and evolving enterprise applications and infrastructure using Azure DevOps practices and cutting-edge tools.You will be instrumental in delivering robust, scalable solutions that drive company success. Apply if you are ready to contribute your engineering expertise and...
-
Senior Site Reliability Engineer
hace 4 días
Desde casa, Colombia EPAM Systems A tiempo completoJoin us as a Senior Site Reliability Engineer on our remote team. RESPONSIBILITIES - Creating data migration and connectivity solutions for Pharmaceutical Labs - Collaborating with diverse teams such as the storage and networking teams, aiming for successful project execution - Leading the team in crafting high-level, low-level end-to-end infra design using...
-
Site Care Partner
hace 2 semanas
Desde casa, Colombia Parexel A tiempo completoColombia, Remote **Job ID** R0000020545 **Category** Clinical Trials **ABOUT THIS ROLE**: Parexel FSP is hiring multiple Site Care Partners to support the Colombia region-these are great opportunities for experienced CRAs seeking career advancement! **Job Summary** The Site Care Partner is the main client point of contact for investigative sites...
-
Lead Cloud Engineer
hace 2 semanas
Desde casa, Colombia EPAM Systems A tiempo completoWe are seeking an experienced Lead Cloud Engineer to join our team. In this role, you will be responsible for ensuring the platform's reliability, performance, and efficiency, while also driving continuous improvement through service quality metrics. You will work closely with cross-functional teams to optimize the platform and keep it running...