Site Reliability Engineer
hace 5 días
Who We Are
At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities.
The Role
Join us as a Site Reliability Engineer (SRE) and embark on an exciting journey of ensuring reliability, resiliency, and innovation in our information systems and ecosystems. As an SRE at Kyndryl, you'll be at the forefront of driving continuous improvement and delivering exceptional service to our customers.
Your role goes beyond traditional engineering, as you'll have the opportunity to analyze business needs, tackle complex problems, and provide strategic advice and designs. You'll be involved in every stage of the software lifecycle, from building and testing to deploying changes and maintaining robust systems.
We're looking for a true visionary who can think strategically and help shape the future of our services. Your expertise in building trusted relationships with customers and partnering with them for success will be instrumental in driving our growth.
As an SRE, you'll have the unique opportunity to work on end-to-end services, spanning customer sites and platforms. Collaboration and proactivity are key as you work alongside a talented team of professionals, eager to make a difference. You'll embrace an entrepreneurial mindset, taking ownership of your responsibilities and constantly seeking innovative solutions.
With an unwavering focus on quality, robustness, and security, you'll be a driving force in implementing cutting-edge tools that enhance our operations, improve reliability, and gather valuable feedback on our platforms. Your ability to identify and mitigate common operational issues will play a crucial role in delivering seamless experiences to our customers.
If you're passionate about pushing the boundaries of technology, thrive in a collaborative environment, and are motivated by the opportunity to shape the future of reliability engineering, then we want to hear from you. Join our team and be part of a dynamic and forward-thinking organization that values innovation and excellence in everything we do.
Your Future at Kyndryl
Kyndryl has a global footprint, which means that as a Site Reliability Engineer at Kyndryl you will have opportunities to work on projects and collaborate with colleagues from around the world. This role is dynamic and influential - offering a wide range of professional and personal growth opportunities that you won’t find anywhere else.
Who You Are
You’re good at what you do and possess the required experience to prove it. However, equally as important - you have a growth mindset; keen to drive your own personal and professional development. You are customer-focused - someone who prioritizes customer success in their work. And finally, you’re open and borderless - naturally inclusive in how you work with others.
Required Skills and Experience
- 10+ years of experience in operational management, including incident management and escalations
- Experience implementing strategies to cap operations load and to handle overflow using appropriate tooling and metrics; defining service level indicators and objectives in collaboration with stakeholders, business, development, DevSecOps and Operations teams
- Solution and design experience in an enterprise environment: Windows server, Linux server (RHEL is preferred), UNIX (AIX, Solaris), -Windows server, storage, and Hyperscaler Cloud (AWS, Azure, Google Cloud Platform); public cloud platforms such as AWS, OpenShift, Azure or GCP
- Experience working with Data format and Scripting languages JSON, YAML, Bash and/or PowerShell
Preferred Skills and Experience
- BS degree in Computer Science, Engineering, or other highly technical, scientific discipline
- Expertise with Ansible, Terraform, and Python
- Experience with distributed technologies as well as dynamic resource management frameworks such as Kubernetes
- Expertise in leveraging open-source tooling such as Prometheus, Grafana, or Loki
Being You
Diversity is a whole lot more than what we look like or where we come from, it’s how we think and who we are. We welcome people of all cultures, backgrounds, and experiences. But we’re not doing it single-handily: Our Kyndryl Inclusion Networks are only one of many ways we create a workplace where all Kyndryls can find and provide support and advice. This dedication to welcoming everyone into our company means that Kyndryl gives you - and everyone next to you - the ability to bring your whole self to work, individually and collectively, and support the activation of our equitable culture. That’s the Kyndryl Way.
What You Can Expect
With state-of-the-art resources and Fortune 100 clients, every day is an opportunity to innovate, build new capabilities, new relationships
-
Site Reliability Engineer
hace 4 horas
Bogotá, Colombia Sur Global A tiempo completoSur Global, Bogota, D.C., Capital District, Colombia Site Reliability Engineer As the Site Reliability Engineer you will support and scale the infrastructure powering our secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production...
-
Site Reliability Engineer — Global Impact
hace 4 horas
Bogotá, Colombia Kyndryl A tiempo completoA global IT services company in Bogotá is seeking a Site Reliability Engineer to ensure system reliability and lead continuous improvements. Responsibilities include analyzing needs, managing application performance, and collaborating globally. Ideal candidates should have over 10 years of experience in operational management, expertise in cloud platforms,...
-
Remote - Edge Site Reliability Engineer (Sre)
hace 4 días
Bogotá, Cundinamarca, Colombia GSB A tiempo completoImportant company requires; **Edge Site Reliability Engineer (SRE) - Remote in Colombia** **Main Activities / Responsibilities**: - Guarantee the general system uptime, focus on availability to comply with the defined SLA, SLO and SLI. - Spend
-
Site Reliability Engineer
hace 2 semanas
Bogotá, Colombia Medtronic A tiempo completoAt Medtronic you can begin a life-long career of exploration and innovation, while helping champion healthcare access and equity for all. You’ll lead with purpose, breaking down barriers to innovation in a more connected, compassionate world. **A Day in the Life**:As a Site Reliability Engineer, you will be responsible for ensuring the reliability,...
-
Bogotá, Colombia Kyndryl A tiempo completoA leading IT services company in Bogotá is seeking a Site Reliability Engineer to ensure reliability and innovative solutions within their information systems. The ideal candidate will have over 10 years of experience and be adept in application monitoring, operational management, and various cloud platforms. This position offers opportunities for growth,...
-
Site Reliability Engineer Ii-2
hace 7 días
Bogotá, Colombia Mastercard A tiempo completo**Our Purpose** - Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation,...
-
Sr. Site Reliability Engineer
hace 4 días
Bogotá, Colombia Coupa A tiempo completoBogota, Colombia Development - Cloud Operations / Mid-Senior Level / Hybrid Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We...
-
Site Reliability Developer
hace 7 días
Bogotá, Colombia Oracle A tiempo completoSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...
-
Lead Site Reliability Engineer
hace 2 semanas
Bogotá, Distrito Capital, Colombia Coupa Software A tiempo completoCoupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter,...
-
Senior SRE Lead: Reliability, Automation
hace 4 horas
Bogotá, Colombia Scotiabank A tiempo completoA leading international bank in Bogotá seeks a System Reliability Engineer to enhance regional system stability and reliability. This role involves collaborating across teams to implement best practices in Site Reliability Engineering, troubleshoot incidents, and improve operational efficiencies. The ideal candidate has significant IT experience, leadership...