Site Reliability Engineering Manager
hace 7 días
This is a world-class **devops engineering management** challenge, bringing together software engineering and product development, operations management, and team leadership in a single high-value role.
We work across the full stack, from bare metal to Kubernetes, including cloud and virtualisation. We also work across the full range of infrastructure, from public cloud to private cloud and edge. You will need to be a Linux and operations expert, as well as a great manager capable of leading a high-performance team, to excel in this role.
If you have an affinity for open source development and a passion for operations, software engineering, and new technology, then you will enjoy working with some of the best people in the industry at Canonical.
**Summary of role and responsibilities**:
The IS team at Canonical runs the services used by over 60 million Ubuntu users. We automate all of Canonical's production services with model-driven operations techniques and technology. We are part of Canonical's effort to raise the bar on ops technology, encapsulating real-world operational knowledge into reusable and composable software operations packages. We use our real-life operational experiences to contribute to product improvements.
From Kubernetes to the kernel and everything in-between, you'll be working with the latest technology in a fast-paced engineering environment. As an SRE Manager you will be responsible for the operations engineers in your time zone. This includes customer service management, managed services operations and consistent product improvement engineering. Collaboration with internal customers, product engineering, and development groups is critical to success.
**As an Engineering Manager in devops you will**:
- Lead your team in daily agile devops practices
- Optimise the quality and velocity of both development and operations
- Mentor engineers to improve their skills
- Identify and measure team health indicators
- Implement structured engineering and operations processes
- Ensure proper team focus on priorities, milestones, and deliverables
- Work to meet service level agreements with customer deployments around the globe
- Deliver quality managed services in a consistent, timely manner
- Represent the IS team to stakeholders, customers, and internal teams
- Bachelors (or equivalent) Degree level education in a technology field
- Proven experience of software delivery using Python, Go, C, C++, or Java
- Proven experience managing devops teams for SAAS or similar offerings
- Understanding of testing methodologies and maintainable code quality
- Experience with Ubuntu system administration
- Experience with agile software development methodologies
- Experience working in and managing distributed teams
- Technical aptitude for understanding complex distributed systems
- Experience with cloud topologies and technologies
- Ability to travel to global company events 10-15% of the time
**About Canonical**:
Canonical is a growing international software company that works with the open-source community to deliver Ubuntu - the world's #1 cloud operating system. Our mission is to realize the potential of free software in the lives of individuals and organisations. Our services help businesses worldwide to reduce costs, improve efficiency and enhance security with Ubuntu.
We offer:
- Learning and development
- Competitive salary
- Recognition rewards
- Priority Pass for travel
- Remote work-from-anywhere policy
- Canonical is proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background lead to a better environment for our employees and a better platform for our users and customers. This is something we value deeply and we encourage everyone to come be a part of the world of Ubuntu._
LI-Remote
stack
-
Senior Site Reliability Engineer
hace 1 semana
Colombia MAS Global Consulting A tiempo completoWho We AreAt MAS Global Consulting, we are a premium digital engineering partner delivering technology solutions to some of the world's most innovative companies — from high-growth startups to Fortune 500 enterprises.With a people-first culture and a commitment to excellence, we combine nearshore talent, agile delivery, and technical depth to build...
-
Junior Site Reliability Engineer
hace 1 semana
Colombia Sana Commerce A tiempo completoMedellín- - IT**Junior Site Reliability Engineer**: - Medellín IT - At Sana Commerce we're committed to an inclusive environment and recognize that our diverse work\force is one of our greatest strengths._ It all started in 2007, with a pizza and a plan. **Sana Commerce is an e-commerce platform designed to help manufacturers, distributors and...
-
Azure DevOps Engineer
hace 3 días
Colombia Axiom Path Inc A tiempo completo**Azure DevOps Engineer / Site Reliability Engineer** **Contract, 100% REMOTE** - In this role, you will leverage your DevOps expertise to design, automate, and streamline the software development lifecycle while playing a crucial role in maintaining website uptime. This role requires a strong ability to handle emergencies, troubleshoot website outages, and...
-
Colombia Kyndryl Colombia SAS A tiempo completo**Why Kyndryl** Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...
-
Reliability Engineer
hace 1 semana
Colombia Neostella A tiempo completo**TO BE CONSIDERED, CANDIDATES MUST HAVE A TOURIST VISA TO TRAVEL TO THE UNITED STATES** **What you’ll do**: Our manufacturing client is looking for a Reliability Engineer. The Reliability Engineer is a hands-on technical role that is critical in leading all engineering efforts in collaboration with the operating teams & unit operations across multiple...
-
Senior Site Reliability Engineer
hace 5 días
Colombia Yuxi Global A tiempo completoCompany Description Yuxi Global is an American company with high functional teams across Latin America. We stay updated with the most modern, edge practices and technologies. Our teams are versatile, adaptable and have expertise in a wide range of programming languages, databases and frameworks. This is your invitation to someone who loves working with the...
-
Senior Site Reliability Engineer/DevOps
hace 2 semanas
Colombia Wizeline A tiempo completoThe CompanyWizeline is a global digital services company helping mid-size to Fortune 500 companies build, scale, and deliver high-quality digital products and services. We thrive in solving our customer's challenges through human-centered experiences, digital core modernization, and intelligence everywhere (AI/ML and data). We help them succeed in building...
-
Site Reliability Engineer
hace 1 semana
Colombia Rockwell Automation A tiempo completoRockwell Automation is a global technology leader focused on helping the world’s manufacturers be more productive, sustainable, and agile. With more than 25,000 employees who make the world better every day, we know we have something special. Behind our customers - amazing companies that help feed the world, provide life-saving medicine on a global scale,...
-
Engineering Manager
hace 3 días
Colombia Devsu A tiempo completoOur client, an AI-powered financial technology company, is looking for an Engineering Manager. The company automates data extraction and financial modeling for investment professionals. They deliver accurate, auditable data and modern workflows to help analysts spend less time on rote work and more time on insight. Responsibilities:Lead, coach, and grow a...
-
Reliability Engineer
hace 3 días
Colombia, Huila Baker Hughes A tiempo completoRole Description **Reliability Engineer** **Summary** Can work with limited supervision on assigned tasks with standard techniques to build on basic knowledge and develop skills in specific practice areas. Interacts with clients and client organisations and has an understanding of how maintenance management is executed. Understands project management...