Site Reliability

hace 3 semanas


Bogota, Colombia Canonical - Jobs A tiempo completo

This role is an opportunity for a hands-on, but literally hands-off, technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products. If you have an affinity for operations automation and a passion for technology, then you will enjoy working with some of the best people in the industry at Canonical.

**Job Summary**:
The IS team at Canonical supports and maintains all of Canonical's IT production services. The team is in charge of running services used by over 60 million Ubuntu users.

As an SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. We do this by utilizing the best of open source infrastructure as code software, software development practices such as CI/CD pipelines, and Canonical's leading products for software operation automation.

In addition to defining the infrastructure as code, you will improve Canonical products and the open-source technologies they're based on by providing critical feedback to developers on how their products operate at scale. This is done by submitting bugs (and sometimes writing pull requests) and collaborating on design and implementations with other teams within the company.

You'll be part of a global team of SREs that work together and support each other to provide the best possible services to our company, Canonical's customers and the Ubuntu Community.

**As a Site Reliability / Gitops Engineer engineer you will**:

- Automate software operations for reusability and consistency across private and public clouds, taking into consideration the complexities of distributed systems
- Develop infrastructure as code practice within IS by constantly increase automation and improve IaC processes
- Develop new features and improve the resilience and scalability of the existing cloud and container portfolio at Canonical
- Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure
- Develop skills in troubleshooting, capacity planning, and performance investigation,
- Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain Implementing monitoring and alerting for various systems and services
- Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures
- Provide assistance and work with globally distributed engineering, operations, and support peers
- Be given uninterrupted software development time to focus on larger projects and automation of manual tasks
- Carry final responsibility for time-critical escalations
- Strong modern engineering background (peer-review, unit testing, SCM, CI/CD, Agile)
- Python software development experience, with large projects
- Practical knowledge of Linux networking, routing, and firewalls
- Affinity with various forms of Linux storage, from Ceph to Databases
- Hands-on experience administering enterprise Linux servers
- Extensive knowledge of cloud computing concepts and technologies
- Bachelor's degree or greater, preferably in computer science or related engineering field
- Motivated and able to troubleshoot from kernel to web, and willing to ask others when appropriate
- A willingness to be flexible and able to learn new things quickly
- Be inspired by the needs of fast-changing environments
- Happy to work within distributed teams
- Be passionate and familiarized about open-source, especially Ubuntu or Debian
- A residence in North-, Middle
- or South America

**What we offer you**:
Your base pay will depend on various factors including your geographical location, level of experience, knowledge and skills. Our compensation philosophy is to ensure equity right across our global workforce.
- Fully remote working environment - we've been working remotely since 2004
- Personal learning and development budget of 2,000USD per annum
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Parental Leave
- Employee Assistance Programme
- Opportunity to travel to new locations to meet colleagues at 'sprints'
- Priority Pass for travel and travel upgrades for long haul company events

**About Canonical**:
Canonical is a pioneering tech firm that is at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT and the cloud, we are changing the world on a daily basis. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do.

Canonical has been a remote-first company since its inception in 2004. Work at Canonical is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your ga



  • Bogota, Colombia Remoti A tiempo completo

    **Site Reliability Engineer **_**SRE**_ **(Amadeus)** Hello! Bogota, we are looking for a Site Reliability Engineer for one of the biggest software companies in travel and hotels called Amadeus: The Service Reliability engineer will actively interface with software developers, network engineers, systems, storage, project management, and database...


  • Bogota, Colombia Torre A tiempo completo

    We’re helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. ‘‘We transform your vision into reality with our on-demand & fully managed Agile software development teams.’’ Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Site reliability engineering, Azure DevOps Server, SQL and...


  • Bogota, Colombia Torre A tiempo completo

    We’re helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. ‘‘We transform your vision into reality with our on-demand & fully managed Agile software development teams.’’ Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Site reliability engineering, Azure DevOps Server, SQL and...


  • Bogota, Colombia Swiftline Technologies LLC A tiempo completo

    We are looking for a Site Reliability Engineer with experience in operations, platform automation, dev-ops, and infrastructure engineering. This role will work closely with the engineering team to harden resources, operationalize new infrastructure, automate deployments, architect backend systems, and more. We’re looking for someone who can take ownership...


  • Bogota, Colombia Sofvio A tiempo completo

    We're helping one of our clients, **Agile Dream Team**, hire for a **Site Reliability Engineering Leader.** “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation**:To be agreed upon.** Location**:Remote (anywhere).** Skills**:Proficient in Azure DevOps Server, and...

  • Site Reliability Engineer

    hace 2 semanas


    Bogota, Colombia Kyndryl A tiempo completo

    Who We Are Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...


  • Bogota, Colombia Torre A tiempo completo

    We're helping one of our clients, Agile Dream Team, hire for Site Reliability Engineering Leader. “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Proficient in Azure DevOps Server and SQL. Responsibilities and more: -...

  • Site Reliability Engineer

    hace 4 semanas


    Bogota, Colombia Kyndryl A tiempo completo

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...

  • Site Reliability Engineer

    hace 4 semanas


    Bogota, Colombia PayU A tiempo completo

    **About PayU** PayU, a leading payment and Fintech company in 50+ high-growth markets throughout Asia, Central and Eastern Europe, Latin America, the Middle East and Africa, part of Prosus group, one of the largest technology investors in the world is redefining the way people buy and sell online for our 300.000+ merchants and millions of consumers. As a...


  • Bogota, Colombia The Bridge Social A tiempo completo

    **Descripción**: **Somos The Bridge** The Bridge es la red de profesionales digitales más grande de LATAM con más de 230 mil rockstars. Startups, scale-ups, grandes empresas y consultorías de todo el mundo trabajan con nosotros para conseguir los proyectos más desafiantes e impactantes. ¿Estás listo para participar en el desafío? **La...


  • Bogota, Colombia Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • Bogota, Colombia Wikimedia Foundation A tiempo completo

    **Summary** The Wikimedia Foundation is seeking a Senior Site Reliability Engineer (Databases). Our objective is to make the sum of all human knowledge available to everyone, and we persist most of this knowledge in MariaDB. Our project sites are some of the most highly trafficked on the internet, with more page views per engineer than any other site. As a...


  • Bogota, Colombia GfK A tiempo completo

    Country Colombia Job Family Technology We show the world what people want. Join GfK and help us shape tomorrow. As an NIQ company, we are the world's leading consumer intelligence firm, delivering the Full View on consumer behavior. We work to enable manufacturers and retailers better understand what consumers really want. Our name has inspired trust...

  • Site Reliability Engineer

    hace 3 semanas


    Bogota, Colombia Kyndryl A tiempo completo

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...

  • Site Reliability Engineer

    hace 3 semanas


    Bogota, Colombia Rappi A tiempo completo

    ¡Oye, es hora de que te unas a nosotros para mostrarle al mundo que somos la empresa que está cambiando paradigmas, donde revolucionamos las horas, los minutos y los segundos! ¿Quieres saber por qué Rappi? - ️ VEMOS OPORTUNIDADES donde otros ven problemas; - ️ VEMOS CERCANIA donde otros ven distancia; - ️ VEMOS ADRENALINA donde otros ven...

  • Site Reliability Engineer

    hace 3 semanas


    Bogota, Colombia IntegriChain A tiempo completo

    * Mission* **Duties** - Troubleshoot during production incidents - Operate with limited oversight - Demonstrate Practical Knowledge in two of the following areas: Release Management, Web Application Development, Linux Administration, Network/Data Center Operations, Source Control Management and Database Administration/Migration(NoSQL and SQL) - Contribute...


  • Bogota, Colombia Netskope A tiempo completo

    **About Netskope**: Today, there's more data and users outside the enterprise than inside, causing the network perimeter as we know it to dissolve. We realized a new perimeter was needed, one that is built in the cloud and follows and protects data wherever it goes, so we started Netskope to redefine Cloud, Network and Data Security. **About the role** The...


  • Bogota, Colombia NinjaOne, LLC A tiempo completo

    **About the Role** English resumes required **Location **- Ecuador, Colombia, Brazil and Mexico **What You’ll Be Doing**: - Spearhead the assurance of reliability, availability, and peak performance of our AWS-hosted databases, including RDS/Postgres, Elasticache, Aurora, and more. - Pioneer the establishment of cutting-edge monitoring, observability,...


  • Bogota, Colombia Infobip Ltd A tiempo completo

    At Infobip, we dream big. We value creativity, persistence, and innovation, passionately believing that it is through teamwork that we can all reach greater heights. Since 2006, we have been innovating at the edge of technological possibilities and are now shaping global communications of the future. Through 70+ offices on six continents, Infobip’s...


  • Bogota, Colombia Kyndryl Colombia SAS A tiempo completo

    **Why Kyndryl** Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...