Reliability Operations Engineer

hace 2 semanas


Bogota, Colombia Infobip Ltd A tiempo completo

At Infobip, we dream big. We value creativity, persistence, and innovation, passionately believing that it is through teamwork that we can all reach greater heights.

Since 2006, we have been innovating at the edge of technological possibilities and are now shaping global communications of the future. Through 70+ offices on six continents, Infobip’s platform is used by almost 70% of the population, making it the largest network of its kind and the only full-stack cloud communication platform globally.

Join us on our mission to create life-changing interactions between humans and online services with new and unseen solutions.

Your job will include working on improving the observability of our platform, as well as collaboration with other engineers in common mitigation tactics. The automation is a big part of the job, as we strive to have meaningful alerting, rather than being triggered for every small glitch, so fine-tuning of existing alerts and improvements of the processes are one of our priorities.

Is your eye twitching when something breaks and you already have a list in your head of possible improvements? This is the place you're looking for.

What you will do:- Monitor our products for issues, prioritize, triage them, and assess client impact- Detect issues, identify them (affected systems, locations, responsible teams) and respond in a timely manner by utilizing runbooks- Clearly communicate (summarize) and escalate platform incidents to responsible individuals- Actively contribute to current runbooks and create a new ones- When an incident is reported, be the driver of the incident resolution (incident commander)- Based on alerts, try to prevent an issue becoming an incident

More about you:
- You have an engineering or support background and passion for IT with at least 1 year of prior experience in the same or similar jobs- You have an experience with tools for monitoring systems (Grafana, Prometheus, NewRelic, Graylog, Kibana, Elasticsearch, Opensearch )- You have a strong system-thinking and problem-solving mindset- You are genuinely interested into how things work, and driven when they don’t- You have strong analytical and investigative skills combined with the ability to navigate through substantial amounts of data to gather critical information in a timely manner- You are genuinely interested in site reliability and want to learn about mitigation tactics- Hands-on knowledge of a system administration tasks are an advantage, but not a prerequisite- You can speak fluently to clients, and colleagues alike, and have great command of English- You can exhibit an advanced level of teamwork, excellent communication skills and a high degree of independence- You are efficient in execution, prone to continuous improvements, experimentation, and self-education.

What kind of people we are looking for:
- tech savvy- curious with attention to detail- critical thinkers- system-knowledge, holistic view- enjoys troubleshooting- responsible- clear communicator- problem solver- willing to teach / mentor others

When you become a part of Infobip you can expect: Awesome clients - We serve and partner with the majority of the leading mobile operators, OTTs, brands, banks, social networks, aggregators and many more. Seriously, our clients are really cool. Work with the world’s leading companies and impact how they communicate with their users Opportunity knocks. Often. - Being a part of a growing company in a growing industry - we challenge you not to grow Whether it’s horizontal, vertical, or angular, we want to support the path that you want to carve. Learn as you grow - Starting with a fantastic onboarding program, to internal education, education resources, e-learning to external educations, we invest heavily in employee learning and development. Connect globally - Work with people from all over the world. We put the “global” in globalization. Pay & Perks - Competitive salary, a team taking care of all the equipment you need, team building and other organized activities... Talk about a balanced lifestyle
- Infobip employees are people with diverse backgrounds, characteristics, and experiences that share the same passion and talent that helps us achieve our mission. That's why Infobip is committed to creating a diverse workplace and is proud to be an equal-opportunity employer._
- All qualified applicants will receive consideration for employment without regard to race, color, ancestry, religion, age, sex, sexual orientation, gender, gender identity, national origin, citizenship, disability, veteran status, or any other part of one's identity._



  • Bogota, Colombia Remoti A tiempo completo

    **Site Reliability Engineer **_**SRE**_ **(Amadeus)** Hello! Bogota, we are looking for a Site Reliability Engineer for one of the biggest software companies in travel and hotels called Amadeus: The Service Reliability engineer will actively interface with software developers, network engineers, systems, storage, project management, and database...

  • Site Reliability Engineer

    hace 2 semanas


    Bogota, Colombia Kyndryl A tiempo completo

    Who We Are Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...


  • Bogota, Colombia Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • Bogota, Colombia Odisea | Cultsure A tiempo completo

    **About the Role** **Locations**:Colombia only (remote) Come join us at Odisea and work with some of the most exciting start-ups in the US! Use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools are built across the world. Construction impacts the lives of nearly everyone in the...

  • Site Reliability Engineer

    hace 4 semanas


    Bogota, Colombia Kyndryl A tiempo completo

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • Bogota, Colombia Swiftline Technologies LLC A tiempo completo

    We are looking for a Site Reliability Engineer with experience in operations, platform automation, dev-ops, and infrastructure engineering. This role will work closely with the engineering team to harden resources, operationalize new infrastructure, automate deployments, architect backend systems, and more. We’re looking for someone who can take ownership...

  • Site Reliability Engineer

    hace 4 semanas


    Bogota, Colombia PayU A tiempo completo

    **About PayU** PayU, a leading payment and Fintech company in 50+ high-growth markets throughout Asia, Central and Eastern Europe, Latin America, the Middle East and Africa, part of Prosus group, one of the largest technology investors in the world is redefining the way people buy and sell online for our 300.000+ merchants and millions of consumers. As a...


  • Bogota, Colombia Wikimedia Foundation A tiempo completo

    **Summary** The Wikimedia Foundation is seeking a Senior Site Reliability Engineer (Databases). Our objective is to make the sum of all human knowledge available to everyone, and we persist most of this knowledge in MariaDB. Our project sites are some of the most highly trafficked on the internet, with more page views per engineer than any other site. As a...

  • Site Reliability Engineer

    hace 3 semanas


    Bogota, Colombia Rappi A tiempo completo

    ¡Oye, es hora de que te unas a nosotros para mostrarle al mundo que somos la empresa que está cambiando paradigmas, donde revolucionamos las horas, los minutos y los segundos! ¿Quieres saber por qué Rappi? - ️ VEMOS OPORTUNIDADES donde otros ven problemas; - ️ VEMOS CERCANIA donde otros ven distancia; - ️ VEMOS ADRENALINA donde otros ven...


  • Bogota, Colombia Radware A tiempo completo

    Security Operations Engineer - (2300005K) **Responsibilities Include** - Plan, Implement, Configure and Manage various security tools that protects the company’s infrastructure - Manage Vulnerabilities and Patching processes - Develop processes for improving operational security efficiencies, - Work with the IT and Information Security teams to address...

  • Site Reliability

    hace 3 semanas


    Bogota, Colombia Canonical - Jobs A tiempo completo

    This role is an opportunity for a hands-on, but literally hands-off, technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products. If you have an affinity for operations automation and a passion for technology, then you will enjoy working with some of the best people in the...


  • Bogota, Colombia Torre A tiempo completo

    We’re helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. ‘‘We transform your vision into reality with our on-demand & fully managed Agile software development teams.’’ Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Site reliability engineering, Azure DevOps Server, SQL and...


  • Bogota, Colombia Torre A tiempo completo

    We’re helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. ‘‘We transform your vision into reality with our on-demand & fully managed Agile software development teams.’’ Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Site reliability engineering, Azure DevOps Server, SQL and...


  • Bogota, Colombia The Bridge Social A tiempo completo

    **Descripción**: **Somos The Bridge** The Bridge es la red de profesionales digitales más grande de LATAM con más de 230 mil rockstars. Startups, scale-ups, grandes empresas y consultorías de todo el mundo trabajan con nosotros para conseguir los proyectos más desafiantes e impactantes. ¿Estás listo para participar en el desafío? **La...


  • Bogota, Colombia GfK A tiempo completo

    Country Colombia Job Family Technology We show the world what people want. Join GfK and help us shape tomorrow. As an NIQ company, we are the world's leading consumer intelligence firm, delivering the Full View on consumer behavior. We work to enable manufacturers and retailers better understand what consumers really want. Our name has inspired trust...

  • Site Reliability Engineer

    hace 3 semanas


    Bogota, Colombia Kyndryl A tiempo completo

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • Bogota, Colombia Sofvio A tiempo completo

    We're helping one of our clients, **Agile Dream Team**, hire for a **Site Reliability Engineering Leader.** “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation**:To be agreed upon.** Location**:Remote (anywhere).** Skills**:Proficient in Azure DevOps Server, and...


  • Bogota, Colombia NinjaOne, LLC A tiempo completo

    **About the Role** English resumes required **Location **- Ecuador, Colombia, Brazil and Mexico **What You’ll Be Doing**: - Spearhead the assurance of reliability, availability, and peak performance of our AWS-hosted databases, including RDS/Postgres, Elasticache, Aurora, and more. - Pioneer the establishment of cutting-edge monitoring, observability,...


  • Bogota, Colombia Torre A tiempo completo

    We're helping one of our clients, Agile Dream Team, hire for Site Reliability Engineering Leader. “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Proficient in Azure DevOps Server and SQL. Responsibilities and more: -...

  • Scada Engineer

    hace 2 semanas


    Bogota, Colombia Capgemini Engineering A tiempo completo

    **SCADA Engineer (Ignition)**: We are seeking a talented SCADA (Supervisory Control and Data Acquisition) Engineer with experience in Ignition to join our team. As a SCADA Engineer, you will be responsible for designing, developing, and maintaining SCADA systems using Ignition software to monitor and control industrial processes. RESPONSIBILITIES - Provide...