Reliability Operations Engineer

hace 4 días


Bogota, Colombia Infobip Ltd A tiempo completo

At Infobip, we dream big. We value creativity, persistence, and innovation, passionately believing that it is through teamwork that we can all reach greater heights.

Since 2006, we have been innovating at the edge of technological possibilities and are now shaping global communications of the future. Through 70+ offices on six continents, Infobip’s platform is used by almost 70% of the population, making it the largest network of its kind and the only full-stack cloud communication platform globally.

Join us on our mission to create life-changing interactions between humans and online services with new and unseen solutions.

Your job will include working on improving the observability of our platform, as well as collaboration with other engineers in common mitigation tactics. The automation is a big part of the job, as we strive to have meaningful alerting, rather than being triggered for every small glitch, so fine-tuning of existing alerts and improvements of the processes are one of our priorities.

Is your eye twitching when something breaks and you already have a list in your head of possible improvements? This is the place you're looking for.

What you will do:- Monitor our products for issues, prioritize, triage them, and assess client impact- Detect issues, identify them (affected systems, locations, responsible teams) and respond in a timely manner by utilizing runbooks- Clearly communicate (summarize) and escalate platform incidents to responsible individuals- Actively contribute to current runbooks and create a new ones- When an incident is reported, be the driver of the incident resolution (incident commander)- Based on alerts, try to prevent an issue becoming an incident

More about you:
- You have an engineering or support background and passion for IT with at least 1 year of prior experience in the same or similar jobs- You have an experience with tools for monitoring systems (Grafana, Prometheus, NewRelic, Graylog, Kibana, Elasticsearch, Opensearch )- You have a strong system-thinking and problem-solving mindset- You are genuinely interested into how things work, and driven when they don’t- You have strong analytical and investigative skills combined with the ability to navigate through substantial amounts of data to gather critical information in a timely manner- You are genuinely interested in site reliability and want to learn about mitigation tactics- Hands-on knowledge of a system administration tasks are an advantage, but not a prerequisite- You can speak fluently to clients, and colleagues alike, and have great command of English- You can exhibit an advanced level of teamwork, excellent communication skills and a high degree of independence- You are efficient in execution, prone to continuous improvements, experimentation, and self-education.

What kind of people we are looking for:
- tech savvy- curious with attention to detail- critical thinkers- system-knowledge, holistic view- enjoys troubleshooting- responsible- clear communicator- problem solver- willing to teach / mentor others

When you become a part of Infobip you can expect: Awesome clients - We serve and partner with the majority of the leading mobile operators, OTTs, brands, banks, social networks, aggregators and many more. Seriously, our clients are really cool. Work with the world’s leading companies and impact how they communicate with their users Opportunity knocks. Often. - Being a part of a growing company in a growing industry - we challenge you not to grow Whether it’s horizontal, vertical, or angular, we want to support the path that you want to carve. Learn as you grow - Starting with a fantastic onboarding program, to internal education, education resources, e-learning to external educations, we invest heavily in employee learning and development. Connect globally - Work with people from all over the world. We put the “global” in globalization. Pay & Perks - Competitive salary, a team taking care of all the equipment you need, team building and other organized activities... Talk about a balanced lifestyle
- Infobip employees are people with diverse backgrounds, characteristics, and experiences that share the same passion and talent that helps us achieve our mission. That's why Infobip is committed to creating a diverse workplace and is proud to be an equal-opportunity employer._
- All qualified applicants will receive consideration for employment without regard to race, color, ancestry, religion, age, sex, sexual orientation, gender, gender identity, national origin, citizenship, disability, veteran status, or any other part of one's identity._



  • Bogota, Colombia Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • Bogota, Colombia Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • Bogota, Colombia Odisea | Cultsure A tiempo completo

    **About the Role** **Locations**:Colombia only (remote) Come join us at Odisea and work with some of the most exciting start-ups in the US! Use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools are built across the world. Construction impacts the lives of nearly everyone in the...


  • Bogota, Colombia Remoti A tiempo completo

    One of the best technological partners in the travel industry is looking for a Database Reliability Engineer. They are a technological partner providing transaction processing power and technology solutions for the travel and tourism industry. They help over 1.5 billion people a year to connect to the travel ecosystem and they combine a deep understanding of...


  • Bogota, Colombia Remoti A tiempo completo

    One of the best technological partners in the travel industry is looking for a Site Reliability Engineer. They are a technological partner providing transaction processing power and technology solutions for the travel and tourism industry. They help over 1.5 billion people a year to connect to the travel ecosystem and they combine a deep understanding of how...

  • Site Reliability Engineer

    hace 2 semanas


    Bogota, Colombia Rappi A tiempo completo

    ¡Oye, es hora de que te unas a nosotros para mostrarle al mundo que somos la empresa que está cambiando paradigmas, donde revolucionamos las horas, los minutos y los segundos! ¿Quieres saber por qué Rappi? - ️ VEMOS OPORTUNIDADES donde otros ven problemas; - ️ VEMOS CERCANIA donde otros ven distancia; - ️ VEMOS ADRENALINA donde otros ven...

  • Site Reliability

    hace 2 días


    Bogota, Colombia Canonical - Jobs A tiempo completo

    This role is an opportunity for a hands-on, but literally hands-off, technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products. If you have an affinity for operations automation and a passion for technology, then you will enjoy working with some of the best people in the...

  • Site Reliability Engineer

    hace 2 semanas


    Bogota, Colombia Kyndryl A tiempo completo

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • Bogota, Colombia Sofvio A tiempo completo

    We're helping one of our clients, **Agile Dream Team**, hire for a **Site Reliability Engineering Leader.** “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation**:To be agreed upon.** Location**:Remote (anywhere).** Skills**:Proficient in Azure DevOps Server, and...


  • Bogota, Colombia NinjaOne, LLC A tiempo completo

    **About the Role** English resumes required **Location **- Ecuador, Colombia, Brazil and Mexico **What You’ll Be Doing**: - Spearhead the assurance of reliability, availability, and peak performance of our AWS-hosted databases, including RDS/Postgres, Elasticache, Aurora, and more. - Pioneer the establishment of cutting-edge monitoring, observability,...