Reliability Engineer

hace 5 días


Barranquilla, Atlántico, Colombia Toeshee A tiempo completo $150.000 - $400.000 al año

We are seeking an experienced
Database Reliability Engineer
with a
DBRE
-focused background to join its expanding team, which utilizes DevOps/Reliability Engineering philosophies.

This role will be key in hardening, scaling, and optimizing our mission-critical databases to ensure world class performance and reliability as we grow. As a Database Reliability Engineer, you will help us deliver automation, feature enhancements and best practice security on our existing hybrid cloud platforms. You'll also explore new technologies and integration with other Open-Source products within our ecosystem.

This role demands database design expertise, query and schema tuning, and proactive monitoring to ensure reliability and scale required to support our growing business needs. As part of this team, you will deliver services on top of robust database infrastructure, ensuring seamless scalability and uninterrupted service to our customers and end-users.

Responsibilities

  • Work on reliability and performance aspects for core database infrastructures that allow supported products to scale.
  • Delivering engineering solutions aligned to key business initiatives, ensuring they are scalable, stable, performant, and operationally efficient
  • Ensure the highest level of uptime and Quality of Service (QoS) for our critical database environments through operational excellence.
  • Work along with Infrastructure and Engineering teams including architects on roadmap planning and architectural discussions to ensure we have architectures in place to scale for the future
  • Implement solutions for automating deployment, provisioning and managing large-scale database environments.
  • Deploy infrastructure as code using terraform, configure, and maintain database instances to ensure high availability, disaster recovery, and backups are properly implemented.
  • Work closely with Infrastructure team, engineering peers, security, and support teams to align database strategies with organizational goals.
  • Conduct regular performance analysis and tuning of database instances to ensure health and efficiency.
  • Collaborate with development teams to implement indexing strategies, improve schema design, and optimize application queries.
  • Contribute to the development and refinement of database standards, guidelines, and procedures.
  • Improve observability by implementing smart monitoring, tracing, and logging. Overall, monitor database performance, adjusting configuration and resources to meet evolving needs.
  • Implement robust monitoring tools and proactive alerts for effective management and mitigation of database instances.
  • Implement and enforce stringent database security practices, including data encryption (at rest and in transit), certificate management, access controls, and comprehensive audit logging.
  • Act as main point of contact for production incidents, perform root cause analysis, identify, and resolve underlying problem patterns, while working towards developing automated and self-healing solutions
  • Manage and maintain database clusters across multiple environments with high availability and cost efficiency.

What you need to succeed

  • Bachelor's degree in computer science or related technical field
  • At least 5 years relevant production experience in supporting at scale, highly available, mission-critical database environments. Strong understanding in all areas such as database backups, replication, security, DevOps for databases (IaC, CI/CD), observability, and disaster recovery.
  • Deep understanding of PostgreSQL architecture, including replication (logical and physical), WAL, vacuuming, checkpointing, and query planning.
  • Hands-on experience with high-availability solutions (Patroni, repmgr, or custom clustering).
  • Working knowledge of MySQL, including replication, performance tuning, and backup strategies.
  • Experience with Cloud database technologies including working with hyper-scale cloud providers (AWS and/or Azure, GCP) and running at scale database environments on virtual computing environments (Amazon EC2, Azure VM)
  • Experience with planning, executing, and managing large - scale system deployments, ensuring high availability and performance.
  • Database migration experience
  • Experience with configuration management and infrastructure-as-code tools such as Puppet, Terraform, Ansible etc
  • Proficiency with monitoring and alerting tools (Prometheus, Grafana, Icinga, Nagios)
  • Experience with CI/CD pipelines and Git-based workflows.
  • Good knowledge of storage, networking, and other systems directly impacting database performance.
  • Solid knowledge of Linux/Unix system administration.
  • Strong scripting and automation skills (Python, Bash…).
  • Demonstrated ability to work in a fast-paced, collaborative environment, with excellent problem-solving and communication skills.

  • Reliability Engineer

    hace 2 días


    Perímetro Urbano Barranquilla, Colombia Toeshee A tiempo completo

    Reliability Engineer 1 day ago Be among the first 25 applicants Direct message the job poster from Toeshee We are seeking an experienced Database Reliability Engineer with a DBRE -focused background to join its expanding team, which utilizes DevOps/Reliability Engineering philosophies. This role will be key in hardening, scaling, and optimizing our...

  • Site Reliability Engineer

    hace 2 semanas


    Barranquilla, Colombia Careers at SunDevs A tiempo completo

    **Descripción del puesto**: Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas y plataformas en la nube altamente disponibles, escalables, seguras y mantenibles para resolver grandes desafíos. Brindarás asesoramiento y guía a nuestros ingenieros de...


  • Perímetro Urbano Barranquilla, Colombia AgileEngine A tiempo completo

    Join to apply for the Site Reliability Engineer ID45689 role at AgileEngine . AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place...


  • Perímetro Urbano Barranquilla, Colombia Toeshee A tiempo completo

    A tech company in Barranquilla seeks an experienced Database Reliability Engineer to enhance and manage their critical databases. This role involves optimizing performance, ensuring high availability, and implementing automation processes across cloud platforms. Candidates should possess strong skills in PostgreSQL and cloud technologies, backed by a...


  • Perímetro Urbano Barranquilla, Colombia AgileEngine A tiempo completo

    A leading software development firm in Barranquilla is seeking a mid-senior level Site Reliability Engineer. In this role, you will design and deploy AWS infrastructure, enhance CI/CD workflows, and implement observability tools to ensure high reliability. Ideal candidates have 8–10 years of experience, strong skills in Infrastructure-as-Code, and...

  • Senior Software Engineer

    hace 1 semana


    Barranquilla, Colombia Angi A tiempo completo

    Angi® is transforming the home services industry, creating an environment for homeowners, service professionals and employees to feel right at "home." For most home maintenance needs, our platform makes it easier than ever to find a qualified service professional for indoor and outdoor jobs, home renovations (or anything in between!). We are on a mission to...


  • Barranquilla, Colombia Angi A tiempo completo

    Angi® is transforming the home services industry, creating an environment for homeowners, service professions and employees to feel right at "home." For most home maintenance needs, our platform makes it easier than ever to find a qualified service professional for most indoor and outdoor jobs, home renovations (or anything in between!). We are on a mission...

  • Sr. Quality Engineer

    hace 2 semanas


    Barranquilla, Colombia Cognizant A tiempo completo

    **About the role** As a Sr. Quality Engineer, you will make an impact by ensuring the quality and reliability of our software products. You will be a valued member of the life sciences team and work collaboratively with a dynamic and inclusive team within the environment at Cognizant. **In this role, you will**: - Developing and executing comprehensive...

  • Senior Software Engineer

    hace 2 semanas


    Barranquilla, Colombia Angi A tiempo completo

    Angi® is transforming the home services industry, creating an environment for homeowners, service professionals and employees to feel right at "home." For most home maintenance needs, our platform makes it easier than ever to find a qualified service professional for indoor and outdoor jobs, home renovations (or anything in between!). We are on a mission to...

  • Software Engineer

    hace 1 semana


    Perímetro Urbano Barranquilla, Colombia AgileEngine A tiempo completo US$120.000 - US$180.000 al año

    AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. WHY JOIN US If you're looking for a place to grow, make an...