Site Reliability Engineer
hace 2 meses
Position Overview: This is for a "Follow the Sun" model with support in New Zealand, the Philippines and Columbia. We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have extensive experience in DevOps practices, continuous integration and continuous deployment (CI/CD) pipelines, and container orchestration with Kubernetes. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our integration platforms, with a focus on Java Spring applications. Key Responsibilities:
- Infrastructure Automation: Design, implement, and maintain infrastructure as code (IaC) using tools such as Terraform, Ansible, or Chef to automate the deployment and management of cloud infrastructure.
- CI/CD Pipeline Management: Develop and optimize CI/CD pipelines using GitHub Actions or other similar tools to automate build, test, and deployment processes for Java Spring applications.
- Kubernetes Orchestration: Deploy, configure, and manage Kubernetes clusters to orchestrate containerized workloads, ensuring high availability, scalability, and reliability.
- Monitoring and Alerting: Implement monitoring and alerting solutions using tools like Prometheus, Grafana, or ELK stack to proactively identify and address performance issues and service disruptions.
- Incident Response and Troubleshooting: Respond to and resolve incidents in a timely manner, conducting root cause analysis and implementing preventive measures to minimize the risk of recurrence.
- Performance Optimization: Identify opportunities for performance optimization and efficiency improvements in the infrastructure and application stack, collaborating with development teams to implement solutions.
- Security and Compliance: Implement security best practices and compliance standards (e.g., GDPR, HIPAA) in the infrastructure and application environments, ensuring data privacy and regulatory compliance.
- Documentation and Knowledge Sharing: Document system configurations, procedures, and troubleshooting steps, and share knowledge with the team to foster collaboration and continuous learning.
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- Extensive experience in DevOps practices, including infrastructure automation, configuration management, and CI/CD pipelines.
- Proficiency in GitHub pipelines and CI/CD practices, with hands-on experience in configuring and managing GitHub Actions.
- Strong expertise in container orchestration with Kubernetes, including cluster management, deployment, scaling, and monitoring.
- Solid programming skills in Java and experience with Java Spring framework.
- Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
- Knowledge of networking concepts, security principles, and best practices.
- Excellent problem-solving skills, attention to detail, and ability to work effectively in a fast-paced environment.
- Strong communication and collaboration skills, with the ability to work closely with cross-functional teams.
-
Site Reliability Engineer
hace 3 semanas
Colombia FullStack Labs, LLC A tiempo completoWe're seeking a skilled Site Reliability Engineer to join our team at FullStack Labs, LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our clients' cloud infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure solutionsCollaborate with...
-
Site Reliability Engineer
hace 3 semanas
Colombia FullStack Labs Inc. A tiempo completoThe Role:We're seeking a skilled Site Reliability Engineer to collaborate with our clients in a dynamic environment. You'll have the opportunity to work closely with our clients' teams, either by integrating directly into their teams or working on a FullStack product team to build and deliver a product.Key Responsibilities:5+ years of professional experience...
-
Site Reliability Engineer
hace 5 meses
Colombia, Huila Datavail A tiempo completo**About the Team** **Job: Site Reliability Engineer - Tier 2** **Experience: 2-5 years (Tier 2)** **Key Skills: Linux, AWS, Terraform** **Required Skills**: - At least 2 years of work experience with: - Linux, Windows, bash scripting, PowerShell and troubleshooting skills - We require at least one associate level cloud AWS certification - Able to...
-
Site Reliability Engineer
hace 1 mes
Colombia WIZELINE A tiempo completoAbout the RoleWizeline is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud...
-
Site Reliability Engineer
hace 3 semanas
Colombia FullStack Labs Inc. A tiempo completoThe Role:We're seeking a highly skilled Site Reliability Engineer to join our team at FullStack Labs Inc. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud infrastructure and applications.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure solutions.Collaborate with...
-
Site Reliability Engineer
hace 2 días
Colombia Captivate IO Ltd A tiempo completoPosition Overview: We are seeking an experienced Site Reliability Engineer to join our dynamic team at Captivate IO Ltd. The ideal candidate will have extensive experience in DevOps practices, continuous integration, and continuous deployment (CI/CD) pipelines. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance...
-
Site Reliability Engineer
hace 2 semanas
Colombia FullStack Labs Inc. A tiempo completoThe Role:We're seeking a skilled Site Reliability Engineer to join our team at FullStack Labs Inc. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems.Key Responsibilities:Design, implement, and maintain scalable and efficient systems.Collaborate with cross-functional teams to...
-
Site Reliability Engineer
hace 3 semanas
Colombia Captivate IO Ltd A tiempo completoJob Title: Site Reliability EngineerJob Summary:We are seeking an experienced Site Reliability Engineer to join our dynamic team at Captivate IO Ltd. The ideal candidate will have extensive experience in DevOps practices, continuous integration and continuous deployment (CI/CD) pipelines, and container orchestration with Kubernetes.Key...
-
Site Reliability Engineer
hace 2 semanas
Colombia FullStack Labs, LLC A tiempo completoAt FullStack Labs, LLC, we're on a mission to build exceptional software development teams for top companies. As a Site Reliability Engineer, you'll play a crucial role in ensuring the reliability and scalability of our clients' software systems.We're looking for a skilled engineer with a strong background in Golang or Rust, as well as experience in...
-
Azure DevOps Engineer
hace 5 meses
Colombia Axiom Path Inc A tiempo completo**Azure DevOps Engineer / Site Reliability Engineer** **Contract, 100% REMOTE** - In this role, you will leverage your DevOps expertise to design, automate, and streamline the software development lifecycle while playing a crucial role in maintaining website uptime. This role requires a strong ability to handle emergencies, troubleshoot website outages, and...
-
Site Reliability Engineer
hace 3 semanas
Colombia FullStack Labs, LLC A tiempo completoAt FullStack Labs, we're on a mission to build the most talented software development teams in the Americas. As a Site Reliability Engineer, you'll play a critical role in ensuring the smooth operation of our clients' distributed software systems. With a strong background in Golang or Rust, and experience in technologies like Python and Ruby, you'll be...
-
Senior Site Reliability Engineer
hace 3 semanas
Colombia Gorilla Logic A tiempo completoJob Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our SRE Cloud Space team. As a key member of our team, you will be responsible for developing and maintaining advanced observability solutions, enhancing blackbox and whitebox monitoring, and improving platform reliability across both...
-
Senior Site Reliability Engineer
hace 7 días
Colombia Wizeline A tiempo completoWizeline is a global digital services company that helps businesses build, scale, and deliver high-quality digital products and services. Our team thrives in solving customer challenges through human-centered experiences, digital core modernization, and intelligence everywhere.Your RoleAs a Senior Site Reliability Engineer at Wizeline, you will be...
-
Site Reliability Engineer
hace 2 semanas
Colombia Captivate IO Ltd A tiempo completoJob Overview: Captivate IO Ltd is seeking an experienced Site Reliability Engineer to join our team. The ideal candidate will have extensive experience in DevOps practices, continuous integration and continuous deployment (CI/CD) pipelines, and container orchestration with Kubernetes.Key Responsibilities:Infrastructure Automation: Design, implement, and...
-
Junior Site Reliability Engineer
hace 5 meses
Colombia Sana Commerce A tiempo completoMedellín- - IT**Junior Site Reliability Engineer**: - Medellín IT - At Sana Commerce we're committed to an inclusive environment and recognize that our diverse work\force is one of our greatest strengths._ It all started in 2007, with a pizza and a plan. **Sana Commerce is an e-commerce platform designed to help manufacturers, distributors and...
-
Senior Cloud Reliability Engineer
hace 2 semanas
Colombia WIZELINE A tiempo completoAt Wizeline, we're looking for a skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will play a key role in ensuring the reliability and scalability of our cloud-based systems.Key responsibilities include:Establishing and implementing observability requirements for monitoring, logging, and...
-
Senior Site Reliability Engineer
hace 3 semanas
Colombia Gorilla Logic A tiempo completoSenior Site Reliability Engineer As a Senior Site Reliability Engineer within the SRE Cloud Space team, you will be at the forefront of developing and maintaining advanced observability solutions. This role focuses on enhancing blackbox and whitebox monitoring, implementing synthetic tests, and improving platform reliability across both on-premise and GCP...
-
Site Reliability Engineer
hace 2 meses
Colombia WIZELINE A tiempo completoWizeline is a global digital services company helping mid-size to Fortune 500 companies build, scale, and deliver high-quality digital products and services. We thrive in solving our customer’s challenges through human-centered experiences, digital core modernization, and intelligence everywhere (AI/ML and data). We help them succeed in building digital...
-
Senior Site Reliability Engineer
hace 3 semanas
Colombia Wizeline A tiempo completoWizeline is a global digital services company helping mid-size to Fortune 500 companies build, scale, and deliver high-quality digital products and services. We thrive in solving our customer’s challenges through human-centered experiences, digital core modernization, and intelligence everywhere (AI/ML and data). We help them succeed in building digital...
-
Site Reliability Engineer
hace 3 semanas
Colombia WIZELINE A tiempo completoJob DescriptionWizeline is a global digital services company that helps mid-size to Fortune 500 companies build, scale, and deliver high-quality digital products and services. We thrive in solving our customers' challenges through human-centered experiences, digital core modernization, and intelligence everywhere (AI/ML and data). We help them succeed in...