Data Engineer
hace 5 días
As founders of Nexus Repository and stewards of Maven Central, the world's largest repository of Java open-source software, we are software pioneers and our open source expertise is unmatched. We empower innovation with an unparalleled commitment to build faster, safer software and harness AI and data intelligence to mitigate risk, maximize efficiencies, and drive powerful software development.
More than 2,000 organizations, including 70% of the Fortune 100 and 15 million software developers, rely on Sonatype to optimize their software supply chains.
We're looking for a Data Engineer to join our growing Data Platform team. You'll play a key role in designing and scaling the infrastructure and pipelines that power the product features, analytics, and machine learning across Sonatype.
You'll work closely with stakeholders across product, engineering, and business teams to ensure data is reliable, accessible, and actionable. This role is ideal for someone who thrives on solving complex data challenges at scale and enjoys building high-quality, maintainable systems. What you'll do:
- Design, build, and maintain scalable data pipelines and ETL processes
- Architect and optimize data models and storage solutions for analytics and operational use
- Collaborate with other data engineers to deliver trusted, high-quality datasets
- Own and evolve parts of our data platform, specifically the streaming pipeline and Data Lake
- Implement observability, alerting, and data quality monitoring for critical pipelines
- Drive best practices in data engineering, including documentation, testing, and CI/CD
- Contribute to the design and evolution of our next-generation data lakehouse architecture
- 4+ years of experience as a Data Engineer or Backend engineering role
- Strong programming skills in Java and Python
- Proficient in writing complex SQL and optimizing queries for performance
- Proficient in English, and strong communication skills, including the ability to speak to other engineers, analysts, and demo or explain new features to non-engineers
- Some experience using AWS cloud-native tools, like S3, SNS, SQS, EC2, or EMR
- Familiarity with streaming data pipelines or real-time processing
- Hands-on experience with distributed data tools like Hadoop, HDFS, and Spark
- Know your way around Docker containers and the Linux command line
- Exposure to DynamoDB or similar NoSQL data stores
- Experience using Databricks to write queries and notebooks
- Experience supporting data products in production
- An understanding of data privacy, security, and compliance best practices
- Data with purpose: Work on problems that directly impact how the world builds secure software
- Modern tooling: Leverage the best of open-source and cloud-native technologies, including very modern versions of Java
- Collaborative culture: Join a passionate team that values learning, autonomy, and impact
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
-
Lead Data Engineer
hace 1 semana
Remote, Colombia Fusemachines A tiempo completo US$80.000 - US$170.000 al añoAbout FusemachinesFusemachines is a leading AI strategy, talent, and education services provider. Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI. With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 400 full-time employees)....
-
Senior Data Engineer
hace 7 días
Remote (Colombia) AspenView Technology Partners A tiempo completo US$60.000 - US$120.000 al añoAbout the roleWe are looking for an experienced Data Engineer with 5+ years of experience designing and implementing scalable data solutions in Azure. The ideal candidate will have a solid understanding of Azure data services, security, and DevOps, contributing to end-to-end data flow architecture and automation.What you will do:Design, develop, and maintain...
-
Senior Data Engineer
hace 5 días
Remote, Colombia Fusemachines A tiempo completo US$85.000 - US$135.000 al añoAbout FusemachinesFusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from...
-
Senior BI Data Engineer
hace 3 días
Remote (Colombia) AspenView Technology Partners A tiempo completo US$80.000 - US$150.000 al añoAbout the roleWe are seeking an Senior Data Engineer with a strong background in Business Intelligence (BI) data modeling, data marts design and development in Power BI, and deep knowledge of managing slowly changing dimensions and historical data structures. The ideal candidate will have demonstrated expertise in integrating batch and near-real-time data...
-
Mid-Level Data Engineer
hace 15 horas
Remote, Colombia Lean Solutions Group A tiempo completoDescription Company Overview: Lean Tech is a rapidly expanding organization situated in Medellín, Colombia. We pride ourselves on possessing one of the most influential networks within software development and IT services for the entertainment, financial, and logistics sectors. Our corporate projections offer many opportunities for professionals to elevate...
-
Senior Data Engineer — Azure
hace 3 días
Remote (Colombia) AspenView Technology Partners A tiempo completo US$80.000 - US$120.000 al añoAbout the roleWe are seeking a Senior Data Engineer with strong experience in Azure Data Factory (ADF) and SQL Server to support a modernization initiative that transitions our current data integration landscape to Microsoft Fabric and Fabric Link. The ideal candidate will design and implement scalable data pipelines that bring together data from ERP...
-
Mid Data Engineer
hace 2 semanas
Remote, Colombia Lean Tech A tiempo completo $40.000 - $80.000 al añoDescriptionCompany Overview:Lean Tech is a rapidly expanding organization situated in Medellín, Colombia. We pride ourselves on possessing one of the most influential networks within software development and IT services for the entertainment, financial, and logistics sectors. Our corporate projections offer many opportunities for professionals to elevate...
-
Senior Data Engineer
hace 2 semanas
Colombia, Remote Sezzle A tiempo completo US$60.000 - US$114.000 al añoThe salary range for this role is $5,000 - $9,500 per month (Gross in USD)About Sezzle:With a mission to financially empower the next generation, Sezzle is revolutionizing the shopping experience beyond payments, blending cutting-edge tech with seamless, interest-free installment plans that make shopping smarter and more accessible. We're not just...
-
Data Engineer
hace 15 horas
Colombia Moabits A tiempo completo**Description** We are looking for an experienced **Data Engineer** who will be responsible for designing, building, and maintaining the infrastructure required for processing and storing the data of the company and its services. You will to join our team in Latam (Remote) to help us to improve Moabits business strategy. **What you will be doing**: -...
-
Data Engineer
hace 3 días
Colombia Pangea Consultants A tiempo completo**Skills** - Previous experience as a Data engineer or in a similar role - ETL tools - Azure Synapse analytics, Databricks, Azure Machine Learning - Technical expertise with data models, data mining, and segmentation techniques - Knowledge of programming languages (e.g. Java and Python) - Hands-on experience with SQL database design - Degree in Computer...