Data Engineer

hace 1 semana


Colombia Huila Datavail A tiempo completo

Data Engineer is responsible for designing, building, and maintaining the infrastructure and systems required for collecting, storing, and processing large datasets efficiently.

**Education**:Bachelor's degree in computer science with 8+ years of experience

**Experience**:

- Technical Skills
- Programming Languages: Proficiency in Python, SQL, Java, or Scala for data manipulation and pipeline development.
- Data Processing Frameworks: Experience with tools like Apache Spark, Hadoop, or Apache Kafka for large-scale data processing.
- Data Systems and Platforms
- Databases: Knowledge of both relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra).
- Data Warehousing: Experience with platforms like Snowflake, Amazon Redshift and Azure Synapse.
- Cloud Platforms: Familiarity with AWS, Azure Cloud for deploying and managing data pipelines. Having Good experience in Fabric is advantageous
- Experience working with distributed computing systems such as Hadoop HDFS, Hive, or Spark.
- Managing and optimizing data lakes and delta lakes for structured and unstructured data.
- Data Modeling and Architecture
- Expertise in designing efficient data models (e.g., star schema, snowflake schema) and maintaining data integrity.
- Understanding of modern data architectures like Data Mesh or Lambda Architecture.
- Data Pipeline Development
- Building and automating ETL/ELT pipelines for extracting data from diverse sources, transforming it, and loading it into target systems.
- Monitoring and troubleshooting pipeline performance and failures.
- Workflow Orchestration
- Hands-on experience with orchestration tools such as Azure Data Factory, AWS Glue jobs, DMS or Prefect to schedule and manage workflows.
- Version Control and CI/CD
- Utilizing Git for version control and implementing CI/CD practices for data pipeline deployments.

**Key Skills**:

- Proficiency in programming languages such as Python, SQL, and optionally Scala or Java.
- Proficiency in data processing frameworks like Apache Spark and Hadoop is crucial for handling large-scale and real-time data.
- Expertise in ETL/ELT tools such as Azure ADF and Fabric Data Pipeline is important for creating efficient and scalable data pipelines.
- A solid understanding of database systems, including relational databases like MySQL and PostgreSQL, as well as NoSQL solutions such as MongoDB and Cassandra, is fundamental.
- Experience with cloud platforms, including AWS, Azure and their data-specific services like S3, BigQuery, and Azure Data Factory, is highly valuable.
- Data modeling skills, including designing star or snowflake schema, and knowledge of modern architectures like Lambda and Data Mesh, are critical for building scalable solutions.

**Role and Responsibilities**:

- Responsible for designing, developing, and maintaining data pipelines and infrastructure to support our data-driven decision-making processes.
- Design, build, and maintain data pipelines to extract, transform, and load data from various sources into our data warehouse and data lake.
- Proficient in creating data bricks creating notebooks, working with catalogs, native SQL, creating clusters, Parameterizing notebooks, and administrating data bricks. Define security models and assign roles as per requirement.
- Responsible for creating data flow in Synapse analytics integrating external source systems, creating external tables, data flows and create data models. Schedule the pipelines using various jobs, creating trigger
- Design and develop data pipelines using Fabric pipelines, spark notebooks accessing multiple data sources. Proficient in developing Data bricks notebooks and data optimization
- Develop and implement data models to ensure data integrity and consistency. Manage and optimize data storage solutions, including databases and data warehouses.
- Develop and implement data quality checks and validation procedures to ensure data accuracy and reliability.
- Design and implement data infrastructure components, including data pipelines, data lakes, and data warehouses.
- Collaborate with data scientists, analysts, and other stakeholders to understand business requirements and translate them into technical solutions.
- Monitoring Azure and Fabric data pipelines, spark jobs and work on fixes based on the request priority.
- Responsible for data monitoring activities, having good knowledge on reporting tools like Power Bi and Tableau is required.
- Responsible for understanding the client requirements and architect solutions in both Azure and AWS cloud platforms.
- Monitor and optimize data pipeline performance and scalability to ensure efficient data processing.


  • Data Engineer Bi

    hace 1 semana


    Colombia, Huila Blossom A tiempo completo

    **Join Blossom!** Blossom is a growing ecosystem of fully integrated digital banking solutions designed by and for credit unions. We’re on a mission to empower credit unions with the tools they need to thrive in the digital era. We are currently looking for a **Data Engineer BI** to join our team. This role is essential to our continued growth and...

  • Data Engineer

    hace 2 semanas


    Colombia, Huila Uberall A tiempo completo

    **Help us bring people and businesses together** Our SaaS platform enables multi-location brands and businesses to boost their online presence in a rapidly evolving world. From big to small, from Adidas to ZenPark, our client base contains some of your favourite brands as well, we bet. **The Past, Present & Future** **The Past**: Founded in 2013 by David...

  • Data Engineer Sr

    hace 2 semanas


    Colombia, Huila Tangelo A tiempo completo

    ¡Hola! Somos Tangelo. Combinamos la tecnología y el análisis de datos para crear productos financieros con procesos eficientes y enfocados en el usuario. Somos una compañía diversa, que trabaja en equipo para crear productos financieros modernos e innovadores mientras que expandimos las oportunidades de crédito a aquellos que están fuera del sistema...

  • Data Engineer

    hace 1 semana


    Colombia ttg Talent Solutions A tiempo completo

    **JOB TITLE**:Data Engineer **LOCATION**:Medellín - Colombia **TYPE**:Full-Time **SCHEDULE**:Hybrid (3 days on-site / 2 days remote per week) **DESCRIPTION**:The Data Engineer III will be responsible for the design, development, and maintenance of our data infrastructure, with a strong emphasis on SQL development for automation and detailed analysis, cloud...


  • Colombia Emprego CO A tiempo completo

    Un cordial saludo, en Smart Talent tenemos una vacante denominada “Azure Data Engineer” que te puede interesar a ti o a alguno de tus compañeros, así que no dejes de compartirla. 100% trabajo remoto. Inglés C1. Salario COP $13.000.000; experiência mínima de 5 años. **Perfil**: Buscamos un Azure Data Engineer, quien será responsable de expandir y...

  • Associate Data Engineer

    hace 2 semanas


    Colombia, Huila TaskUS A tiempo completo

    **About TaskUs**:TaskUs is a provider of outsourced digital services and next-generation customer experience to fast-growing technology companies, helping its clients represent, protect and grow their brands. Leveraging a cloud-based infrastructure, TaskUs serves clients in the fastest-growing sectors, including social media, e-commerce, gaming, streaming...

  • Data Engineer

    hace 2 días


    Colombia Kualty A tiempo completo

    **What were looking for** Looking for Data Engineer With 4+ years of eperience. **Experience and knowledge** - At least 4+ years of relevant experience; - Data warehousing experience, specifically for AWS, Glue, Redshift, Athena, S3, Spectrum, PySpark, Lambdas. - Applies ONLY LATAM **Compensation & Benefits** - Work 100% Remote - Work from anywhere - We...

  • Data Engineer

    hace 1 semana


    Colombia Nearshore Business Solutions A tiempo completo

     Job Title: Data Engineer (Snowflake & dbt Specialist)Location: Remote – Latin America PreferredType of Contract: Contractor, Full-TimeSalary Range: 4K to 6K USD/month Language Requirements: English – Advanced (Written & Spoken)We are seeking an experienced Data Engineer with strong expertise in Snowflake and dbt (Data Build Tool) to join our dynamic...

  • Data Engineer

    hace 6 días


    Colombia, Huila FENARC A tiempo completo

    **La empresa**: **Amphora** es una solución logística para ecommerce. Cuentan con una red de almacenes, transportistas y una tecnología integrada para optimizar su cadena logística. Mejorando tiempos de entrega, costes de envío y la experiência de los clientes Su equipo te proporcionará las herramientas necesarias para que no tengas que almacenar,...

  • Data Intern

    hace 1 semana


    Colombia, Huila KIWI CAMPUS S.A.S. A tiempo completo

    This is a remote position. At Kiwibot we are building the largest robotic last-mile delivery network to support operations in several environments from college campuses to cities across the US, Middle East & Asia. We believe that the future will be powered by clean and effective technological solutions and that everyone should have the access to receive...