directohace 17 días

Data Engineer

B
Brainlabs
Híbrido · Tiempo completo
Semi Senior2+ años
A convenir
Este aviso fue publicado originalmente en inglés, así que es probable que necesites inglés para este puesto. La descripción puede estar traducida automáticamente al español; ante la duda, revisá el aviso original con el botón de postularte.

Se busca Data Engineer con experiencia en Google Cloud Platform (GCP) para diseñar, construir y gestionar soluciones de datos escalables, incluyendo pipelines para aplicaciones de IA y GenAI. Se requiere experiencia en Python, SQL y herramientas de GCP.

Por qué aplicar

Si te copa trabajar con datos y te interesa el mundo de la IA y GenAI, este puesto en Brainlabs es para vos. Podrás diseñar y construir soluciones escalables en Google Cloud, sumando experiencia en pipelines para aplicaciones de vanguardia.

Descripción del puesto

<div class="content-intro"><p>Brainlabs is the media agency built to answer one question: what's actually driving profit? Founded in 2012 by Daniel Gilbert, we were built by engineers before we were a media agency. Today, 1,000+ Brainlabbers across five continents use our proprietary agents, built on 32 media tools and over 2,500 logged experiments, to help brands connect every channel they plan and buy to one thing: the bottom line.</p></div><p>We are looking for a motivated and detail-oriented <strong>Data Engineer </strong>with 2+ years of experience in designing, building, and managing scalable data solutions on Google Cloud Platform (GCP). The ideal candidate will have a strong background in data engineering, cloud-based architectures, and proficiency in implementing data pipelines to transform raw data into actionable insights. Experience building or supporting AI and GenAI data workflows, including pipelines for LLM applications and AI/ML model training, is a strong plus.&nbsp;</p> <p><strong>What you do</strong></p> <ul> <li><strong>Data Pipeline Development:</strong></li> <ul> <li>Design, develop, and maintain ETL/ELT pipelines using GCP tools like CloudFunctions, CloudRun, Dataflow, Dataproc, or Cloud Data Fusion.</li> <li>Ensure data pipelines are scalable, efficient, and optimised for performance.</li> </ul> <li><strong>AI &amp; GenAI Process Development:</strong></li> <ul> <li>Build and manage data pipelines that support LLM and GenAI applications, including Retrieval-Augmented Generation (RAG) architectures, vector data stores, and prompt context assembly workflows.</li> <li>Curate and prepare datasets for AI/ML model training, covering feature engineering, labeling pipeline oversight, and data versioning using tools like Vertex AI Feature Store or DVC.</li> </ul> <li><strong>Data Integration and Storage:</strong></li> <ul> <li>Integrate data from various sources into GCP services such as BigQuery, Cloud Storage, and Cloud SQL.</li> <li>Design and implement data warehouse/mart solutions using BigQuery for analytics and reporting.</li> </ul> <li><strong>Data Transformation &amp; Optimization:</strong></li> <ul> <li>Build transformation logic using SQL, Python, or Spark for preparing clean and structured data.</li> <li>Optimise query performance and storage cost in BigQuery or other GCP storage systems.</li> </ul> <li><strong>Data Quality &amp; Monitoring:</strong></li> <ul> <li>Develop processes to ensure data quality, integrity, and consistency across the pipeline.</li> <li>Implement monitoring and logging systems using tools like Stackdriver or Looker.</li> </ul> </ul> <ul> <li><strong>Requirement Understanding</strong>:&nbsp;</li> <ul> <li>Understand and interpret business and technical requirements to support data development tasks.&nbsp;</li> <li>Assist in building, testing, and maintaining data pipelines while ensuring alignment with project objectives and stakeholder needs</li> </ul> </ul> <ul> <li><strong>Collaboration &amp; Communication:</strong></li> <ul> <li>Work closely with cross-functional teams, including data analysts, data scientists, and business stakeholders, to understand requirements.</li> <li>Provide technical guidance on GCP best practices and tools.</li> </ul> <li><strong>Documentation &amp; Maintenance:</strong></li> <ul> <li>Maintain clear documentation of processes, workflows, and data architecture.</li> <li>Ensure regular maintenance and version control of pipelines and scripts.</li> </ul> </ul> <p><strong>Who you are</strong></p> <p><strong>Mandatory Skills:</strong></p> <ul> <li> <ul> <li>Hands-on experience with GCP services like CloudFunctions, CloudRun, Schedular, BigQuery, Dataflow, Pub/Sub, and Cloud Storage.</li> <li>Strong programming skills in Python, SQL.</li> <li>Knowledge of data modelling, schema design, and query optimization techniques.</li> <li>Experience in building batch and streaming data pipelines.</li> <li>Excellent communication and collaboration skills.</li> <li>Ability to work in a fast-paced and dynamic environment.</li> </ul> </li> </ul> <p><strong>Preferred Skills:</strong></p> <ul> <li> <ul> <li>Familiarity with orchestration tools like Apache Airflow, Cloud Composer, or similar.</li> <li>Working experience on other cloud stack for ETL(AWS or Azure) is a plus</li> <li>Experience with GCP’s AI/ML platform (Vertex AI, BigQuery ML, or AutoML) for building, evaluating, or serving models is a strong advantage.</li> <li>Hands-on experience building or supporting LLM/GenAI pipelines using frameworks such as LangChain, LlamaIndex, or Vertex AI Agent Builder.</li> <li>Familiarity with AI/ML data preparation practices, including feature engineering, dataset curation, and data versioning for model training workflows.</li> <li>Knowledge of CI/CD practices and tools like Git, Jenkins, or Terraform for pipeline deployments&nbsp;</li> <li>Understanding of data security, governance, and compliance practices on GCP.</li> </ul> </li> </ul> <h3>Education &amp; Certifications:</h3> <ul> <li>Bachelor’s degree in Computer Science, Engineering</li> <li>GCP Data Engineer or Associate Cloud Engineer certification (preferred but not mandatory)</li> </ul> <p><strong>How you succeed</strong></p> <ul> <li>You successfully deliver complex projects on time and within scope, with clear stakeholder alignment throughout</li> <li>Your strategic recommendations and measurement frameworks directly influence client business decisions and outcomes</li> <li>Clients view you as a trusted advisor who deeply understands their business and data landscape</li> <li>You actively develop the skills and careers of team members under your mentorship</li> <li>You drive innovation through reusable frameworks, templates, and process improvements that benefit the broader team</li> <li>You maintain high client satisfaction</li> </ul> <p><strong>What's in it for you ✨</strong></p> <ul> <li>This is a full time job (<em>en relación de dependencia</em>).</li> <li>Hybrid Salary (50% of net salary paid in USD)</li> <li>20 working days vacation plus all Argentina public holidays.</li> <li>Private healthcare (OSDE 360).</li> <li>Adaptive/hybrid working from our offices in HIT Polo (Palermo).</li> <li>Free breakfast and lunch when in office.</li> <li>Access to learning and development opportunities.</li> <li>Mobility programmes - work from another country for up to 30 days!</li> </ul> <p>#LI-CO|</p> <p>&nbsp;</p><div class="content-conclusion"><p><strong>What happens next?</strong></p> <p>We know searching for a job is tough and that you want to find the best career and employer for you. We also want to ensure that this position is the best fit for both you and us. Therefore, you will participate in a comprehensive interview process that includes skills interviews with our team. The goal of this process is to allow you to get to know us as we learn more about you.</p> <hr> <p style="text-align: center;"><em>Brainlabs actively seeks and encourages applications from candidates with diverse backgrounds and identities. We are proud to be an equal opportunity workplace: we are committed to equal opportunity for all applicants and employees regardless of age, disability, sex, gender reassignment, sexual orientation, pregnancy and maternity, race, religion, or belief, and marriage and civil partnerships. If you have a disability or special need that requires accommodation during the application process, please let us know!</em></p> <p style="text-align: center;"><em>Please note that we will never ask you to transfer cash or make any other payment to us in order to apply for a role or to work for Brainlabs. Any such asks are fraudulent and should be reported to the appropriate authorities in your area.</em></p></div>

Responsabilidades

  • Diseñar, desarrollar y mantener pipelines ETL/ELT en GCP (CloudFunctions, CloudRun, Dataflow, Dataproc, Cloud Data Fusion)
  • Construir y gestionar pipelines de datos para aplicaciones LLM y GenAI (RAG, vector stores, prompt context assembly)
  • Curar y preparar datasets para entrenamiento de modelos AI/ML (feature engineering, data versioning)
  • Integrar datos de diversas fuentes en servicios GCP (BigQuery, Cloud Storage, Cloud SQL)
  • Diseñar e implementar soluciones de data warehouse/mart en BigQuery
  • Construir lógica de transformación con SQL, Python o Spark
  • Optimizar rendimiento de consultas y costos de almacenamiento en BigQuery
  • Desarrollar procesos para asegurar calidad, integridad y consistencia de datos
  • Implementar sistemas de monitoreo y logging (Stackdriver, Looker)
  • Interpretar requerimientos de negocio y técnicos
  • Colaborar con equipos multidisciplinarios (analistas de datos, científicos de datos, stakeholders)
  • Proveer guía técnica sobre mejores prácticas y herramientas GCP
  • Mantener documentación clara de procesos, flujos de trabajo y arquitectura de datos
  • Realizar mantenimiento regular y control de versiones de pipelines y scripts

Skills requeridas

Diseño de pipelines de datosETL/ELTTransformación de datosIntegración de datosData warehousingData modelingDiseño de esquemasOptimización de consultasCalidad de datosMonitoreo de datosPythonSQLBatch processingStreaming processingComunicaciónColaboraciónTrabajo en entornos dinámicosOrientación al detalleResolución de problemasAdaptabilidadInnovaciónLiderazgo (mentoría)

Beneficios

  • Trabajo a tiempo completo (en relación de dependencia)
  • 50% del sueldo neto en USD
  • 20 días de vacaciones
  • Feriados argentinos
  • Obra social OSDE 360
  • Modalidad híbrida
  • Desayuno y almuerzo gratis en la oficina
  • Oportunidades de aprendizaje y desarrollo
  • Programas de movilidad internacional