Ai data engineer

Werkgever:
NXTminds
Regio:
Houten
 
Functieomschrijving

About the vacancy

As an AI Data Engineer, you are the bridge between raw data and intelligent applications. You will take full end-to-end responsibility for datasets—covering ingestion, transformation, security, monitoring, and reliability. Working within a multidisciplinary, international AI team, the focus is on developing an innovative platform that supports technicians in diagnostics and maintenance, enabling maximum uptime and fast detection of potential vehicle issues. Responsibilities include designing scalable data pipelines, integrating diverse data sources (including IoT), and delivering data products ready for AI models. In close collaboration with data scientists, software engineers, and product owners, this role lays the groundwork for an AI-driven platform that has a tangible impact on daily operations.


Information about the organization

You will join an innovative unit within a company rooted in the industrial manufacturing sector. With a strong focus on digitalisation and AI, the organisation embraces technology to enable sustainable growth. The team operates with autonomy and a strong emphasis on experimentation, ownership, and collaboration. Technology is not an end in itself but a tool to empower skilled professionals and optimise complex processes. This is an environment where impactful AI solutions are built (and applied) directly in practice.

Job description


Deliverables

  • Design and maintain scalable data pipelines for batch and real-time processing
  • Ingest and integrate data from cloud and on-premise sources
  • Develop data products and knowledge graphs for AI applications
  • Optimise data flows for performance and cost-efficiency
  • Ensure secure, governed, and reliable data infrastructure
  • Monitor system performance and manage logging
  • Enable seamless data integration into a central web application
  • Apply CI/CD principles, version control, and documentation best practices
  • Work with tools such as Azure Data Factory, Azure Foundry, Databricks, Python, (No)SQL, Gremlin, and (Py)Spark


Requirements

  • Experience with Azure (Foundry, Azure ML), Databricks, and cloud-native data solutions
  • Knowledge of NoSQL databases (e.g., Cosmos DB) and graph technologies (e.g., Neo4j, Gremlin, GraphQL)
  • Proficiency in Python, (No)SQL, and (Py)Spark with proven skills in building robust data pipelines
  • Familiarity with CI/CD, version control (Azure DevOps or GitHub), and modern data workflows
  • Strong foundation in designing secure and scalable data flows
  • Plus: full-stack experience and familiarity with deployment pipelines
  • Proven interest in or experience with AI/ML, MLOps, deep learning, or LLMs
  • Or a strong background in software engineering (building APIs, applications, supporting deployment)
  • Socially adept, proactive, and solution-oriented within a small team

 Kernwoorden