Ai data engineer

Werkgever:
NXTminds

Regio:
Houten

Functieomschrijving

About the vacancy

As an AI Data Engineer, you are the bridge between raw data and intelligent applications. You will take full end-to-end responsibility for datasets—covering ingestion, transformation, security, monitoring, and reliability. Working within a multidisciplinary, international AI team, the focus is on developing an innovative platform that supports technicians in diagnostics and maintenance, enabling maximum uptime and fast detection of potential vehicle issues. Responsibilities include designing scalable data pipelines, integrating diverse data sources (including IoT), and delivering data products ready for AI models. In close collaboration with data scientists, software engineers, and product owners, this role lays the groundwork for an AI-driven platform that has a tangible impact on daily operations.

Information about the organization

You will join an innovative unit within a company rooted in the industrial manufacturing sector. With a strong focus on digitalisation and AI, the organisation embraces technology to enable sustainable growth. The team operates with autonomy and a strong emphasis on experimentation, ownership, and collaboration. Technology is not an end in itself but a tool to empower skilled professionals and optimise complex processes. This is an environment where impactful AI solutions are built (and applied) directly in practice.

Job description

Deliverables

Design and maintain scalable data pipelines for batch and real-time processing
Ingest and integrate data from cloud and on-premise sources
Develop data products and knowledge graphs for AI applications
Optimise data flows for performance and cost-efficiency
Ensure secure, governed, and reliable data infrastructure
Monitor system performance and manage logging
Enable seamless data integration into a central web application
Apply CI/CD principles, version control, and documentation best practices
Work with tools such as Azure Data Factory, Azure Foundry, Databricks, Python, (No)SQL, Gremlin, and (Py)Spark

Requirements

Experience with Azure (Foundry, Azure ML), Databricks, and cloud-native data solutions
Knowledge of NoSQL databases (e.g., Cosmos DB) and graph technologies (e.g., Neo4j, Gremlin, GraphQL)
Proficiency in Python, (No)SQL, and (Py)Spark with proven skills in building robust data pipelines
Familiarity with CI/CD, version control (Azure DevOps or GitHub), and modern data workflows
Strong foundation in designing secure and scalable data flows
Plus: full-stack experience and familiarity with deployment pipelines
Proven interest in or experience with AI/ML, MLOps, deep learning, or LLMs
Or a strong background in software engineering (building APIs, applications, supporting deployment)
Socially adept, proactive, and solution-oriented within a small team

Kernwoorden

Utrecht