
Medior DevOps/AIOps Engineer
- Hybrid
- Amsterdam, Noord-Holland, Netherlands
- Core AI
Job description
About TKH
TKH Group is a global technology company, delivering innovative, client-centric solutions across high-growth markets. With a global presence, TKH operates through three divisions: Smart Vision, Smart Manufacturing, and Smart Connectivity. The TKH Artificial Intelligence (TKH AI Hub) is dedicated to tackling complex AI challenges for our global operating companies.
TKH AI team is seeking a skilled and motivated Medior DevOps/AIOps Engineer to join our dynamic team. In this role, you will help bridge the gap between development and operations, implementing and maintaining cloud infrastructure while incorporating AI-driven operations principles.
About the Role
As a Medior DevOps/AIOps Engineer, you will be responsible for designing, implementing, and maintaining cloud infrastructure on Microsoft Azure, with a strong focus on Kubernetes ecosystems. You will collaborate with development teams to automate deployment processes, enhance system security, and optimize application performance through modern DevOps practices and AI-powered operations.
Key Responsibilities
Design, implement and maintain cloud infrastructure on Microsoft Azure
Build and optimize Kubernetes-based deployments and services
Implement Infrastructure as Code (IaC) using Terraform and other tools
Experience with CI/CD pipelines automation using GitLab, Azure DevOps, or GitHub
Configure and optimize monitoring solutions for applications and infrastructure
Implement security best practices and conduct regular vulnerability assessments
Troubleshoot and resolve infrastructure and application issues
Collaborate with development teams to improve system architecture and performance
Support AI model deployment and infrastructure requirements
Document technical processes and knowledge sharing with the team
Job requirements
Cloud Technologies
Strong experience with Azure Cloud services including:
Microsoft Entra ID (AAD)
Azure Management Groups, Subscriptions, and Resource Hierarchy
Azure Policies, Advisor, and Billing management
Azure Monitoring (Application Insights, Container Insights, Network Insights, Alerts)
Azure Networking (Virtual Networks, Peering, NET Gateway, Subnets, Firewalls)
Azure Security (Security Groups, Role-Based Access Control)
Azure Services (App Services, Functions App, Load Balancer)
Azure Kubernetes Services (AKS)
Azure Storage (Storage Accounts, Disks, Container Registry)
Virtual Machines and Bastion
Kubernetes & Containers
Proficiency with Kubernetes resources (Deployments, DaemonSets, StatefulSets, Services, Ingress, etc.)
Experience with Pod Security Standards and RBAC
Knowledge of Kubernetes Operators and Controllers
Package management with Helm Charts and Kustomize
Containerization using OCI tools & runtimes such as Podman and Docker
Infrastructure & Networking
Strong Linux administration skills
Network architecture and troubleshooting capabilities
Infrastructure as Code using Terraform
GitOps workflow implementation
Development & Security
Programming experience in Python, Golang, C#, or similar languages
Understanding of REST APIs and Microservices architecture
Knowledge of security best practices and penetration testing
Experience with DevSecOps and shift-left security principles
Familiarity with CI/CD pipelines in Azure DevOps/GitLab/GitHub
Interpersonal Skills
Active listening and clear communication (both verbal and written)
Emotional intelligence and empathy when working with team members
Collaborative approach to problem-solving
Ability to provide and receive constructive feedback
Conflict resolution and diplomatic skills
Adaptability when working with different stakeholders and situations
Reliability and trustworthiness in handling responsibilities
Inclusive attitude that values diverse perspectives
Positive, solution-focused mindset
General Skills & Attributes
Experience with migration projects and system configuration
Understanding of system architecture and domain-driven design principles
Strategic thinking and hands-on technical leadership
Problem-solving skills with a logical approach
Experience with Agile methodologies and Scrum
Growth mindset and willingness to learn new technologies
Basic understanding of database design (particularly PostgreSQL)
Experience with application monitoring and performance optimization
Knowledge of test-driven development (TDD) and unit testing
Passion for exploring and implementing innovative AI solutions
AI Integration (That's a Plus)
Knowledge of AI tooling and frameworks (TensorFlow, PyTorch, etc.)
Experience with AI model serving platforms (MLFlow, vLLM, Hugging Face TGI, TensorFlow Serving, ONNX Runtime, etc.)
Familiarity with MLOps practices and tools
Understanding of infrastructure requirements for AI/ML workloads
Experience with GPU acceleration and optimization for AI models
Enthusiasm for AI technologies and their practical applications
Qualifications
Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
3-5 years of experience in DevOps, Site Reliability Engineering, or similar roles
Demonstrated experience with cloud platforms, preferably Azure
Proven track record of implementing and maintaining Kubernetes environments
Certifications in Azure, Kubernetes, or related technologies are a plus
What We Offer
Opportunity to work on diverse, high-impact AI projects
Collaborative environment with leading technology experts
Professional development and growth opportunities
Starting with a one-year contract, with the potential for a permanent position following successful completion.
Competitive salary and benefits package
Hybrid working environment with minimum 3 days per week in our Amsterdam office
Remote working days available
or
All done!
Your application has been successfully submitted!