Mustafa Bozkaya

DevOps & MLOps Engineer

Cloud | Kubernetes | CI/CD | AI Model Deployment

Professional Summary

7+ years of experience in CI/CD automation, cloud infrastructure, LLM optimization, and AI model deployment. Specializing in building scalable AI/ML workflows, automating infrastructure, and optimizing cloud-based AI deployments.

Cloud Expertise

Extensive experience with AWS, GCP, and Azure cloud platforms, specializing in serverless architectures and containerized deployments.

AWS
GCP
Azure

AI/ML Deployment

Specialized in deploying and optimizing large language models with techniques like LoRA fine-tuning and quantization.

LLMs
PyTorch
TensorFlow

CI/CD Automation

Built and optimized CI/CD pipelines that reduced deployment times by up to 60% and improved code quality.

GitHub Actions
Jenkins
ArgoCD

Kubernetes

Designed and managed Kubernetes clusters for high-availability deployments with auto-scaling capabilities.

K8s
Helm
Operators

Infrastructure as Code

Implemented IaC solutions using Terraform and other tools to create reproducible and scalable infrastructure.

Terraform
Pulumi
AWS CDK

Data Engineering

Built ETL pipelines and data processing systems for large-scale industrial and AI applications.

ETL
PostgreSQL
MongoDB

Professional Experience

Co-Founder

July 2024 – Present

Premium AI

Remote

  • Enterprise AI model development (LLMs, RAG-based retrieval systems, optimized AI APIs)
  • AI model deployment in GCP and Kubernetes for ultra-low latency inference
  • Reduced deployment cycles by 60% with CI/CD processes
LLMs
Kubernetes
GCP
CI/CD

DevOps & MLOps Engineer

July 2023 – July 2024

AI Planet

Remote

  • Deployed GPT-4 and LLaMA models on Kubernetes and GCP with 10,000+ requests/second capacity
  • Achieved 15% accuracy improvement in LLM optimization and RAG workflows
  • Reduced model size by 40% using LoRA and 4-bit Quantization
  • Implemented automated CI/CD processes with GitHub Actions and Jenkins, resulting in 40% faster deployments
GPT-4
LLaMA
Kubernetes
GCP
LoRA
Quantization
GitHub Actions
Jenkins

AI & Software Engineer

May 2022 – June 2023

EPİK ROBOTİK

Gaziantep, Turkey

  • Developed autonomous navigation systems with 95% accuracy using OpenCV, ROS Noetic, and PyTorch
  • Achieved 90% success rate in robotic command recognition using BERT-based NLP
  • Reduced inference time by 30% with CUDA acceleration and model distillation
OpenCV
ROS Noetic
PyTorch
BERT
CUDA
Model Distillation

AI R&D Engineer

March 2021 - May 2022

BOYAR KIMYA A.S

Turkey

  • Built predictive maintenance models using ML & cloud-based AI APIs
  • Designed ETL workflows to process large-scale industrial data
Predictive Maintenance
ML
ETL
Industrial Data

DevOps Engineer

2014 – 2020

Freelancer.com

Remote

  • Developed cloud-based applications with AWS Lambda, GCP Functions, and containerized Kubernetes deployments
  • Optimized CI/CD processes with GitHub Actions and Jenkins, reducing software release time by 50%
  • Developed e-commerce solutions with ERP integration (React, Node.js, PostgreSQL, Next.js)
AWS Lambda
GCP Functions
Kubernetes
GitHub Actions
Jenkins
React
Node.js
PostgreSQL
Next.js

Skills & Certifications

DevOps & CI/CD

DockerKubernetesTerraformJenkinsGitHub ActionsGitLab CI

Cloud & Infrastructure

AWSGCPAzureKubernetes ClustersServerlessInfrastructure as Code

MLOps & AI Deployment

MLflowTensorFlowPyTorchFastAPIHugging FaceRay

Programming

PythonBashC++GoJavaScriptTypeScript

Monitoring & Logging

PrometheusGrafanaELK StackDatadogNew RelicJaeger

LLM Optimization

GPT-4LLaMAClaude.aiLoRAQuantizationRAG

Certifications

AWS Certified Solutions Architect – Associate

Google Cloud Professional Cloud Architect

Machine Learning Specialization (Coursera – Andrew Ng)

Terraform for DevOps Engineers (LinkedIn Learning)

Kubernetes Certified Administrator (CKA)

Deep Learning Specialization (Coursera)

Projects

Loading projects...

Get in Touch