Senior Platform Engineer (M/F)
Closer Consulting
08.05.2025 | | Referência: 2264299

PARTILHAR
Empresa:
Closer Consulting
Descrição da Função
(You must be living in Portugal at the moment)
The ideal candidate will play a crucial role in the operational management and continuous development of the NNDIP platform, ensuring its reliability and efficiency in the rapid deployment of AI solutions across NN.
Key Responsibilities:
- Lead the operational management and ongoing development of NNDIP, aligning with business needs for the fast deployment of AI solutions, and overseeing strategic planning and execution.
- Install, configure, and manage key platform components such as Seldon, Istio, Elasticsearch, Prometheus, and Grafana, ensuring seamless integration and optimal performance.
- Work closely with the infrastructure team to optimize resources and configurations, ensuring compatibility with Seldon and related machine learning workloads.
- Develop and enhance application components, contributing to the platform's evolution and expansion of its functionalities. Monitor, diagnose, and resolve platform issues while documenting operational procedures, best practices, and guidelines. Stay current with advancements in Kubernetes and MLOps to drive continuous improvement.
- Write and maintain Python code for platform enhancements and custom solutions. Operate and maintain the NNDIP in a production environment, ensuring high availability, performance, and security. Provide hands-on MLOps support to data scientists for model deployment.
Required Skills and Qualifications:
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- Solid experience in Kubernetes management; expertise in Seldon is a plus.
- Proficient in CI/CD, (Azure) DevOps, application security, and performance monitoring.
- Familiarity with tools such as ArgoCD, Istio, Terraform, Opensearch, Prometheus, and Grafana.
- Strong knowledge of Python, FastAPI, general API concepts, and unit testing.
- Proficiency in MLOps practices (MLFlow preferred), with experience supporting data scientists in machine learning model deployments.
- Strong understanding of cloud services, preferably AWS, and experience maintaining large-scale platforms.
Minimum Skills for Pre-selection:
- Proficiency in Python and FastAPI.
- Experience with Kubernetes, AWS, and Docker.
- Understanding of CI/CD processes and tools like ArgoCD.
- MLOps expertise: ability to build and manage machine learning models using Seldon and MLFlow.
- Knowledge of defensive programming, unit testing, and authorization flows.

Observações
Not Specified (Portugal)