Full Time | Valencia, Spain or Remotely within the CET/GMT time zone
If you like this offer, please send your CV mentioning the job title to: [email protected]
Location: Valencia, Spain, or Remote working on the CET time zone
Teleworking option: Yes
The resource MUST have the following skills and experience:
- Proven experience in deploying and managing Prometheus and Grafana in on-premises setupwith
multi-tenant and several support teams design.
- Strong understanding of observability concepts and best practices.
- Experience with related technologies (e.g., Kubernetes, Docker, Mimir, Loki, Tempo, Thanos, on premises infrastructure).
- Proficiency in scripting and automation (e.g., Bash, Python).
- Familiarity with infrastructure-as-code tools (e.g., Ansible, Terraform).
- Experience with log management and tracing solutions (e.g., Loki, ELK stack, Jaeger).
- Excellent problem-solving and analytical skills.
- Ability to work independently and as part of a team.
- Strong English communication skills for effective collaboration and training.
The resource SHOULD have the following skills and experience:
- Experience in Canonical Observability Stack (COS) tools and juju is highly desirable.
- Experience with Elastic (ELK Stack) for logging and analytics in on-premises setup is desirable.
- Knowledge of other monitoring tool is a plus, especially Elastic, SCOM and Checkmk.
- Programming skills in .NET C#, Python, Juju also desirable.
- DevOps practice is desirable.
Soft skills:
- Customer facing experience and oral communication skills
- Ability to write documentation & reports
- Creativity / ability to find innovative solutions
- Willingness to learn on the job
- Conflict management & cooperation
- Experience and understanding complex enterprise environments
- Critical thinking and produce results
Desirable certifications:
- Relevant industry certifications
- Prometheus, Grafana certification
- Other Monitoring tools certifications
- Programming language, preferably either .NET C#, Python, Juju