Prometheus & Grafana Expert

Detalles del empleo

Full Time | Valencia, Spain or Remotely within the CET/GMT time zone

If you like this offer, please send your CV mentioning the job title to: [email protected]

Location: Valencia, Spain, or Remote working on the CET time zone

Teleworking option: Yes

The resource MUST have the following skills and experience:

Proven experience in deploying and managing Prometheus and Grafana in on-premises setupwith

multi-tenant and several support teams design.

Strong understanding of observability concepts and best practices.
Experience with related technologies (e.g., Kubernetes, Docker, Mimir, Loki, Tempo, Thanos, on premises infrastructure).
Proficiency in scripting and automation (e.g., Bash, Python).
Familiarity with infrastructure-as-code tools (e.g., Ansible, Terraform).
Experience with log management and tracing solutions (e.g., Loki, ELK stack, Jaeger).
Excellent problem-solving and analytical skills.
Ability to work independently and as part of a team.
Strong English communication skills for effective collaboration and training.

The resource SHOULD have the following skills and experience:

Experience in Canonical Observability Stack (COS) tools and juju is highly desirable.
Experience with Elastic (ELK Stack) for logging and analytics in on-premises setup is desirable.
Knowledge of other monitoring tool is a plus, especially Elastic, SCOM and Checkmk.
Programming skills in .NET C#, Python, Juju also desirable.
DevOps practice is desirable.

Soft skills:

Desirable certifications:

Teleworking Option:

On-call requirements: