About Us
Kharidle Online Enterprise is a growing technology-driven company focused on delivering reliable, scalable, and high-performance online services. We are seeking an experienced Principal Site Reliability Engineer (SRE) to lead the design, operation, and continuous improvement of our infrastructure and platform reliability.
Job Summary
As a Principal Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, performance, and security of mission-critical systems. You will provide technical leadership, establish reliability standards, and collaborate closely with engineering teams to build resilient services that support our business growth.
Key Responsibilities
- Lead the architecture, implementation, and optimization of highly available cloud infrastructure.
- Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and reliability metrics.
- Drive automation initiatives to improve operational efficiency and reduce manual intervention.
- Design and maintain monitoring, logging, alerting, and incident response systems.
- Lead root cause analysis and post-incident reviews to prevent recurring issues.
- Collaborate with software engineering teams to improve system reliability throughout the development lifecycle.
- Develop disaster recovery, backup, and business continuity strategies.
- Mentor and guide engineers on reliability engineering best practices.
- Evaluate and implement new technologies that improve platform stability and performance.
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field, or equivalent practical experience.
- 8+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or Infrastructure Engineering.
- Strong experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
- Expertise in Linux system administration and networking concepts.
- Experience with Infrastructure as Code tools such as Terraform or CloudFormation.
- Strong knowledge of containerization and orchestration technologies, including Docker and Kubernetes.
- Experience with CI/CD pipelines and automation tools.
- Proficiency in scripting or programming languages such as Python, Go, Bash, or Java.
- Excellent troubleshooting, communication, and leadership skills.
Preferred Qualifications
- Experience leading large-scale distributed systems.
- Professional cloud certifications.
- Experience with observability platforms and advanced monitoring solutions.
- Knowledge of security best practices and compliance standards.
Benefits
- Competitive salary of €72,000 per year.
- Flexible working arrangements.
- Professional development and training opportunities.
- Collaborative and innovative work environment.
- Paid vacation and public holidays.
- Career growth opportunities within a growing organization.
Sueldo: 72.000,00€ al año
Beneficios:
- Programa de formación
- Seguro dental
- Seguro de vida
- Seguro médico privado
Ubicación del trabajo: Empleo presencial