Job Title : Mid-Level SRE -Site Reliability EngineerLocation : Remote
Duration: 6+ Months Contract
- Advanced Kubernetes Must have strong skills in Kubernetes at scale using one of GKE, AKS, EKS or RKE. Experience with Kubectl and
- Helm. Worked on EKS with Kubectl.
- Containers: Experience deploying Java (Spring Boot) microservices in dockerized environments.
- Observability Experience in setting up tools like Prom/Grafana, Datadog, AppDynamics, Splunk. to give actionable intel on a microservi environment including but not limited to synthetics, Application performance monitoring, logging and Alerting (Pagerduty/OpsGenie Integrations). Worked on elasticsearch and OpsGenie Integration.
- Good CI/CD expertise. Jenkins, Azure DevOps, Github Actions, ArgoCD, Artifactory, Azure container registry, Google container registry am other similar tooling. Worked on Jenkins, ArgoCD.
- SCM Working with tools like Github/Gitlab for source code management and well as experience with branching strategies like GitFlow and trunk based. Gitlab using GitFlow.
- Strong troubleshooting skills Be able to move all the way down to code level to give development teams a head start on application issues. Effectively be able to contribute to root cause analysis exercises post problem resolution.
- Good Communication Skills Respect. Active listening, verbal and non-verbal communication, Clarity and Concision, Confidence, Open-Mindedness,
- solution. Good Documentation skills Be able to effectively document any automation, technical efforts so as to ensure ease of adoptability of a
- Good collaboration skills Must be able to work effectively with Scrum/Dev teams with a push/pull (push back and prioritize work pulled in) philosophy in order to manage expectations and contribute to the stability and improvement of the platform.
Desirable skills/knowledge/experience:
- IAC Terraform Pulumi. Preferably developed modules in the past rather than just using them. – Terraform
- Security worked with encryption at rest, in transit patterns. Experience with tools like Azure Key vault, Hashicorp Vault, Google KMS.
- Security Experience with tools like Veracode, Blackduck for AppSec testing, Qualys scanners for infra testing and Twistlock/Aqua for
- container scanning. Qulays for OS Vulnerabilities scanning and fixing. Automation Must be able to identify toil and opportunities to reduce that within the team.
- Authentication/Authorization Familiarity with Authn/Authz schemes like OpenID, OAuth 2.0, SAML.
- Scripting and Programming Experience with Python, Powershell, Go, Java, Node.
- Event Driven/Event Sourcing Patterns. Familiarity with distributed event streaming platforms like Kafka, EventHub, RabbitMQ and patterns
- like CORS.
- Advanced Microservice Patterns: Familiarity Saga Choreography and Orchestration patterns.
