Senior Network Engineer
Опис на работната позиција
Remote Senior Site Reliability Engineer
Roles and Responsibilities
Supporting business infrastructure to ensure service availability, even outside of regular business hours when necessary.
Optimizing services across the company to manage costs, including right-sizing and deprecating systems.
Contributing to the technology strategy by guiding production and development technical architecture, maintaining high-quality standards, fostering a culture of long-term thinking and innovation.
Overseeing technical scoping and planning for the team, guiding and empowering the development approach.
Researching new technologies to address future deployment, monitoring, and scaling needs.
Managing and participating in 24x7 on-call rotations to ensure site reliability and performance.
Defining best practices for monitoring, alerting, and incident management.
Leading and participating in root cause analysis and documenting procedures.
What We Offer
100% Remote Work- Work From Anywhere
Opportunity To Learn & Develop New Skills
An Open & Collaborative Work Environment
Cutting Edge Technology and Implementations
Generous Compensation based on Industry Standards + Benefits
9 AM - 5 PM EST (flexibility required during upgrades or critical issues for on-call support)
BS in Computer Science or related field or equivalent work experience
5+ years of experience working with cloud infrastructure (GCP prefered, AWS, Private Cloud, etc.) in a secure environment (ISO27001, SOC 2 type 2, GDPR, etc.).
4+ years of technical operations experience, with a background in SaaS and cloud-based platforms.
Experience dealing with environments that leverage container orchestration tools like Kubernetes.
Experience building scalable and fault-tolerant systems.
Experience in successfully leading one or more DevOps projects (CI/CD, pipeline tools, operations management, etc.) to completion through tools like Jenkins, Helm, Terraform.
Experience with system health monitoring tools such as New Relic, OpsGenie, and Uptime Robot.
Experience with databases, including relational and non-relational. Proficiency in MySQL and MS SQL is a plus.
Proficiency with scripting and/or programming languages - Bash, Python, and Golang preferred.
What you bring to the role
Ability to maintain, design, and build development and deployment systems.
Active management of hosting at scale at multiple companies, ensuring reliability, stability, scalability, and 24x7 uptime.
Experience migrating from Data Centers to Cloud-based solutions and migrating solutions from other cloud providers.
Understanding of DevOps as a culture and practice in organizations of our size or larger.
Comfortable in a fast-paced development environment.
Familiarity with Intranet tools and processes including Confluence, Jira, and Microsoft Teams.
Excellent verbal and written communication skills.
Објавен на: 13/02/2024
Краен рок за пријавување: 13/03/2024