Cloud Computing Engineer - Remote
Description
SAIC is looking for a highly motivated and experienced Cloud Operations Implementation Engineer to join our internal Innovation Factory Cloud Operations team. This critical role combines operations management, continuous monitoring, advanced AWS cloud configuration, security best practices, Infrastructure as Code (IaC) development, and ITIL framework knowledge. The successful candidate will be instrumental in ensuring the optimal performance, reliability, and security of our cloud-based services. The selected candidate will be an expert in all aspects of Cloud Operations, Automation, Scripting and Development who readily adapts to support a variety of challenging technical requirements in a dynamic and fast paced environment.
This is fully remote within the U.S. There will be up to 25% travel annually.
The Cloud Operations Implementation Engineer will provide technical expertise and commercial best practices to optimize Cloud resource consumption and costs for AWS and Azure. They will evolve and improve our Cloud Operations processes, procedures, common services, and operations and sustainment activities with the intent to reduce Cloud resource consumption and costs without sacrificing performance and availability. They will be responsible for sustaining our Cloud Services and offerings, capturing, and analyzing incidents and events, optimizing system performance, and troubleshooting anomalies.
Key Responsibilities:
- Cloud Operations Management:
- Implement, manage, and monitor all cloud-based infrastructure and systems to ensure high availability, performance, and scalability.
- Work closely with development teams to support deployment and operation of applications.
- Automate routine operational tasks and incident responses to ensure swift resolution of operational issues.
- Observability and Monitoring:
- Develop and maintain robust monitoring solutions that provide visibility into our cloud applications and infrastructure.
- Analyze system performance indicators and respond to alerts to identify and address issues proactively.
- AWS Cloud Configuration:
- Configure and maintain AWS services, including EC2, RDS, S3, VPC, Lambda, and more as needed.
- Apply best practices in the management of AWS resources to optimize performance and costs.
- Security:
- Ensure the integrity and security of data in compliance with policy and industry best practices.
- Manage identity and access controls, data encryption, and secure software deployment.
- Regularly review and audit infrastructure to identify and remediate any vulnerabilities.
- Infrastructure as Code (IaC):
- Develop and maintain infrastructure as code (IaC) using tools such as Terraform, AWS CloudFormation, or similar.
- Promote code reuse and modularization to facilitate maintenance and scalability.
- ITIL Experience:
- Utilize ITIL best practices for service management, change management, and incident management.
- Contribute to continuous improvement of internal processes aligned with ITIL frameworks.