Career Techniques Inc
Description
Responsibilities:
-
- Architect and implement scalable cloud infrastructure using Terraform (or equivalent IaC tools)
- Design high-performance compute environments, including optimized networking and storage stacks
- Manage and optimize Kubernetes clusters in cloud environments
- Automate infrastructure workflows and build complex CI/CD pipelines
- Build and maintain custom cloud images
- Implement robust monitoring and observability for performance-critical workloads
- Lead on-prem to cloud and cloud-to-cloud migrations of HPC and distributed workloads
- Collaborate with research/engineering teams to optimize compute, storage, and scheduling efficiency
Qualifications:
-
- A bachelor’s degree in Computer Science or a related field
- At least 5 years of experience
- Proficiency in Bash, Python, or Go for automation
- Strong expertise in HPC and distributed workloads to design scalable, high-performance cloud infrastructure
- Deep hands-on experience with Infrastructure as Code, Linux systems, cloud networking, and storage architectures
- Ability to build automated, production-grade cloud platforms for compute-intensive environments
- Deep understanding of cloud networking, storage architectures, and storage protocols (NFS, object, block, parallel/distributed storage)
- Expertise with Terraform or equivalent IaC tools
- Experience with HPC schedulers (Slurm/HTCondor)
- Experience with distributed computing frameworks such as Ray
Comp: $200-250K base + Bonuus
