We are seeking a highly motivated and experienced Storage Engineer to join our team and play a key role in managing multiple data storage infrastructures. The ideal candidate will have extensive experience with Weka and Ceph, or other parallel and object storage solutions. In this role, you will be responsible for various aspects of storage administration, including provisioning, configuration, monitoring, and troubleshooting. This is a customer-facing position.
Key Responsibilities
· Manage and maintain Weka and Ceph storage environments.
· Provision storage resources to meet application and user requirements.
· Configure and manage storage pools, volumes, and snapshots.
· Implement and maintain data protection strategies, including backups and replication.
· Monitor storage performance and capacity utilization.
· Identify and troubleshoot storage-related issues.
· Perform routine maintenance tasks.
· Stay up-to-date on the latest storage technologies and best practices.
· Document storage configurations and procedures.
· Participate in an on-call rotation to provide critical support for AI and HPC operations.
Work Conditions
Requirements
· Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
· 8+ years of experience in storage administration.
· Proven experience with Weka and Ceph or other parallel and object storage solutions (Panasas/Vdura, Lustre, VAST).
· Strong understanding of storage protocols (e.g., FC, iSCSI, NFS, S3).
· Experience with storage management software (e.g., Weka Home, VAST Management Console).
· Strong Linux systems administration skills and experience with open-source technologies.
· Excellent problem-solving and troubleshooting skills.
· Strong analytical and communication skills.
· Ability to work independently and as part of a team.
· Excellent time management and organizational skills.
Preferred Qualifications
· Experience with storage virtualization technologies (e.g., NetApp SVM, Weka CSI).
· Experience with scripting languages (e.g., Python, BASH, Ansible).
· Familiarity with storage over InfiniBand networks.
· Understanding of high-performance storage and parallel file systems used in HPC/AI and Cloud.
· Hands-on HPC Cluster administration experience.
Benefits