Talentsafari

Senior Systems Engineer

Talentsafari Remote Today
engineering

About Share

Share is building Africa's next-generation ISP ecosystem, a fully automated, software-defined network that deploys fiber, wireless, and intelligent routing into a unified bandwidth & compute ecosystem for ISPs to grow on top of.

About the Role

We're looking for a Senior Systems Engineer who will own the compute, storage, and platform infrastructure that powers our network. You'll build the systems that enable automated provisioning, AI-driven network operations, and real-time telemetry across multiple markets.

This is greenfield work, you'll make foundational decisions about how we run databases, deploy services, and instrument everything for intelligent automation. If you're excited about applying AI to network operations and want to build infrastructure that learns and adapts, this role is for you.

What You'll Do

  • Platform Infrastructure

    • Design and deploy virtualization platforms (Proxmox, KVM) across distributed PoPs

    • Manage Kubernetes clusters for platform services and microservices deployments

    • Architect storage systems (Ceph, ZFS, TrueNAS) for network telemetry, logs, and operational data

    • Build and maintain AAA infrastructure (FreeRADIUS backed by Percona MySQL cluster) supporting thousands of concurrent subscribers

    • Implement core services: DNS, NTP, syslog, DHCP, IPAM

  • Database & Data Systems

    • Deploy and optimize time-series databases (TimescaleDB, InfluxDB) for network metrics at scale

    • Design data pipelines that feed AI/ML models for network automation

    • Implement backup, replication, and disaster recovery strategies

    • Design unified data architectures that bridge physical infrastructure (fiber, devices, PoPs), logical network state (IP allocations, routing, VLANs), and operational telemetry into coherent, queryable systems

    • Build data pipelines that correlate network events with billing and SLA compensation workflows

  • AI-Driven Network Operations

    • Build the telemetry and observability stack that enables intelligent network management

    • Instrument systems to generate training data for predictive maintenance and anomaly detection

    • Work with the team to deploy AI models that automate fault detection, capacity planning, and traffic optimization

    • Create feedback loops between network events and automated remediation systems

  • Automation & Monitoring

    • Develop infrastructure-as-code (Ansible, Terraform) for repeatable deployments across markets

    • Implement comprehensive monitoring (Zabbix, Prometheus, Grafana, LibreNMS, Graylog)

    • Build alerting systems that reduce NOC workload through intelligent triage

    • Create tooling for automated provisioning of partner ISP infrastructure

    • Design internal tools as extensible platforms - monitoring, provisioning, and analytics systems you build will be productized and offered to partner ISPs on our backbone

Must Have

  • 5+ years in systems engineering or infrastructure roles

  • Strong Linux administration (we run Debian/Ubuntu)

  • Kubernetes administration—deploying, scaling, and troubleshooting clusters

  • Deep experience with PostgreSQL and/or MySQL—query optimization, replication, clustering, backup strategies

  • Virtualization and bare-metal provisioning at scale

  • Proficiency with network troubleshooting (tcpdump, Wireshark)

  • Infrastructure-as-code experience (Ansible, Terraform, or similar)

  • Scripting in Python or Bash

Nice to Have

  • Experience with time-series databases (TimescaleDB, InfluxDB, Prometheus long-term storage)

  • Experience with graph databases (Neo4j, AWS Neptune, or similar) for modeling network topology and dependencies

  • Familiarity with ML/AI infrastructure—model serving, feature stores, data pipelines

  • ISP or telecommunications background

  • IP Network and Routing Protocol (BGP, OSPF) fundamentals.

  • BGP/OSPF fundamentals and how routing integrates with systems infrastructure

  • Understanding of network security principles (firewall management, ACLs, secure access, vulnerability assessment)

  • Experience with Ceph or distributed storage systems

  • Exposure to network automation tools (NAPALM, Netmiko, or similar)

What We Offer

  • A foundational role in building infrastructure for Africa's next-generation connectivity platform

  • Competitive salary

  • Equity in a mission-driven, investor-backed company

  • Private health and wellness benefits

  • High ownership and direct impact on how millions connect to the internet

Sponsored

Explore Engineering

Skills in this job

People also search for