US logo

Sr. Director, Network Operations Modernization (AI/ML)

US
Full-time
Remote
United States

We are looking for an experienced Sr. Director, Network Operations Modernization (AI/ML)Β to oversee and enhance the reliability and performance of our IT systems through strategic Artificial Intelligence and Machine learning initiatives. This role involves leading a team of engineers, collaborating with cross-functional teams, and implementing best practices to ensure system resilience and efficiency.Β 

PRIMARY RESPONSIBILITIES

  • LeadҀ¯and mentor a team of IT reliability and automation engineers.
  • Develop an AI/ML solution for Operational Center activitiesΒ 
  • DevelopҀ¯and implement strategies for automating repetitive tasks and improving system reliability.
  • OverseeҀ¯the design, development, and maintenance of automation tools and scripts.
  • CollaborateҀ¯with development, operations, and product teams to ensure seamless integration and deployment of new systems and features.
  • MonitorҀ¯system performance and reliability, proactively identifying and addressing potential issues.
  • EstablishҀ¯and enforce best practices for system monitoring, incident response, and disaster recovery.
  • AnalyzeҀ¯system failures and develop comprehensive solutions to prevent recurrence.
  • MaintainҀ¯detailed documentation of system configurations, processes, and procedures.

REQUIRED KNOWLEDGE/SKILLS/ABILITIES

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.Β 
  • Extensive experience in solving problems with AI & MLΒ 
  • Proven leadership and team management skills.Β 
  • Strong programming skills in languages such as Python, Go, or Java.Β 
  • Experience with automation tools like Ansible, Puppet, or Chef.Β 
  • Familiarity with monitoring tools such as Prometheus, Grafana, or Nagios.Β 
  • Excellent problem-solving skills and attention to detail.Β 
  • Strong communication and collaboration skills.Β 

PREFERRED KNOWLEDGE/SKILLS/ABILITIES

  • Experience with containerization technologies like Docker and Kubernetes.Β 
  • Knowledge of cloud platforms such as AWS, Azure, or Google Cloud.Β 
  • Understanding of CI/CD pipelines and tools like Jenkins or GitLab CI. #LI-REMOTE #LI-JL1