We are looking for an experienced Sr. Director, Network Operations Modernization (AI/ML)Β to oversee and enhance the reliability and performance of our IT systems through strategic Artificial Intelligence and Machine learning initiatives. This role involves leading a team of engineers, collaborating with cross-functional teams, and implementing best practices to ensure system resilience and efficiency.Β
PRIMARY RESPONSIBILITIES
- LeadΓ’ΒΒ―and mentor a team of IT reliability and automation engineers.
- Develop an AI/ML solution for Operational Center activitiesΒ
- DevelopΓ’ΒΒ―and implement strategies for automating repetitive tasks and improving system reliability.
- OverseeΓ’ΒΒ―the design, development, and maintenance of automation tools and scripts.
- CollaborateΓ’ΒΒ―with development, operations, and product teams to ensure seamless integration and deployment of new systems and features.
- MonitorΓ’ΒΒ―system performance and reliability, proactively identifying and addressing potential issues.
- EstablishΓ’ΒΒ―and enforce best practices for system monitoring, incident response, and disaster recovery.
- AnalyzeΓ’ΒΒ―system failures and develop comprehensive solutions to prevent recurrence.
- MaintainΓ’ΒΒ―detailed documentation of system configurations, processes, and procedures.
REQUIRED KNOWLEDGE/SKILLS/ABILITIES
- Bachelorβs degree in Computer Science, Information Technology, or a related field.Β
- Extensive experience in solving problems with AI & MLΒ
- Proven leadership and team management skills.Β
- Strong programming skills in languages such as Python, Go, or Java.Β
- Experience with automation tools like Ansible, Puppet, or Chef.Β
- Familiarity with monitoring tools such as Prometheus, Grafana, or Nagios.Β
- Excellent problem-solving skills and attention to detail.Β
- Strong communication and collaboration skills.Β
PREFERRED KNOWLEDGE/SKILLS/ABILITIES
- Experience with containerization technologies like Docker and Kubernetes.Β
- Knowledge of cloud platforms such as AWS, Azure, or Google Cloud.Β
- Understanding of CI/CD pipelines and tools like Jenkins or GitLab CI. #LI-REMOTE #LI-JL1