Industry: Software as a Service (SaaS) in AI
Location: 100% Remote, 95% Async, 100% in Writing
About Us:
AIPRM stands at the cutting edge of AI-driven solutions, renowned for pushing boundaries and setting industry benchmarks. We champion innovation, but also recognize the unsung heroes of tech: those who ensure that our platforms run like well-oiled machines, day in and day out. At AIPRM, you're more than just an engineer; you're the backbone of our digital world.
Summary:
We seek a seasoned Site Reliability Engineer, one who measures success in millions of rows, nanoseconds, and petabytes. Your invaluable experience in overseeing both development environments and vast production systems for mass data processing is exactly what we're after. If you possess an intricate knowledge of IT system architecture combined with unmatched database and networking skills, we invite you to be the linchpin of our operations.
What You'll Do:
- Continuously analyze and enhance system performance metrics.
- Spearhead the specification, design, and implementation of cutting-edge data components and systems.
- Proactively monitor, optimize, and test Clickhouse clusters to ensure optimal performance and reliability.
- Review, refine, and optimize database backup & recovery processes
- Investigate and harded security aspects across all layers
- Collaborate directly with our founders to develop high-performance AI-driven solutions.
- Design and implement efficient migration tools to seamlessly transition data while maintaining its integrity.
Basic Requirements:
- Minimum of 5 years' experience engineering automated systems for extensive data processing, spanning development environments to production landscapes.
- Profound understanding of operating systems, database systems, and networking fundamentals.
- Ability to independently tackle and probe infrastructure issues in live production setups, inclusive of hardware complications and liaising with data centers.
- Hands-on experience with bare-metal servers.
- Active participation in on-call rosters.
- Comprehensive experience with Linux, Networking, Databases, and SQL.
- Exceptional English communication abilities, both in writing and verbally.
- Demonstrated experience in data modeling, ideally within an AI-focused environment.
- Hands-on experience designing and implementing data migration tools.
- Exceptional English communication skills, both written and verbal.
Preferred Qualities:
- Proven experience with Clickhouse clusters, including monitoring, optimization, and testing.
- Comprehensive knowledge of Clickhouse Backup & Recovery processes.
- Demonstrated ability to drive projects to completion with minimal supervision.
How to Apply:
We invite dynamic candidates to forward their applications, inclusive of a resume and a detailed cover letter emphasizing their experience in DevOps and site reliability. Initiate your application with the code AIPRMsre1, followed by your timezone.
Highlight any former roles that mirror the responsibilities described, specifying the duration and organization involved.
Join AIPRM, where every challenge transcends into an opportunity, and every solution marks a milestone in technological evolution. We're excited to integrate your expertise into our journey towards AI-driven excellence.
Let's define the future, together!