- Deeply analyze artificial intelligence architectures using tools like AWS / Azure / GCP
- Analyze performance metrics including latency vs cost trade-offs.
- Break down utilization patterns across clusters.
- Optimize token usage while minimizing errors,
- Implement batching strategies to speed up processing without excessive resource allocation.,
- Make use of job queues as required.
Founding Pre-Sales Cloud - Kakinada - Cloudoku AI
Description
About Cloudoku AI
Cloudoku AI is a company working on solving complex problems, particularly reducing cloud & AI inference costs by 30–60% for high-growth companies. We operate at the intersection of LLM systems, GPU economics, cloud pricing models and architecture-level optimization.
The Role
We are looking for an experienced Cloud & AI Architect who can diagnose inefficiencies in infrastructure economics and support high-stakes technical sales conversations. You will work directly with our founder to design optimization roadmaps.