Kubernetes-container-platforms Trabajos a distancia y desde casa en toronto ∙ Página 1
73 Trabajos a distancia y desde casa en línea
Senior Technical Consultant, App Engine Platform Expert Services
ServiceNow · Toronto, Canadá · Remote
Homeoffice Senior Solution Architect, Platform Expert Services
ServiceNow · Toronto, Canadá · Remote
Homeoffice Site Reliability Engineer (SRE) – Azure & SaaS Platforms
Xplor · Toronto, Canadá · Remote
Senior Software Developer - AI Platform
Caseware · Toronto, Canadá · Hybrid
- Oficina en Toronto
What you will be doing:
- Design and build reusable platform components — including prompt/schema design, RAG pipelines, grounding connectors, and agentic execution patterns (task orchestration, tool invocation, and workflow runtime primitives) — to deliver reliable, context-aware LLM interactions and accelerate product team's ability to ship AI-powered features
- Build evaluation systems for LLM-based features — including LLM-as-a-judge, structured evals, regression suites, and automated reliability/safety checks — to ensure consistent behavior, measurable quality, and dependable customer outcomes
- Stay current with emerging AI and cloud technologies, lead proof-of-concepts, and translate findings into strategic guidance that informs platform roadmaps and long-term architectural decisions
- Take ownership of features and solutions across the entire software development lifecycle — from design and implementation to testing, deployment, and ongoing maintenance.
- Provide technical mentorship to junior developers through code reviews, pair programming, and collaborative solution design
- Maintain clear, current technical architecture documentation and enforce development best practices to protect the integrity of the codebase
- Drive operational excellence by identifying recurring issues and eliminating root causes that impact customers and internal teams
- Partner with DevOps/DevSuccess to improve your team’s build processes, test automation, and CI/CD pipelines
- Participate in the prioritization and reduction of technical debt of the system your team manages
- Participate in 24/7 production support rotation for your team’s systems, delivering thorough post-mortems and root cause analysis for major client impact incidents
What you’ll bring:
- 1–2+ years of practical experience developing LLM-powered systems, including retrieval-augmented generation (RAG), prompt/context engineering, agent orchestration, and tool use -- with experience applying evaluation methods (e.g. LLM-as-a-judge, structured tests) and implementing security and guardrails such as safety filtering, validation, and input/output sanitization
- 5+ years of experience writing production-grade front-end applications using TypeScript/Angular (or other related modern front-end technology)
- 5+ years of experience with API microservice development using TypeScript / NestJS (or other related modern JavaScript server frameworks)
- Proven experience designing and deploying solutions in public cloud environments (preferably AWS), with an understanding of cloud-native services and infrastructure-as-code practices
- Demonstrated aptitude for writing effective LLM prompts and instructions, and a solid understanding of prompt engineering patterns in real-world use