- Professional
- Office in San Francisco
The Role
We're seeking an AI engineer to own the core models and prompts that power our product. Gamma weaves together text, image, and layout generation to automate all the drudgery of building presentations and websites.We use AI throughout our product, and we want you to help us to elevate quality, evaluate new models, and push the frontier with new features and modalities.
Note: we're looking for someone who is passionate about productizing existing models, not training new ones. The focus of this role is prompting, evaluating, and fine-tuning foundation models for maximum performance.
In this role, you'll be responsible for:
Prompt Engineering
Own our existing LLM & image prompts. Measure and continuously improve quality
Develop complex prompts for new features (we use AI JSX)
Evaluation
Build evals for our prompts and models
Monitor metrics + qualitative feedback to build better test sets
Drive roadmap based on quality gaps
Models
Constantly evaluating new frontier models and methods
Curate datasets for fine-tuning open source models
Launch new modalities (eg voice, video)
Ops
Build analytics & tracking
Own uptime, latency, and costs
Compensation
Salary: The cash compensation for this role ranges from $150k to $240k.
Equity: In addition to the base salary, equity is part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents.
Final offer amounts are determined by multiple factors such as experience and expertise in the requirements listed above.
About you
Prompt hacker
You are a tinkerer who loves seeing how far you can push the limits of a foundation model. You love learning new tricks of prompt engineering and building complex flows that solve new challenges. You have experience building and evaluating prompts at scale.
Software engineer
You're an experienced software developer comfortable in Typescript and/or Python. You're excited about mixing prompt engineering with traditional software engineering to up-level our AI capabilities.
Data-driven
You embrace the need to use data to raise the bar of AI quality. Writing evals, designing metrics, and turning qualitative feedback into quantitative measures excites you! You're self-sufficient in gathering and cleaning data.
AI Scale @ Gamma
AI Generated Presentations Daily
AI Images Generated Daily
tokens per minute on average of LLM usage
Our Stack
AI
Language Models: Claude 3.5 Sonnet, Gemini 1.5, GPT-4o,
Image Models: Flux, Imagen, Ideogram and many more.
Observability: Datadog
Prompting: AIJSX
Evals: Braintrust