August 31st 2024
New
We are thrilled to introduce a powerful new featureâAI Prompt Engineer, Ape in short. đĨŗ
Ape is an AI-driven solution that takes the hassle out of prompt engineering. It automatically refines and optimizes prompts using LLM-driven algorithms, helping to reduce the time and effort required for manual adjustments.
Weâve added a demo project on the GSM8K benchmark that you can use as a reference for prompts and datasets. You can find the demo under âGSM8K Ape Demoâ in the bottom left corner of your dashboard.
đĄ GSM8K, or Grade School Math 8K, is a dataset made up of 8,500 high-quality and linguistically diverse math word problems for grade school students. Itâs designed to evaluate and improve language modelsâ mathematical reasoning skills by challenging them with multi-step problem-solving tasks.
To help you get started even faster, weâve released a starter template on GitHub. This template provides the necessary structure to get started with Ape in your projects. You can find it here: Ape Starter Template.
Set up a prompt. You can either start with an existing one or have Ape create a new prompt from scratch.
Add data. You can use your current production data, import an existing dataset, or manually input specific examples.
Optimize the prompt. Add an evaluation metric to measure how well it performs, and include any instructions to guide the process. Ape then generates a prompt thatâs more accurate and cost-effective, based on the data and criteria youâve provided.
Iterate the process. As you gather more data and refine your goals, you can repeat these steps to continually improve your prompts.
The goal is to provide a more consistent and reliable way to achieve high-quality outputs, addressing some of the common challenges faced by engineers in the field.
Vanilla | CoT | DSPy | Ape | |
Optimization time | - | - | 15m 41.8s (20 steps) | 4m 32.5s (20 steps) |
Score | 54.5 | 87.5 | 90.0 | 94.5 |
(tested on GPT-4o)
If you need help with setup or onboarding, feel free to schedule a 15-minute chat with us or drop us an email. Also, join us on Discord to stay up-to-date with the latest updates, learn best practices, and be part of a growing community. We look forward to seeing you there đ
Onward and upward!
- HyunJie from Weavel