Skip to content

Please put base prompt in readme #73

@theaustinhatfield

Description

@theaustinhatfield

It took me a while to find the base prompt to do the evals. For this specific repo it would be extremely helpful to including to others to put the v2 prompt in the README.md please. I've copy and pasted the prompt here for convenience.


Prompt Example - v2
The following is an example of the prompt used for open_ai_o1_mini_20241217 and open_ai_o1_preview_20241217. The prior prompt in this repo was used for all other runs on 12/17/2024 and 12/18/2024.

Find the common rule that maps an input grid to an output grid, given the examples below.

Example 1:

Input: 0 2 0 0 0 2 5 2 2 0 5 2 5 5 0 2 2 5 2 2 5 5 0 2 0 0 2 0 0 0 5 0 0 5 2 2 5 2 5 0 0 2 2 5 5 2 2 5 0 5 2 0 0 0 5 0 5 5 0 2 5 0 2 2 8 8 8 8 8 8 8 5 0 2 4 4 4 4 5 0 0 2 3 3 3 3 3 0 0 2 0 5 0 5 8 8 8 8 8 8 8 2 0 0 4 4 4 4 0 0 2 0 3 3 3 3 3 0 2 0 5 0 5 0 8 8 8 8 8 8 8 2 2 0 4 4 4 4 2 2 0 2 3 3 3 3 3 5 0 5 0 0 0 5 8 8 8 8 8 8 8 2 0 0 4 4 4 4 0 0 2 2 3 3 3 3 3 0 0 2 0 0 0 2 5 5 5 2 2 0 0 0 2 5 0 5 2 0 2 0 5 0 5 2 0 2 0 5 5 2 0 0 2 2 5 5 0 0 2 0 5 0 5 0 0 0 2 2 0 0 2 0 0 0 2 0 2 0 0 0 0 2 0 2 0 0 0 0 2 0 2 0 2 0 5 2 0 0 0 5 2 0 5 2 0 0 5 2 0 0 0 2 0 2 0 0 2 0 0 0 2 5 2 0 0 2 0 0 2 0 2 0 0 0 2 0 5 0 5 0 0 2 2 2 1 1 1 1 1 2 2 2 3 3 3 3 3 3 3 0 0 7 7 7 7 7 0 0 5 0 0 0 0 2 1 1 1 1 1 0 5 0 3 3 3 3 3 3 3 2 0 7 7 7 7 7 2 5 5 5 0 0 5 2 1 1 1 1 1 5 2 0 3 3 3 3 3 3 3 0 2 7 7 7 7 7 0 2 5 2 2 5 0 2 1 1 1 1 1 2 0 0 3 3 3 3 3 3 3 2 5 7 7 7 7 7 0 0 0 0 0 0 0 2 0 0 5 0 2 2 2 0 3 3 3 3 3 3 3 0 0 7 7 7 7 7 2 0 2 2 0 0 2 0 0 5 0 2 0 2 0 5 5 0 0 2 0 5 2 2 2 2 0 5 2 0 0 2 2 0 0 0 5 2 0 0 2 0 5 0 0 0 0 5 0 0 0 2 2 0 0 0 0 5 5 0 2 0 0 5 0 2 2 0 8 8 8 8 8 0 2 0 5 4 4 4 4 4 2 0 0 2 0 0 5 0 0 0 2 0 0 0 2 0 8 8 8 8 8 2 2 5 0 4 4 4 4 4 0 2 5 0 1 1 1 1 1 2 0 2 2 2 0 0 8 8 8 8 8 5 0 0 0 4 4 4 4 4 0 0 5 5 1 1 1 1 1 5 0 0 2 5 5 0 8 8 8 8 8 0 5 0 5 4 4 4 4 4 0 5 0 2 1 1 1 1 1 0 0 0 2 0 0 0 8 8 8 8 8 0 0 0 5 2 5 0 0 2 5 0 2 2 1 1 1 1 1 0 0 0 0 5 2 5 5 2 2 0 2 0 0 2 5 0 5 0 0 5 0 0 0 0 1 1 1 1 1 0 0 0 2 0 0 0 2 5 0 0 5 5 2 0 2 2 0 0 5 5 0 0 0 5 0 2 0 5 0 0 2 5 0 0 5 0 0 0 0 2 0 5 5 0 2 5 0 0 0 2 0 2 0 0 5 0 0 0 0 0 0 5 0 2 0 2 0 5 2 5 0 5 2 0 0 0 0 0 0 5 2 2 5 2 0 0 0 0 0 5 5 0 0 0 0 5 5 0 2 2 2 0 0 2 0 2 0 0 5 2 0 2 2 0 0 0 0 0 0 2 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 2 2 0 2 2 0 0 0 0 5 2 2 2 0 0 0 5 2 2 2 0 0 0 0 2 0 5 5 0 0 0 5 0 2 0 5 0 0 0 5 0 2 0 2 2 2 5 5 0 0 2 2 5 2 2 0 0 0 0 2 5 0 2 0 5 0 0 5 5 5 0 0 2 0 0 0 5 Output: 8 4 3 1 3 7 8 4 1

Example 2:

Input: 0 2 0 0 0 2 0 8 0 0 0 2 0 2 0 2 0 0 2 8 0 0 2 0 8 0 0 0 0 0 0 0 0 3 3 3 3 3 3 0 0 0 1 1 1 1 1 1 1 1 2 8 8 2 0 0 0 0 0 0 8 0 2 3 3 3 3 3 3 0 0 2 1 1 1 1 1 1 1 1 0 0 0 9 9 9 9 9 0 0 8 0 8 3 3 3 3 3 3 2 2 2 1 1 1 1 1 1 1 1 8 0 8 9 9 9 9 9 8 8 2 8 0 3 3 3 3 3 3 8 8 0 1 1 1 1 1 1 1 1 0 0 2 9 9 9 9 9 0 0 8 0 0 3 3 3 3 3 3 0 0 2 2 2 8 8 8 8 0 2 8 2 0 9 9 9 9 9 0 0 0 0 0 8 0 0 8 0 0 2 8 2 0 0 2 0 0 0 0 0 0 8 0 9 9 9 9 9 8 8 0 8 8 8 0 0 2 0 8 0 0 0 2 8 8 0 0 0 8 0 2 0 2 0 8 0 0 8 8 0 0 0 0 0 0 0 0 0 0 2 2 2 0 0 2 8 8 2 0 0 2 0 0 2 0 0 8 2 8 0 8 0 0 0 0 0 8 2 8 2 8 0 0 0 0 0 0 2 8 2 0 0 0 0 0 8 0 0 0 0 0 0 2 6 6 6 6 0 8 0 0 4 4 4 4 4 4 2 0 0 0 8 0 0 2 0 0 0 2 0 8 0 8 6 6 6 6 0 8 0 8 4 4 4 4 4 4 2 0 2 2 2 0 1 1 1 1 1 8 0 0 2 0 6 6 6 6 8 0 2 2 4 4 4 4 4 4 8 0 8 0 0 0 1 1 1 1 1 0 2 0 2 8 6 6 6 6 8 0 8 0 4 4 4 4 4 4 0 8 2 2 0 2 1 1 1 1 1 0 8 0 0 2 6 6 6 6 0 0 0 2 4 4 4 4 4 4 0 0 8 0 8 8 1 1 1 1 1 8 0 0 0 0 6 6 6 6 0 0 2 8 0 8 8 2 8 0 8 0 0 0 0 0 1 1 1 1 1 0 2 2 8 0 6 6 6 6 0 2 0 0 0 0 2 8 0 0 0 2 8 0 0 2 0 0 0 0 0 0 0 0 0 8 0 2 0 0 0 0 0 8 0 0 0 2 8 0 0 0 0 0 0 0 0 8 2 0 0 0 2 0 0 2 0 8 0 0 0 2 8 0 8 0 0 0 8 0 8 8 8 0 8 0 0 8 0 2 2 0 2 8 0 0 0 0 0 8 8 2 2 8 0 8 2 2 8 0 0 0 0 8 0 2 0 8 0 0 0 8 2 2 2 0 0 0 0 2 8 0 8 0 0 2 2 8 0 0 2 0 0 0 2 2 2 0 0 0 2 2 8 0 8 8 0 0 8 8 0 8 0 8 0 0 0 0 0 0 0 0 0 2 2 0 0 0 0 8 2 0 0 0 0 2 8 2 0 2 0 0 8 0 0 0 2 0 8 0 0 0 2 8 8 0 8 0 2 0 0 0 8 2 0 0 0 0 0 0 0 8 8 0 2 0 8 0 0 0 0 0 0 2 2 0 0 2 0 0 8 8 0 8 2 0 0 0 8 0 8 0 8 2 0 0 0 8 0 0 8 0 2 0 0 8 0 2 2 8 0 0 0 0 8 0 2 2 8 2 8 0 2 2 0 0 0 2 2 2 2 2 2 0 0 0 8 0 8 0 0 8 2 0 0 2 8 2 8 0 0 0 0 0 0 0 0 8 0 0 2 0 2 2 0 0 8 0 2 0 0 8 8 0 0 0 0 8 0 0 0 8 0 2 8 0 0 0 0 0 0 0 0 0 0 2 8 2 8 0 0 8 0 8 2 0 2 8 8 0 0 0 2 0 0 0 8 8 0 8 0 0 0 8 2 8 8 0 2 8 2 2 2 2 0 8 8 0 0 0 8 0 0 8 0 8 0 0 0 8 0 2 0 0 8 0 8 0 0 2 8 0 0 Output: 3 1 9 6 4 1

Example 3:

Input: 1 0 0 0 9 1 1 0 1 9 1 0 9 0 0 1 0 1 0 0 0 0 1 9 0 1 1 9 9 9 0 0 0 0 9 1 0 0 0 1 1 0 1 0 0 1 1 1 1 0 9 9 0 0 1 1 1 1 9 0 1 1 1 0 0 1 1 9 1 0 1 0 4 4 4 4 4 4 1 1 0 0 1 0 1 0 0 0 1 9 0 1 9 0 0 0 0 1 0 0 1 1 4 4 4 4 4 4 0 9 0 0 8 8 8 8 1 0 1 0 0 0 1 1 0 9 0 9 0 0 0 9 4 4 4 4 4 4 9 0 1 1 8 8 8 8 0 1 9 0 1 1 0 8 8 8 8 8 8 1 0 0 4 4 4 4 4 4 1 0 0 0 8 8 8 8 1 0 9 0 1 0 9 8 8 8 8 8 8 0 0 9 4 4 4 4 4 4 0 0 1 9 8 8 8 8 1 0 1 0 9 0 0 8 8 8 8 8 8 0 0 0 0 0 0 9 9 0 9 0 0 1 0 1 9 1 0 0 9 1 0 9 1 1 0 1 9 1 0 1 0 9 1 0 0 0 9 9 1 0 1 1 0 0 0 0 0 9 0 1 1 1 0 9 9 0 0 9 0 0 0 0 7 7 7 7 1 1 1 0 1 0 3 3 3 3 3 0 1 0 0 1 0 0 3 3 3 1 9 1 0 0 7 7 7 7 0 1 0 9 0 0 3 3 3 3 3 1 1 9 1 0 1 1 3 3 3 1 0 0 1 0 7 7 7 7 0 0 9 0 0 0 3 3 3 3 3 0 1 0 0 1 1 0 3 3 3 9 0 1 0 9 1 1 0 0 0 1 9 1 1 1 3 3 3 3 3 0 0 9 0 0 0 1 0 9 9 9 0 9 9 1 9 9 0 0 1 0 1 0 0 9 0 0 0 0 9 0 9 0 0 1 0 1 0 9 1 0 1 9 1 9 0 0 1 0 0 0 0 0 0 9 9 9 9 0 9 9 1 0 1 0 9 0 1 9 0 0 0 0 9 9 1 1 1 9 0 1 9 1 4 4 4 4 4 9 0 1 0 0 9 0 0 0 9 0 9 0 0 9 0 0 9 0 0 0 1 0 0 9 4 4 4 4 4 0 1 0 0 0 9 0 9 2 2 2 2 2 9 9 1 9 8 8 8 8 0 9 0 9 4 4 4 4 4 0 0 0 0 1 0 0 1 2 2 2 2 2 1 0 1 0 8 8 8 8 1 9 9 1 4 4 4 4 4 1 0 9 9 0 0 1 0 2 2 2 2 2 0 1 0 1 8 8 8 8 0 9 1 0 4 4 4 4 4 0 1 1 1 1 1 0 0 2 2 2 2 2 0 0 1 0 8 8 8 8 0 9 0 0 1 1 0 0 1 1 1 1 0 0 9 1 9 0 9 0 9 9 1 9 9 9 1 0 0 1 0 0 1 0 1 1 0 0 0 1 0 1 1 0 9 0 9 0 0 1 0 0 9 1 1 9 9 1 0 9 1 0 0 0 1 0 0 0 0 0 0 0 0 1 1 0 0 0 1 9 1 1 1 1 0 0 9 1 0 1 1 1 9 1 9 0 9 1 1 1 1 0 0 0 1 0 0 0 1 9 9 1 1 0 1 0 0 9 0 0 1 0 0 0 0 0 0 0 0 9 0 9 1 1 0 0 1 1 1 0 1 0 0 1 1 0 0 0 0 0 0 0 0 0 9 9 9 1 1 1 0 0 0 0 0 0 9 0 1 0 1 0 0 0 0 1 0 1 1 1 0 0 1 1 0 9 9 0 1 0 1 1 0 1 0 0 0 9 0 1 9 1 1 1 1 0 9 9 0 0 0 0 0 0 9 0 1 0 0 0 0 9 0 1 1 0 1 9 0 9 0 0 0 0 9 1 0 0 0 0 9 0 1 1 0 1 1 1 0 0 0 1 0 0 1 0 0 0 0 9 9 0 1 0 9 0 9 0 1 1 1 0 0 1 0 0 9 0 1 0 9 9 9 1 Output: 8 4 8 3 7 3 2 8 4

Below is a test input grid. Predict the corresponding output grid by applying the rule you found. Your final answer should just be the text output grid itself.

Input: 5 5 0 0 0 8 5 0 0 8 8 8 0 8 0 0 5 5 0 5 0 5 8 0 0 0 0 0 0 8 8 8 5 5 0 8 0 0 5 8 0 0 5 8 0 8 0 8 0 8 0 0 5 0 8 8 0 0 0 0 0 5 5 5 0 5 8 0 5 8 0 0 0 5 0 5 8 8 5 8 5 0 5 0 0 0 0 0 5 5 0 0 0 5 5 5 8 8 0 0 0 5 8 3 3 3 3 3 5 0 8 0 8 8 0 8 8 0 0 5 0 5 0 5 2 2 2 2 2 2 0 5 8 3 3 3 3 3 8 8 8 3 3 3 3 3 3 0 0 5 8 8 0 0 2 2 2 2 2 2 0 0 0 3 3 3 3 3 5 8 0 3 3 3 3 3 3 0 8 0 8 5 0 0 2 2 2 2 2 2 0 0 8 3 3 3 3 3 0 0 0 3 3 3 3 3 3 5 0 0 5 0 8 8 2 2 2 2 2 2 8 0 0 3 3 3 3 3 0 0 0 0 5 5 0 0 0 0 0 5 0 0 0 5 0 8 0 5 5 0 0 0 0 0 0 5 5 5 0 0 0 0 0 0 0 0 0 8 8 0 0 0 5 0 5 5 0 8 0 8 8 0 0 5 8 0 0 0 0 5 0 0 1 1 1 1 1 5 5 5 8 0 8 4 4 4 4 4 5 0 5 8 7 7 7 7 7 0 0 8 5 0 1 1 1 1 1 0 5 0 8 5 0 4 4 4 4 4 0 0 0 0 7 7 7 7 7 0 0 5 0 0 1 1 1 1 1 0 5 0 0 8 0 4 4 4 4 4 0 0 0 0 7 7 7 7 7 5 0 0 5 8 1 1 1 1 1 5 5 0 0 8 5 4 4 4 4 4 0 0 0 0 7 7 7 7 7 0 8 0 8 0 1 1 1 1 1 5 5 0 0 5 8 4 4 4 4 4 0 0 8 0 8 0 0 0 0 0 5 0 0 0 5 0 0 0 5 0 5 8 8 8 0 0 0 0 8 0 8 0 0 0 0 0 0 5 0 0 5 5 8 0 5 0 5 8 0 0 0 5 0 8 0 5 0 0 0 5 5 8 5 5 3 3 3 3 3 3 3 8 0 5 0 7 7 7 7 5 0 5 0 0 5 5 0 5 1 1 1 1 0 0 3 3 3 3 3 3 3 0 8 8 8 7 7 7 7 8 0 8 0 0 0 0 0 0 1 1 1 1 5 8 3 3 3 3 3 3 3 8 5 0 8 7 7 7 7 0 5 5 0 5 0 8 0 5 1 1 1 1 5 0 3 3 3 3 3 3 3 5 0 5 0 7 7 7 7 5 0 0 0 0 5 0 0 8 1 1 1 1 0 0 5 8 0 0 5 8 8 0 0 8 0 7 7 7 7 8 0 0 5 0 5 8 0 0 8 0 5 0 0 0 0 0 5 8 0 0 5 8 0 0 5 0 8 8 8 0 0 5 0 5 0 5 5 4 4 4 5 0 5 0 6 6 6 6 6 6 0 0 2 2 2 2 2 2 0 0 5 5 0 8 0 5 5 4 4 4 0 8 5 0 6 6 6 6 6 6 0 0 2 2 2 2 2 2 5 0 8 5 8 0 0 0 0 4 4 4 5 0 8 0 6 6 6 6 6 6 0 0 2 2 2 2 2 2 0 0 0 0 0 0 0 0 0 4 4 4 5 5 0 8 6 6 6 6 6 6 5 0 2 2 2 2 2 2 0 0 8 0 5 5 0 0 0 5 5 8 5 8 0 0 6 6 6 6 6 6 8 5 8 0 0 8 5 0 8 5 0 0 0 5 8 5 0 8 5 5 5 0 8 8 0 0 5 0 8 5 5 0 0 0 5 8 0 0 0 0 8 5 0 0 0 0 8 0 0 5 8 8 8 5 0 0 0 5 0 5 0 0 0 5 0 8 0 5 5 0 0 8 8 0 5 0 0 0 0 0 5 8 8 8 0 0 5 5 5 5 8 5 0 0 5 8 5 8 5 5 0 5

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions