Meta Prompting

Meta-prompting is an advanced prompt engineering technique in which large language models (LLMs) are used to create, adjust, and optimize their own instructions (prompts)^[1]. Unlike traditional approaches where a human manually writes detailed instructions, meta-prompting focuses on defining the problem-solving structure and the roles the model should perform. In other words, it describes how to solve a task, not what to do in each specific case^[2].

This approach allows LLMs to independently decompose complex problems, clarify queries, and iteratively improve their answers, paving the way for the creation of more autonomous and adaptive artificial intelligence systems.

Key Methods and Approaches

The term "meta-prompting" gained traction in scientific literature in late 2023 and early 2024, when several research groups introduced similar approaches to managing LLMs.

The "Scaffolding" Method

One of the key approaches was proposed by researchers from Stanford University and OpenAI^[3]. In this framework, called scaffolding, the same language model (GPT-4 in the experiments) plays several roles simultaneously:

Conductor: Receives a high-level meta-prompt and breaks down a complex task into a series of simpler subtasks.
Experts: The Conductor initializes several instances of itself as "experts," each of which solves a specific subtask according to specially provided instructions.
Integration: The Conductor coordinates the work of the "experts" and integrates their responses into a final solution.

This orchestration-based approach has significantly improved the efficiency of solving complex tasks. For example, in experiments, meta-prompting outperformed standard single-step queries by 17.1% and other advanced methods by 15–17% on task sets requiring multi-component reasoning (e.g., the Game of 24, chess puzzles)^[3]. Importantly, this method is task-agnostic and operates in a zero-shot manner, requiring no specific examples for each new task.

Theoretical Approach and Structural Patterns

Concurrently, a research group from Tsinghua University proposed its own concept of meta-prompting, shifting the focus from the content of the task to its syntax and the form of data representation^[4]. Building on a categorical model, they demonstrated that an abstract description of a problem's structure allows the model to construct reasoning close in depth to human reasoning and to effectively decompose complex questions into simpler steps.

Related Concepts

The ideas behind meta-prompting also appear in other methods:

Automatic Prompt Engineer (APE): A method where an LLM automatically generates and selects effective instructions for itself, evaluating them based on the quality of the resulting outputs^[5].
Self-Refine: An approach where the model iteratively improves its own response by critically analyzing the previous version and generating corrections based on this critique^[6].

Application and Advantages

Meta-prompting has shown high efficiency in tasks that require complex, multi-component reasoning, such as mathematical proofs, programming, and step-by-step puzzle solving. Key advantages of the method include:

Token Efficiency: Focusing on the general structure of a task instead of listing numerous examples reduces the prompt's length. A meta-prompt acts as a universal template, requiring fewer tokens^[7].
Stability and Impartiality: The method avoids reliance on specific examples, making the model less susceptible to bias from the particulars of the training samples^[7].
Dynamic Adaptation: Unlike a static prompt, meta-prompting allows for iterative improvement. The model can refine instructions, request missing information, and adjust its strategy during the problem-solving process^[2].
Generalization to New Tasks: High-level instructions are more easily transferable to new, unseen tasks, making meta-prompting an advanced form of the zero-shot approach.

Limitations and Risks

Computational Cost and Complexity: The method requires multiple calls to the model for a single task, which increases time and API request costs. Current implementations run sequentially, making parallelization difficult^[8].
Dependence on Model Capabilities: The effectiveness of meta-prompting is highly dependent on the quality of the base LLM. Studies have shown that GPT-3.5 gains almost no benefit from this method, whereas for GPT-4 and later models, the advantage is significantly greater^[8].
Fundamental LLM Limitations: Meta-prompting is not a panacea. If a task is outside the knowledge base of the underlying model, even a perfectly formulated meta-prompt does not guarantee a correct result. In such cases, either fine-tuning or integration with external tools (such as web search or a code interpreter) is required^[3].

Links

Literature

Suzgun, M.; Kalai, A. T. (2024). Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding. arXiv:2401.12954.
Zhang, Y.; Yuan, Y.; Yao, A. C.-C. (2023). Meta Prompting for AGI Systems. arXiv:2311.11482.
Zhou, Y. et al. (2022). Large Language Models Are Human-Level Prompt Engineers. arXiv:2211.01910.
Madaan, A. et al. (2023). Self-Refine: Iterative Refinement with Self-Feedback. arXiv:2303.17651.
Ning, X. et al. (2023). Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation. arXiv:2307.15337.
Chen, X. et al. (2023). Universal Self-Consistency for Large Language Model Generation. arXiv:2311.17311.
Wang, X. et al. (2022). Self-Consistency Improves Chain-of-Thought Reasoning in Language Models. arXiv:2203.11171.
Fernando, C. et al. (2023). PromptBreeder: Self-Referential Self-Improvement via Prompt Evolution. arXiv:2309.16797.
Zhou, P. et al. (2024). Self-DISCOVER: Large Language Models Self-Compose Reasoning Structures. arXiv:2402.03620.
Chen, J. et al. (2024). Thought-Augmented Reasoning with Large Language Models. arXiv:2406.04271.

Notes

↑ "Enhance your prompts with meta prompting". OpenAI Cookbook. [1]
↑ ^2.0 ^2.1 "Meta-Prompting: LLMs Crafting & Enhancing Their Own Prompts". Intuition Labs. [2]
↑ ^3.0 ^3.1 ^3.2 Suzgun, Mirac; Kalai, Adam T. (2024). "Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding". arXiv:2401.12954.
↑ Zhang, Yifan, et al. (2023). "Meta Prompting for AGI Systems". arXiv:2311.11482.
↑ Zhou, Y., et al. (2022). "Large Language Models Are Human-Level Prompt Engineers". arXiv:2211.01910.
↑ Madaan, A., et al. (2023). "Self-Refine: Iterative Refinement with Self-Feedback". arXiv:2303.17651.
↑ ^7.0 ^7.1 "Meta Prompting". Prompt Engineering Guide. [3]
↑ ^8.0 ^8.1 "AI within an AI: Meta-prompting can improve the reasoning capabilities of large language models". The Decoder.

[cookbook_openai-1] "Enhance your prompts with meta prompting". OpenAI Cookbook. [1]

[intuition_labs-2] 2.0 ^2.1 "Meta-Prompting: LLMs Crafting & Enhancing Their Own Prompts". Intuition Labs. [2]

[suzgun2024-3] 3.0 ^3.1 ^3.2 Suzgun, Mirac; Kalai, Adam T. (2024). "Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding". arXiv:2401.12954.

[zhang2024-4] Zhang, Yifan, et al. (2023). "Meta Prompting for AGI Systems". arXiv:2311.11482.

[ape2022-5] Zhou, Y., et al. (2022). "Large Language Models Are Human-Level Prompt Engineers". arXiv:2211.01910.

[self_refine2023-6] Madaan, A., et al. (2023). "Self-Refine: Iterative Refinement with Self-Feedback". arXiv:2303.17651.

[prompting_guide-7] 7.0 ^7.1 "Meta Prompting". Prompt Engineering Guide. [3]

[decoder_article-8] 8.0 ^8.1 "AI within an AI: Meta-prompting can improve the reasoning capabilities of large language models". The Decoder.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

Meta Prompting

Contents

Key Methods and Approaches

The "Scaffolding" Method

Theoretical Approach and Structural Patterns

Related Concepts

Application and Advantages

Limitations and Risks

Links

Literature

Notes

Navigation menu

Meta Prompting

Key Methods and Approaches

The "Scaffolding" Method

Theoretical Approach and Structural Patterns

Related Concepts

Application and Advantages

Limitations and Risks

Links

Literature

Notes

Navigation menu

Search