DiEP.github.io/index.md at main · Mongoosesyf/DiEP.github.io

Abstract

Recent planning methods based on LLMs typically employ the In-Context Learning paradigm. Complex long-horizon tasks require more context(including instructions and demonstrations) to guarantee generated plan can be executed. However, in such condition, LLMs may overlook(unfaithful) the rules in the given context, resulting in plans that can be invalid or even lead to dangerous actions. In this paper, we investigate the faithfulness of LLMs for complex long-horizon tasks. Inspired by human intelligence, we introduce a novel framework named DiEP. DiEP employs a language-based RNN structure to integrate task decomposition, memory management into LLM planning inference, which could effective improve the faithfulness of LLM and make planner more reliable. We conducted experiments in VirtualHome household tasks. Results show that our model significantly improves faithfulness and success rates for complex long-horizon tasks.

Video

Results

Example of our frameworks for long-horizon task planning:

Methodology

The framework uses the task description as input and outputs the task plan. Our framework consists of three stages:

Decompose a complex, long-horizon task into several simpler sub-tasks and formulate an abstract plan.
Represent the task goal, abstract plan, and instruction as long-term memory, while designating the selected sub-goal in the plan, demonstration, and sub-task specifics as short-term memory.
Input the combined long and short-term memories and the environment observation into the LLM to retrieve the sub-task plan. Update memory simultaneously and repeat the above steps until the task is complete.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Abstract

Video

Results

Methodology

BibTeX

FilesExpand file tree

index.md

Latest commit

History

index.md

File metadata and controls

Abstract

Video

Results

Methodology

BibTeX