Skip to content

"Feature Request: Support for resuming interrupted tasks" "如何从中断处继续运行?" #1202

@yuhuan1981

Description

@yuhuan1981

Hello RD-Agent Team,

Thank you for your great work on this promising project.

I would like to request a feature that allows the agent to resume from its last state if a task is interrupted.

Problem:
Currently, if a long-running task is interrupted for any reason (e.g., system reboot, network issue, or manual stop), it seems the only option is to restart the entire task from the beginning. This can be very time-consuming and inefficient, especially for complex R&D processes.

Proposed Solution:
It would be incredibly helpful if the agent could periodically save its state (create a checkpoint) to the --log_dir. When the agent is restarted with the same command or a specific --resume flag, it could check for a saved state in the log directory and continue the task from where it left off.

This feature would make the RD-Agent much more robust and practical for real-world, long-duration tasks.

Thank you for considering this request!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions