The summarize-from-feedback repository implements the methods from the paper “Learning to Summarize from Human Feedback”. Its purpose is to train a summarization model that better aligns with human preferences by first collecting human feedback (comparisons between summaries) to train a reward model, and then fine-tuning a policy (summarizer) to maximize that learned reward. The code includes different stages: a supervised baseline (i.e. standard summarization training), the reward modeling component, and the reinforcement learning (or preference-based fine-tuning) phase. The repo also includes utilities for dataset handling, modeling architectures, inference, and evaluation. Because the codebase is experimental, parts of it may not run out-of-box depending on dependencies or environment, but it remains a canonical reference for how to implement summarization via human feedback.

Features

  • Supervised baseline summarization model to initialize performance
  • Reward model trained from human comparisons of summary pairs
  • Preference-based fine-tuning / RL stage to optimize summarizer toward human judgments
  • Dataset handling modules (loading, comparisons, splits)
  • Inference and evaluation scripts to generate and score summaries
  • Architecture layout files (e.g. model_layout.py) supporting modular model definitions

Project Samples

Project Activity

See All Activity >

Categories

Education

License

MIT License

Follow Summarize from Feedback

Summarize from Feedback Web Site

Other Useful Business Software
Professional Streaming and Video Hosting - GDPR Compliant - 3Q Icon
Professional Streaming and Video Hosting - GDPR Compliant - 3Q

Secure hosting, scalable streaming, and easy integration for internal and external communications

3Q offers a multifunctional video platform for hosting, managing and distributing video and audio content on all channels. Live and on-demand.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Summarize from Feedback!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Education Software

Registered

2025-10-03