tag:github.com,2008:https://github.com/RUCAIBox/TextBox/releases Release notes from TextBox 2022-12-27T15:06:10Z tag:github.com,2008:Repository/311009578/v2.0.0 2022-12-28T02:06:22Z TextBox 2.0 Release <p>TextBox 2.0 is an up-to-date text generation library based on Python and PyTorch focusing on building a unified and standardized pipeline for applying pre-trained language models to text generation:</p> <ul> <li>From a task perspective, we consider 13 common text generation tasks such as translation, story generation, and style transfer, and their corresponding 83 widely-used datasets.</li> <li>From a model perspective, we incorporate 47 pre-trained language models/modules covering the categories of general, translation, Chinese, dialogue, controllable, distilled, prompting, and lightweight models (modules).</li> <li>From a training perspective, we support 4 pre-training objectives and 4 efficient and robust training strategies, such as distributed data parallel and efficient generation.</li> </ul> <p>Compared with the previous version of TextBox, this extension mainly focuses on building a unified, flexible, and standardized framework for better supporting PLM-based text generation models. There are three advantages of TextBox 2.0:</p> <ul> <li>It is a significant innovation focusing on comprehensive tasks and PLMs.</li> <li>It is designed to be unified in implementation and interface.</li> <li>It can faithfully reproduce the results reported in existing work.</li> </ul> StevenTang1998 tag:github.com,2008:Repository/311009578/v0.2.1 2021-04-15T13:39:28Z TextBox v0.2.1 <h1>TextBox v0.2.1 Release Notes</h1> <p>The TextBox v0.2.1 release includes a number of wonderful new features, some bug fixes and code refactor. A few of the highlights include:</p> <ul> <li>We add 6 new models: HRED, CVAE, T5, ProphetNet, Context2Seq and Attribute2Seq.</li> <li>We add 3 new datasets: Persona Chat for dialog system, Amazon Electronic for attribute to text generation and Chinese Classical Poetry Corpus for poem generation.</li> <li>We support Distributed Data Parallel (DDP) for training with multiple GPUs conveniently.</li> <li>We refactor the codes of pretrained language models (PLMs) for improving performance.</li> <li>We refactor the <code>dataset</code> and <code>dataloader</code> to provide unified and convenient interface.</li> <li>We unify and simplify the <code>generate</code> function for each model.</li> <li>We unify the config parameters of different models and datasets.</li> </ul> StevenTang1998 tag:github.com,2008:Repository/311009578/v0.1.5 2021-01-11T12:54:55Z TextBox v0.1.5 <p>TextBox is an open-source library for building text generation system. It is developed based on Python and PyTorch.</p> turboLJY