tag:github.com,2008:https://github.com/RUCAIBox/TextBox/releasesRelease notes from TextBox2022-12-27T15:06:10Ztag:github.com,2008:Repository/311009578/v2.0.02022-12-28T02:06:22ZTextBox 2.0 Release<p>TextBox 2.0 is an up-to-date text generation library based on Python and PyTorch focusing on building a unified and standardized pipeline for applying pre-trained language models to text generation:</p>
<ul>
<li>From a task perspective, we consider 13 common text generation tasks such as translation, story generation, and style transfer, and their corresponding 83 widely-used datasets.</li>
<li>From a model perspective, we incorporate 47 pre-trained language models/modules covering the categories of general, translation, Chinese, dialogue, controllable, distilled, prompting, and lightweight models (modules).</li>
<li>From a training perspective, we support 4 pre-training objectives and 4 efficient and robust training strategies, such as distributed data parallel and efficient generation.</li>
</ul>
<p>Compared with the previous version of TextBox, this extension mainly focuses on building a unified, flexible, and standardized framework for better supporting PLM-based text generation models. There are three advantages of TextBox 2.0:</p>
<ul>
<li>It is a significant innovation focusing on comprehensive tasks and PLMs.</li>
<li>It is designed to be unified in implementation and interface.</li>
<li>It can faithfully reproduce the results reported in existing work.</li>
</ul>StevenTang1998tag:github.com,2008:Repository/311009578/v0.2.12021-04-15T13:39:28ZTextBox v0.2.1<h1>TextBox v0.2.1 Release Notes</h1>
<p>The TextBox v0.2.1 release includes a number of wonderful new features, some bug fixes and code refactor. A few of the highlights include:</p>
<ul>
<li>We add 6 new models: HRED, CVAE, T5, ProphetNet, Context2Seq and Attribute2Seq.</li>
<li>We add 3 new datasets: Persona Chat for dialog system, Amazon Electronic for attribute to text generation and Chinese Classical Poetry Corpus for poem generation.</li>
<li>We support Distributed Data Parallel (DDP) for training with multiple GPUs conveniently.</li>
<li>We refactor the codes of pretrained language models (PLMs) for improving performance.</li>
<li>We refactor the <code>dataset</code> and <code>dataloader</code> to provide unified and convenient interface.</li>
<li>We unify and simplify the <code>generate</code> function for each model.</li>
<li>We unify the config parameters of different models and datasets.</li>
</ul>StevenTang1998tag:github.com,2008:Repository/311009578/v0.1.52021-01-11T12:54:55ZTextBox v0.1.5<p>TextBox is an open-source library for building text generation system. It is developed based on Python and PyTorch.</p>turboLJY