Diffusion-LM Improves Controllable Text Generation

https://arxiv.org/pdf/2205.14217.pdf

Setup (uv)

uv sync

If you need MPI support (recommended for this repo), install MPICH and rebuild mpi4py:

sudo apt-get update
sudo apt-get install -y mpich libmpich-dev
MPICC=mpicc.mpich uv pip install --no-binary=mpi4py mpi4py

MPI quick check:

uv run python -c "from mpi4py import MPI; print(MPI.COMM_WORLD.Get_rank())"

Train Diffusion-LM:

You can run from the repo root; run_train.py handles paths and sets RDMAV_FORK_SAFE=1 to avoid EFA/libfabric fork crashes.

WANDB_MODE=offline uv run improved-diffusion/scripts/run_train.py \
  --diff_steps 2000 \
  --model_arch transformer \
  --lr 0.0001 \
  --lr_anneal_steps 200000 \
  --seed 102 \
  --noise_schedule sqrt \
  --in_channel 16 \
  --modality e2e-tgt \
  --submit no \
  --padding_mode block \
  --app "--predict_xstart True --training_mode e2e --vocab_size 821 --e2e_train ../datasets/e2e_data" \
  --notes xstart_e2e

WANDB_MODE=offline uv run improved-diffusion/scripts/run_train.py \
  --diff_steps 2000 \
  --model_arch transformer \
  --lr 0.0001 \
  --lr_anneal_steps 400000 \
  --seed 101 \
  --noise_schedule sqrt \
  --in_channel 128 \
  --modality roc \
  --submit no \
  --padding_mode pad \
  --app "--predict_xstart True --training_mode e2e --vocab_size 11043 --roc_train ../datasets/ROCstory" \
  --notes xstart_e2e \
  --bsz 64

Decode Diffusion-LM:

mkdir generation_outputs

python scripts/batch_decode.py {path-to-diffusion-lm} -1.0 ema

Controllable Text Generation

First, train the classsifier used to guide the generation (e.g. a syntactic parser)

python train_run.py --experiment e2e-tgt-tree --app "--init_emb {path-to-diffusion-lm} --n_embd {16} --learned_emb yes " --pretrained_model bert-base-uncased --epoch 6 --bsz 10

Then, we can use the trained classifier to guide generation. (currently, need to update the classifier directory in scripts/infill.py. I will clean this up in the next release.)

python python scripts/infill.py --model_path {path-to-diffusion-lm} --eval_task_ 'control_tree' --use_ddim True --notes "tree_adagrad" --eta 1. --verbose pipe

For details of the methods and results, please refer to our paper.

@article{Li-2022-DiffusionLM,
  title={Diffusion-LM Improves Controllable Text Generation},
  author={Xiang Lisa Li and John Thickstun and Ishaan Gulrajani and Percy Liang and Tatsunori Hashimoto},
  journal={ArXiv},
  year={2022},
  volume={abs/2205.14217}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
datasets		datasets
improved-diffusion		improved-diffusion
tests		tests
transformers		transformers
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
train_run.py		train_run.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion-LM Improves Controllable Text Generation

Setup (uv)

Train Diffusion-LM:

Decode Diffusion-LM:

Controllable Text Generation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Diffusion-LM Improves Controllable Text Generation

Setup (uv)

Train Diffusion-LM:

Decode Diffusion-LM:

Controllable Text Generation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages