https://arxiv.org/pdf/2205.14217.pdf
uv syncIf you need MPI support (recommended for this repo), install MPICH and rebuild mpi4py:
sudo apt-get update
sudo apt-get install -y mpich libmpich-dev
MPICC=mpicc.mpich uv pip install --no-binary=mpi4py mpi4pyMPI quick check:
uv run python -c "from mpi4py import MPI; print(MPI.COMM_WORLD.Get_rank())"You can run from the repo root; run_train.py handles paths and sets RDMAV_FORK_SAFE=1 to avoid EFA/libfabric fork crashes.
WANDB_MODE=offline uv run improved-diffusion/scripts/run_train.py \
--diff_steps 2000 \
--model_arch transformer \
--lr 0.0001 \
--lr_anneal_steps 200000 \
--seed 102 \
--noise_schedule sqrt \
--in_channel 16 \
--modality e2e-tgt \
--submit no \
--padding_mode block \
--app "--predict_xstart True --training_mode e2e --vocab_size 821 --e2e_train ../datasets/e2e_data" \
--notes xstart_e2eWANDB_MODE=offline uv run improved-diffusion/scripts/run_train.py \
--diff_steps 2000 \
--model_arch transformer \
--lr 0.0001 \
--lr_anneal_steps 400000 \
--seed 101 \
--noise_schedule sqrt \
--in_channel 128 \
--modality roc \
--submit no \
--padding_mode pad \
--app "--predict_xstart True --training_mode e2e --vocab_size 11043 --roc_train ../datasets/ROCstory" \
--notes xstart_e2e \
--bsz 64mkdir generation_outputs
python scripts/batch_decode.py {path-to-diffusion-lm} -1.0 ema
First, train the classsifier used to guide the generation (e.g. a syntactic parser)
python train_run.py --experiment e2e-tgt-tree --app "--init_emb {path-to-diffusion-lm} --n_embd {16} --learned_emb yes " --pretrained_model bert-base-uncased --epoch 6 --bsz 10
Then, we can use the trained classifier to guide generation. (currently, need to update the classifier directory in scripts/infill.py. I will clean this up in the next release.)
python python scripts/infill.py --model_path {path-to-diffusion-lm} --eval_task_ 'control_tree' --use_ddim True --notes "tree_adagrad" --eta 1. --verbose pipe
For details of the methods and results, please refer to our paper.
@article{Li-2022-DiffusionLM,
title={Diffusion-LM Improves Controllable Text Generation},
author={Xiang Lisa Li and John Thickstun and Ishaan Gulrajani and Percy Liang and Tatsunori Hashimoto},
journal={ArXiv},
year={2022},
volume={abs/2205.14217}
}