What Alphafold2 settings were for generating iPTM, pLDDT, etc. scores before the 9 evaluation criteria are applied? I am attempting to benchmark a BindCraft-style evaluation criteria using the proteins experimentally tested in the paper, but am finding inaccurate results as I am folding with different parameters. Thanks for any help!