Skip to content

Accelerate mmCIF parsing #2

@rhiju

Description

@rhiju

Templates for tRNAs (which are super common!) often reside in ribosome mmCIF files, which take forever to parse with Biopython.

  • Write parsers 'by hand' that rapidly grep for the _pdbx_poly_seq_scheme information and xyz coordinates. If we go this route, let's separate out the functions for doing this into something like cif_util.py so we can re-use in other codes.
  • Less elegant: could use pre-split PDB chains and, if needed, pre-split _pdbx_poly_seq_scheme information. If we go this route, let's add the python scripts and workflow for the pre-splitting

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions