You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've just been trying to make use of my own dataset through the API here, and have collected a few comments:
improve default rule for generating id field. At the moment we use precisely the relative filepath that the structure came from. I wonder if a better default would be attempting filepath.split("/")[-1].split(".")[0] (i.e., take just the filename). This is not guaranteed to be unique; rather than having some awkward config for specifying this, we could attempt to generate the unique set of IDs from first the filenames themselves, then the first folder up, then second folder up etc. and only fallback to the full file path when required. The immutable_id field (which we dont use atm) can be set by the relative filepath still. I will try experimenting with this...
we should set last_modified to be the datetime of extraction
the database metadata should include the optimade-maker version used
I've just been trying to make use of my own dataset through the API here, and have collected a few comments:
idfield. At the moment we use precisely the relative filepath that the structure came from. I wonder if a better default would be attemptingfilepath.split("/")[-1].split(".")[0](i.e., take just the filename). This is not guaranteed to be unique; rather than having some awkward config for specifying this, we could attempt to generate the unique set of IDs from first the filenames themselves, then the first folder up, then second folder up etc. and only fallback to the full file path when required. Theimmutable_idfield (which we dont use atm) can be set by the relative filepath still. I will try experimenting with this...last_modifiedto be the datetime of extraction