-
my config is
i'm index 100M files, embedding api have some problem when extract graph is finished, i'm tring to resume work
And just re-extract graph now, is something i did wrong? |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 6 replies
-
same problem |
Beta Was this translation helpful? Give feedback.
-
I have the same situation where I have only embedding pipeline failed and do not want to rerun everything else. I checked doc and code, I didn't find resume option. But I did find a "workflow" config where you can specify the pipelines. |
Beta Was this translation helpful? Give feedback.
-
I tried --resume and skip_workflows, both not working. But here is a hardcode way out, comment "_get_workflow_list" function |
Beta Was this translation helpful? Give feedback.
-
I added workflows configuration in the settings.yaml file, and successfully resolved! https://microsoft.github.io/graphrag/config/yaml/ ![]() |
Beta Was this translation helpful? Give feedback.
-
GraphRAG caches LLM calls aggressively, so re-running after a failure should skip over all the previous work. If that still takes a long time due to CPU processing as it runs through the workflows, the solution by @VanillaTY is correct: the |
Beta Was this translation helpful? Give feedback.
I added workflows configuration in the settings.yaml file, and successfully resolved! https://microsoft.github.io/graphrag/config/yaml/