fix: HuggingFace embedder configuration validation error (#3995) #3996

devin-ai-integration · 2025-11-29T13:36:17Z

Summary

Fixes GitHub issue #3995 where users couldn't configure the HuggingFace embedder with the documented configuration format (api_key, model, api_url).

The root cause was that HuggingFaceProvider was using ChromaDB's HuggingFaceEmbeddingServer (which requires a url parameter for custom embedding servers) instead of HuggingFaceEmbeddingFunction (which uses the HuggingFace Inference API with api_key and model_name).

Changes:

Switch from HuggingFaceEmbeddingServer to HuggingFaceEmbeddingFunction
Add api_key, model_name, and api_key_env_var fields with appropriate validation aliases
Accept api_url/url for compatibility with docs but exclude from model_dump (not used by Inference API)
Update HuggingFaceProviderConfig TypedDict with new fields
Add comprehensive tests

Review & Testing Checklist for Human

Verify the api_url being silently ignored is acceptable - The field is accepted for compatibility with the documented config but excluded from the underlying call. Users might expect it to configure a custom endpoint. Consider if a warning should be logged.
Check for breaking changes - The old provider required url and used HuggingFaceEmbeddingServer. Verify no existing users depend on that behavior (search for "provider": "huggingface" usage in production).
Review uv.lock changes - The lock file was regenerated due to corruption. Scan for any unexpected dependency version changes that could cause issues.

Test with real HuggingFace API - The automated tests don't call the actual API. Recommend testing with a real HF token:

from crewai import Crew
crew = Crew(
    agents=[...],
    tasks=[...],
    memory=True,
    embedder={
        "provider": "huggingface",
        "config": {
            "api_key": os.getenv("HF_TOKEN"),
            "model": "sentence-transformers/all-MiniLM-L6-v2"
        }
    }
)

Notes

Link to Devin run: https://app.devin.ai/sessions/d2c24438b91b4ad2b2afcd925ba5e0b1
Requested by: João ([email protected])

- Update HuggingFaceProvider to use HuggingFaceEmbeddingFunction instead of HuggingFaceEmbeddingServer for HuggingFace Inference API support - Add api_key, model_name, and api_key_env_var fields to match documented config - Accept api_url for compatibility but exclude from model_dump (not used by HuggingFace Inference API) - Add validation aliases for model (maps to model_name) and environment variables - Update HuggingFaceProviderConfig TypedDict with new fields - Add comprehensive tests for HuggingFace provider configuration - Regenerate uv.lock (was corrupted) Fixes #3995 Co-Authored-By: João <[email protected]>

devin-ai-integration · 2025-11-29T13:36:19Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

greysonlalonde closed this Dec 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: HuggingFace embedder configuration validation error (#3995) #3996

fix: HuggingFace embedder configuration validation error (#3995) #3996

devin-ai-integration bot commented Nov 29, 2025 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Nov 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: HuggingFace embedder configuration validation error (#3995) #3996

fix: HuggingFace embedder configuration validation error (#3995) #3996

Conversation

devin-ai-integration bot commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration bot commented Nov 29, 2025

🤖 Devin AI Engineer

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

devin-ai-integration bot commented Nov 29, 2025 •

edited

Loading