Welcome to the WhiteLightning Model Library - a centralized repository for uploading, downloading, and managing trained machine learning models.
This repository serves as a comprehensive model library where you can:
- Upload your trained machine learning models
- Download pre-trained models for your projects
- Share models with team members and collaborators
- Access model metadata, training data, and configuration files
The repository is organized by model classification types:
whitelightning-model-library/
βββ Binary/ # Binary classification models
β βββ hate_speech_classifier/
β βββ sentiment_classifier/
β βββ spam_classifier/
βββ Multiclass/ # Multiclass classification models
β βββ customer_review_topic/
β βββ news_topic_clf/
βββ Multiclass(Sigmoid)/ # Multilabel classification models
βββ emotion_classifier/
βββ news_multilabel/
- Purpose: Detects hate speech in text content
- Type: Binary classification (hate speech / not hate speech)
- Files: Model weights (.h5, .onnx), training data, vocabulary, scaler
- Purpose: Analyzes sentiment in text
- Type: Binary classification (positive / negative sentiment)
- Files: Model weights (.h5, .onnx), training data, vocabulary, scaler
- Purpose: Identifies spam messages
- Type: Binary classification (spam / not spam)
- Files: Model weights (.h5, .onnx), training data, vocabulary, scaler
- Purpose: Categorizes customer reviews by topic
- Type: Multiclass classification
- Categories: Customer Service, Ease of Use, Price, and more
- Files: Model weights (.h5, .onnx), training data, vocabulary, scaler
- Purpose: Classifies news articles by topic
- Type: Multiclass classification
- Categories: Business, Health, Politics, and more
- Files: Model weights (.h5, .onnx), training data, vocabulary, scaler
- Purpose: Detects multiple emotions in text simultaneously
- Type: Multilabel classification with sigmoid activation
- Categories: Fear, Happy, Love, Sadness
- Unique Feature: Single text can belong to multiple emotion categories
- Files: Model weights (.keras, .onnx), training data, vocabulary, scaler
- Purpose: Categorizes news articles into multiple topics simultaneously
- Type: Multilabel classification with sigmoid activation
- Categories: Business, Environment, Health, Politics, Sports, Technologies, Wars
- Unique Feature: Single article can belong to multiple news categories
- Files: Model weights (.keras, .onnx), training data, vocabulary, scaler
Each model directory contains:
model_name/
βββ model.h5 # Keras model file
βββ model.onnx # ONNX model file for cross-platform deployment
βββ training_data.csv # Training dataset
βββ edge_case_data.csv # Edge case test data
βββ vocab.json # Vocabulary mapping
βββ scaler.json # Feature scaling parameters
βββ generation_config.json # Model generation configuration
βββ *_edge_case_predictions.csv # Edge case prediction results
βββ api_requests/ # API request logs and examples
βββ *.json # Request/response examples
-
Clone the repository:
git clone <repository-url> cd whitelightning-model-library
-
Navigate to desired model:
cd Binary/sentiment_classifier/ # Example
-
Load the model in your code:
# For Keras models from tensorflow.keras.models import load_model model = load_model('model.h5') # For ONNX models import onnxruntime as ort session = ort.InferenceSession('model.onnx')
-
Create a new directory under the appropriate classification type:
mkdir Binary/your_model_name/ # or mkdir Multiclass/your_model_name/ # or mkdir "Multiclass(Sigmoid)/your_model_name/"
-
Add required files:
model.h5- Keras modelmodel.onnx- ONNX model (optional but recommended)training_data.csv- Training datasetvocab.json- Vocabulary filescaler.json- Scaler parametersgeneration_config.json- Model configurationedge_case_data.csv- Test cases (optional)
-
Commit and push:
git add . git commit -m "Add new model: your_model_name" git push origin main
- Keras (.h5): Native TensorFlow/Keras format
- ONNX (.onnx): Cross-platform format for deployment
- JSON files: Configuration and vocabulary data
- CSV files: Training and test datasets
- Python 3.7+
- TensorFlow/Keras (for .h5 models)
- ONNX Runtime (for .onnx models)
- pandas, numpy (for data handling)
- Fork the repository
- Create a feature branch
- Add your model following the directory structure
- Submit a pull request with model description
WhiteLightning distills massive, state-of-the-art language models into lightweight, hyper-efficient text classifiers. It's a command-line tool that lets you create specialized models that run anywhereβfrom the cloud to the edgeβusing the universal ONNX format for maximum compatibility.
Need comprehensive guides and documentation? Check out our WhiteLightning Site - this repository hosts the official website for WhiteLightning at https://whitelightning.ai, a cutting-edge LLM distillation tool with detailed documentation, tutorials, and implementation guides.
Want to test your ONNX models across multiple programming languages? Check out our WhiteLightning Test Framework - a comprehensive cross-language testing suite that validates your models in:
- 8 Programming Languages: Python, Java, C++, C, Node.js, Rust, Dart, and Swift
- Performance Benchmarking: Detailed timing, memory usage, and throughput analysis
- Automated Testing: GitHub Actions workflows for continuous validation
- Real-world Scenarios: Test with custom inputs and edge cases
Perfect for ensuring your WhiteLightning models work consistently across all target platforms and deployment environments.
For questions or issue regarding the model library, please:
- Open an issues in this repository
- Contact the WhiteLightning team
This project is licensed under the MIT License.
Happy modeling! π