AI Model Deployment Toolkit

AUTHORS: Benjamin Lee and Jayanth Vennamreddy

Features

Summarization: Condense long documents
Translation: 50+ language pairs
Question Answering: Extract answers from context
Sentiment Analysis: Detect emotion strength

General Information

This toolkit provides four NLP capabilities through a user-friendly GUI:

Text summarization
Machine translation
Question answering
Sentiment analysis

DEPENDENCIES: Below are the dependencies needed to install in the terminal (we have used VScode) before running the program:

transformers
torch
sentencepiece
scikit-learn
accelerate
VaderSentiment
gradio
psutil

FOR EACH FEATURE:

Summarize (t5-base): place some text as input
Translate (Helsinki-NLP/opus-mt-{source}-{target}): place some text as input and choose source language and target language
Answer Question (t5-base): Line 1: context (text that contains the answer), Line 2: the question
Classify (sentiment analysis): place some sentence or paragraph as input

LOCAL TERMINAL INSTRUCTIONS (Milestone 2.1 folder)

pip install all the dependencies
Run python main_gui.py
Alternatively you can run the gradio or streamlit versions:

python main_gradio.py
python -m streamlit run main_streamlit.py

REMOTE HPC INSTRUCTIONS (Milestone 2.2 folder):

NOTE: This version is not up to date Differences: The program connects remotely to a HPC and runs the AI on there instead of on your local machine. It will display the result on your local machine. As this program is not published yet and is still in testing, some setup needs to be done:

SETUP INSTRUCTIONS

Generate an id_rsa public/private pair using the following command on your local machine:

ssh-keygen -t rsa -b 4096 -C "username@host"
Replace "username@host" with what you use to log on to your HPC cluster

If it's not there, put the public id_rsa in the .ssh folder of your hpc cluster: cd .ssh and use scp. It might be there already, double check just in case
Make a secret.py file. Include the following 5 variables:

HPC_USER = your username, whatever is before the @ in what you use to login
HPC_HOST = the host of the HPC, whatever is after the @ in what you use to login
REMOTE_INPUT_FILE = The absolute path to your desired prompt.txt location in your HPC
REMOTE_OUTPUT_FILE = The absolute path to your desired prompt.txt location in your HPC
HPC_JOB_SCRIPT = The path to your desired jobscript.sh location. Does not have to be the absolute path
Note: To get the absolute filepath you can use pwd or realpath NOTE: Put secret.py in the .gitignore

Clone the repository in the HPC
scp your secrets file into the HPC
run "python main_hpc.py" from the local directory

DEBUGGING: If you want error logs, add the following line to the start of your jobscript.sh on the HPC: #SBATCH -oReport-%j.out More of a note to self, but if you want to remove all your report logs, use "rm Report*" in the directory of your report logs

KNOWN ISSUES: "Answer Question" response not displaying properly when using the HPC version of the program

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
__pycache__		__pycache__
milestone 2.1		milestone 2.1
milestone_2_2		milestone_2_2
README.md		README.md
T5Base_cybershuttle.ipynb		T5Base_cybershuttle.ipynb
cybershuttle.yml		cybershuttle.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Model Deployment Toolkit

Features

General Information

LOCAL TERMINAL INSTRUCTIONS (Milestone 2.1 folder)

REMOTE HPC INSTRUCTIONS (Milestone 2.2 folder):

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Model Deployment Toolkit

Features

General Information

LOCAL TERMINAL INSTRUCTIONS (Milestone 2.1 folder)

REMOTE HPC INSTRUCTIONS (Milestone 2.2 folder):

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages