Lecture 24: Deployment and conclusion#

UBC 2024-25

Imports and LOs#

Imports#

import sys, os
import pandas as pd
from sklearn.ensemble import RandomForestRegressor
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import mean_absolute_error
from sklearn.model_selection import cross_validate, train_test_split
from sklearn.pipeline import Pipeline, make_pipeline
DATA_DIR = os.path.join(os.path.abspath(".."), "data/")

Learning objectives#

  • Describe the goals and challenges of model deployment.



Model deployment#

❓❓ Questions for you#

Imagine you’ve created a machine learning model and are eager to share it with others. Consider the following scenarios for sharing your model:

  • To a non-technical Audience: How would you present your model to friends and family who may not have a technical background?

  • To a technical audience: How would you share your model with peers or professionals in the field who have a technical understanding of machine learning?

  • In an academic or research setting: How would you disseminate your model within academic or research communities?









Try out this moment predictor#

https://cpsc330-moment-predictor.onrender.com/

  • In this lecture, I will show you how to set up/develop this.

What is deployment?#

  • After we train a model, we want to use it!

  • The user likely does not want to install your Python stack, train your model.

  • You don’t necessarily want to share your dataset.

  • So we need to do two things:

  1. Save/store your model for later use.

  2. Make the saved model conveniently accessible.

We will use Joblib for (1) and Flask & render for (2).

Requirements (I already did these)#

  • render account. Register here.

More python installations:

pip install Flask
pip install Flask-WTF
pip install joblib

Demo: Deploying moment classification model#

Building a model#

Recall the multi-class classification problem using the HappyDB corpus.

df = pd.read_csv(DATA_DIR + "cleaned_hm.csv", index_col=0)
sample_df = df.dropna()
sample_df.head()
sample_df = sample_df.rename(
    columns={"cleaned_hm": "moment", "ground_truth_category": "target"}
)
sample_df.head()
wid reflection_period original_hm moment modified num_sentence target predicted_category
hmid
27676 206 24h We had a serious talk with some friends of our... We had a serious talk with some friends of our... True 2 bonding bonding
27678 45 24h I meditated last night. I meditated last night. True 1 leisure leisure
27697 498 24h My grandmother start to walk from the bed afte... My grandmother start to walk from the bed afte... True 1 affection affection
27705 5732 24h I picked my daughter up from the airport and w... I picked my daughter up from the airport and w... True 1 bonding affection
27715 2272 24h when i received flowers from my best friend when i received flowers from my best friend True 1 bonding bonding
sample_df["target"].value_counts()
target
affection           4810
achievement         4276
bonding             1750
enjoy_the_moment    1514
leisure             1306
nature               252
exercise             217
Name: count, dtype: int64

It’s a multiclass classification problem!

train_df, test_df = train_test_split(sample_df, test_size=0.2, random_state=123)
X_train_happy, y_train_happy = train_df["moment"], train_df["target"]
X_test_happy, y_test_happy = test_df["moment"], test_df["target"]
pipe_lr = make_pipeline(
    CountVectorizer(stop_words="english"),
    LogisticRegression(max_iter=2000),
)
pipe_lr.fit(X_train_happy, y_train_happy);
pipe_lr.score(X_train_happy, y_train_happy)
0.9530973451327434
pipe_lr.score(X_test_happy, y_test_happy)
0.8162831858407079

Training on the full corpus#

  • Ideally, you’ll try all different models, fine tune the most promising models and deploy the best performing model.

  • Sometimes before deploying a model people train it on the full dataset.

    • This is probably a good idea, because more data is better.

    • It’s also a little scary, because we can’t test this new model.

  • Here I’m just deploying the model trained on the training set.

Saving the model#

  • If we want to deploy a model, we need to save it.

  • We are using joblib for that.

import joblib

with open("../web_api/moment_predictor.joblib", "wb") as f:
    joblib.dump(pipe_lr, f)
with open("../web_application/moment_predictor.joblib", "wb") as f:
    joblib.dump(pipe_lr, f)

We’ll define a function that accepts input data as a dictionary and returns a prediction:

Loading our saved model#

Let’s write a function to get predictions.

def return_prediction(model, text):
    prediction = model.predict([text])[0]
    return prediction
model = joblib.load("../web_api/moment_predictor.joblib") # load the saved model
text = "I love my students!"
return_prediction(model, text)
'affection'

This function appears in the app.py that we’ll be using shortly.

(Optional) Setting up a directory structure and environment#

  • We need a specific directory structure to help us easily deploy our machine learning model.

  • This is already set up in this repo.

├── web_api
│   └── moment_predictor.joblib  # this is the machine learning model we have built locally
│   └── app.py  # the file that defines our flask API
│   └── Procfile  # required to help start flask app
│   └── requirements.txt  # file containing required packages
│   
└── web_application
    └── moment_predictor.joblib  # this is the machine learning model we have built locally
    └── app.py  # the file that defines our flask API
    └── Procfile  # required to help start flask app
    └── requirements.txt  # file containing required packages
    └── templates  # contains HTML templates to help us build the web application
    │   └── style.css  # css template to be used in web application
    └── static  # this subdirectory contains CSS style sheets
        └── home.html  # html template to be used in web application
        └── prediction.html  # html template to be used in web application

Model deployment#

We have two options for deploying our moment prediction model. We can:

  1. Build a web application (app) with a HTML user-interface that interacts directly with our model.

  2. Develop a RESTful (REST stands for REpresentational State Transfer) web API that accepts HTTP requests in the form of input data and returns a prediction.

We’ll explore both options below.

Building and deploying a web app#

on localhost (my laptop)

on server (the interwebs)

app

API

  • Flask can create entire web applications.

  • We need to link our code to some html and css to create our web application.

  • We will use Flask to create a html form, accept data submitted to the form, and return a prediction using the submitted data.

  • Again, I won’t go into too much detail here, but we can open up web_application/ and take a quick look.

  • Let’s try web_application/app.py that handles this part.

  • We can open it up here in Jupyter and take a look.

  • If we run python app.py we’ll bring it to life.

Web app on local server#

  1. Go to the terminal.

  2. Navigate to the web_application directory.

  3. Run the following to make the app alive: python app.py.

    • If you get an error, you may need to install those extra packages and make sure you have the environment loaded.

  4. Now you should be able to access the app at: http://127.0.0.1:5000/ or http://localhost:5000/.

Web app on a real server#

  • If you want people to use your app/model, you would probably want to put it on a real server and not your laptop so that it’s live all the time.

on localhost (my laptop)

on server (the interwebs)

app

API

  • We’ll use Render for this.

Render set-up (I already did these):#

  1. Go to render, log-in

  2. Click on “New” and create a web service.

Demo of creating a new web service#

  • Create an associated GitHub repository.

  • I’ve put the contents from web_application folder in this repository.

  • Then you connect that repository to your new web service in render.

  • Once you fill in all the information, it’ll create an environment for you using requirement.txt and if everything goes well, your web service will be live.

  • You’ll see a link at the top which you can share with your friends.

  • Try it out for your own machine learning applications!

  • This is nice! If you develop a model and you want your friends to try it out without installing anything on their local computers, you can do this.

API on the localhost#

  • Often you want other people to be able to use your models in their applications.

  • We can do this by creating an API.

  • If you don’t know what an API is, that’s OK.

    • For our purposes, it’s something that exists at a particular address, that can accept information and return information.

    • Sort of like a function but not Python-specific and potentially accessible by anyone on the internet.

on localhost (my laptop)

on server (the interwebs)

app

API

  • Go to the terminal.

  • Navigate to web_api folder in this repo.

  • Run the following to make the api alive: python app.py

(Note that for more complex applications, you may choose to containerize everything in a Docker container to deploy to render).

Sending a request to the API#

  • We have a RESTful (REST stands for REpresentational State Transfer) web API that accepts HTTP requests in the form of input data and returns a prediction.

  • Now you can send requests to the API and get predictions.

!curl -d '{"text":"The students will be submitting their last homework assignment soon!"}' -H "Content-Type: application/json" -X POST http://127.0.0.1:5000/predict
"achievement"
  • curl (stands for client URL) is a tool for transferring data using various network protocols.

Here’s what each part does in the command:

  • -d \(\rightarrow\) allows you to send a POST request with the specified data.

  • '{"text":"The students will be submitting their last homework assignment soon!"}' \(\rightarrow\) the JSON data you’re sending.

  • -H \(\rightarrow\) specifies a custom header, in this case, Content-Type: application/json to tell the server you’re sending JSON.

  • -X POST \(\rightarrow\) explicitly sets the HTTP method to POST.

  • http://localhost:5000/predict \(\rightarrow\) the URL where you’re sending the request.

Make sure you have app.py running at port 5000 when you execute the curl command.

Okay, so we have a working API running on localhost, but we don’t want to host this service on my laptop!

Deploying the API on a server (not covered)#

  • We now want to deploy it on a “real” server so others can send it requests.

  • You could use services such as AWS for this

on localhost (my laptop)

on server (the interwebs)

app

API

You can also use render for this. Follow the instructions here to try the API on server.

Discussion#

  • There are many ways to deploy a model; a RESTful API is very common and convenient.

  • As you can see, a simple deployment is fairly straight forward.

  • However, there may be other considerations such as:

    • Privacy/security

    • Scaling to large number of requests

    • Error handling

    • Real-time / speed

    • Low-resource environments

    • etc.

Break (~15 min)#





Course review / conclusion (~20 min)#

We started with these learning objectives.

  1. Identify problems that may be addressed with machine learning.

  2. Select the appropriate machine learning tool for a problem.

  3. Transform data of various types into usable features.

  4. Apply standard tools implementing supervised and unsupervised learning techniques.

  5. Describe core differences between training, validation, and testing regimes.

  6. Effectively communicate the results of a machine learning pipeline.

  7. Be realistic about the limitations of individual approaches and machine learning as a whole.

  8. Identify and avoid scenarios in which training and testing data are accidentally mixed (the “Golden Rule”).

  9. Employ good habits for applying ML, such as starting an analysis with a baseline estimator.

  • How did we do?

  • Hopefully OK.

  • Another important part of this is create reproducible workflows and pipelines.

  • There is a new course on this topic.

What did we cover?#

I see the course roughly like this (not necessarily in order):

Part 1: Supervised learning on tabular data

  • Overfitting, train/validation/test/deployment, cross-validation

  • Feature preprocessing, pipelines, imputation, OHE, etc

  • The Golden Rule, various ways to accidentally violate it

  • Classification metrics: confusion matrix, precision/recall, ROC, AUC

  • Regression metrics: MSE, MAPE

  • Feature importances, feature selection

  • Hyperparameter optimization

  • A bunch of models:

    • baselines

    • linear models (ridge, logistic regression)

    • KNNs and RBF SVMs

    • tree-based models (random forest, gradient boosted trees)

    • ensembles

Part 2: Other data types (non-tabular)

  • Clustering: K-Means, DBSCAN

  • Recommender systems

  • Computer vision with pre-trained deep learning models (high level)

  • Language data, text preprocessing, embeddings, topic modeling

  • Time series

  • Right-censored data / survival analysis

Part 3: Communication and Ethics

  • Communicating your results

  • Ethics for ML



❓❓ Questions for you#

For each of the scenarios below

  • Identify if ML is a good solution for a problem.

  • If yes

    • Frame the problem to a ML problem.

    • Discuss what kind of features you would need to effectively solve the problem

    • What would be a reasonable baseline?

    • Which model would be a suitable model for the given scenario?

    • What would be the appropriate success metrics.

App

Goal

QueuePredictor app

Inform callers how long they’ll wait on hold given the current call volume

To-doList App

Keep track of the tasks that a user inputs and organize them by date

SegmentSphere App

To segment customers to tailor marketing strategies based on purchasing behavior

Video app

Recommend useful videos

Dining app

Identify cuisine by a restaurant’s menu

Weather app

Calculate precipitation in six hour increments for a geographic region

EvoCarShare app

Calculate number of car rentals in four increaments at a particular Evo parking spot

Pharma app

Understand the effect of a new drug on patient survival time





Some key takeaways#

Some useful guidelines:

  • Do train-test split right away and only once

  • Don’t look at the test set until the end

  • Don’t call fit on test/validation data

  • Use pipelines

  • Use baselines

  • Do not be overconfident about your results

Difference between Statistics and Machine Learning#

  • There is no clear boundary.

  • But loosely

    • in statistics the emphasis is on inference (usually framed as hypothesis testing)

    • in machine learning the emphasis is on prediction and generalization. We are primarily interested in using our model on new unseen examples.

Recipe to approach a supervised learning problem with tabular data#

Understanding the problem#

  1. Have a long conversation with the stakeholder(s) who will be using your pipeline.

  2. Have a long conversation with the person(s) who collected the data.

  3. Think about

    • what do you want to accomplish?

    • whether you really need to use machine learning for the problem or not

    • how do you want to frame the problem as a supervised machine learning problem? What do you want to predict?

    • how do you plan to measure your success?

    • is there a baseline?

    • is there any operating point?

    • what will it buy the stakeholders if you improve over their current baseline?

  4. Think about the ethical implications - are you sure you want to do this project? If so, should ethics guide your approach?

Initial analysis, EDA, preprocessing#

  1. Random train-test split with fixed random seed; do not touch the test data until Step 16.

  2. Exploratory data analysis, outlier detection.

  3. Choose a scoring metric -> higher values should make you & your stakeholders happier.

  4. Feature engineering –> Create new features which might be useful to solve the prediction problem. This is typically a time-consuming process. Also, sometimes you get an idea of good features only after looking at your model results. So it’s an iterative process.

  5. Fit a baseline model, e.g. DummyClassifier or DummyRegressor.

  6. Create a preprocessing pipeline.

  7. (Optional) Incorporate feature selection.

Model building#

  1. Try a linear model, e.g. LogisticRegression for classification or Ridge; tune hyperparameters with CV.

  2. Try other sensible model(s), e.g. LightGBM; tune hyperparameters with CV.

  3. For each model, look at sub-scores from the folds of cross-validation to get a sense of “error bars” on the scores.

  4. Pick a few reasonable models. Best CV score is a reasonable metric, though you may choose to favour simpler models.

  5. Carry out hyperparameter optimization for these models, paying attention to whether you are not susceptible to optimization bias.

  6. Try averaging and stacking to examine whether you improve the results over your best-performing models.

Model transparency and interpretation#

  1. Look at feature importances.

  2. (optional) Perform some more diagnostics like confusion matrix for classification, or “predicted vs. true” scatterplots for regression.

  3. (optional) Try to calibrate the uncertainty/confidence outputted by your model.

  4. Test set evaluation.

  5. Model transparency. Take a random sample from the test data and try to explain your predictions using SHAP plots.

  6. Question everything again: validity of results, bias/fairness of trained model, etc.

  7. Concisely summarize your results and discuss them with stakeholders.

  8. (optional) Retrain on all your data.

  9. Deployment & integration.

PS: the order of steps is approximate, and some steps may need to be repeated during prototyping, experimentation, and as needed over time.

Note

The order of steps is approximate, and some steps may need to be repeated during prototyping, experimentation, and as needed over time.



What would I do differently?#

Lots of room for improvement. Here are some things on my mind.

  • Flipped classroom in a more effective way in the first part of the course.

  • More demos during the lecture time

  • Worksheets/practice questions during tutorials

  • Course project??

  • Add more interactive components in the lectures

  • Some material to cover: dealing with outliers, data collection, large language models

I’m sure you have other suggestions - feel free to drop me an email or drop them in the course evaluations.

What next?#

If you want to further develop your machine learning skills:

  • Practice!

    • Work on your own projects

  • Work hard and be consistent.

  • If you are interested in research in machine learning

    • Take CPSC 340. If you do not have the required prereqs you can try to audit it.

    • Get into the habit of reading papers and replicating results

330 vs. 340#

  • We are hoping lots of people will take both courses.

  • There is some overlap but not a crazy amount.

  • If you want to learn how these methods work under the hood, CPSC 340 will give you a lot of that, such as:

    • Implementing Ridge.fit() from scratch

    • Mathematically speaking, what is C in LogisticRegression?

    • How fast do these algorithms run in terms of the number of rows and columns of your dataset?

    • Etc.

  • There are also a bunch of other methods covered.

Conclusion & farewell#

That’s all, folks. We made it! Good luck on your final exam! When you get a chance, please let me know what worked for you and what didn’t work for you in this course.