How to Run PyTorch Models in the Browser With ONNX.js

Elliot Waite

13 Feb 202022:22

Summary

TLDRIn this video, the presenter demonstrates how to convert a PyTorch model for digit recognition (MNIST) into a JavaScript-compatible format for use in the browser. The benefits of running machine learning models directly in the browser include low latency, offline functionality, easy scalability, and enhanced privacy. The tutorial covers using ONNX.js to load and run the model in the browser, troubleshooting common issues, and implementing data augmentation to improve model robustness. The video provides practical insights for integrating machine learning models into web applications with efficient and secure performance.

Takeaways

😀 Running machine learning models directly in the browser can eliminate latency, allowing for quick predictions without needing to send data to a server.
😀 Using static files to serve a model in the browser makes scaling easy, as no back-end server or load balancing is required to handle increasing users.
😀 Models running in the browser can function offline, as the necessary code is pre-loaded, allowing for offline prediction once the app or website is initialized.
😀 Running models in the browser ensures better privacy since user data doesn’t need to be sent to a server, thus reducing potential interception risks.
😕 Large models that require significant download time or high computational power are not suitable to run in the browser and should remain on the back-end server.
⏳ Computation time can be an issue for certain models, especially on lower-performance devices, making server-hosted models a better option in some cases.
🔐 Hosting a model on the back-end ensures model privacy, as it keeps the model's structure and parameters hidden from the client-side.
⚙️ To run a PyTorch model in the browser, it must first be converted to a format like ONNX, which can then be used with JavaScript libraries like ONNX.js for inference.
🖥️ ONNX is an open format for machine learning models that allows cross-platform compatibility, enabling models to run across various frameworks and environments.
💡 When preparing a model for browser use, it's important to adjust the model to ensure compatibility with supported operators in ONNX.js, such as replacing unsupported ones like 'log softmax' with 'softmax'.

Q & A

What are the main benefits of running a machine learning model in the browser?
-The main benefits include reduced latency by avoiding server communication, ease of scaling with static files, offline functionality, and enhanced privacy since no user data needs to be sent to a server.
Why might you not want to run a machine learning model in the browser?
-You may not want to run a model in the browser if the model is too large to download quickly, if the computation is too resource-intensive for the device, or if you want to keep the model private to protect intellectual property.
What is the ONNX format and why is it useful?
-ONNX (Open Neural Network Exchange) is an open format that defines a common set of operators for machine learning models. It allows models to be transferred between different frameworks and used in various tools, runtimes, and compilers, making it easier to load and run models across different environments.
What is the difference between using ONNX.js and TensorFlow.js?
-ONNX.js is simpler and more performant for inference tasks but does not support training. TensorFlow.js, on the other hand, allows both training and inference in the browser, but it may be more complex and resource-intensive.
How do you convert a PyTorch model into an ONNX model?
-To convert a PyTorch model into an ONNX model, you instantiate the model, load the trained weights, set it to evaluation mode, create a dummy input tensor, and then use the `torch.onnx.export()` method to save the model in the ONNX format.
What issue was encountered when trying to run the model in the browser and how was it fixed?
-The issue was that ONNX.js did not support the 'log softmax' operator, which was used in the PyTorch model. This was resolved by replacing 'log softmax' with 'softmax', which is supported by ONNX.js.
What is the role of data normalization in training the model?
-Data normalization is crucial in training models as it ensures that the input data is standardized, bringing all pixel values into a similar range. This improves the model's ability to learn effectively and reduces biases that could result from varying input scales.
How does the data augmentation technique improve the model's performance?
-Data augmentation improves model robustness by applying random transformations (like rotation, translation, scaling, and shearing) to the training data. This helps the model generalize better to different variations of input data and improves its accuracy in real-world scenarios.
What is the importance of testing the model after conversion to ONNX format?
-Testing the model after conversion ensures that the conversion process was successful and that the model produces the expected output in the new format. It also helps identify and fix any issues, such as unsupported operators or incorrect output shapes.
What changes were made to the model to handle input from a canvas element?
-To handle input from a canvas, the model's expected input shape was adjusted. The original model expected 28x28 pixel images, but the canvas input was 280x280 with RGBA channels. The input was reshaped and normalized to match the model's expected format.