The WebML (Web Machine Learning) domain seeks to enable the ability to execute machine learning model inferences directly in the browser. To do so, it leverages the native capabilities of the local hardware on the client machine. This results in a significant reduction in the computational requirements needed on the server, often resulting in a better user experience. Specifically, it enables the development of real-time applications in the browser by eliminating the latencies inherent in inference calls to remote APIs. WebML opens the door to smarter, faster, and more private web applications, without compromising performance.