Online-Ressource | |
Verfasst von: | Hapke, Hannes [VerfasserIn] |
Titel: | Advanced model deployments with TensorFlow Serving |
Institutionen: | Safari, an O’Reilly Media Company. |
Verf.angabe: | Hapke, Hannes |
Ausgabe: | 1st edition |
Verlagsort: | [Erscheinungsort nicht ermittelbar] |
Verlag: | O'Reilly Media, Inc. |
Jahr: | 2020 |
Umfang: | 1 online resource (1 video file, approximately 41 min.) |
Fussnoten: | Online resource; Title from title screen (viewed February 28, 2020) |
Abstract: | TensorFlow Serving is one of the cornerstones in the TensorFlow ecosystem. It has eased the deployment of machine learning models tremendously and led to an acceleration of model deployments. Unfortunately, machine learning engineers aren’t familiar with the details of TensorFlow Serving, and they’re missing out on significant performance increases. Hannes Hapke (SAP ConcurLabs) provides a brief introduction to TensorFlow Serving, then leads a deep dive into advanced settings and use cases. He introduces advanced concepts and implementation suggestions to increase the performance of the TensorFlow Serving setup, which includes an introduction to how clients can request model meta-information from the model server, an overview of model optimization options for optimal prediction throughput, an introduction to batching requests to improve the throughput performance, an example implementation to support model A/B testing, and an overview of monitoring your TensorFlow Serving setup. Prerequisite knowledge A basic understanding of Docker functionality and how HTTP requests work General knowledge of machine learning (useful but not required) What you'll learn Learn how to increase the TensorFlow Serving inference performance, increase the inference response time by tweaking the request payload, and run TensorFlow Serving with NVIDIA's TensorRT for further performance improvements Discover how to configure batch requests in TensorFlow Serving and how to configure TensorFlow Serving to provide A/B Testing capabilities This session is from the 2019 O'Reilly TensorFlow World Conference in Santa Clara, CA. |
ComputerInfo: | Mode of access: World Wide Web. |
URL: | Aggregator: https://learning.oreilly.com/library/view/-/0636920372608/?ar |
Datenträger: | Online-Ressource |
Sprache: | eng |
Sach-SW: | Electronic videos ; local |
K10plus-PPN: | 1693203847 |
Lokale URL UB: | Zum Volltext |
Bibliothek der Medizinischen Fakultät Mannheim der Universität Heidelberg | |
Bestellen/Vormerken für Benutzer des Klinikums Mannheim Eigene Kennung erforderlich | |
Bibliothek/Idn: | UW / m3612470175 |
Lokale URL Inst.: | Zum Volltext |