| Management number | 231977606 | Release Date | 2026/06/18 | List Price | $3.10 | Model Number | 231977606 | ||
|---|---|---|---|---|---|---|---|---|---|
| Category | |||||||||
Learn how to compile, optimize, and deploy deep learning models with Apache TVM across CPUs, GPUs, and microcontrollers.Moving a trained model from PyTorch or ONNX into a fast, reliable deployment artifact is not just a conversion step. You need to understand model import, IRModule structure, Relax graph optimization, TensorIR scheduling, target-specific builds, runtime packaging, and validation.Apache TVM: The Complete Guide to Deep Learning Compilation and Deployment gives you a practical path through the TVM workflow, from setup and model conversion to tuning, benchmarking, cross compilation, and production-ready deployment checks.Inside, you will learn how to:Set up a reliable TVM development environment for CPU, GPU, and microcontroller workflowsUnderstand how IRModule, Relax, and TensorIR fit into the compilation pipelineImport PyTorch exported programs and ONNX models into TVM with correct shapes, dtypes, and parametersBuild Relax optimization pipelines using constant folding, dataflow conversion, dead code elimination, fusion, and pattern rewritingRead and schedule TensorIR programs using split, reorder, fuse, vectorize, parallel, cache reads, cache writes, and GPU thread mappingUnderstand AutoTVM, AutoScheduler, MetaSchedule, tuning records, and practical auto-tuning workflowsCompile and deploy models on LLVM-based CPU targets with thread, vectorization, export, loading, and benchmarking guidanceBuild GPU workflows for CUDA, ROCm, Vulkan, OpenCL, and Metal targets while managing memory, device contexts, synchronization, and batch sizeUse DLight, TensorIR scheduling, and library dispatch concepts for GPU workloadsHandle cross compilation, RPC upload, remote benchmarking, microTVM, AOT execution, CRT, and Model Library FormatDiagnose import failures, unsupported operators, build problems, runtime errors, shared library issues, and target mismatchesCreate repeatable validation workflows for accuracy, latency, throughput, memory use, artifact size, and deployment readinessThis is a code-heavy technical guide with practical Python, Shell, CMake, TensorIR, Relax, CUDA-oriented, RPC, and microTVM-style examples that show how TVM concepts become real deployment workflows.Whether you are compiling ONNX models, exporting PyTorch models, optimizing kernels, preparing GPU inference, or testing embedded deployment paths, this book helps you build a clearer and more controlled TVM workflow.Grab your copy today and start building practical Apache TVM deployment skills. Read more
| ASIN | B0H198PW78 |
|---|---|
| XRay | Not Enabled |
| Language | English |
| File size | 870 KB |
| Page Flip | Enabled |
| Word Wise | Not Enabled |
| Print length | 420 pages |
| Accessibility | Learn more |
| Screen Reader | Supported |
| Publication date | May 10, 2026 |
| Enhanced typesetting | Enabled |
If you notice any omissions or errors in the product information on this page, please use the correction request form below.
Correction Request Form