Limited Time SaleUS$17.99 cheaper than the new price!!
| Management number | 220490535 | Release Date | 2026/05/03 | List Price | US$12.00 | Model Number | 220490535 | ||
|---|---|---|---|---|---|---|---|---|---|
| Category | |||||||||
Build faster AI training and HPC clusters with predictable low latency and high throughput.Scaling GPUs and tightly coupled compute falls apart when the network adds jitter, stalls, or hidden bottlenecks. Engineers need clear guidance that translates InfiniBand theory into stable production fabrics, from host bring up to cluster wide policies.This book shows how to design, configure, and operate InfiniBand for modern AI and scientific workloads. You will learn practical patterns for topology, RDMA, GPUDirect, QoS, partitioning, monitoring, and troubleshooting so your jobs run consistently at scale.choose between infiniband and roce based on workload and operationsplan fat tree dragonfly and mesh fabrics with clear port and spine mathinstall and align ofed drivers firmware and userspace librariesconfigure hcas link modes mtu queue depths and completion strategyenable gpudirect rdma and map gpus to hcas for numa aware throughputtune nccl and ucx for multi gpu training with multi rail selectionset up opensm routing qos sl to vl mappings and congestion controluse partitions and pkeys for safe multi tenant isolationbring up ipoib child interfaces for clean control planes and storagemeasure bandwidth and latency and read key counters with perfquerydiagnose link health with ibstat iblinkinfo ibnetdiscover and ibdiagnetrun mpi over infiniband and optimize collectives and process placementintegrate storage nvme over fabrics lustre and gpfs over rdmaapply kernel sysctl and cpu affinity settings for low latency rdmadeploy rdma in kubernetes and virtualization with sriov and device pluginsuse adaptive routing and service levels to keep control traffic responsiveplan for hdr and ndr generations optics cabling power and thermal limitsThis is a code forward guide with working Bash Python YAML C and XML snippets that map directly to production tasks so you can replicate configurations and validate results quickly.Get the field tested playbook for reliable performance at scale, grab your copy today. Read more
| ISBN13 | 979-8269039381 |
|---|---|
| Language | English |
| Publisher | Independently published |
| Dimensions | 7 x 0.61 x 10 inches |
| Item Weight | 1.31 pounds |
| Print length | 267 pages |
| Publication date | October 8, 2025 |
If you notice any omissions or errors in the product information on this page, please use the correction request form below.
Correction Request Form