Model Optimizer — Web Pulse coverage

Model Quantization: Turn FP8 Checkpoints into High-Performance Inference Engines with NVIDIA TensorRT :: https://wpnews.pro/news/model-quantization-turn-fp8-checkpoints-into-high-performance-inference-engines