QuantTess.h File
QuantTess Node — fused CVU kernel that quantizes FP32→INT8 then tessellates. More...
Included Headers
#include "builder/NodeContractConfigurable.h"
#include "builder/NodeContractProvider.h"
#include "builder/Node.h"
#include <nlohmann/json.hpp>
#include <memory>
#include <optional>
#include <string>
#include <vector>
Namespaces Index
| namespace | simaai |
| namespace | neat |
| namespace | nodes |
Classes Index
| struct | QuantTessOptions |
| class | QuantTess |
|
Fused CVU kernel Node: Quant (FP32→INT8) followed by Tess. More... | |
Description
QuantTess Node — fused CVU kernel that quantizes FP32→INT8 then tessellates.
Combines an INT8 quantize and a tessellate layout shuffle into a single CVU pass. Inserted by the route planner before the MLA when the model expects INT8 input and MLA-side tessellation is not part of the compiled MLA kernel — fusing the two stages saves a DDR round-trip versus running Quant and Tess back to back.
- See Also
"The dtype contract" page in /concepts/dtype_contract
Generated via doxygen2docusaurus 2.0.0 by Doxygen 1.9.8.