Intel® Advisor Help
Medium
Survey + Characterization (Trip Counts and FLOP with cache simulation and light data transfer simulation) + Performance Modeling with no assumed dependencies
After running the Offload Modeling perspective with Medium accuracy, you get an extended Offload Modeling report, which provides information about memory and cache usage and taxes of your offloaded application. In addition to the basic data, the result includes:
Offload Modeling perspective assumes a loop is parallel if its dependency type is unknown. It means that there is no information about a loop from a compiler or the loop is not explicitly marked as parallel, for example, with a programming model (OpenMP*, Data Parallel C++, Intel® oneAPI Threading Building Blocks).
If you had a report generated for a lower accuracy, all offload recommendations, metrics, and speed-up will be updated to be more precise taking into account new data.
In the Accelerated Regions tab of the Offload Modeling report, review the metrics about memory usage and data transfers.
Expand the Estimated Bounded By group to see a full picture of all time taxes paid for offloading the region to the target platform.
To learn more about data transfers estimated between host and target device for your application, run Offload Modeling with one the following properties:
Offloaded Objects pane shows a list of memory objects with data about each object aggregated between different instances of one region.
Analytics histogram shows the number of memory objects that the selected region accessed distributed by their size.
The result should have data transfer metrics in the Code Regions pane estimated with and without data reuse for each code region. Examine the metrics in the Estimated Bounded By and Estimated Data Transfer with Reuse columns to check if a code region can benefit from applying data reuse.
For code regions that can benefit from data reuse, you should see Apply Data Reuse guidance in the Recommendations tab. The guidance shows the data transfer estimated with and without data reuse and the performance gain from applying the data reuse. It also explains how you can apply the data reuse technique to your code.