SPRUIG3C January 2018 – August 2019 TDA4VM , TDA4VM-Q1
The collective design goal of the C7x ISA and the migration tool is to achieve cycle parity with VCOP on Kernel-C kernels when translated to C7x. The actual performance of a given kernel depends on a variety of factors:
Of these, only the ISA itself is relatively constant.
The most significant performance issues arise from the use of LHT (lookup and histogram) operations due to the overhead of copying the table into and out of L1D, the use of OFFSET_NP1 and PDDA parallel scattering stores, and collating stores which are not well-supported on C7x.
Having said that, for many kernels the goal of cycle parity is already achieved with current tools. We have established the following general expectations: