SPRAC21A June 2016 – June 2019 OMAP-L132 , OMAP-L138 , TDA2E , TDA2EG-17 , TDA2HF , TDA2HG , TDA2HV , TDA2LF , TDA2P-ABZ , TDA2P-ACD , TDA2SA , TDA2SG , TDA2SX , TDA3LA , TDA3LX , TDA3MA , TDA3MD , TDA3MV
Initiator/Operation | Source | Destination | Size (KB) | Bandwidth (MB/s) |
---|---|---|---|---|
M4 WR | CPU Register | DDR | 4096 | 268.03 |
M4 RD | DDR | CPU Register | 4096 | 93.59 |
M4 COPY | DDR | DDR | 4096 | 111.39 |
M4 WR | CPU Register | OCMC | 128 | 276.12 |
M4 RD | OCMC | CPU Register | 128 | 158.76 |
M4 COPY | OCMC | OCMC | 128 | 177.34 |
Initiator/
Operation |
Source | Destination | Size (KB) | Write-Back, Write Allocate | Write-Back, No-Write Allocate | Write-Through, Write Allocate | Write-Through,
No-Write Allocate |
---|---|---|---|---|---|---|---|
M4 WR | CPU Register | DDR | 4096 | 75.2 | 268.03 | 267.98 | 267.03 |
M4 RD | DDR | CPU Register | 4096 | 93.48 | 93.59 | 93.15 | 93.61 |
M4 COPY | DDR | DDR | 4096 | 80.56 | 111.39 | 110.44 | 111.42 |
M4 WR | CPU Register | OCMC | 128 | 148.78 | 276.12 | 269.51 | 269.6 |
M4 RD | OCMC | CPU Register | 128 | 158.73 | 158.76 | 155.07 | 158.73 |
M4 COPY | OCMC | OCMC | 128 | 141.7 | 177.34 | 174.21 | 177.78 |
Initiator/Operation | Word Size | Source | Destination | Size (KB) | Bandwidth (MB/s) |
---|---|---|---|---|---|
M4 WR | 32 bit | CPU Register | DDR | 4096 | 267.95 |
M4 RD | 32 bit | DDR | CPU Register | 4096 | 93.17 |
M4 COPY | 32 bit | DDR | DDR | 4096 | 110.42 |
M4 WR | 64 bit | CPU Register | DDR | 8192 | 179.88 |
M4 RD | 64 bit | DDR | CPU Register | 8192 | 78.09 |
M4 COPY | 64 bit | DDR | DDR | 8192 | 100.2 |
Table 36 shows the performance difference between with and without loop unroll. In case of without loop unroll (when there is only one read, write, and copy operation in the loop) gives inferior performance compared to with loop unroll as it checks for loop condition after every word transfer in case of without loop unroll. In case of with loop unroll, it checks for loop condition only after a 128 words (write or read) or a 32 words (copy) transfer.
Initiator/Operation | Source | Destination | Size (KB) | Bandwidth (MB/s) (with loop unroll) | Bandwidth (MB/s) (without loop unroll) |
---|---|---|---|---|---|
M4 WR | CPU Register | DDR | 4096 | 267.76 | 203.08 |
M4 RD | DDR | CPU Register | 4096 | 93.19 | 69.2 |
M4 COPY | DDR | DDR | 4096 | 110.6 | 104.24 |