Basic Thicket Tutorial: Thicket 101
Thicket is a python-based toolkit for Exploratory Data Analysis (EDA) of parallel performance data that enables performance optimization and understanding of applications’ performance on supercomputers. It bridges the performance tool gap between being able to consider only a single instance of a simulation run (e.g., single platform, single measurement tool, or single scale) and finding actionable insights in multi-dimensional, multi-scale, multi-architecture, and multi-tool performance datasets.
NOTE: An interactive version of this notebook is available in the Binder environment.
1. Import Necessary Packages
To explore the structure and various capabilities of thicket components, we begin by importing necessary packages. These include python extensions and thicket’s statistical functions.
[1]:
import re
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from IPython.display import display
from IPython.display import HTML
import hatchet as ht
import thicket as tt
display(HTML("<style>.container { width:80% !important; }</style>"))
2. Read in Performance Profiles
For this notebook, we select profiles generated on Lawrence Livermore National Lab (LLNL) machine, lassen. We create two thicket objects, one generated with the same problem size of 1048576 and the other generated with different problem sizes (1048576 and 4194304).
[2]:
lassen1 = [f"../data/lassen/XL_BaseCuda_01048576_0{x}.cali" for x in range(1, 4)]
lassen2 = [f"../data/lassen/XL_BaseCuda_04194304_01.cali"]
# generate thicket(s)
th_lassen = tt.Thicket.from_caliperreader(lassen1)
th_obj = tt.Thicket.from_caliperreader(lassen1+lassen2)
3. More Information on a Function
You can use the help()
method within Python to see the information for a given object. You can do this by typing help(object)
. This will allow you to see the arguments for the function, and what will be returned. An example is below.
[3]:
help(tt.median)
Help on function median in module thicket.stats.median:
median(thicket, columns=None)
Calculate the median for each node in the performance data table.
Designed to take in a thicket, and append one or more columns to the
aggregated statistics table for the median calculation for each node.
Arguments:
thicket (thicket): Thicket object
columns (list): List of hardware/timing metrics to perform median calculation
on. Note, if using a columnar joined thicket a list of tuples must be passed
in with the format (column index, column name).
4. Thicket Components
4.1 Performance Data
The performance data table is a multi-dimensional, multi-indexed component of thicket. The rows represent nodes that each contain a different execution (i.e., profile index) of the associated call tree node.
View performance data table:
[4]:
display(HTML(th_lassen.dataframe.to_html()))
nid | spot.channel | Min time/rank | Max time/rank | Avg time/rank | Total time | Min time/rank (exc) | Max time/rank (exc) | Avg time/rank (exc) | Total time (exc) | name | ||
---|---|---|---|---|---|---|---|---|---|---|---|---|
node | profile | |||||||||||
{'name': 'Base_CUDA', 'type': 'function'} | 2118176828577267326 | 0 | regionprofile | 1.892446 | 1.892446 | 1.892446 | 1.892446 | 0.000642 | 0.000642 | 0.000642 | 0.000642 | Base_CUDA |
5411856859942141273 | 0 | regionprofile | 1.893661 | 1.893661 | 1.893661 | 1.893661 | 0.000636 | 0.000636 | 0.000636 | 0.000636 | Base_CUDA | |
7599706706110163426 | 0 | regionprofile | 1.885701 | 1.885701 | 1.885701 | 1.885701 | 0.000617 | 0.000617 | 0.000617 | 0.000617 | Base_CUDA | |
{'name': 'Algorithm', 'type': 'function'} | 2118176828577267326 | 8 | regionprofile | 0.015058 | 0.015058 | 0.015058 | 0.015058 | 0.000050 | 0.000050 | 0.000050 | 0.000050 | Algorithm |
5411856859942141273 | 8 | regionprofile | 0.015072 | 0.015072 | 0.015072 | 0.015072 | 0.000045 | 0.000045 | 0.000045 | 0.000045 | Algorithm | |
7599706706110163426 | 8 | regionprofile | 0.015026 | 0.015026 | 0.015026 | 0.015026 | 0.000048 | 0.000048 | 0.000048 | 0.000048 | Algorithm | |
{'name': 'Algorithm_MEMCPY', 'type': 'function'} | 2118176828577267326 | 19 | regionprofile | 0.005065 | 0.005065 | 0.005065 | 0.005065 | 0.000016 | 0.000016 | 0.000016 | 0.000016 | Algorithm_MEMCPY |
5411856859942141273 | 19 | regionprofile | 0.005063 | 0.005063 | 0.005063 | 0.005063 | 0.000016 | 0.000016 | 0.000016 | 0.000016 | Algorithm_MEMCPY | |
7599706706110163426 | 19 | regionprofile | 0.005072 | 0.005072 | 0.005072 | 0.005072 | 0.000016 | 0.000016 | 0.000016 | 0.000016 | Algorithm_MEMCPY | |
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} | 2118176828577267326 | 23 | regionprofile | 0.002440 | 0.002440 | 0.002440 | 0.002440 | 0.002440 | 0.002440 | 0.002440 | 0.002440 | Algorithm_MEMCPY.block_128 |
5411856859942141273 | 23 | regionprofile | 0.002439 | 0.002439 | 0.002439 | 0.002439 | 0.002439 | 0.002439 | 0.002439 | 0.002439 | Algorithm_MEMCPY.block_128 | |
7599706706110163426 | 23 | regionprofile | 0.002447 | 0.002447 | 0.002447 | 0.002447 | 0.002447 | 0.002447 | 0.002447 | 0.002447 | Algorithm_MEMCPY.block_128 | |
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} | 2118176828577267326 | 21 | regionprofile | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | Algorithm_MEMCPY.library |
5411856859942141273 | 21 | regionprofile | 0.002608 | 0.002608 | 0.002608 | 0.002608 | 0.002608 | 0.002608 | 0.002608 | 0.002608 | Algorithm_MEMCPY.library | |
7599706706110163426 | 21 | regionprofile | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | 0.002609 | Algorithm_MEMCPY.library | |
{'name': 'Algorithm_MEMSET', 'type': 'function'} | 2118176828577267326 | 14 | regionprofile | 0.002783 | 0.002783 | 0.002783 | 0.002783 | 0.000014 | 0.000014 | 0.000014 | 0.000014 | Algorithm_MEMSET |
5411856859942141273 | 14 | regionprofile | 0.002788 | 0.002788 | 0.002788 | 0.002788 | 0.000016 | 0.000016 | 0.000016 | 0.000016 | Algorithm_MEMSET | |
7599706706110163426 | 14 | regionprofile | 0.002775 | 0.002775 | 0.002775 | 0.002775 | 0.000016 | 0.000016 | 0.000016 | 0.000016 | Algorithm_MEMSET | |
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} | 2118176828577267326 | 17 | regionprofile | 0.001488 | 0.001488 | 0.001488 | 0.001488 | 0.001488 | 0.001488 | 0.001488 | 0.001488 | Algorithm_MEMSET.block_128 |
5411856859942141273 | 17 | regionprofile | 0.001490 | 0.001490 | 0.001490 | 0.001490 | 0.001490 | 0.001490 | 0.001490 | 0.001490 | Algorithm_MEMSET.block_128 | |
7599706706110163426 | 17 | regionprofile | 0.001484 | 0.001484 | 0.001484 | 0.001484 | 0.001484 | 0.001484 | 0.001484 | 0.001484 | Algorithm_MEMSET.block_128 | |
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} | 2118176828577267326 | 15 | regionprofile | 0.001281 | 0.001281 | 0.001281 | 0.001281 | 0.001281 | 0.001281 | 0.001281 | 0.001281 | Algorithm_MEMSET.library |
5411856859942141273 | 15 | regionprofile | 0.001282 | 0.001282 | 0.001282 | 0.001282 | 0.001282 | 0.001282 | 0.001282 | 0.001282 | Algorithm_MEMSET.library | |
7599706706110163426 | 15 | regionprofile | 0.001275 | 0.001275 | 0.001275 | 0.001275 | 0.001275 | 0.001275 | 0.001275 | 0.001275 | Algorithm_MEMSET.library | |
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} | 2118176828577267326 | 11 | regionprofile | 0.004220 | 0.004220 | 0.004220 | 0.004220 | 0.000019 | 0.000019 | 0.000019 | 0.000019 | Algorithm_REDUCE_SUM |
5411856859942141273 | 11 | regionprofile | 0.004250 | 0.004250 | 0.004250 | 0.004250 | 0.000020 | 0.000020 | 0.000020 | 0.000020 | Algorithm_REDUCE_SUM | |
7599706706110163426 | 11 | regionprofile | 0.004223 | 0.004223 | 0.004223 | 0.004223 | 0.000018 | 0.000018 | 0.000018 | 0.000018 | Algorithm_REDUCE_SUM | |
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} | 2118176828577267326 | 13 | regionprofile | 0.002683 | 0.002683 | 0.002683 | 0.002683 | 0.002683 | 0.002683 | 0.002683 | 0.002683 | Algorithm_REDUCE_SUM.block_128 |
5411856859942141273 | 13 | regionprofile | 0.002696 | 0.002696 | 0.002696 | 0.002696 | 0.002696 | 0.002696 | 0.002696 | 0.002696 | Algorithm_REDUCE_SUM.block_128 | |
7599706706110163426 | 13 | regionprofile | 0.002678 | 0.002678 | 0.002678 | 0.002678 | 0.002678 | 0.002678 | 0.002678 | 0.002678 | Algorithm_REDUCE_SUM.block_128 | |
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} | 2118176828577267326 | 12 | regionprofile | 0.001518 | 0.001518 | 0.001518 | 0.001518 | 0.001518 | 0.001518 | 0.001518 | 0.001518 | Algorithm_REDUCE_SUM.cub |
5411856859942141273 | 12 | regionprofile | 0.001534 | 0.001534 | 0.001534 | 0.001534 | 0.001534 | 0.001534 | 0.001534 | 0.001534 | Algorithm_REDUCE_SUM.cub | |
7599706706110163426 | 12 | regionprofile | 0.001527 | 0.001527 | 0.001527 | 0.001527 | 0.001527 | 0.001527 | 0.001527 | 0.001527 | Algorithm_REDUCE_SUM.cub | |
{'name': 'Algorithm_SCAN', 'type': 'function'} | 2118176828577267326 | 9 | regionprofile | 0.002940 | 0.002940 | 0.002940 | 0.002940 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Algorithm_SCAN |
5411856859942141273 | 9 | regionprofile | 0.002926 | 0.002926 | 0.002926 | 0.002926 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Algorithm_SCAN | |
7599706706110163426 | 9 | regionprofile | 0.002908 | 0.002908 | 0.002908 | 0.002908 | 0.000007 | 0.000007 | 0.000007 | 0.000007 | Algorithm_SCAN | |
{'name': 'Algorithm_SCAN.default', 'type': 'function'} | 2118176828577267326 | 10 | regionprofile | 0.002932 | 0.002932 | 0.002932 | 0.002932 | 0.002932 | 0.002932 | 0.002932 | 0.002932 | Algorithm_SCAN.default |
5411856859942141273 | 10 | regionprofile | 0.002917 | 0.002917 | 0.002917 | 0.002917 | 0.002917 | 0.002917 | 0.002917 | 0.002917 | Algorithm_SCAN.default | |
7599706706110163426 | 10 | regionprofile | 0.002901 | 0.002901 | 0.002901 | 0.002901 | 0.002901 | 0.002901 | 0.002901 | 0.002901 | Algorithm_SCAN.default | |
{'name': 'Apps', 'type': 'function'} | 2118176828577267326 | 1 | regionprofile | 0.193195 | 0.193195 | 0.193195 | 0.193195 | 0.000093 | 0.000093 | 0.000093 | 0.000093 | Apps |
5411856859942141273 | 1 | regionprofile | 0.193273 | 0.193273 | 0.193273 | 0.193273 | 0.000085 | 0.000085 | 0.000085 | 0.000085 | Apps | |
7599706706110163426 | 1 | regionprofile | 0.193437 | 0.193437 | 0.193437 | 0.193437 | 0.000086 | 0.000086 | 0.000086 | 0.000086 | Apps | |
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} | 2118176828577267326 | 112 | regionprofile | 0.003169 | 0.003169 | 0.003169 | 0.003169 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_CONVECTION3DPA |
5411856859942141273 | 112 | regionprofile | 0.003174 | 0.003174 | 0.003174 | 0.003174 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_CONVECTION3DPA | |
7599706706110163426 | 112 | regionprofile | 0.003165 | 0.003165 | 0.003165 | 0.003165 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_CONVECTION3DPA | |
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} | 2118176828577267326 | 113 | regionprofile | 0.003161 | 0.003161 | 0.003161 | 0.003161 | 0.003161 | 0.003161 | 0.003161 | 0.003161 | Apps_CONVECTION3DPA.block_64 |
5411856859942141273 | 113 | regionprofile | 0.003166 | 0.003166 | 0.003166 | 0.003166 | 0.003166 | 0.003166 | 0.003166 | 0.003166 | Apps_CONVECTION3DPA.block_64 | |
7599706706110163426 | 113 | regionprofile | 0.003157 | 0.003157 | 0.003157 | 0.003157 | 0.003157 | 0.003157 | 0.003157 | 0.003157 | Apps_CONVECTION3DPA.block_64 | |
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} | 2118176828577267326 | 114 | regionprofile | 0.007015 | 0.007015 | 0.007015 | 0.007015 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_DEL_DOT_VEC_2D |
5411856859942141273 | 114 | regionprofile | 0.007069 | 0.007069 | 0.007069 | 0.007069 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_DEL_DOT_VEC_2D | |
7599706706110163426 | 114 | regionprofile | 0.007000 | 0.007000 | 0.007000 | 0.007000 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_DEL_DOT_VEC_2D | |
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} | 2118176828577267326 | 115 | regionprofile | 0.007006 | 0.007006 | 0.007006 | 0.007006 | 0.007006 | 0.007006 | 0.007006 | 0.007006 | Apps_DEL_DOT_VEC_2D.block_128 |
5411856859942141273 | 115 | regionprofile | 0.007060 | 0.007060 | 0.007060 | 0.007060 | 0.007060 | 0.007060 | 0.007060 | 0.007060 | Apps_DEL_DOT_VEC_2D.block_128 | |
7599706706110163426 | 115 | regionprofile | 0.006991 | 0.006991 | 0.006991 | 0.006991 | 0.006991 | 0.006991 | 0.006991 | 0.006991 | Apps_DEL_DOT_VEC_2D.block_128 | |
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} | 2118176828577267326 | 116 | regionprofile | 0.004936 | 0.004936 | 0.004936 | 0.004936 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_DIFFUSION3DPA |
5411856859942141273 | 116 | regionprofile | 0.004932 | 0.004932 | 0.004932 | 0.004932 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_DIFFUSION3DPA | |
7599706706110163426 | 116 | regionprofile | 0.004956 | 0.004956 | 0.004956 | 0.004956 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_DIFFUSION3DPA | |
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} | 2118176828577267326 | 117 | regionprofile | 0.004926 | 0.004926 | 0.004926 | 0.004926 | 0.004926 | 0.004926 | 0.004926 | 0.004926 | Apps_DIFFUSION3DPA.block_64 |
5411856859942141273 | 117 | regionprofile | 0.004924 | 0.004924 | 0.004924 | 0.004924 | 0.004924 | 0.004924 | 0.004924 | 0.004924 | Apps_DIFFUSION3DPA.block_64 | |
7599706706110163426 | 117 | regionprofile | 0.004946 | 0.004946 | 0.004946 | 0.004946 | 0.004946 | 0.004946 | 0.004946 | 0.004946 | Apps_DIFFUSION3DPA.block_64 | |
{'name': 'Apps_ENERGY', 'type': 'function'} | 2118176828577267326 | 118 | regionprofile | 0.039311 | 0.039311 | 0.039311 | 0.039311 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_ENERGY |
5411856859942141273 | 118 | regionprofile | 0.039254 | 0.039254 | 0.039254 | 0.039254 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_ENERGY | |
7599706706110163426 | 118 | regionprofile | 0.039317 | 0.039317 | 0.039317 | 0.039317 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_ENERGY | |
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} | 2118176828577267326 | 119 | regionprofile | 0.039303 | 0.039303 | 0.039303 | 0.039303 | 0.039303 | 0.039303 | 0.039303 | 0.039303 | Apps_ENERGY.block_128 |
5411856859942141273 | 119 | regionprofile | 0.039245 | 0.039245 | 0.039245 | 0.039245 | 0.039245 | 0.039245 | 0.039245 | 0.039245 | Apps_ENERGY.block_128 | |
7599706706110163426 | 119 | regionprofile | 0.039307 | 0.039307 | 0.039307 | 0.039307 | 0.039307 | 0.039307 | 0.039307 | 0.039307 | Apps_ENERGY.block_128 | |
{'name': 'Apps_FIR', 'type': 'function'} | 2118176828577267326 | 120 | regionprofile | 0.004163 | 0.004163 | 0.004163 | 0.004163 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_FIR |
5411856859942141273 | 120 | regionprofile | 0.004167 | 0.004167 | 0.004167 | 0.004167 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_FIR | |
7599706706110163426 | 120 | regionprofile | 0.004195 | 0.004195 | 0.004195 | 0.004195 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_FIR | |
{'name': 'Apps_FIR.block_128', 'type': 'function'} | 2118176828577267326 | 121 | regionprofile | 0.004154 | 0.004154 | 0.004154 | 0.004154 | 0.004154 | 0.004154 | 0.004154 | 0.004154 | Apps_FIR.block_128 |
5411856859942141273 | 121 | regionprofile | 0.004157 | 0.004157 | 0.004157 | 0.004157 | 0.004157 | 0.004157 | 0.004157 | 0.004157 | Apps_FIR.block_128 | |
7599706706110163426 | 121 | regionprofile | 0.004185 | 0.004185 | 0.004185 | 0.004185 | 0.004185 | 0.004185 | 0.004185 | 0.004185 | Apps_FIR.block_128 | |
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} | 2118176828577267326 | 122 | regionprofile | 0.037487 | 0.037487 | 0.037487 | 0.037487 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_HALOEXCHANGE |
5411856859942141273 | 122 | regionprofile | 0.037465 | 0.037465 | 0.037465 | 0.037465 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_HALOEXCHANGE | |
7599706706110163426 | 122 | regionprofile | 0.037805 | 0.037805 | 0.037805 | 0.037805 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_HALOEXCHANGE | |
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} | 2118176828577267326 | 123 | regionprofile | 0.037478 | 0.037478 | 0.037478 | 0.037478 | 0.037478 | 0.037478 | 0.037478 | 0.037478 | Apps_HALOEXCHANGE.block_128 |
5411856859942141273 | 123 | regionprofile | 0.037456 | 0.037456 | 0.037456 | 0.037456 | 0.037456 | 0.037456 | 0.037456 | 0.037456 | Apps_HALOEXCHANGE.block_128 | |
7599706706110163426 | 123 | regionprofile | 0.037796 | 0.037796 | 0.037796 | 0.037796 | 0.037796 | 0.037796 | 0.037796 | 0.037796 | Apps_HALOEXCHANGE.block_128 | |
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} | 2118176828577267326 | 124 | regionprofile | 0.009477 | 0.009477 | 0.009477 | 0.009477 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_HALOEXCHANGE_FUSED |
5411856859942141273 | 124 | regionprofile | 0.009440 | 0.009440 | 0.009440 | 0.009440 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_HALOEXCHANGE_FUSED | |
7599706706110163426 | 124 | regionprofile | 0.009135 | 0.009135 | 0.009135 | 0.009135 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_HALOEXCHANGE_FUSED | |
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} | 2118176828577267326 | 125 | regionprofile | 0.009467 | 0.009467 | 0.009467 | 0.009467 | 0.009467 | 0.009467 | 0.009467 | 0.009467 | Apps_HALOEXCHANGE_FUSED.block_128 |
5411856859942141273 | 125 | regionprofile | 0.009430 | 0.009430 | 0.009430 | 0.009430 | 0.009430 | 0.009430 | 0.009430 | 0.009430 | Apps_HALOEXCHANGE_FUSED.block_128 | |
7599706706110163426 | 125 | regionprofile | 0.009125 | 0.009125 | 0.009125 | 0.009125 | 0.009125 | 0.009125 | 0.009125 | 0.009125 | Apps_HALOEXCHANGE_FUSED.block_128 | |
{'name': 'Apps_LTIMES', 'type': 'function'} | 2118176828577267326 | 126 | regionprofile | 0.012112 | 0.012112 | 0.012112 | 0.012112 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_LTIMES |
5411856859942141273 | 126 | regionprofile | 0.012087 | 0.012087 | 0.012087 | 0.012087 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_LTIMES | |
7599706706110163426 | 126 | regionprofile | 0.012097 | 0.012097 | 0.012097 | 0.012097 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_LTIMES | |
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} | 2118176828577267326 | 127 | regionprofile | 0.012104 | 0.012104 | 0.012104 | 0.012104 | 0.012104 | 0.012104 | 0.012104 | 0.012104 | Apps_LTIMES.block_128 |
5411856859942141273 | 127 | regionprofile | 0.012079 | 0.012079 | 0.012079 | 0.012079 | 0.012079 | 0.012079 | 0.012079 | 0.012079 | Apps_LTIMES.block_128 | |
7599706706110163426 | 127 | regionprofile | 0.012088 | 0.012088 | 0.012088 | 0.012088 | 0.012088 | 0.012088 | 0.012088 | 0.012088 | Apps_LTIMES.block_128 | |
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} | 2118176828577267326 | 128 | regionprofile | 0.012095 | 0.012095 | 0.012095 | 0.012095 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_LTIMES_NOVIEW |
5411856859942141273 | 128 | regionprofile | 0.012091 | 0.012091 | 0.012091 | 0.012091 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Apps_LTIMES_NOVIEW | |
7599706706110163426 | 128 | regionprofile | 0.012106 | 0.012106 | 0.012106 | 0.012106 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Apps_LTIMES_NOVIEW | |
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} | 2118176828577267326 | 129 | regionprofile | 0.012085 | 0.012085 | 0.012085 | 0.012085 | 0.012085 | 0.012085 | 0.012085 | 0.012085 | Apps_LTIMES_NOVIEW.block_128 |
5411856859942141273 | 129 | regionprofile | 0.012080 | 0.012080 | 0.012080 | 0.012080 | 0.012080 | 0.012080 | 0.012080 | 0.012080 | Apps_LTIMES_NOVIEW.block_128 | |
7599706706110163426 | 129 | regionprofile | 0.012095 | 0.012095 | 0.012095 | 0.012095 | 0.012095 | 0.012095 | 0.012095 | 0.012095 | Apps_LTIMES_NOVIEW.block_128 | |
{'name': 'Apps_MASS3DPA', 'type': 'function'} | 2118176828577267326 | 130 | regionprofile | 0.001910 | 0.001910 | 0.001910 | 0.001910 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_MASS3DPA |
5411856859942141273 | 130 | regionprofile | 0.001920 | 0.001920 | 0.001920 | 0.001920 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_MASS3DPA | |
7599706706110163426 | 130 | regionprofile | 0.001912 | 0.001912 | 0.001912 | 0.001912 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_MASS3DPA | |
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} | 2118176828577267326 | 131 | regionprofile | 0.001902 | 0.001902 | 0.001902 | 0.001902 | 0.001902 | 0.001902 | 0.001902 | 0.001902 | Apps_MASS3DPA.block_25 |
5411856859942141273 | 131 | regionprofile | 0.001912 | 0.001912 | 0.001912 | 0.001912 | 0.001912 | 0.001912 | 0.001912 | 0.001912 | Apps_MASS3DPA.block_25 | |
7599706706110163426 | 131 | regionprofile | 0.001904 | 0.001904 | 0.001904 | 0.001904 | 0.001904 | 0.001904 | 0.001904 | 0.001904 | Apps_MASS3DPA.block_25 | |
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} | 2118176828577267326 | 2 | regionprofile | 0.007415 | 0.007415 | 0.007415 | 0.007415 | 0.000007 | 0.000007 | 0.000007 | 0.000007 | Apps_NODAL_ACCUMULATION_3D |
5411856859942141273 | 2 | regionprofile | 0.007454 | 0.007454 | 0.007454 | 0.007454 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_NODAL_ACCUMULATION_3D | |
7599706706110163426 | 2 | regionprofile | 0.007358 | 0.007358 | 0.007358 | 0.007358 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_NODAL_ACCUMULATION_3D | |
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} | 2118176828577267326 | 3 | regionprofile | 0.007408 | 0.007408 | 0.007408 | 0.007408 | 0.007408 | 0.007408 | 0.007408 | 0.007408 | Apps_NODAL_ACCUMULATION_3D.block_128 |
5411856859942141273 | 3 | regionprofile | 0.007445 | 0.007445 | 0.007445 | 0.007445 | 0.007445 | 0.007445 | 0.007445 | 0.007445 | Apps_NODAL_ACCUMULATION_3D.block_128 | |
7599706706110163426 | 3 | regionprofile | 0.007349 | 0.007349 | 0.007349 | 0.007349 | 0.007349 | 0.007349 | 0.007349 | 0.007349 | Apps_NODAL_ACCUMULATION_3D.block_128 | |
{'name': 'Apps_PRESSURE', 'type': 'function'} | 2118176828577267326 | 4 | regionprofile | 0.048260 | 0.048260 | 0.048260 | 0.048260 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_PRESSURE |
5411856859942141273 | 4 | regionprofile | 0.048222 | 0.048222 | 0.048222 | 0.048222 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_PRESSURE | |
7599706706110163426 | 4 | regionprofile | 0.048312 | 0.048312 | 0.048312 | 0.048312 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_PRESSURE | |
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} | 2118176828577267326 | 5 | regionprofile | 0.048251 | 0.048251 | 0.048251 | 0.048251 | 0.048251 | 0.048251 | 0.048251 | 0.048251 | Apps_PRESSURE.block_128 |
5411856859942141273 | 5 | regionprofile | 0.048212 | 0.048212 | 0.048212 | 0.048212 | 0.048212 | 0.048212 | 0.048212 | 0.048212 | Apps_PRESSURE.block_128 | |
7599706706110163426 | 5 | regionprofile | 0.048303 | 0.048303 | 0.048303 | 0.048303 | 0.048303 | 0.048303 | 0.048303 | 0.048303 | Apps_PRESSURE.block_128 | |
{'name': 'Apps_VOL3D', 'type': 'function'} | 2118176828577267326 | 6 | regionprofile | 0.005752 | 0.005752 | 0.005752 | 0.005752 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Apps_VOL3D |
5411856859942141273 | 6 | regionprofile | 0.005913 | 0.005913 | 0.005913 | 0.005913 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Apps_VOL3D | |
7599706706110163426 | 6 | regionprofile | 0.005993 | 0.005993 | 0.005993 | 0.005993 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Apps_VOL3D | |
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} | 2118176828577267326 | 7 | regionprofile | 0.005743 | 0.005743 | 0.005743 | 0.005743 | 0.005743 | 0.005743 | 0.005743 | 0.005743 | Apps_VOL3D.block_128 |
5411856859942141273 | 7 | regionprofile | 0.005903 | 0.005903 | 0.005903 | 0.005903 | 0.005903 | 0.005903 | 0.005903 | 0.005903 | Apps_VOL3D.block_128 | |
7599706706110163426 | 7 | regionprofile | 0.005985 | 0.005985 | 0.005985 | 0.005985 | 0.005985 | 0.005985 | 0.005985 | 0.005985 | Apps_VOL3D.block_128 | |
{'name': 'Basic', 'type': 'function'} | 2118176828577267326 | 16 | regionprofile | 0.361976 | 0.361976 | 0.361976 | 0.361976 | 0.000096 | 0.000096 | 0.000096 | 0.000096 | Basic |
5411856859942141273 | 16 | regionprofile | 0.332863 | 0.332863 | 0.332863 | 0.332863 | 0.000101 | 0.000101 | 0.000101 | 0.000101 | Basic | |
7599706706110163426 | 16 | regionprofile | 0.362643 | 0.362643 | 0.362643 | 0.362643 | 0.000097 | 0.000097 | 0.000097 | 0.000097 | Basic | |
{'name': 'Basic_DAXPY', 'type': 'function'} | 2118176828577267326 | 18 | regionprofile | 0.016935 | 0.016935 | 0.016935 | 0.016935 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Basic_DAXPY |
5411856859942141273 | 18 | regionprofile | 0.016840 | 0.016840 | 0.016840 | 0.016840 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Basic_DAXPY | |
7599706706110163426 | 18 | regionprofile | 0.016959 | 0.016959 | 0.016959 | 0.016959 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Basic_DAXPY | |
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} | 2118176828577267326 | 20 | regionprofile | 0.016924 | 0.016924 | 0.016924 | 0.016924 | 0.016924 | 0.016924 | 0.016924 | 0.016924 | Basic_DAXPY.block_128 |
5411856859942141273 | 20 | regionprofile | 0.016830 | 0.016830 | 0.016830 | 0.016830 | 0.016830 | 0.016830 | 0.016830 | 0.016830 | Basic_DAXPY.block_128 | |
7599706706110163426 | 20 | regionprofile | 0.016948 | 0.016948 | 0.016948 | 0.016948 | 0.016948 | 0.016948 | 0.016948 | 0.016948 | Basic_DAXPY.block_128 | |
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} | 2118176828577267326 | 22 | regionprofile | 0.016983 | 0.016983 | 0.016983 | 0.016983 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_DAXPY_ATOMIC |
5411856859942141273 | 22 | regionprofile | 0.016901 | 0.016901 | 0.016901 | 0.016901 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_DAXPY_ATOMIC | |
7599706706110163426 | 22 | regionprofile | 0.016956 | 0.016956 | 0.016956 | 0.016956 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_DAXPY_ATOMIC | |
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} | 2118176828577267326 | 24 | regionprofile | 0.016974 | 0.016974 | 0.016974 | 0.016974 | 0.016974 | 0.016974 | 0.016974 | 0.016974 | Basic_DAXPY_ATOMIC.block_128 |
5411856859942141273 | 24 | regionprofile | 0.016892 | 0.016892 | 0.016892 | 0.016892 | 0.016892 | 0.016892 | 0.016892 | 0.016892 | Basic_DAXPY_ATOMIC.block_128 | |
7599706706110163426 | 24 | regionprofile | 0.016947 | 0.016947 | 0.016947 | 0.016947 | 0.016947 | 0.016947 | 0.016947 | 0.016947 | Basic_DAXPY_ATOMIC.block_128 | |
{'name': 'Basic_IF_QUAD', 'type': 'function'} | 2118176828577267326 | 25 | regionprofile | 0.012748 | 0.012748 | 0.012748 | 0.012748 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_IF_QUAD |
5411856859942141273 | 25 | regionprofile | 0.012599 | 0.012599 | 0.012599 | 0.012599 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_IF_QUAD | |
7599706706110163426 | 25 | regionprofile | 0.012630 | 0.012630 | 0.012630 | 0.012630 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_IF_QUAD | |
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} | 2118176828577267326 | 26 | regionprofile | 0.012740 | 0.012740 | 0.012740 | 0.012740 | 0.012740 | 0.012740 | 0.012740 | 0.012740 | Basic_IF_QUAD.block_128 |
5411856859942141273 | 26 | regionprofile | 0.012590 | 0.012590 | 0.012590 | 0.012590 | 0.012590 | 0.012590 | 0.012590 | 0.012590 | Basic_IF_QUAD.block_128 | |
7599706706110163426 | 26 | regionprofile | 0.012622 | 0.012622 | 0.012622 | 0.012622 | 0.012622 | 0.012622 | 0.012622 | 0.012622 | Basic_IF_QUAD.block_128 | |
{'name': 'Basic_INDEXLIST', 'type': 'function'} | 2118176828577267326 | 27 | regionprofile | 0.004529 | 0.004529 | 0.004529 | 0.004529 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Basic_INDEXLIST |
5411856859942141273 | 27 | regionprofile | 0.004284 | 0.004284 | 0.004284 | 0.004284 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_INDEXLIST | |
7599706706110163426 | 27 | regionprofile | 0.004514 | 0.004514 | 0.004514 | 0.004514 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_INDEXLIST | |
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} | 2118176828577267326 | 28 | regionprofile | 0.004519 | 0.004519 | 0.004519 | 0.004519 | 0.004519 | 0.004519 | 0.004519 | 0.004519 | Basic_INDEXLIST.block_256 |
5411856859942141273 | 28 | regionprofile | 0.004276 | 0.004276 | 0.004276 | 0.004276 | 0.004276 | 0.004276 | 0.004276 | 0.004276 | Basic_INDEXLIST.block_256 | |
7599706706110163426 | 28 | regionprofile | 0.004506 | 0.004506 | 0.004506 | 0.004506 | 0.004506 | 0.004506 | 0.004506 | 0.004506 | Basic_INDEXLIST.block_256 | |
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} | 2118176828577267326 | 29 | regionprofile | 0.008203 | 0.008203 | 0.008203 | 0.008203 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INDEXLIST_3LOOP |
5411856859942141273 | 29 | regionprofile | 0.008037 | 0.008037 | 0.008037 | 0.008037 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INDEXLIST_3LOOP | |
7599706706110163426 | 29 | regionprofile | 0.008184 | 0.008184 | 0.008184 | 0.008184 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INDEXLIST_3LOOP | |
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} | 2118176828577267326 | 30 | regionprofile | 0.008194 | 0.008194 | 0.008194 | 0.008194 | 0.008194 | 0.008194 | 0.008194 | 0.008194 | Basic_INDEXLIST_3LOOP.block_256 |
5411856859942141273 | 30 | regionprofile | 0.008028 | 0.008028 | 0.008028 | 0.008028 | 0.008028 | 0.008028 | 0.008028 | 0.008028 | Basic_INDEXLIST_3LOOP.block_256 | |
7599706706110163426 | 30 | regionprofile | 0.008175 | 0.008175 | 0.008175 | 0.008175 | 0.008175 | 0.008175 | 0.008175 | 0.008175 | Basic_INDEXLIST_3LOOP.block_256 | |
{'name': 'Basic_INIT3', 'type': 'function'} | 2118176828577267326 | 31 | regionprofile | 0.028361 | 0.028361 | 0.028361 | 0.028361 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INIT3 |
5411856859942141273 | 31 | regionprofile | 0.028205 | 0.028205 | 0.028205 | 0.028205 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_INIT3 | |
7599706706110163426 | 31 | regionprofile | 0.028205 | 0.028205 | 0.028205 | 0.028205 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INIT3 | |
{'name': 'Basic_INIT3.block_128', 'type': 'function'} | 2118176828577267326 | 32 | regionprofile | 0.028352 | 0.028352 | 0.028352 | 0.028352 | 0.028352 | 0.028352 | 0.028352 | 0.028352 | Basic_INIT3.block_128 |
5411856859942141273 | 32 | regionprofile | 0.028197 | 0.028197 | 0.028197 | 0.028197 | 0.028197 | 0.028197 | 0.028197 | 0.028197 | Basic_INIT3.block_128 | |
7599706706110163426 | 32 | regionprofile | 0.028196 | 0.028196 | 0.028196 | 0.028196 | 0.028196 | 0.028196 | 0.028196 | 0.028196 | Basic_INIT3.block_128 | |
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} | 2118176828577267326 | 33 | regionprofile | 0.042035 | 0.042035 | 0.042035 | 0.042035 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INIT_VIEW1D |
5411856859942141273 | 33 | regionprofile | 0.036596 | 0.036596 | 0.036596 | 0.036596 | 0.000007 | 0.000007 | 0.000007 | 0.000007 | Basic_INIT_VIEW1D | |
7599706706110163426 | 33 | regionprofile | 0.041789 | 0.041789 | 0.041789 | 0.041789 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INIT_VIEW1D | |
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} | 2118176828577267326 | 34 | regionprofile | 0.042026 | 0.042026 | 0.042026 | 0.042026 | 0.042026 | 0.042026 | 0.042026 | 0.042026 | Basic_INIT_VIEW1D.block_128 |
5411856859942141273 | 34 | regionprofile | 0.036589 | 0.036589 | 0.036589 | 0.036589 | 0.036589 | 0.036589 | 0.036589 | 0.036589 | Basic_INIT_VIEW1D.block_128 | |
7599706706110163426 | 34 | regionprofile | 0.041780 | 0.041780 | 0.041780 | 0.041780 | 0.041780 | 0.041780 | 0.041780 | 0.041780 | Basic_INIT_VIEW1D.block_128 | |
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} | 2118176828577267326 | 35 | regionprofile | 0.042047 | 0.042047 | 0.042047 | 0.042047 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INIT_VIEW1D_OFFSET |
5411856859942141273 | 35 | regionprofile | 0.036612 | 0.036612 | 0.036612 | 0.036612 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INIT_VIEW1D_OFFSET | |
7599706706110163426 | 35 | regionprofile | 0.041804 | 0.041804 | 0.041804 | 0.041804 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_INIT_VIEW1D_OFFSET | |
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} | 2118176828577267326 | 36 | regionprofile | 0.042038 | 0.042038 | 0.042038 | 0.042038 | 0.042038 | 0.042038 | 0.042038 | 0.042038 | Basic_INIT_VIEW1D_OFFSET.block_128 |
5411856859942141273 | 36 | regionprofile | 0.036603 | 0.036603 | 0.036603 | 0.036603 | 0.036603 | 0.036603 | 0.036603 | 0.036603 | Basic_INIT_VIEW1D_OFFSET.block_128 | |
7599706706110163426 | 36 | regionprofile | 0.041795 | 0.041795 | 0.041795 | 0.041795 | 0.041795 | 0.041795 | 0.041795 | 0.041795 | Basic_INIT_VIEW1D_OFFSET.block_128 | |
{'name': 'Basic_MULADDSUB', 'type': 'function'} | 2118176828577267326 | 37 | regionprofile | 0.019727 | 0.019727 | 0.019727 | 0.019727 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Basic_MULADDSUB |
5411856859942141273 | 37 | regionprofile | 0.019762 | 0.019762 | 0.019762 | 0.019762 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Basic_MULADDSUB | |
7599706706110163426 | 37 | regionprofile | 0.019756 | 0.019756 | 0.019756 | 0.019756 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Basic_MULADDSUB | |
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} | 2118176828577267326 | 38 | regionprofile | 0.019716 | 0.019716 | 0.019716 | 0.019716 | 0.019716 | 0.019716 | 0.019716 | 0.019716 | Basic_MULADDSUB.block_128 |
5411856859942141273 | 38 | regionprofile | 0.019751 | 0.019751 | 0.019751 | 0.019751 | 0.019751 | 0.019751 | 0.019751 | 0.019751 | Basic_MULADDSUB.block_128 | |
7599706706110163426 | 38 | regionprofile | 0.019746 | 0.019746 | 0.019746 | 0.019746 | 0.019746 | 0.019746 | 0.019746 | 0.019746 | Basic_MULADDSUB.block_128 | |
{'name': 'Basic_NESTED_INIT', 'type': 'function'} | 2118176828577267326 | 39 | regionprofile | 0.020790 | 0.020790 | 0.020790 | 0.020790 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_NESTED_INIT |
5411856859942141273 | 39 | regionprofile | 0.018093 | 0.018093 | 0.018093 | 0.018093 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_NESTED_INIT | |
7599706706110163426 | 39 | regionprofile | 0.020679 | 0.020679 | 0.020679 | 0.020679 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_NESTED_INIT | |
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} | 2118176828577267326 | 40 | regionprofile | 0.020782 | 0.020782 | 0.020782 | 0.020782 | 0.020782 | 0.020782 | 0.020782 | 0.020782 | Basic_NESTED_INIT.block_128 |
5411856859942141273 | 40 | regionprofile | 0.018084 | 0.018084 | 0.018084 | 0.018084 | 0.018084 | 0.018084 | 0.018084 | 0.018084 | Basic_NESTED_INIT.block_128 | |
7599706706110163426 | 40 | regionprofile | 0.020670 | 0.020670 | 0.020670 | 0.020670 | 0.020670 | 0.020670 | 0.020670 | 0.020670 | Basic_NESTED_INIT.block_128 | |
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} | 2118176828577267326 | 41 | regionprofile | 0.124121 | 0.124121 | 0.124121 | 0.124121 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_PI_ATOMIC |
5411856859942141273 | 41 | regionprofile | 0.110341 | 0.110341 | 0.110341 | 0.110341 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_PI_ATOMIC | |
7599706706110163426 | 41 | regionprofile | 0.126655 | 0.126655 | 0.126655 | 0.126655 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_PI_ATOMIC | |
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} | 2118176828577267326 | 42 | regionprofile | 0.124113 | 0.124113 | 0.124113 | 0.124113 | 0.124113 | 0.124113 | 0.124113 | 0.124113 | Basic_PI_ATOMIC.block_128 |
5411856859942141273 | 42 | regionprofile | 0.110332 | 0.110332 | 0.110332 | 0.110332 | 0.110332 | 0.110332 | 0.110332 | 0.110332 | Basic_PI_ATOMIC.block_128 | |
7599706706110163426 | 42 | regionprofile | 0.126646 | 0.126646 | 0.126646 | 0.126646 | 0.126646 | 0.126646 | 0.126646 | 0.126646 | Basic_PI_ATOMIC.block_128 | |
{'name': 'Basic_PI_REDUCE', 'type': 'function'} | 2118176828577267326 | 43 | regionprofile | 0.002610 | 0.002610 | 0.002610 | 0.002610 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_PI_REDUCE |
5411856859942141273 | 43 | regionprofile | 0.002586 | 0.002586 | 0.002586 | 0.002586 | 0.000007 | 0.000007 | 0.000007 | 0.000007 | Basic_PI_REDUCE | |
7599706706110163426 | 43 | regionprofile | 0.002685 | 0.002685 | 0.002685 | 0.002685 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_PI_REDUCE | |
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} | 2118176828577267326 | 44 | regionprofile | 0.002602 | 0.002602 | 0.002602 | 0.002602 | 0.002602 | 0.002602 | 0.002602 | 0.002602 | Basic_PI_REDUCE.block_128 |
5411856859942141273 | 44 | regionprofile | 0.002579 | 0.002579 | 0.002579 | 0.002579 | 0.002579 | 0.002579 | 0.002579 | 0.002579 | Basic_PI_REDUCE.block_128 | |
7599706706110163426 | 44 | regionprofile | 0.002677 | 0.002677 | 0.002677 | 0.002677 | 0.002677 | 0.002677 | 0.002677 | 0.002677 | Basic_PI_REDUCE.block_128 | |
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} | 2118176828577267326 | 45 | regionprofile | 0.002468 | 0.002468 | 0.002468 | 0.002468 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_REDUCE3_INT |
5411856859942141273 | 45 | regionprofile | 0.002380 | 0.002380 | 0.002380 | 0.002380 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_REDUCE3_INT | |
7599706706110163426 | 45 | regionprofile | 0.002388 | 0.002388 | 0.002388 | 0.002388 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_REDUCE3_INT | |
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} | 2118176828577267326 | 46 | regionprofile | 0.002460 | 0.002460 | 0.002460 | 0.002460 | 0.002460 | 0.002460 | 0.002460 | 0.002460 | Basic_REDUCE3_INT.block_128 |
5411856859942141273 | 46 | regionprofile | 0.002372 | 0.002372 | 0.002372 | 0.002372 | 0.002372 | 0.002372 | 0.002372 | 0.002372 | Basic_REDUCE3_INT.block_128 | |
7599706706110163426 | 46 | regionprofile | 0.002380 | 0.002380 | 0.002380 | 0.002380 | 0.002380 | 0.002380 | 0.002380 | 0.002380 | Basic_REDUCE3_INT.block_128 | |
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} | 2118176828577267326 | 47 | regionprofile | 0.017720 | 0.017720 | 0.017720 | 0.017720 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_REDUCE_STRUCT |
5411856859942141273 | 47 | regionprofile | 0.016925 | 0.016925 | 0.016925 | 0.016925 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_REDUCE_STRUCT | |
7599706706110163426 | 47 | regionprofile | 0.016720 | 0.016720 | 0.016720 | 0.016720 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Basic_REDUCE_STRUCT | |
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} | 2118176828577267326 | 48 | regionprofile | 0.017712 | 0.017712 | 0.017712 | 0.017712 | 0.017712 | 0.017712 | 0.017712 | 0.017712 | Basic_REDUCE_STRUCT.block_128 |
5411856859942141273 | 48 | regionprofile | 0.016917 | 0.016917 | 0.016917 | 0.016917 | 0.016917 | 0.016917 | 0.016917 | 0.016917 | Basic_REDUCE_STRUCT.block_128 | |
7599706706110163426 | 48 | regionprofile | 0.016711 | 0.016711 | 0.016711 | 0.016711 | 0.016711 | 0.016711 | 0.016711 | 0.016711 | Basic_REDUCE_STRUCT.block_128 | |
{'name': 'Basic_TRAP_INT', 'type': 'function'} | 2118176828577267326 | 49 | regionprofile | 0.002603 | 0.002603 | 0.002603 | 0.002603 | 0.000007 | 0.000007 | 0.000007 | 0.000007 | Basic_TRAP_INT |
5411856859942141273 | 49 | regionprofile | 0.002601 | 0.002601 | 0.002601 | 0.002601 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Basic_TRAP_INT | |
7599706706110163426 | 49 | regionprofile | 0.002622 | 0.002622 | 0.002622 | 0.002622 | 0.000007 | 0.000007 | 0.000007 | 0.000007 | Basic_TRAP_INT | |
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} | 2118176828577267326 | 50 | regionprofile | 0.002596 | 0.002596 | 0.002596 | 0.002596 | 0.002596 | 0.002596 | 0.002596 | 0.002596 | Basic_TRAP_INT.block_128 |
5411856859942141273 | 50 | regionprofile | 0.002593 | 0.002593 | 0.002593 | 0.002593 | 0.002593 | 0.002593 | 0.002593 | 0.002593 | Basic_TRAP_INT.block_128 | |
7599706706110163426 | 50 | regionprofile | 0.002615 | 0.002615 | 0.002615 | 0.002615 | 0.002615 | 0.002615 | 0.002615 | 0.002615 | Basic_TRAP_INT.block_128 | |
{'name': 'Lcals', 'type': 'function'} | 2118176828577267326 | 51 | regionprofile | 0.430549 | 0.430549 | 0.430549 | 0.430549 | 0.000071 | 0.000071 | 0.000071 | 0.000071 | Lcals |
5411856859942141273 | 51 | regionprofile | 0.460418 | 0.460418 | 0.460418 | 0.460418 | 0.000080 | 0.000080 | 0.000080 | 0.000080 | Lcals | |
7599706706110163426 | 51 | regionprofile | 0.427691 | 0.427691 | 0.427691 | 0.427691 | 0.000074 | 0.000074 | 0.000074 | 0.000074 | Lcals | |
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} | 2118176828577267326 | 52 | regionprofile | 0.061532 | 0.061532 | 0.061532 | 0.061532 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_DIFF_PREDICT |
5411856859942141273 | 52 | regionprofile | 0.060504 | 0.060504 | 0.060504 | 0.060504 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_DIFF_PREDICT | |
7599706706110163426 | 52 | regionprofile | 0.061569 | 0.061569 | 0.061569 | 0.061569 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_DIFF_PREDICT | |
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} | 2118176828577267326 | 53 | regionprofile | 0.061523 | 0.061523 | 0.061523 | 0.061523 | 0.061523 | 0.061523 | 0.061523 | 0.061523 | Lcals_DIFF_PREDICT.block_128 |
5411856859942141273 | 53 | regionprofile | 0.060494 | 0.060494 | 0.060494 | 0.060494 | 0.060494 | 0.060494 | 0.060494 | 0.060494 | Lcals_DIFF_PREDICT.block_128 | |
7599706706110163426 | 53 | regionprofile | 0.061560 | 0.061560 | 0.061560 | 0.061560 | 0.061560 | 0.061560 | 0.061560 | 0.061560 | Lcals_DIFF_PREDICT.block_128 | |
{'name': 'Lcals_EOS', 'type': 'function'} | 2118176828577267326 | 54 | regionprofile | 0.022710 | 0.022710 | 0.022710 | 0.022710 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_EOS |
5411856859942141273 | 54 | regionprofile | 0.022695 | 0.022695 | 0.022695 | 0.022695 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_EOS | |
7599706706110163426 | 54 | regionprofile | 0.022684 | 0.022684 | 0.022684 | 0.022684 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_EOS | |
{'name': 'Lcals_EOS.block_128', 'type': 'function'} | 2118176828577267326 | 55 | regionprofile | 0.022701 | 0.022701 | 0.022701 | 0.022701 | 0.022701 | 0.022701 | 0.022701 | 0.022701 | Lcals_EOS.block_128 |
5411856859942141273 | 55 | regionprofile | 0.022685 | 0.022685 | 0.022685 | 0.022685 | 0.022685 | 0.022685 | 0.022685 | 0.022685 | Lcals_EOS.block_128 | |
7599706706110163426 | 55 | regionprofile | 0.022674 | 0.022674 | 0.022674 | 0.022674 | 0.022674 | 0.022674 | 0.022674 | 0.022674 | Lcals_EOS.block_128 | |
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} | 2118176828577267326 | 56 | regionprofile | 0.048464 | 0.048464 | 0.048464 | 0.048464 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_FIRST_DIFF |
5411856859942141273 | 56 | regionprofile | 0.048488 | 0.048488 | 0.048488 | 0.048488 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Lcals_FIRST_DIFF | |
7599706706110163426 | 56 | regionprofile | 0.048353 | 0.048353 | 0.048353 | 0.048353 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_FIRST_DIFF | |
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} | 2118176828577267326 | 57 | regionprofile | 0.048455 | 0.048455 | 0.048455 | 0.048455 | 0.048455 | 0.048455 | 0.048455 | 0.048455 | Lcals_FIRST_DIFF.block_128 |
5411856859942141273 | 57 | regionprofile | 0.048480 | 0.048480 | 0.048480 | 0.048480 | 0.048480 | 0.048480 | 0.048480 | 0.048480 | Lcals_FIRST_DIFF.block_128 | |
7599706706110163426 | 57 | regionprofile | 0.048344 | 0.048344 | 0.048344 | 0.048344 | 0.048344 | 0.048344 | 0.048344 | 0.048344 | Lcals_FIRST_DIFF.block_128 | |
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} | 2118176828577267326 | 58 | regionprofile | 0.048732 | 0.048732 | 0.048732 | 0.048732 | 0.000019 | 0.000019 | 0.000019 | 0.000019 | Lcals_FIRST_MIN |
5411856859942141273 | 58 | regionprofile | 0.079664 | 0.079664 | 0.079664 | 0.079664 | 0.000012 | 0.000012 | 0.000012 | 0.000012 | Lcals_FIRST_MIN | |
7599706706110163426 | 58 | regionprofile | 0.046295 | 0.046295 | 0.046295 | 0.046295 | 0.000013 | 0.000013 | 0.000013 | 0.000013 | Lcals_FIRST_MIN | |
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} | 2118176828577267326 | 59 | regionprofile | 0.048713 | 0.048713 | 0.048713 | 0.048713 | 0.048713 | 0.048713 | 0.048713 | 0.048713 | Lcals_FIRST_MIN.block_128 |
5411856859942141273 | 59 | regionprofile | 0.079652 | 0.079652 | 0.079652 | 0.079652 | 0.079652 | 0.079652 | 0.079652 | 0.079652 | Lcals_FIRST_MIN.block_128 | |
7599706706110163426 | 59 | regionprofile | 0.046282 | 0.046282 | 0.046282 | 0.046282 | 0.046282 | 0.046282 | 0.046282 | 0.046282 | Lcals_FIRST_MIN.block_128 | |
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} | 2118176828577267326 | 60 | regionprofile | 0.048529 | 0.048529 | 0.048529 | 0.048529 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_FIRST_SUM |
5411856859942141273 | 60 | regionprofile | 0.048525 | 0.048525 | 0.048525 | 0.048525 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_FIRST_SUM | |
7599706706110163426 | 60 | regionprofile | 0.048453 | 0.048453 | 0.048453 | 0.048453 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_FIRST_SUM | |
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} | 2118176828577267326 | 61 | regionprofile | 0.048519 | 0.048519 | 0.048519 | 0.048519 | 0.048519 | 0.048519 | 0.048519 | 0.048519 | Lcals_FIRST_SUM.block_128 |
5411856859942141273 | 61 | regionprofile | 0.048516 | 0.048516 | 0.048516 | 0.048516 | 0.048516 | 0.048516 | 0.048516 | 0.048516 | Lcals_FIRST_SUM.block_128 | |
7599706706110163426 | 61 | regionprofile | 0.048444 | 0.048444 | 0.048444 | 0.048444 | 0.048444 | 0.048444 | 0.048444 | 0.048444 | Lcals_FIRST_SUM.block_128 | |
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} | 2118176828577267326 | 62 | regionprofile | 0.048873 | 0.048873 | 0.048873 | 0.048873 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_GEN_LIN_RECUR |
5411856859942141273 | 62 | regionprofile | 0.049033 | 0.049033 | 0.049033 | 0.049033 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_GEN_LIN_RECUR | |
7599706706110163426 | 62 | regionprofile | 0.048874 | 0.048874 | 0.048874 | 0.048874 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_GEN_LIN_RECUR | |
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} | 2118176828577267326 | 63 | regionprofile | 0.048864 | 0.048864 | 0.048864 | 0.048864 | 0.048864 | 0.048864 | 0.048864 | 0.048864 | Lcals_GEN_LIN_RECUR.block_128 |
5411856859942141273 | 63 | regionprofile | 0.049024 | 0.049024 | 0.049024 | 0.049024 | 0.049024 | 0.049024 | 0.049024 | 0.049024 | Lcals_GEN_LIN_RECUR.block_128 | |
7599706706110163426 | 63 | regionprofile | 0.048865 | 0.048865 | 0.048865 | 0.048865 | 0.048865 | 0.048865 | 0.048865 | 0.048865 | Lcals_GEN_LIN_RECUR.block_128 | |
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} | 2118176828577267326 | 64 | regionprofile | 0.033847 | 0.033847 | 0.033847 | 0.033847 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_HYDRO_1D |
5411856859942141273 | 64 | regionprofile | 0.034110 | 0.034110 | 0.034110 | 0.034110 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_HYDRO_1D | |
7599706706110163426 | 64 | regionprofile | 0.033981 | 0.033981 | 0.033981 | 0.033981 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_HYDRO_1D | |
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} | 2118176828577267326 | 65 | regionprofile | 0.033838 | 0.033838 | 0.033838 | 0.033838 | 0.033838 | 0.033838 | 0.033838 | 0.033838 | Lcals_HYDRO_1D.block_128 |
5411856859942141273 | 65 | regionprofile | 0.034101 | 0.034101 | 0.034101 | 0.034101 | 0.034101 | 0.034101 | 0.034101 | 0.034101 | Lcals_HYDRO_1D.block_128 | |
7599706706110163426 | 65 | regionprofile | 0.033971 | 0.033971 | 0.033971 | 0.033971 | 0.033971 | 0.033971 | 0.033971 | 0.033971 | Lcals_HYDRO_1D.block_128 | |
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} | 2118176828577267326 | 66 | regionprofile | 0.022991 | 0.022991 | 0.022991 | 0.022991 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Lcals_HYDRO_2D |
5411856859942141273 | 66 | regionprofile | 0.023084 | 0.023084 | 0.023084 | 0.023084 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_HYDRO_2D | |
7599706706110163426 | 66 | regionprofile | 0.023005 | 0.023005 | 0.023005 | 0.023005 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_HYDRO_2D | |
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} | 2118176828577267326 | 67 | regionprofile | 0.022980 | 0.022980 | 0.022980 | 0.022980 | 0.022980 | 0.022980 | 0.022980 | 0.022980 | Lcals_HYDRO_2D.block_128 |
5411856859942141273 | 67 | regionprofile | 0.023074 | 0.023074 | 0.023074 | 0.023074 | 0.023074 | 0.023074 | 0.023074 | 0.023074 | Lcals_HYDRO_2D.block_128 | |
7599706706110163426 | 67 | regionprofile | 0.022995 | 0.022995 | 0.022995 | 0.022995 | 0.022995 | 0.022995 | 0.022995 | 0.022995 | Lcals_HYDRO_2D.block_128 | |
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} | 2118176828577267326 | 68 | regionprofile | 0.046886 | 0.046886 | 0.046886 | 0.046886 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_INT_PREDICT |
5411856859942141273 | 68 | regionprofile | 0.046884 | 0.046884 | 0.046884 | 0.046884 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_INT_PREDICT | |
7599706706110163426 | 68 | regionprofile | 0.046948 | 0.046948 | 0.046948 | 0.046948 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Lcals_INT_PREDICT | |
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} | 2118176828577267326 | 69 | regionprofile | 0.046876 | 0.046876 | 0.046876 | 0.046876 | 0.046876 | 0.046876 | 0.046876 | 0.046876 | Lcals_INT_PREDICT.block_128 |
5411856859942141273 | 69 | regionprofile | 0.046874 | 0.046874 | 0.046874 | 0.046874 | 0.046874 | 0.046874 | 0.046874 | 0.046874 | Lcals_INT_PREDICT.block_128 | |
7599706706110163426 | 69 | regionprofile | 0.046939 | 0.046939 | 0.046939 | 0.046939 | 0.046939 | 0.046939 | 0.046939 | 0.046939 | Lcals_INT_PREDICT.block_128 | |
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} | 2118176828577267326 | 70 | regionprofile | 0.002775 | 0.002775 | 0.002775 | 0.002775 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_PLANCKIAN |
5411856859942141273 | 70 | regionprofile | 0.002769 | 0.002769 | 0.002769 | 0.002769 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_PLANCKIAN | |
7599706706110163426 | 70 | regionprofile | 0.002764 | 0.002764 | 0.002764 | 0.002764 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_PLANCKIAN | |
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} | 2118176828577267326 | 71 | regionprofile | 0.002765 | 0.002765 | 0.002765 | 0.002765 | 0.002765 | 0.002765 | 0.002765 | 0.002765 | Lcals_PLANCKIAN.block_128 |
5411856859942141273 | 71 | regionprofile | 0.002759 | 0.002759 | 0.002759 | 0.002759 | 0.002759 | 0.002759 | 0.002759 | 0.002759 | Lcals_PLANCKIAN.block_128 | |
7599706706110163426 | 71 | regionprofile | 0.002754 | 0.002754 | 0.002754 | 0.002754 | 0.002754 | 0.002754 | 0.002754 | 0.002754 | Lcals_PLANCKIAN.block_128 | |
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} | 2118176828577267326 | 72 | regionprofile | 0.045139 | 0.045139 | 0.045139 | 0.045139 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_TRIDIAG_ELIM |
5411856859942141273 | 72 | regionprofile | 0.044582 | 0.044582 | 0.044582 | 0.044582 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_TRIDIAG_ELIM | |
7599706706110163426 | 72 | regionprofile | 0.044691 | 0.044691 | 0.044691 | 0.044691 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Lcals_TRIDIAG_ELIM | |
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} | 2118176828577267326 | 73 | regionprofile | 0.045129 | 0.045129 | 0.045129 | 0.045129 | 0.045129 | 0.045129 | 0.045129 | 0.045129 | Lcals_TRIDIAG_ELIM.block_128 |
5411856859942141273 | 73 | regionprofile | 0.044572 | 0.044572 | 0.044572 | 0.044572 | 0.044572 | 0.044572 | 0.044572 | 0.044572 | Lcals_TRIDIAG_ELIM.block_128 | |
7599706706110163426 | 73 | regionprofile | 0.044681 | 0.044681 | 0.044681 | 0.044681 | 0.044681 | 0.044681 | 0.044681 | 0.044681 | Lcals_TRIDIAG_ELIM.block_128 | |
{'name': 'Polybench', 'type': 'function'} | 2118176828577267326 | 74 | regionprofile | 0.623023 | 0.623023 | 0.623023 | 0.623023 | 0.000085 | 0.000085 | 0.000085 | 0.000085 | Polybench |
5411856859942141273 | 74 | regionprofile | 0.622754 | 0.622754 | 0.622754 | 0.622754 | 0.000086 | 0.000086 | 0.000086 | 0.000086 | Polybench | |
7599706706110163426 | 74 | regionprofile | 0.622254 | 0.622254 | 0.622254 | 0.622254 | 0.000088 | 0.000088 | 0.000088 | 0.000088 | Polybench | |
{'name': 'Polybench_2MM', 'type': 'function'} | 2118176828577267326 | 75 | regionprofile | 0.005957 | 0.005957 | 0.005957 | 0.005957 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_2MM |
5411856859942141273 | 75 | regionprofile | 0.005934 | 0.005934 | 0.005934 | 0.005934 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_2MM | |
7599706706110163426 | 75 | regionprofile | 0.005982 | 0.005982 | 0.005982 | 0.005982 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_2MM | |
{'name': 'Polybench_2MM.block_128', 'type': 'function'} | 2118176828577267326 | 76 | regionprofile | 0.005949 | 0.005949 | 0.005949 | 0.005949 | 0.005949 | 0.005949 | 0.005949 | 0.005949 | Polybench_2MM.block_128 |
5411856859942141273 | 76 | regionprofile | 0.005925 | 0.005925 | 0.005925 | 0.005925 | 0.005925 | 0.005925 | 0.005925 | 0.005925 | Polybench_2MM.block_128 | |
7599706706110163426 | 76 | regionprofile | 0.005974 | 0.005974 | 0.005974 | 0.005974 | 0.005974 | 0.005974 | 0.005974 | 0.005974 | Polybench_2MM.block_128 | |
{'name': 'Polybench_3MM', 'type': 'function'} | 2118176828577267326 | 77 | regionprofile | 0.008891 | 0.008891 | 0.008891 | 0.008891 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_3MM |
5411856859942141273 | 77 | regionprofile | 0.008893 | 0.008893 | 0.008893 | 0.008893 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_3MM | |
7599706706110163426 | 77 | regionprofile | 0.008894 | 0.008894 | 0.008894 | 0.008894 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_3MM | |
{'name': 'Polybench_3MM.block_128', 'type': 'function'} | 2118176828577267326 | 78 | regionprofile | 0.008883 | 0.008883 | 0.008883 | 0.008883 | 0.008883 | 0.008883 | 0.008883 | 0.008883 | Polybench_3MM.block_128 |
5411856859942141273 | 78 | regionprofile | 0.008885 | 0.008885 | 0.008885 | 0.008885 | 0.008885 | 0.008885 | 0.008885 | 0.008885 | Polybench_3MM.block_128 | |
7599706706110163426 | 78 | regionprofile | 0.008886 | 0.008886 | 0.008886 | 0.008886 | 0.008886 | 0.008886 | 0.008886 | 0.008886 | Polybench_3MM.block_128 | |
{'name': 'Polybench_ADI', 'type': 'function'} | 2118176828577267326 | 79 | regionprofile | 0.033510 | 0.033510 | 0.033510 | 0.033510 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_ADI |
5411856859942141273 | 79 | regionprofile | 0.033662 | 0.033662 | 0.033662 | 0.033662 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Polybench_ADI | |
7599706706110163426 | 79 | regionprofile | 0.033601 | 0.033601 | 0.033601 | 0.033601 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_ADI | |
{'name': 'Polybench_ADI.block_128', 'type': 'function'} | 2118176828577267326 | 80 | regionprofile | 0.033501 | 0.033501 | 0.033501 | 0.033501 | 0.033501 | 0.033501 | 0.033501 | 0.033501 | Polybench_ADI.block_128 |
5411856859942141273 | 80 | regionprofile | 0.033652 | 0.033652 | 0.033652 | 0.033652 | 0.033652 | 0.033652 | 0.033652 | 0.033652 | Polybench_ADI.block_128 | |
7599706706110163426 | 80 | regionprofile | 0.033592 | 0.033592 | 0.033592 | 0.033592 | 0.033592 | 0.033592 | 0.033592 | 0.033592 | Polybench_ADI.block_128 | |
{'name': 'Polybench_ATAX', 'type': 'function'} | 2118176828577267326 | 81 | regionprofile | 0.037496 | 0.037496 | 0.037496 | 0.037496 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_ATAX |
5411856859942141273 | 81 | regionprofile | 0.037376 | 0.037376 | 0.037376 | 0.037376 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_ATAX | |
7599706706110163426 | 81 | regionprofile | 0.037412 | 0.037412 | 0.037412 | 0.037412 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_ATAX | |
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} | 2118176828577267326 | 82 | regionprofile | 0.037487 | 0.037487 | 0.037487 | 0.037487 | 0.037487 | 0.037487 | 0.037487 | 0.037487 | Polybench_ATAX.block_128 |
5411856859942141273 | 82 | regionprofile | 0.037367 | 0.037367 | 0.037367 | 0.037367 | 0.037367 | 0.037367 | 0.037367 | 0.037367 | Polybench_ATAX.block_128 | |
7599706706110163426 | 82 | regionprofile | 0.037403 | 0.037403 | 0.037403 | 0.037403 | 0.037403 | 0.037403 | 0.037403 | 0.037403 | Polybench_ATAX.block_128 | |
{'name': 'Polybench_FDTD_2D', 'type': 'function'} | 2118176828577267326 | 83 | regionprofile | 0.037077 | 0.037077 | 0.037077 | 0.037077 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_FDTD_2D |
5411856859942141273 | 83 | regionprofile | 0.037133 | 0.037133 | 0.037133 | 0.037133 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_FDTD_2D | |
7599706706110163426 | 83 | regionprofile | 0.037119 | 0.037119 | 0.037119 | 0.037119 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Polybench_FDTD_2D | |
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} | 2118176828577267326 | 84 | regionprofile | 0.037069 | 0.037069 | 0.037069 | 0.037069 | 0.037069 | 0.037069 | 0.037069 | 0.037069 | Polybench_FDTD_2D.block_128 |
5411856859942141273 | 84 | regionprofile | 0.037124 | 0.037124 | 0.037124 | 0.037124 | 0.037124 | 0.037124 | 0.037124 | 0.037124 | Polybench_FDTD_2D.block_128 | |
7599706706110163426 | 84 | regionprofile | 0.037109 | 0.037109 | 0.037109 | 0.037109 | 0.037109 | 0.037109 | 0.037109 | 0.037109 | Polybench_FDTD_2D.block_128 | |
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} | 2118176828577267326 | 85 | regionprofile | 0.209528 | 0.209528 | 0.209528 | 0.209528 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_FLOYD_WARSHALL |
5411856859942141273 | 85 | regionprofile | 0.209062 | 0.209062 | 0.209062 | 0.209062 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_FLOYD_WARSHALL | |
7599706706110163426 | 85 | regionprofile | 0.208607 | 0.208607 | 0.208607 | 0.208607 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_FLOYD_WARSHALL | |
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} | 2118176828577267326 | 86 | regionprofile | 0.209520 | 0.209520 | 0.209520 | 0.209520 | 0.209520 | 0.209520 | 0.209520 | 0.209520 | Polybench_FLOYD_WARSHALL.block_128 |
5411856859942141273 | 86 | regionprofile | 0.209053 | 0.209053 | 0.209053 | 0.209053 | 0.209053 | 0.209053 | 0.209053 | 0.209053 | Polybench_FLOYD_WARSHALL.block_128 | |
7599706706110163426 | 86 | regionprofile | 0.208599 | 0.208599 | 0.208599 | 0.208599 | 0.208599 | 0.208599 | 0.208599 | 0.208599 | Polybench_FLOYD_WARSHALL.block_128 | |
{'name': 'Polybench_GEMM', 'type': 'function'} | 2118176828577267326 | 87 | regionprofile | 0.006751 | 0.006751 | 0.006751 | 0.006751 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Polybench_GEMM |
5411856859942141273 | 87 | regionprofile | 0.006751 | 0.006751 | 0.006751 | 0.006751 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Polybench_GEMM | |
7599706706110163426 | 87 | regionprofile | 0.006756 | 0.006756 | 0.006756 | 0.006756 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Polybench_GEMM | |
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} | 2118176828577267326 | 88 | regionprofile | 0.006741 | 0.006741 | 0.006741 | 0.006741 | 0.006741 | 0.006741 | 0.006741 | 0.006741 | Polybench_GEMM.block_128 |
5411856859942141273 | 88 | regionprofile | 0.006740 | 0.006740 | 0.006740 | 0.006740 | 0.006740 | 0.006740 | 0.006740 | 0.006740 | Polybench_GEMM.block_128 | |
7599706706110163426 | 88 | regionprofile | 0.006746 | 0.006746 | 0.006746 | 0.006746 | 0.006746 | 0.006746 | 0.006746 | 0.006746 | Polybench_GEMM.block_128 | |
{'name': 'Polybench_GEMVER', 'type': 'function'} | 2118176828577267326 | 89 | regionprofile | 0.008459 | 0.008459 | 0.008459 | 0.008459 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_GEMVER |
5411856859942141273 | 89 | regionprofile | 0.008441 | 0.008441 | 0.008441 | 0.008441 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_GEMVER | |
7599706706110163426 | 89 | regionprofile | 0.008433 | 0.008433 | 0.008433 | 0.008433 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_GEMVER | |
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} | 2118176828577267326 | 90 | regionprofile | 0.008451 | 0.008451 | 0.008451 | 0.008451 | 0.008451 | 0.008451 | 0.008451 | 0.008451 | Polybench_GEMVER.block_128 |
5411856859942141273 | 90 | regionprofile | 0.008432 | 0.008432 | 0.008432 | 0.008432 | 0.008432 | 0.008432 | 0.008432 | 0.008432 | Polybench_GEMVER.block_128 | |
7599706706110163426 | 90 | regionprofile | 0.008425 | 0.008425 | 0.008425 | 0.008425 | 0.008425 | 0.008425 | 0.008425 | 0.008425 | Polybench_GEMVER.block_128 | |
{'name': 'Polybench_GESUMMV', 'type': 'function'} | 2118176828577267326 | 91 | regionprofile | 0.037419 | 0.037419 | 0.037419 | 0.037419 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_GESUMMV |
5411856859942141273 | 91 | regionprofile | 0.037170 | 0.037170 | 0.037170 | 0.037170 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_GESUMMV | |
7599706706110163426 | 91 | regionprofile | 0.037532 | 0.037532 | 0.037532 | 0.037532 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Polybench_GESUMMV | |
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} | 2118176828577267326 | 92 | regionprofile | 0.037410 | 0.037410 | 0.037410 | 0.037410 | 0.037410 | 0.037410 | 0.037410 | 0.037410 | Polybench_GESUMMV.block_128 |
5411856859942141273 | 92 | regionprofile | 0.037161 | 0.037161 | 0.037161 | 0.037161 | 0.037161 | 0.037161 | 0.037161 | 0.037161 | Polybench_GESUMMV.block_128 | |
7599706706110163426 | 92 | regionprofile | 0.037524 | 0.037524 | 0.037524 | 0.037524 | 0.037524 | 0.037524 | 0.037524 | 0.037524 | Polybench_GESUMMV.block_128 | |
{'name': 'Polybench_HEAT_3D', 'type': 'function'} | 2118176828577267326 | 93 | regionprofile | 0.021835 | 0.021835 | 0.021835 | 0.021835 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_HEAT_3D |
5411856859942141273 | 93 | regionprofile | 0.021867 | 0.021867 | 0.021867 | 0.021867 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_HEAT_3D | |
7599706706110163426 | 93 | regionprofile | 0.021811 | 0.021811 | 0.021811 | 0.021811 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Polybench_HEAT_3D | |
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} | 2118176828577267326 | 94 | regionprofile | 0.021826 | 0.021826 | 0.021826 | 0.021826 | 0.021826 | 0.021826 | 0.021826 | 0.021826 | Polybench_HEAT_3D.block_128 |
5411856859942141273 | 94 | regionprofile | 0.021858 | 0.021858 | 0.021858 | 0.021858 | 0.021858 | 0.021858 | 0.021858 | 0.021858 | Polybench_HEAT_3D.block_128 | |
7599706706110163426 | 94 | regionprofile | 0.021801 | 0.021801 | 0.021801 | 0.021801 | 0.021801 | 0.021801 | 0.021801 | 0.021801 | Polybench_HEAT_3D.block_128 | |
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} | 2118176828577267326 | 95 | regionprofile | 0.077689 | 0.077689 | 0.077689 | 0.077689 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_JACOBI_1D |
5411856859942141273 | 95 | regionprofile | 0.077843 | 0.077843 | 0.077843 | 0.077843 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Polybench_JACOBI_1D | |
7599706706110163426 | 95 | regionprofile | 0.077730 | 0.077730 | 0.077730 | 0.077730 | 0.000012 | 0.000012 | 0.000012 | 0.000012 | Polybench_JACOBI_1D | |
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} | 2118176828577267326 | 96 | regionprofile | 0.077680 | 0.077680 | 0.077680 | 0.077680 | 0.077680 | 0.077680 | 0.077680 | 0.077680 | Polybench_JACOBI_1D.block_128 |
5411856859942141273 | 96 | regionprofile | 0.077832 | 0.077832 | 0.077832 | 0.077832 | 0.077832 | 0.077832 | 0.077832 | 0.077832 | Polybench_JACOBI_1D.block_128 | |
7599706706110163426 | 96 | regionprofile | 0.077718 | 0.077718 | 0.077718 | 0.077718 | 0.077718 | 0.077718 | 0.077718 | 0.077718 | Polybench_JACOBI_1D.block_128 | |
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} | 2118176828577267326 | 97 | regionprofile | 0.102498 | 0.102498 | 0.102498 | 0.102498 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_JACOBI_2D |
5411856859942141273 | 97 | regionprofile | 0.102730 | 0.102730 | 0.102730 | 0.102730 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Polybench_JACOBI_2D | |
7599706706110163426 | 97 | regionprofile | 0.102590 | 0.102590 | 0.102590 | 0.102590 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Polybench_JACOBI_2D | |
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} | 2118176828577267326 | 98 | regionprofile | 0.102489 | 0.102489 | 0.102489 | 0.102489 | 0.102489 | 0.102489 | 0.102489 | 0.102489 | Polybench_JACOBI_2D.block_128 |
5411856859942141273 | 98 | regionprofile | 0.102720 | 0.102720 | 0.102720 | 0.102720 | 0.102720 | 0.102720 | 0.102720 | 0.102720 | Polybench_JACOBI_2D.block_128 | |
7599706706110163426 | 98 | regionprofile | 0.102580 | 0.102580 | 0.102580 | 0.102580 | 0.102580 | 0.102580 | 0.102580 | 0.102580 | Polybench_JACOBI_2D.block_128 | |
{'name': 'Polybench_MVT', 'type': 'function'} | 2118176828577267326 | 99 | regionprofile | 0.035828 | 0.035828 | 0.035828 | 0.035828 | 0.000012 | 0.000012 | 0.000012 | 0.000012 | Polybench_MVT |
5411856859942141273 | 99 | regionprofile | 0.035806 | 0.035806 | 0.035806 | 0.035806 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_MVT | |
7599706706110163426 | 99 | regionprofile | 0.035699 | 0.035699 | 0.035699 | 0.035699 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Polybench_MVT | |
{'name': 'Polybench_MVT.block_128', 'type': 'function'} | 2118176828577267326 | 100 | regionprofile | 0.035816 | 0.035816 | 0.035816 | 0.035816 | 0.035816 | 0.035816 | 0.035816 | 0.035816 | Polybench_MVT.block_128 |
5411856859942141273 | 100 | regionprofile | 0.035797 | 0.035797 | 0.035797 | 0.035797 | 0.035797 | 0.035797 | 0.035797 | 0.035797 | Polybench_MVT.block_128 | |
7599706706110163426 | 100 | regionprofile | 0.035690 | 0.035690 | 0.035690 | 0.035690 | 0.035690 | 0.035690 | 0.035690 | 0.035690 | Polybench_MVT.block_128 | |
{'name': 'Stream', 'type': 'function'} | 2118176828577267326 | 101 | regionprofile | 0.268003 | 0.268003 | 0.268003 | 0.268003 | 0.000033 | 0.000033 | 0.000033 | 0.000033 | Stream |
5411856859942141273 | 101 | regionprofile | 0.268645 | 0.268645 | 0.268645 | 0.268645 | 0.000036 | 0.000036 | 0.000036 | 0.000036 | Stream | |
7599706706110163426 | 101 | regionprofile | 0.264033 | 0.264033 | 0.264033 | 0.264033 | 0.000033 | 0.000033 | 0.000033 | 0.000033 | Stream | |
{'name': 'Stream_ADD', 'type': 'function'} | 2118176828577267326 | 102 | regionprofile | 0.034015 | 0.034015 | 0.034015 | 0.034015 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_ADD |
5411856859942141273 | 102 | regionprofile | 0.033807 | 0.033807 | 0.033807 | 0.033807 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_ADD | |
7599706706110163426 | 102 | regionprofile | 0.033724 | 0.033724 | 0.033724 | 0.033724 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_ADD | |
{'name': 'Stream_ADD.block_128', 'type': 'function'} | 2118176828577267326 | 103 | regionprofile | 0.034006 | 0.034006 | 0.034006 | 0.034006 | 0.034006 | 0.034006 | 0.034006 | 0.034006 | Stream_ADD.block_128 |
5411856859942141273 | 103 | regionprofile | 0.033798 | 0.033798 | 0.033798 | 0.033798 | 0.033798 | 0.033798 | 0.033798 | 0.033798 | Stream_ADD.block_128 | |
7599706706110163426 | 103 | regionprofile | 0.033715 | 0.033715 | 0.033715 | 0.033715 | 0.033715 | 0.033715 | 0.033715 | 0.033715 | Stream_ADD.block_128 | |
{'name': 'Stream_COPY', 'type': 'function'} | 2118176828577267326 | 104 | regionprofile | 0.043270 | 0.043270 | 0.043270 | 0.043270 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_COPY |
5411856859942141273 | 104 | regionprofile | 0.043329 | 0.043329 | 0.043329 | 0.043329 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Stream_COPY | |
7599706706110163426 | 104 | regionprofile | 0.042980 | 0.042980 | 0.042980 | 0.042980 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_COPY | |
{'name': 'Stream_COPY.block_128', 'type': 'function'} | 2118176828577267326 | 105 | regionprofile | 0.043261 | 0.043261 | 0.043261 | 0.043261 | 0.043261 | 0.043261 | 0.043261 | 0.043261 | Stream_COPY.block_128 |
5411856859942141273 | 105 | regionprofile | 0.043321 | 0.043321 | 0.043321 | 0.043321 | 0.043321 | 0.043321 | 0.043321 | 0.043321 | Stream_COPY.block_128 | |
7599706706110163426 | 105 | regionprofile | 0.042971 | 0.042971 | 0.042971 | 0.042971 | 0.042971 | 0.042971 | 0.042971 | 0.042971 | Stream_COPY.block_128 | |
{'name': 'Stream_DOT', 'type': 'function'} | 2118176828577267326 | 106 | regionprofile | 0.113664 | 0.113664 | 0.113664 | 0.113664 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_DOT |
5411856859942141273 | 106 | regionprofile | 0.114367 | 0.114367 | 0.114367 | 0.114367 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_DOT | |
7599706706110163426 | 106 | regionprofile | 0.110674 | 0.110674 | 0.110674 | 0.110674 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_DOT | |
{'name': 'Stream_DOT.block_128', 'type': 'function'} | 2118176828577267326 | 107 | regionprofile | 0.113655 | 0.113655 | 0.113655 | 0.113655 | 0.113655 | 0.113655 | 0.113655 | 0.113655 | Stream_DOT.block_128 |
5411856859942141273 | 107 | regionprofile | 0.114358 | 0.114358 | 0.114358 | 0.114358 | 0.114358 | 0.114358 | 0.114358 | 0.114358 | Stream_DOT.block_128 | |
7599706706110163426 | 107 | regionprofile | 0.110665 | 0.110665 | 0.110665 | 0.110665 | 0.110665 | 0.110665 | 0.110665 | 0.110665 | Stream_DOT.block_128 | |
{'name': 'Stream_MUL', 'type': 'function'} | 2118176828577267326 | 108 | regionprofile | 0.043283 | 0.043283 | 0.043283 | 0.043283 | 0.000012 | 0.000012 | 0.000012 | 0.000012 | Stream_MUL |
5411856859942141273 | 108 | regionprofile | 0.043339 | 0.043339 | 0.043339 | 0.043339 | 0.000011 | 0.000011 | 0.000011 | 0.000011 | Stream_MUL | |
7599706706110163426 | 108 | regionprofile | 0.042951 | 0.042951 | 0.042951 | 0.042951 | 0.000010 | 0.000010 | 0.000010 | 0.000010 | Stream_MUL | |
{'name': 'Stream_MUL.block_128', 'type': 'function'} | 2118176828577267326 | 109 | regionprofile | 0.043271 | 0.043271 | 0.043271 | 0.043271 | 0.043271 | 0.043271 | 0.043271 | 0.043271 | Stream_MUL.block_128 |
5411856859942141273 | 109 | regionprofile | 0.043328 | 0.043328 | 0.043328 | 0.043328 | 0.043328 | 0.043328 | 0.043328 | 0.043328 | Stream_MUL.block_128 | |
7599706706110163426 | 109 | regionprofile | 0.042941 | 0.042941 | 0.042941 | 0.042941 | 0.042941 | 0.042941 | 0.042941 | 0.042941 | Stream_MUL.block_128 | |
{'name': 'Stream_TRIAD', 'type': 'function'} | 2118176828577267326 | 110 | regionprofile | 0.033738 | 0.033738 | 0.033738 | 0.033738 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Stream_TRIAD |
5411856859942141273 | 110 | regionprofile | 0.033767 | 0.033767 | 0.033767 | 0.033767 | 0.000008 | 0.000008 | 0.000008 | 0.000008 | Stream_TRIAD | |
7599706706110163426 | 110 | regionprofile | 0.033671 | 0.033671 | 0.033671 | 0.033671 | 0.000009 | 0.000009 | 0.000009 | 0.000009 | Stream_TRIAD | |
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} | 2118176828577267326 | 111 | regionprofile | 0.033730 | 0.033730 | 0.033730 | 0.033730 | 0.033730 | 0.033730 | 0.033730 | 0.033730 | Stream_TRIAD.block_128 |
5411856859942141273 | 111 | regionprofile | 0.033759 | 0.033759 | 0.033759 | 0.033759 | 0.033759 | 0.033759 | 0.033759 | 0.033759 | Stream_TRIAD.block_128 | |
7599706706110163426 | 111 | regionprofile | 0.033662 | 0.033662 | 0.033662 | 0.033662 | 0.033662 | 0.033662 | 0.033662 | 0.033662 | Stream_TRIAD.block_128 |
The metadata table stores HPC simulation information such as an application’s build settings and execution context. A row corresponds to a single execution of the application and is identified by a unique profile index.
View metadata table:
[5]:
display(HTML(th_lassen.metadata.to_html()))
spot.channels | spot.options | spot.format.version | spot.timeseries.metrics | spot.metrics | cali.caliper.version | variant | Algorithm_MEMCPY | Algorithm_MEMSET | Algorithm_REDUCE_SUM | Algorithm_SORTPAIRS | Algorithm_SORT | Algorithm_SCAN | Apps_VOL3D | Apps_PRESSURE | Apps_NODAL_ACCUMULATION_3D | Apps_MASS3DPA | Apps_LTIMES_NOVIEW | Apps_LTIMES | Apps_HALOEXCHANGE_FUSED | Apps_HALOEXCHANGE | Apps_FIR | Apps_ENERGY | Apps_DIFFUSION3DPA | Apps_DEL_DOT_VEC_2D | Apps_CONVECTION3DPA | Stream_TRIAD | Stream_MUL | Stream_DOT | Stream_COPY | Stream_ADD | Polybench_MVT | Polybench_JACOBI_2D | Polybench_JACOBI_1D | Polybench_HEAT_3D | Polybench_GESUMMV | Polybench_GEMVER | Polybench_GEMM | Polybench_FLOYD_WARSHALL | Polybench_FDTD_2D | Polybench_ATAX | Polybench_ADI | Polybench_3MM | Polybench_2MM | Lcals_TRIDIAG_ELIM | Lcals_PLANCKIAN | Lcals_INT_PREDICT | Lcals_HYDRO_2D | Lcals_HYDRO_1D | Lcals_GEN_LIN_RECUR | Lcals_FIRST_SUM | Lcals_FIRST_MIN | Lcals_FIRST_DIFF | Lcals_EOS | Lcals_DIFF_PREDICT | Basic_TRAP_INT | Basic_REDUCE_STRUCT | Basic_REDUCE3_INT | Basic_PI_REDUCE | Basic_PI_ATOMIC | Basic_NESTED_INIT | Basic_MULADDSUB | Basic_MAT_MAT_SHARED | Basic_INIT_VIEW1D_OFFSET | Basic_INIT_VIEW1D | Basic_INIT3 | Basic_INDEXLIST_3LOOP | Basic_INDEXLIST | Basic_IF_QUAD | Basic_DAXPY_ATOMIC | Basic_DAXPY | SizeMeaning | Tuning | systype_build | gpu_targets_block_sizes | cuda_compiler_version | Compiler_path_version | compiler | compiler_version | rajaperf_compiler_options | rajaperf_compiler | cmake_exe_linker_flags | cmake_cxx_flags | cmake_build_type | raja_version | perfsuite_version | cluster | cmdline | libraries | launchdate | user | cali.channel | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
profile | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
2118176828577267326 | regionprofile | 2 | min#inclusive#sum#time.duration,max#inclusive#sum#time.duration,avg#inclusive#sum#time.duration,sum#inclusive#sum#time.duration,min#sum#sum#time.duration,max#sum#sum#time.duration,avg#sum#sum#time.duration,sum#sum#sum#time.duration | 2.9.0-dev | Base_CUDA | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554432,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} | {'Problem size': 1135363,'Reps':100,'Iterations/rep': 1135363,'Kernels/rep': 1,'Bytes/rep': 36598760,'FLOPS/rep': 81746136} | {'Problem size': 1048576,'Reps':700,'Iterations/rep': 2097152,'Kernels/rep': 2,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} | {'Problem size': 1030301,'Reps':100,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 33464144,'FLOPS/rep': 9272709} | {'Problem size': 1048500,'Reps':50,'Iterations/rep': 1048500,'Kernels/rep': 1,'Bytes/rep': 16977632,'FLOPS/rep': 42518772} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} | {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 2,'Bytes/rep': 7491120,'FLOPS/rep': 0} | {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 156,'Bytes/rep': 7491120,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':160,'Iterations/rep': 1048560,'Kernels/rep': 1,'Bytes/rep': 16777088,'FLOPS/rep': 33553920} | {'Problem size': 1048576,'Reps':130,'Iterations/rep': 6291456,'Kernels/rep': 6,'Bytes/rep': 226492416,'FLOPS/rep': 57671680} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 57409728,'FLOPS/rep': 115752960} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 50397216,'FLOPS/rep': 56623104} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 32244000,'FLOPS/rep': 60342272} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777232,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 1048576} | {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16859200,'FLOPS/rep': 4202500} | {'Problem size': 1046529,'Reps':50,'Iterations/rep': 83722320,'Kernels/rep': 2,'Bytes/rep': 1342176000,'FLOPS/rep': 418611600} | {'Problem size': 1048574,'Reps':100,'Iterations/rep': 33554368,'Kernels/rep': 32,'Bytes/rep': 536870400,'FLOPS/rep': 100663104} | {'Problem size': 1000000,'Reps':20,'Iterations/rep': 40000000,'Kernels/rep': 40,'Bytes/rep': 659200000,'FLOPS/rep': 600000000} | {'Problem size': 1050625,'Reps':120,'Iterations/rep': 1025,'Kernels/rep': 1,'Bytes/rep': 16834600,'FLOPS/rep': 4205575} | {'Problem size': 1050625,'Reps':20,'Iterations/rep': 3152900,'Kernels/rep': 4,'Bytes/rep': 33726600,'FLOPS/rep': 10507275} | {'Problem size': 1050625,'Reps':4,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 28085000,'FLOPS/rep': 3783300625} | {'Problem size': 1050625,'Reps':8,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 16810000,'FLOPS/rep': 1076890625} | {'Problem size': 1049600,'Reps':8,'Iterations/rep': 125952040,'Kernels/rep': 160,'Bytes/rep': 3359048960,'FLOPS/rep': 461619200} | {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16851000,'FLOPS/rep': 4202500} | {'Problem size': 1046529,'Reps':4,'Iterations/rep': 8184,'Kernels/rep': 8,'Bytes/rep': 402652800,'FLOPS/rep': 142327944} | {'Problem size': 1050625,'Reps':2,'Iterations/rep': 3151875,'Kernels/rep': 3,'Bytes/rep': 78269000,'FLOPS/rep': 6797543750} | {'Problem size': 1050625,'Reps':2,'Iterations/rep': 2101250,'Kernels/rep': 2,'Bytes/rep': 51988000,'FLOPS/rep': 5683881250} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554400,'FLOPS/rep': 2097150} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 4194304} | {'Problem size': 1048576,'Reps':400,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 92274688,'FLOPS/rep': 17825792} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145728,'Kernels/rep': 3,'Bytes/rep': 150667584,'FLOPS/rep': 45957296} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165832,'FLOPS/rep': 5242880} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 2,'Bytes/rep': 83886080,'FLOPS/rep': 6291456} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777208,'FLOPS/rep': 1048575} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388640,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777224,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554488,'FLOPS/rep': 16777216} | {'Problem size': 1048576,'Reps':200,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 167772160,'FLOPS/rep': 9437184} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 10485760} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777264,'FLOPS/rep': 2097154} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 4194328,'FLOPS/rep': 1048577} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} | {'Problem size': 1030301,'Reps':1000,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 8242408,'FLOPS/rep': 3090903} | {'Problem size': 1048576,'Reps':350,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} | {'Problem size': 1048576,'Reps':5,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 2147483648} | {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145729,'Kernels/rep': 3,'Bytes/rep': 27263004,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 10485776,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':180,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 11534336} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | Direct | default | blueos_3_ppc64le_ib_p9 | 128 | 11.2.152 | xl-2022.08.19 | xlc++_r-16.1.1.12 | 16.1.1.12 | /usr/tce/packages/cuda/cuda-11.2.0/bin/nvcc;/usr/tce/packages/xl/xl-2022.08.19/bin/xlc++_r | -qthreaded | Release | 2022.03.0 | 2022.3.0 | lassen | [./my-xl-block-128-build/bin/raja-perf.exe, --variants, Base_CUDA, --size, 1048576] | [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/workspace/asde/caliper-lassen/lib64/libcaliper.so.2, /usr/workspace/wsb/asde/adiak-lassen/lib/libadiak.so, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpiprofilesupport.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpi_ibm.so.3, /usr/tce/packages/xl/xl-2022.08.19/lib/libibmc++.so.1, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libstdc++.so.6, /lib64/libm.so.6, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libgcc_s.so.1, /lib64/libpthread.so.0, /lib64/libc.so.6, /lib64/ld64.so.2, /usr/tce/packages/cuda/cuda-10.1.243/nvidia/lib64/libnvToolsExt.so.1, /usr/tce/packages/cuda/cuda-10.1.243/bin/../extras/CUPTI/lib64/libcupti.so.10.1, /lib64/libpapi.so.5.2.0.0, /lib64/libdw.so.1, /lib64/libunwind.so.8, /lib64/libutil.so.1, /lib64/libz.so.1, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libhwloc_ompi.so.15, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent_pthreads-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-rte.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-pal.so.3, /lib64/libpfm.so.4, /lib64/libsensors.so.4, /lib64/libelf.so.1, /lib64/liblzma.so.5, /lib64/libbz2.so.1, /lib64/libnss_files.so.2] | 1668559508 | burgess9 | spot | ||||
5411856859942141273 | regionprofile | 2 | min#inclusive#sum#time.duration,max#inclusive#sum#time.duration,avg#inclusive#sum#time.duration,sum#inclusive#sum#time.duration,min#sum#sum#time.duration,max#sum#sum#time.duration,avg#sum#sum#time.duration,sum#sum#sum#time.duration | 2.9.0-dev | Base_CUDA | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554432,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} | {'Problem size': 1135363,'Reps':100,'Iterations/rep': 1135363,'Kernels/rep': 1,'Bytes/rep': 36598760,'FLOPS/rep': 81746136} | {'Problem size': 1048576,'Reps':700,'Iterations/rep': 2097152,'Kernels/rep': 2,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} | {'Problem size': 1030301,'Reps':100,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 33464144,'FLOPS/rep': 9272709} | {'Problem size': 1048500,'Reps':50,'Iterations/rep': 1048500,'Kernels/rep': 1,'Bytes/rep': 16977632,'FLOPS/rep': 42518772} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} | {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 2,'Bytes/rep': 7491120,'FLOPS/rep': 0} | {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 156,'Bytes/rep': 7491120,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':160,'Iterations/rep': 1048560,'Kernels/rep': 1,'Bytes/rep': 16777088,'FLOPS/rep': 33553920} | {'Problem size': 1048576,'Reps':130,'Iterations/rep': 6291456,'Kernels/rep': 6,'Bytes/rep': 226492416,'FLOPS/rep': 57671680} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 57409728,'FLOPS/rep': 115752960} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 50397216,'FLOPS/rep': 56623104} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 32244000,'FLOPS/rep': 60342272} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777232,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 1048576} | {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16859200,'FLOPS/rep': 4202500} | {'Problem size': 1046529,'Reps':50,'Iterations/rep': 83722320,'Kernels/rep': 2,'Bytes/rep': 1342176000,'FLOPS/rep': 418611600} | {'Problem size': 1048574,'Reps':100,'Iterations/rep': 33554368,'Kernels/rep': 32,'Bytes/rep': 536870400,'FLOPS/rep': 100663104} | {'Problem size': 1000000,'Reps':20,'Iterations/rep': 40000000,'Kernels/rep': 40,'Bytes/rep': 659200000,'FLOPS/rep': 600000000} | {'Problem size': 1050625,'Reps':120,'Iterations/rep': 1025,'Kernels/rep': 1,'Bytes/rep': 16834600,'FLOPS/rep': 4205575} | {'Problem size': 1050625,'Reps':20,'Iterations/rep': 3152900,'Kernels/rep': 4,'Bytes/rep': 33726600,'FLOPS/rep': 10507275} | {'Problem size': 1050625,'Reps':4,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 28085000,'FLOPS/rep': 3783300625} | {'Problem size': 1050625,'Reps':8,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 16810000,'FLOPS/rep': 1076890625} | {'Problem size': 1049600,'Reps':8,'Iterations/rep': 125952040,'Kernels/rep': 160,'Bytes/rep': 3359048960,'FLOPS/rep': 461619200} | {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16851000,'FLOPS/rep': 4202500} | {'Problem size': 1046529,'Reps':4,'Iterations/rep': 8184,'Kernels/rep': 8,'Bytes/rep': 402652800,'FLOPS/rep': 142327944} | {'Problem size': 1050625,'Reps':2,'Iterations/rep': 3151875,'Kernels/rep': 3,'Bytes/rep': 78269000,'FLOPS/rep': 6797543750} | {'Problem size': 1050625,'Reps':2,'Iterations/rep': 2101250,'Kernels/rep': 2,'Bytes/rep': 51988000,'FLOPS/rep': 5683881250} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554400,'FLOPS/rep': 2097150} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 4194304} | {'Problem size': 1048576,'Reps':400,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 92274688,'FLOPS/rep': 17825792} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145728,'Kernels/rep': 3,'Bytes/rep': 150667584,'FLOPS/rep': 45957296} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165832,'FLOPS/rep': 5242880} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 2,'Bytes/rep': 83886080,'FLOPS/rep': 6291456} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777208,'FLOPS/rep': 1048575} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388640,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777224,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554488,'FLOPS/rep': 16777216} | {'Problem size': 1048576,'Reps':200,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 167772160,'FLOPS/rep': 9437184} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 10485760} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777264,'FLOPS/rep': 2097154} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 4194328,'FLOPS/rep': 1048577} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} | {'Problem size': 1030301,'Reps':1000,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 8242408,'FLOPS/rep': 3090903} | {'Problem size': 1048576,'Reps':350,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} | {'Problem size': 1048576,'Reps':5,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 2147483648} | {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145729,'Kernels/rep': 3,'Bytes/rep': 27263004,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 10485776,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':180,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 11534336} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | Direct | default | blueos_3_ppc64le_ib_p9 | 128 | 11.2.152 | xl-2022.08.19 | xlc++_r-16.1.1.12 | 16.1.1.12 | /usr/tce/packages/cuda/cuda-11.2.0/bin/nvcc;/usr/tce/packages/xl/xl-2022.08.19/bin/xlc++_r | -qthreaded | Release | 2022.03.0 | 2022.3.0 | lassen | [./my-xl-block-128-build/bin/raja-perf.exe, --variants, Base_CUDA, --size, 1048576] | [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/workspace/asde/caliper-lassen/lib64/libcaliper.so.2, /usr/workspace/wsb/asde/adiak-lassen/lib/libadiak.so, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpiprofilesupport.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpi_ibm.so.3, /usr/tce/packages/xl/xl-2022.08.19/lib/libibmc++.so.1, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libstdc++.so.6, /lib64/libm.so.6, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libgcc_s.so.1, /lib64/libpthread.so.0, /lib64/libc.so.6, /lib64/ld64.so.2, /usr/tce/packages/cuda/cuda-10.1.243/nvidia/lib64/libnvToolsExt.so.1, /usr/tce/packages/cuda/cuda-10.1.243/bin/../extras/CUPTI/lib64/libcupti.so.10.1, /lib64/libpapi.so.5.2.0.0, /lib64/libdw.so.1, /lib64/libunwind.so.8, /lib64/libutil.so.1, /lib64/libz.so.1, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libhwloc_ompi.so.15, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent_pthreads-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-rte.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-pal.so.3, /lib64/libpfm.so.4, /lib64/libsensors.so.4, /lib64/libelf.so.1, /lib64/liblzma.so.5, /lib64/libbz2.so.1, /lib64/libnss_files.so.2] | 1668559527 | burgess9 | spot | ||||
7599706706110163426 | regionprofile | 2 | min#inclusive#sum#time.duration,max#inclusive#sum#time.duration,avg#inclusive#sum#time.duration,sum#inclusive#sum#time.duration,min#sum#sum#time.duration,max#sum#sum#time.duration,avg#sum#sum#time.duration,sum#sum#sum#time.duration | 2.9.0-dev | Base_CUDA | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554432,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} | {'Problem size': 1135363,'Reps':100,'Iterations/rep': 1135363,'Kernels/rep': 1,'Bytes/rep': 36598760,'FLOPS/rep': 81746136} | {'Problem size': 1048576,'Reps':700,'Iterations/rep': 2097152,'Kernels/rep': 2,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} | {'Problem size': 1030301,'Reps':100,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 33464144,'FLOPS/rep': 9272709} | {'Problem size': 1048500,'Reps':50,'Iterations/rep': 1048500,'Kernels/rep': 1,'Bytes/rep': 16977632,'FLOPS/rep': 42518772} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} | {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 2,'Bytes/rep': 7491120,'FLOPS/rep': 0} | {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 156,'Bytes/rep': 7491120,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':160,'Iterations/rep': 1048560,'Kernels/rep': 1,'Bytes/rep': 16777088,'FLOPS/rep': 33553920} | {'Problem size': 1048576,'Reps':130,'Iterations/rep': 6291456,'Kernels/rep': 6,'Bytes/rep': 226492416,'FLOPS/rep': 57671680} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 57409728,'FLOPS/rep': 115752960} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 50397216,'FLOPS/rep': 56623104} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 32244000,'FLOPS/rep': 60342272} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777232,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 1048576} | {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16859200,'FLOPS/rep': 4202500} | {'Problem size': 1046529,'Reps':50,'Iterations/rep': 83722320,'Kernels/rep': 2,'Bytes/rep': 1342176000,'FLOPS/rep': 418611600} | {'Problem size': 1048574,'Reps':100,'Iterations/rep': 33554368,'Kernels/rep': 32,'Bytes/rep': 536870400,'FLOPS/rep': 100663104} | {'Problem size': 1000000,'Reps':20,'Iterations/rep': 40000000,'Kernels/rep': 40,'Bytes/rep': 659200000,'FLOPS/rep': 600000000} | {'Problem size': 1050625,'Reps':120,'Iterations/rep': 1025,'Kernels/rep': 1,'Bytes/rep': 16834600,'FLOPS/rep': 4205575} | {'Problem size': 1050625,'Reps':20,'Iterations/rep': 3152900,'Kernels/rep': 4,'Bytes/rep': 33726600,'FLOPS/rep': 10507275} | {'Problem size': 1050625,'Reps':4,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 28085000,'FLOPS/rep': 3783300625} | {'Problem size': 1050625,'Reps':8,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 16810000,'FLOPS/rep': 1076890625} | {'Problem size': 1049600,'Reps':8,'Iterations/rep': 125952040,'Kernels/rep': 160,'Bytes/rep': 3359048960,'FLOPS/rep': 461619200} | {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16851000,'FLOPS/rep': 4202500} | {'Problem size': 1046529,'Reps':4,'Iterations/rep': 8184,'Kernels/rep': 8,'Bytes/rep': 402652800,'FLOPS/rep': 142327944} | {'Problem size': 1050625,'Reps':2,'Iterations/rep': 3151875,'Kernels/rep': 3,'Bytes/rep': 78269000,'FLOPS/rep': 6797543750} | {'Problem size': 1050625,'Reps':2,'Iterations/rep': 2101250,'Kernels/rep': 2,'Bytes/rep': 51988000,'FLOPS/rep': 5683881250} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554400,'FLOPS/rep': 2097150} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 4194304} | {'Problem size': 1048576,'Reps':400,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 92274688,'FLOPS/rep': 17825792} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145728,'Kernels/rep': 3,'Bytes/rep': 150667584,'FLOPS/rep': 45957296} | {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165832,'FLOPS/rep': 5242880} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 2,'Bytes/rep': 83886080,'FLOPS/rep': 6291456} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777208,'FLOPS/rep': 1048575} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388640,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777224,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554488,'FLOPS/rep': 16777216} | {'Problem size': 1048576,'Reps':200,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 167772160,'FLOPS/rep': 9437184} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 10485760} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777264,'FLOPS/rep': 2097154} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 4194328,'FLOPS/rep': 1048577} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} | {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} | {'Problem size': 1030301,'Reps':1000,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 8242408,'FLOPS/rep': 3090903} | {'Problem size': 1048576,'Reps':350,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} | {'Problem size': 1048576,'Reps':5,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 2147483648} | {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 1048576} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145729,'Kernels/rep': 3,'Bytes/rep': 27263004,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 10485776,'FLOPS/rep': 0} | {'Problem size': 1048576,'Reps':180,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 11534336} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} | Direct | default | blueos_3_ppc64le_ib_p9 | 128 | 11.2.152 | xl-2022.08.19 | xlc++_r-16.1.1.12 | 16.1.1.12 | /usr/tce/packages/cuda/cuda-11.2.0/bin/nvcc;/usr/tce/packages/xl/xl-2022.08.19/bin/xlc++_r | -qthreaded | Release | 2022.03.0 | 2022.3.0 | lassen | [./my-xl-block-128-build/bin/raja-perf.exe, --variants, Base_CUDA, --size, 1048576] | [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/workspace/asde/caliper-lassen/lib64/libcaliper.so.2, /usr/workspace/wsb/asde/adiak-lassen/lib/libadiak.so, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpiprofilesupport.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpi_ibm.so.3, /usr/tce/packages/xl/xl-2022.08.19/lib/libibmc++.so.1, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libstdc++.so.6, /lib64/libm.so.6, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libgcc_s.so.1, /lib64/libpthread.so.0, /lib64/libc.so.6, /lib64/ld64.so.2, /usr/tce/packages/cuda/cuda-10.1.243/nvidia/lib64/libnvToolsExt.so.1, /usr/tce/packages/cuda/cuda-10.1.243/bin/../extras/CUPTI/lib64/libcupti.so.10.1, /lib64/libpapi.so.5.2.0.0, /lib64/libdw.so.1, /lib64/libunwind.so.8, /lib64/libutil.so.1, /lib64/libz.so.1, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libhwloc_ompi.so.15, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent_pthreads-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-rte.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-pal.so.3, /lib64/libpfm.so.4, /lib64/libsensors.so.4, /lib64/libelf.so.1, /lib64/liblzma.so.5, /lib64/libbz2.so.1, /lib64/libnss_files.so.2] | 1668559517 | burgess9 | spot |
Composing multiple Thickets:
We can compose thickets in a hierarchical, horizontal ordering using thicket’s columnar_join
function. In this example, we compose profiles of two different problem sizes and four different block sizes seamlessly.
[6]:
problem_sizes = ["1M", "4M"]
data = {
"block_128": [f"../data/lassen/new-cali/Base_CUDA-block_128-{i}.cali" for i in problem_sizes],
"block_256": [f"../data/lassen/new-cali/Base_CUDA-block_256-{i}.cali" for i in problem_sizes],
"block_512": [f"../data/lassen/new-cali/Base_CUDA-block_512-{i}.cali" for i in problem_sizes],
"block_1024": [f"../data/lassen/new-cali/Base_CUDA-block_1024-{i}.cali" for i in problem_sizes],
}
[7]:
block_128 = tt.Thicket.from_caliperreader(data["block_128"])
block_256 = tt.Thicket.from_caliperreader(data["block_256"])
block_512 = tt.Thicket.from_caliperreader(data["block_512"])
block_1024 = tt.Thicket.from_caliperreader(data["block_1024"])
[8]:
th_cj = tt.Thicket.columnar_join(
thicket_list=[block_128, block_256, block_512, block_1024],
header_list=["Block 128", "Block 256", "Block 512", "Block 1024"],
column_name="ProblemSizeRunParam"
)
[9]:
print(th_cj.tree())
__ __ __ __
/ /_ ____ _/ /______/ /_ ___ / /_
/ __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
/ / / / /_/ / /_/ /__/ / / / __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/ v2023.1.0
-1.000 RAJAPerf
├─ -1.000 Algorithm
│ ├─ -1.000 Algorithm_MEMCPY
│ ├─ -1.000 Algorithm_MEMSET
│ └─ -1.000 Algorithm_REDUCE_SUM
├─ -1.000 Apps
│ ├─ -1.000 Apps_DEL_DOT_VEC_2D
│ ├─ -1.000 Apps_ENERGY
│ ├─ -1.000 Apps_FIR
│ ├─ -1.000 Apps_HALOEXCHANGE
│ ├─ -1.000 Apps_HALOEXCHANGE_FUSED
│ ├─ -1.000 Apps_LTIMES
│ ├─ -1.000 Apps_LTIMES_NOVIEW
│ ├─ -1.000 Apps_NODAL_ACCUMULATION_3D
│ ├─ -1.000 Apps_PRESSURE
│ └─ -1.000 Apps_VOL3D
├─ -1.000 Basic
│ ├─ -1.000 Basic_DAXPY
│ ├─ -1.000 Basic_DAXPY_ATOMIC
│ ├─ -1.000 Basic_IF_QUAD
│ ├─ -1.000 Basic_INDEXLIST
│ ├─ -1.000 Basic_INDEXLIST_3LOOP
│ ├─ -1.000 Basic_INIT3
│ ├─ -1.000 Basic_INIT_VIEW1D
│ ├─ -1.000 Basic_INIT_VIEW1D_OFFSET
│ ├─ -1.000 Basic_MAT_MAT_SHARED
│ ├─ -1.000 Basic_MULADDSUB
│ ├─ -1.000 Basic_NESTED_INIT
│ ├─ -1.000 Basic_PI_ATOMIC
│ ├─ -1.000 Basic_PI_REDUCE
│ ├─ -1.000 Basic_REDUCE3_INT
│ ├─ -1.000 Basic_REDUCE_STRUCT
│ └─ -1.000 Basic_TRAP_INT
├─ -1.000 Lcals
│ ├─ -1.000 Lcals_DIFF_PREDICT
│ ├─ -1.000 Lcals_EOS
│ ├─ -1.000 Lcals_FIRST_DIFF
│ ├─ -1.000 Lcals_FIRST_MIN
│ ├─ -1.000 Lcals_FIRST_SUM
│ ├─ -1.000 Lcals_GEN_LIN_RECUR
│ ├─ -1.000 Lcals_HYDRO_1D
│ ├─ -1.000 Lcals_HYDRO_2D
│ ├─ -1.000 Lcals_INT_PREDICT
│ ├─ -1.000 Lcals_PLANCKIAN
│ └─ -1.000 Lcals_TRIDIAG_ELIM
├─ -1.000 Polybench
│ ├─ -1.000 Polybench_2MM
│ ├─ -1.000 Polybench_3MM
│ ├─ -1.000 Polybench_ADI
│ ├─ -1.000 Polybench_ATAX
│ ├─ -1.000 Polybench_FDTD_2D
│ ├─ -1.000 Polybench_FLOYD_WARSHALL
│ ├─ -1.000 Polybench_GEMM
│ ├─ -1.000 Polybench_GEMVER
│ ├─ -1.000 Polybench_GESUMMV
│ ├─ -1.000 Polybench_HEAT_3D
│ ├─ -1.000 Polybench_JACOBI_1D
│ ├─ -1.000 Polybench_JACOBI_2D
│ └─ -1.000 Polybench_MVT
└─ -1.000 Stream
├─ -1.000 Stream_ADD
├─ -1.000 Stream_COPY
├─ -1.000 Stream_DOT
├─ -1.000 Stream_MUL
└─ -1.000 Stream_TRIAD
Legend (Metric: thicket_tree Min: -1.00 Max: -1.00)
█ -1.00 - -1.00
█ -1.00 - -1.00
█ -1.00 - -1.00
█ -1.00 - -1.00
█ -1.00 - -1.00
█ -1.00 - -1.00
name User code ◀ Only in left graph ▶ Only in right graph
[10]:
display(HTML(th_cj.dataframe.to_html()))
Block 128 | Block 256 | Block 512 | Block 1024 | name | ||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
nid | spot.channel | Min time/rank | Max time/rank | Avg time/rank | Total time | Bytes/Rep | Flops/Rep | Iterations/Rep | Kernels/Rep | ProblemSize | Reps | nid | spot.channel | Min time/rank | Max time/rank | Avg time/rank | Total time | Bytes/Rep | Flops/Rep | Iterations/Rep | Kernels/Rep | ProblemSize | Reps | nid | spot.channel | Min time/rank | Max time/rank | Avg time/rank | Total time | Bytes/Rep | Flops/Rep | Iterations/Rep | Kernels/Rep | ProblemSize | Reps | nid | spot.channel | Min time/rank | Max time/rank | Avg time/rank | Total time | Bytes/Rep | Flops/Rep | Iterations/Rep | Kernels/Rep | ProblemSize | Reps | |||
node | ProblemSizeRunParam | |||||||||||||||||||||||||||||||||||||||||||||||||
{'name': 'RAJAPerf', 'type': 'function'} | 1048576.0 | 0.0 | regionprofile | 1.772533 | 1.772533 | 1.772533 | 1.772533 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1135363.0 | 2500.0 | 0 | regionprofile | 1.763211 | 1.763211 | 1.763211 | 1.763211 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1135363.0 | 2500.0 | 0.0 | regionprofile | 1.836441 | 1.836441 | 1.836441 | 1.836441 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1135363.0 | 2500.0 | 0.0 | regionprofile | 2.159525 | 2.159525 | 2.159525 | 2.159525 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1135363.0 | 2500.0 | RAJAPerf |
4194304.0 | 0.0 | regionprofile | 6.716152 | 6.716152 | 6.716152 | 6.716152 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4437343.0 | 2500.0 | 0 | regionprofile | 6.727046 | 6.727046 | 6.727046 | 6.727046 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4437343.0 | 2500.0 | 0.0 | regionprofile | 6.807905 | 6.807905 | 6.807905 | 6.807905 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4437343.0 | 2500.0 | 0.0 | regionprofile | 7.424382 | 7.424382 | 7.424382 | 7.424382 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4437343.0 | 2500.0 | RAJAPerf | |
{'name': 'Algorithm', 'type': 'function'} | 1048576.0 | 58.0 | regionprofile | 0.006471 | 0.006471 | 0.006471 | 0.006471 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 61 | regionprofile | 0.005986 | 0.005986 | 0.005986 | 0.005986 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 58.0 | regionprofile | 0.006234 | 0.006234 | 0.006234 | 0.006234 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 59.0 | regionprofile | 0.006306 | 0.006306 | 0.006306 | 0.006306 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | Algorithm |
4194304.0 | 58.0 | regionprofile | 0.018733 | 0.018733 | 0.018733 | 0.018733 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 61 | regionprofile | 0.016742 | 0.016742 | 0.016742 | 0.016742 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 58.0 | regionprofile | 0.017144 | 0.017144 | 0.017144 | 0.017144 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 59.0 | regionprofile | 0.017673 | 0.017673 | 0.017673 | 0.017673 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | Algorithm | |
{'name': 'Algorithm_MEMCPY', 'type': 'function'} | 1048576.0 | 61.0 | regionprofile | 0.002441 | 0.002441 | 0.002441 | 0.002441 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 64 | regionprofile | 0.002444 | 0.002444 | 0.002444 | 0.002444 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 61.0 | regionprofile | 0.002502 | 0.002502 | 0.002502 | 0.002502 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 62.0 | regionprofile | 0.002497 | 0.002497 | 0.002497 | 0.002497 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | Algorithm_MEMCPY |
4194304.0 | 61.0 | regionprofile | 0.008628 | 0.008628 | 0.008628 | 0.008628 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 64 | regionprofile | 0.008647 | 0.008647 | 0.008647 | 0.008647 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 61.0 | regionprofile | 0.008661 | 0.008661 | 0.008661 | 0.008661 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 62.0 | regionprofile | 0.008700 | 0.008700 | 0.008700 | 0.008700 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | Algorithm_MEMCPY | |
{'name': 'Algorithm_MEMSET', 'type': 'function'} | 1048576.0 | 60.0 | regionprofile | 0.001485 | 0.001485 | 0.001485 | 0.001485 | 8.388616e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 63 | regionprofile | 0.001248 | 0.001248 | 0.001248 | 0.001248 | 8.388616e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 60.0 | regionprofile | 0.001248 | 0.001248 | 0.001248 | 0.001248 | 8.388616e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 61.0 | regionprofile | 0.001248 | 0.001248 | 0.001248 | 0.001248 | 8.388616e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | Algorithm_MEMSET |
4194304.0 | 60.0 | regionprofile | 0.004979 | 0.004979 | 0.004979 | 0.004979 | 3.355444e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 63 | regionprofile | 0.004082 | 0.004082 | 0.004082 | 0.004082 | 3.355444e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 60.0 | regionprofile | 0.004070 | 0.004070 | 0.004070 | 0.004070 | 3.355444e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 61.0 | regionprofile | 0.004083 | 0.004083 | 0.004083 | 0.004083 | 3.355444e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | Algorithm_MEMSET | |
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} | 1048576.0 | 59.0 | regionprofile | 0.002526 | 0.002526 | 0.002526 | 0.002526 | 8.388616e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 62 | regionprofile | 0.002276 | 0.002276 | 0.002276 | 0.002276 | 8.388616e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 59.0 | regionprofile | 0.002465 | 0.002465 | 0.002465 | 0.002465 | 8.388616e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 60.0 | regionprofile | 0.002543 | 0.002543 | 0.002543 | 0.002543 | 8.388616e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Algorithm_REDUCE_SUM |
4194304.0 | 59.0 | regionprofile | 0.005103 | 0.005103 | 0.005103 | 0.005103 | 3.355444e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 62 | regionprofile | 0.003991 | 0.003991 | 0.003991 | 0.003991 | 3.355444e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 59.0 | regionprofile | 0.004393 | 0.004393 | 0.004393 | 0.004393 | 3.355444e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 60.0 | regionprofile | 0.004869 | 0.004869 | 0.004869 | 0.004869 | 3.355444e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Algorithm_REDUCE_SUM | |
{'name': 'Apps', 'type': 'function'} | 1048576.0 | 47.0 | regionprofile | 0.173218 | 0.173218 | 0.173218 | 0.173218 | 2.264924e+08 | 8.174614e+07 | 6291456.0 | 156.0 | 1135363.0 | 700.0 | 50 | regionprofile | 0.171198 | 0.171198 | 0.171198 | 0.171198 | 2.264924e+08 | 8.174614e+07 | 6291456.0 | 156.0 | 1135363.0 | 700.0 | 47.0 | regionprofile | 0.170593 | 0.170593 | 0.170593 | 0.170593 | 2.264924e+08 | 8.174614e+07 | 6291456.0 | 156.0 | 1135363.0 | 700.0 | 48.0 | regionprofile | 0.175296 | 0.175296 | 0.175296 | 0.175296 | 2.264924e+08 | 8.174614e+07 | 6291456.0 | 156.0 | 1135363.0 | 700.0 | Apps |
4194304.0 | 47.0 | regionprofile | 0.548273 | 0.548273 | 0.548273 | 0.548273 | 9.059697e+08 | 3.194887e+08 | 25165824.0 | 156.0 | 4437343.0 | 700.0 | 50 | regionprofile | 0.564474 | 0.564474 | 0.564474 | 0.564474 | 9.059697e+08 | 3.194887e+08 | 25165824.0 | 156.0 | 4437343.0 | 700.0 | 47.0 | regionprofile | 0.564357 | 0.564357 | 0.564357 | 0.564357 | 9.059697e+08 | 3.194887e+08 | 25165824.0 | 156.0 | 4437343.0 | 700.0 | 48.0 | regionprofile | 0.572482 | 0.572482 | 0.572482 | 0.572482 | 9.059697e+08 | 3.194887e+08 | 25165824.0 | 156.0 | 4437343.0 | 700.0 | Apps | |
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} | 1048576.0 | 48.0 | regionprofile | 0.007075 | 0.007075 | 0.007075 | 0.007075 | 5.039722e+07 | 5.662310e+07 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 51 | regionprofile | 0.007010 | 0.007010 | 0.007010 | 0.007010 | 5.039722e+07 | 5.662310e+07 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 48.0 | regionprofile | 0.007162 | 0.007162 | 0.007162 | 0.007162 | 5.039722e+07 | 5.662310e+07 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 49.0 | regionprofile | 0.008157 | 0.008157 | 0.008157 | 0.008157 | 5.039722e+07 | 5.662310e+07 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | Apps_DEL_DOT_VEC_2D |
4194304.0 | 48.0 | regionprofile | 0.026371 | 0.026371 | 0.026371 | 0.026371 | 2.014577e+08 | 2.264924e+08 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 51 | regionprofile | 0.026163 | 0.026163 | 0.026163 | 0.026163 | 2.014577e+08 | 2.264924e+08 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 48.0 | regionprofile | 0.026415 | 0.026415 | 0.026415 | 0.026415 | 2.014577e+08 | 2.264924e+08 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 49.0 | regionprofile | 0.029889 | 0.029889 | 0.029889 | 0.029889 | 2.014577e+08 | 2.264924e+08 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | Apps_DEL_DOT_VEC_2D | |
{'name': 'Apps_ENERGY', 'type': 'function'} | 1048576.0 | 49.0 | regionprofile | 0.039007 | 0.039007 | 0.039007 | 0.039007 | 2.264924e+08 | 5.767168e+07 | 6291456.0 | 6.0 | 1048576.0 | 130.0 | 52 | regionprofile | 0.039096 | 0.039096 | 0.039096 | 0.039096 | 2.264924e+08 | 5.767168e+07 | 6291456.0 | 6.0 | 1048576.0 | 130.0 | 49.0 | regionprofile | 0.039065 | 0.039065 | 0.039065 | 0.039065 | 2.264924e+08 | 5.767168e+07 | 6291456.0 | 6.0 | 1048576.0 | 130.0 | 50.0 | regionprofile | 0.039312 | 0.039312 | 0.039312 | 0.039312 | 2.264924e+08 | 5.767168e+07 | 6291456.0 | 6.0 | 1048576.0 | 130.0 | Apps_ENERGY |
4194304.0 | 49.0 | regionprofile | 0.147485 | 0.147485 | 0.147485 | 0.147485 | 9.059697e+08 | 2.306867e+08 | 25165824.0 | 6.0 | 4194304.0 | 130.0 | 52 | regionprofile | 0.146777 | 0.146777 | 0.146777 | 0.146777 | 9.059697e+08 | 2.306867e+08 | 25165824.0 | 6.0 | 4194304.0 | 130.0 | 49.0 | regionprofile | 0.146717 | 0.146717 | 0.146717 | 0.146717 | 9.059697e+08 | 2.306867e+08 | 25165824.0 | 6.0 | 4194304.0 | 130.0 | 50.0 | regionprofile | 0.146882 | 0.146882 | 0.146882 | 0.146882 | 9.059697e+08 | 2.306867e+08 | 25165824.0 | 6.0 | 4194304.0 | 130.0 | Apps_ENERGY | |
{'name': 'Apps_FIR', 'type': 'function'} | 1048576.0 | 50.0 | regionprofile | 0.004151 | 0.004151 | 0.004151 | 0.004151 | 1.677709e+07 | 3.355392e+07 | 1048560.0 | 1.0 | 1048576.0 | 160.0 | 53 | regionprofile | 0.004203 | 0.004203 | 0.004203 | 0.004203 | 1.677709e+07 | 3.355392e+07 | 1048560.0 | 1.0 | 1048576.0 | 160.0 | 50.0 | regionprofile | 0.004278 | 0.004278 | 0.004278 | 0.004278 | 1.677709e+07 | 3.355392e+07 | 1048560.0 | 1.0 | 1048576.0 | 160.0 | 51.0 | regionprofile | 0.004374 | 0.004374 | 0.004374 | 0.004374 | 1.677709e+07 | 3.355392e+07 | 1048560.0 | 1.0 | 1048576.0 | 160.0 | Apps_FIR |
4194304.0 | 50.0 | regionprofile | 0.014257 | 0.014257 | 0.014257 | 0.014257 | 6.710874e+07 | 1.342172e+08 | 4194288.0 | 1.0 | 4194304.0 | 160.0 | 53 | regionprofile | 0.014324 | 0.014324 | 0.014324 | 0.014324 | 6.710874e+07 | 1.342172e+08 | 4194288.0 | 1.0 | 4194304.0 | 160.0 | 50.0 | regionprofile | 0.014437 | 0.014437 | 0.014437 | 0.014437 | 6.710874e+07 | 1.342172e+08 | 4194288.0 | 1.0 | 4194304.0 | 160.0 | 51.0 | regionprofile | 0.014574 | 0.014574 | 0.014574 | 0.014574 | 6.710874e+07 | 1.342172e+08 | 4194288.0 | 1.0 | 4194304.0 | 160.0 | Apps_FIR | |
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} | 1048576.0 | 51.0 | regionprofile | 0.031788 | 0.031788 | 0.031788 | 0.031788 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 156.0 | 1030301.0 | 50.0 | 54 | regionprofile | 0.031574 | 0.031574 | 0.031574 | 0.031574 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 156.0 | 1030301.0 | 50.0 | 51.0 | regionprofile | 0.031545 | 0.031545 | 0.031545 | 0.031545 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 156.0 | 1030301.0 | 50.0 | 52.0 | regionprofile | 0.031682 | 0.031682 | 0.031682 | 0.031682 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 156.0 | 1030301.0 | 50.0 | Apps_HALOEXCHANGE |
4194304.0 | 51.0 | regionprofile | 0.032575 | 0.032575 | 0.032575 | 0.032575 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 156.0 | 4173281.0 | 50.0 | 54 | regionprofile | 0.032425 | 0.032425 | 0.032425 | 0.032425 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 156.0 | 4173281.0 | 50.0 | 51.0 | regionprofile | 0.032377 | 0.032377 | 0.032377 | 0.032377 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 156.0 | 4173281.0 | 50.0 | 52.0 | regionprofile | 0.032624 | 0.032624 | 0.032624 | 0.032624 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 156.0 | 4173281.0 | 50.0 | Apps_HALOEXCHANGE | |
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} | 1048576.0 | 52.0 | regionprofile | 0.005385 | 0.005385 | 0.005385 | 0.005385 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 2.0 | 1030301.0 | 50.0 | 55 | regionprofile | 0.003587 | 0.003587 | 0.003587 | 0.003587 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 2.0 | 1030301.0 | 50.0 | 52.0 | regionprofile | 0.002755 | 0.002755 | 0.002755 | 0.002755 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 2.0 | 1030301.0 | 50.0 | 53.0 | regionprofile | 0.002732 | 0.002732 | 0.002732 | 0.002732 | 7.491120e+06 | 0.000000e+00 | 187278.0 | 2.0 | 1030301.0 | 50.0 | Apps_HALOEXCHANGE_FUSED |
4194304.0 | 52.0 | regionprofile | 0.008215 | 0.008215 | 0.008215 | 0.008215 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 2.0 | 4173281.0 | 50.0 | 55 | regionprofile | 0.008030 | 0.008030 | 0.008030 | 0.008030 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 2.0 | 4173281.0 | 50.0 | 52.0 | regionprofile | 0.007012 | 0.007012 | 0.007012 | 0.007012 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 2.0 | 4173281.0 | 50.0 | 53.0 | regionprofile | 0.006375 | 0.006375 | 0.006375 | 0.006375 | 1.889592e+07 | 0.000000e+00 | 472398.0 | 2.0 | 4173281.0 | 50.0 | Apps_HALOEXCHANGE_FUSED | |
{'name': 'Apps_LTIMES', 'type': 'function'} | 1048576.0 | 53.0 | regionprofile | 0.012112 | 0.012112 | 0.012112 | 0.012112 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 56 | regionprofile | 0.012074 | 0.012074 | 0.012074 | 0.012074 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 53.0 | regionprofile | 0.012070 | 0.012070 | 0.012070 | 0.012070 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 54.0 | regionprofile | 0.012949 | 0.012949 | 0.012949 | 0.012949 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Apps_LTIMES |
4194304.0 | 53.0 | regionprofile | 0.047001 | 0.047001 | 0.047001 | 0.047001 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 56 | regionprofile | 0.055041 | 0.055041 | 0.055041 | 0.055041 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 53.0 | regionprofile | 0.055581 | 0.055581 | 0.055581 | 0.055581 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 54.0 | regionprofile | 0.055605 | 0.055605 | 0.055605 | 0.055605 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Apps_LTIMES | |
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} | 1048576.0 | 54.0 | regionprofile | 0.012124 | 0.012124 | 0.012124 | 0.012124 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 57 | regionprofile | 0.012075 | 0.012075 | 0.012075 | 0.012075 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 54.0 | regionprofile | 0.012069 | 0.012069 | 0.012069 | 0.012069 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 55.0 | regionprofile | 0.012946 | 0.012946 | 0.012946 | 0.012946 | 1.495501e+07 | 5.242880e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Apps_LTIMES_NOVIEW |
4194304.0 | 54.0 | regionprofile | 0.047010 | 0.047010 | 0.047010 | 0.047010 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 57 | regionprofile | 0.055052 | 0.055052 | 0.055052 | 0.055052 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 54.0 | regionprofile | 0.055579 | 0.055579 | 0.055579 | 0.055579 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 55.0 | regionprofile | 0.055599 | 0.055599 | 0.055599 | 0.055599 | 5.978163e+07 | 2.097152e+08 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Apps_LTIMES_NOVIEW | |
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} | 1048576.0 | 55.0 | regionprofile | 0.007383 | 0.007383 | 0.007383 | 0.007383 | 3.346414e+07 | 9.272709e+06 | 1030301.0 | 1.0 | 1030301.0 | 100.0 | 58 | regionprofile | 0.007599 | 0.007599 | 0.007599 | 0.007599 | 3.346414e+07 | 9.272709e+06 | 1030301.0 | 1.0 | 1030301.0 | 100.0 | 55.0 | regionprofile | 0.007695 | 0.007695 | 0.007695 | 0.007695 | 3.346414e+07 | 9.272709e+06 | 1030301.0 | 1.0 | 1030301.0 | 100.0 | 56.0 | regionprofile | 0.007855 | 0.007855 | 0.007855 | 0.007855 | 3.346414e+07 | 9.272709e+06 | 1030301.0 | 1.0 | 1030301.0 | 100.0 | Apps_NODAL_ACCUMULATION_3D |
4194304.0 | 55.0 | regionprofile | 0.027353 | 0.027353 | 0.027353 | 0.027353 | 1.347969e+08 | 3.755953e+07 | 4173281.0 | 1.0 | 4173281.0 | 100.0 | 58 | regionprofile | 0.029700 | 0.029700 | 0.029700 | 0.029700 | 1.347969e+08 | 3.755953e+07 | 4173281.0 | 1.0 | 4173281.0 | 100.0 | 55.0 | regionprofile | 0.029550 | 0.029550 | 0.029550 | 0.029550 | 1.347969e+08 | 3.755953e+07 | 4173281.0 | 1.0 | 4173281.0 | 100.0 | 56.0 | regionprofile | 0.030351 | 0.030351 | 0.030351 | 0.030351 | 1.347969e+08 | 3.755953e+07 | 4173281.0 | 1.0 | 4173281.0 | 100.0 | Apps_NODAL_ACCUMULATION_3D | |
{'name': 'Apps_PRESSURE', 'type': 'function'} | 1048576.0 | 56.0 | regionprofile | 0.048319 | 0.048319 | 0.048319 | 0.048319 | 4.194304e+07 | 3.145728e+06 | 2097152.0 | 2.0 | 1048576.0 | 700.0 | 59 | regionprofile | 0.048264 | 0.048264 | 0.048264 | 0.048264 | 4.194304e+07 | 3.145728e+06 | 2097152.0 | 2.0 | 1048576.0 | 700.0 | 56.0 | regionprofile | 0.048032 | 0.048032 | 0.048032 | 0.048032 | 4.194304e+07 | 3.145728e+06 | 2097152.0 | 2.0 | 1048576.0 | 700.0 | 57.0 | regionprofile | 0.048526 | 0.048526 | 0.048526 | 0.048526 | 4.194304e+07 | 3.145728e+06 | 2097152.0 | 2.0 | 1048576.0 | 700.0 | Apps_PRESSURE |
4194304.0 | 56.0 | regionprofile | 0.176850 | 0.176850 | 0.176850 | 0.176850 | 1.677722e+08 | 1.258291e+07 | 8388608.0 | 2.0 | 4194304.0 | 700.0 | 59 | regionprofile | 0.176168 | 0.176168 | 0.176168 | 0.176168 | 1.677722e+08 | 1.258291e+07 | 8388608.0 | 2.0 | 4194304.0 | 700.0 | 56.0 | regionprofile | 0.176042 | 0.176042 | 0.176042 | 0.176042 | 1.677722e+08 | 1.258291e+07 | 8388608.0 | 2.0 | 4194304.0 | 700.0 | 57.0 | regionprofile | 0.176699 | 0.176699 | 0.176699 | 0.176699 | 1.677722e+08 | 1.258291e+07 | 8388608.0 | 2.0 | 4194304.0 | 700.0 | Apps_PRESSURE | |
{'name': 'Apps_VOL3D', 'type': 'function'} | 1048576.0 | 57.0 | regionprofile | 0.005813 | 0.005813 | 0.005813 | 0.005813 | 3.659876e+07 | 8.174614e+07 | 1135363.0 | 1.0 | 1135363.0 | 100.0 | 60 | regionprofile | 0.005648 | 0.005648 | 0.005648 | 0.005648 | 3.659876e+07 | 8.174614e+07 | 1135363.0 | 1.0 | 1135363.0 | 100.0 | 57.0 | regionprofile | 0.005859 | 0.005859 | 0.005859 | 0.005859 | 3.659876e+07 | 8.174614e+07 | 1135363.0 | 1.0 | 1135363.0 | 100.0 | 58.0 | regionprofile | 0.006691 | 0.006691 | 0.006691 | 0.006691 | 3.659876e+07 | 8.174614e+07 | 1135363.0 | 1.0 | 1135363.0 | 100.0 | Apps_VOL3D |
4194304.0 | 57.0 | regionprofile | 0.021084 | 0.021084 | 0.021084 | 0.021084 | 1.426524e+08 | 3.194887e+08 | 4437343.0 | 1.0 | 4437343.0 | 100.0 | 60 | regionprofile | 0.020718 | 0.020718 | 0.020718 | 0.020718 | 1.426524e+08 | 3.194887e+08 | 4437343.0 | 1.0 | 4437343.0 | 100.0 | 57.0 | regionprofile | 0.020576 | 0.020576 | 0.020576 | 0.020576 | 1.426524e+08 | 3.194887e+08 | 4437343.0 | 1.0 | 4437343.0 | 100.0 | 58.0 | regionprofile | 0.023812 | 0.023812 | 0.023812 | 0.023812 | 1.426524e+08 | 3.194887e+08 | 4437343.0 | 1.0 | 4437343.0 | 100.0 | Apps_VOL3D | |
{'name': 'Basic', 'type': 'function'} | 1048576.0 | 1.0 | regionprofile | 0.347801 | 0.347801 | 0.347801 | 0.347801 | 4.194304e+07 | 1.153434e+07 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 1 | regionprofile | 0.333475 | 0.333475 | 0.333475 | 0.333475 | 4.194304e+07 | 2.147484e+09 | 3145729.0 | 3.0 | 1048576.0 | 2500.0 | 1.0 | regionprofile | 0.310439 | 0.310439 | 0.310439 | 0.310439 | 4.194304e+07 | 1.153434e+07 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 1.0 | regionprofile | 0.314681 | 0.314681 | 0.314681 | 0.314681 | 4.194304e+07 | 2.147484e+09 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | Basic |
4194304.0 | 1.0 | regionprofile | 1.181348 | 1.181348 | 1.181348 | 1.181348 | 1.677722e+08 | 4.613734e+07 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 1 | regionprofile | 1.162457 | 1.162457 | 1.162457 | 1.162457 | 1.677722e+08 | 1.717987e+10 | 12582913.0 | 3.0 | 4194304.0 | 2500.0 | 1.0 | regionprofile | 1.070884 | 1.070884 | 1.070884 | 1.070884 | 1.677722e+08 | 4.613734e+07 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 1.0 | regionprofile | 1.113558 | 1.113558 | 1.113558 | 1.113558 | 1.677722e+08 | 1.717987e+10 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | Basic | |
{'name': 'Basic_DAXPY', 'type': 'function'} | 1048576.0 | 2.0 | regionprofile | 0.016944 | 0.016944 | 0.016944 | 0.016944 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 2 | regionprofile | 0.016949 | 0.016949 | 0.016949 | 0.016949 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 2.0 | regionprofile | 0.016973 | 0.016973 | 0.016973 | 0.016973 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 2.0 | regionprofile | 0.017015 | 0.017015 | 0.017015 | 0.017015 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | Basic_DAXPY |
4194304.0 | 2.0 | regionprofile | 0.061445 | 0.061445 | 0.061445 | 0.061445 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 2 | regionprofile | 0.061474 | 0.061474 | 0.061474 | 0.061474 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 2.0 | regionprofile | 0.061560 | 0.061560 | 0.061560 | 0.061560 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 2.0 | regionprofile | 0.061673 | 0.061673 | 0.061673 | 0.061673 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | Basic_DAXPY | |
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} | 1048576.0 | 3.0 | regionprofile | 0.016943 | 0.016943 | 0.016943 | 0.016943 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 3 | regionprofile | 0.016936 | 0.016936 | 0.016936 | 0.016936 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 3.0 | regionprofile | 0.016874 | 0.016874 | 0.016874 | 0.016874 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 3.0 | regionprofile | 0.016940 | 0.016940 | 0.016940 | 0.016940 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | Basic_DAXPY_ATOMIC |
4194304.0 | 3.0 | regionprofile | 0.061938 | 0.061938 | 0.061938 | 0.061938 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 3 | regionprofile | 0.061883 | 0.061883 | 0.061883 | 0.061883 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 3.0 | regionprofile | 0.061990 | 0.061990 | 0.061990 | 0.061990 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 3.0 | regionprofile | 0.062022 | 0.062022 | 0.062022 | 0.062022 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | Basic_DAXPY_ATOMIC | |
{'name': 'Basic_IF_QUAD', 'type': 'function'} | 1048576.0 | 4.0 | regionprofile | 0.012530 | 0.012530 | 0.012530 | 0.012530 | 4.194304e+07 | 1.153434e+07 | 1048576.0 | 1.0 | 1048576.0 | 180.0 | 4 | regionprofile | 0.012603 | 0.012603 | 0.012603 | 0.012603 | 4.194304e+07 | 1.153434e+07 | 1048576.0 | 1.0 | 1048576.0 | 180.0 | 4.0 | regionprofile | 0.012496 | 0.012496 | 0.012496 | 0.012496 | 4.194304e+07 | 1.153434e+07 | 1048576.0 | 1.0 | 1048576.0 | 180.0 | 4.0 | regionprofile | 0.012786 | 0.012786 | 0.012786 | 0.012786 | 4.194304e+07 | 1.153434e+07 | 1048576.0 | 1.0 | 1048576.0 | 180.0 | Basic_IF_QUAD |
4194304.0 | 4.0 | regionprofile | 0.047675 | 0.047675 | 0.047675 | 0.047675 | 1.677722e+08 | 4.613734e+07 | 4194304.0 | 1.0 | 4194304.0 | 180.0 | 4 | regionprofile | 0.047554 | 0.047554 | 0.047554 | 0.047554 | 1.677722e+08 | 4.613734e+07 | 4194304.0 | 1.0 | 4194304.0 | 180.0 | 4.0 | regionprofile | 0.047973 | 0.047973 | 0.047973 | 0.047973 | 1.677722e+08 | 4.613734e+07 | 4194304.0 | 1.0 | 4194304.0 | 180.0 | 4.0 | regionprofile | 0.048466 | 0.048466 | 0.048466 | 0.048466 | 1.677722e+08 | 4.613734e+07 | 4194304.0 | 1.0 | 4194304.0 | 180.0 | Basic_IF_QUAD | |
{'name': 'Basic_INIT3', 'type': 'function'} | 1048576.0 | 5.0 | regionprofile | 0.027995 | 0.027995 | 0.027995 | 0.027995 | 4.194304e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 7 | regionprofile | 0.028340 | 0.028340 | 0.028340 | 0.028340 | 4.194304e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 5.0 | regionprofile | 0.027850 | 0.027850 | 0.027850 | 0.027850 | 4.194304e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 5.0 | regionprofile | 0.028308 | 0.028308 | 0.028308 | 0.028308 | 4.194304e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | Basic_INIT3 |
4194304.0 | 5.0 | regionprofile | 0.105995 | 0.105995 | 0.105995 | 0.105995 | 1.677722e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 7 | regionprofile | 0.106222 | 0.106222 | 0.106222 | 0.106222 | 1.677722e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 5.0 | regionprofile | 0.106498 | 0.106498 | 0.106498 | 0.106498 | 1.677722e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 5.0 | regionprofile | 0.106129 | 0.106129 | 0.106129 | 0.106129 | 1.677722e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | Basic_INIT3 | |
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} | 1048576.0 | 6.0 | regionprofile | 0.042143 | 0.042143 | 0.042143 | 0.042143 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 8 | regionprofile | 0.031319 | 0.031319 | 0.031319 | 0.031319 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 6.0 | regionprofile | 0.031419 | 0.031419 | 0.031419 | 0.031419 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 6.0 | regionprofile | 0.031429 | 0.031429 | 0.031429 | 0.031429 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | Basic_INIT_VIEW1D |
4194304.0 | 6.0 | regionprofile | 0.128866 | 0.128866 | 0.128866 | 0.128866 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 8 | regionprofile | 0.101000 | 0.101000 | 0.101000 | 0.101000 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 6.0 | regionprofile | 0.101011 | 0.101011 | 0.101011 | 0.101011 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 6.0 | regionprofile | 0.101026 | 0.101026 | 0.101026 | 0.101026 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | Basic_INIT_VIEW1D | |
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} | 1048576.0 | 7.0 | regionprofile | 0.042177 | 0.042177 | 0.042177 | 0.042177 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 9 | regionprofile | 0.031324 | 0.031324 | 0.031324 | 0.031324 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 7.0 | regionprofile | 0.031376 | 0.031376 | 0.031376 | 0.031376 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | 7.0 | regionprofile | 0.031453 | 0.031453 | 0.031453 | 0.031453 | 8.388608e+06 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2500.0 | Basic_INIT_VIEW1D_OFFSET |
4194304.0 | 7.0 | regionprofile | 0.123683 | 0.123683 | 0.123683 | 0.123683 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 9 | regionprofile | 0.100943 | 0.100943 | 0.100943 | 0.100943 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 7.0 | regionprofile | 0.100950 | 0.100950 | 0.100950 | 0.100950 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | 7.0 | regionprofile | 0.101024 | 0.101024 | 0.101024 | 0.101024 | 3.355443e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2500.0 | Basic_INIT_VIEW1D_OFFSET | |
{'name': 'Basic_MULADDSUB', 'type': 'function'} | 1048576.0 | 8.0 | regionprofile | 0.019797 | 0.019797 | 0.019797 | 0.019797 | 4.194304e+07 | 3.145728e+06 | 1048576.0 | 1.0 | 1048576.0 | 350.0 | 11 | regionprofile | 0.019799 | 0.019799 | 0.019799 | 0.019799 | 4.194304e+07 | 3.145728e+06 | 1048576.0 | 1.0 | 1048576.0 | 350.0 | 8.0 | regionprofile | 0.019521 | 0.019521 | 0.019521 | 0.019521 | 4.194304e+07 | 3.145728e+06 | 1048576.0 | 1.0 | 1048576.0 | 350.0 | 9.0 | regionprofile | 0.019806 | 0.019806 | 0.019806 | 0.019806 | 4.194304e+07 | 3.145728e+06 | 1048576.0 | 1.0 | 1048576.0 | 350.0 | Basic_MULADDSUB |
4194304.0 | 8.0 | regionprofile | 0.074822 | 0.074822 | 0.074822 | 0.074822 | 1.677722e+08 | 1.258291e+07 | 4194304.0 | 1.0 | 4194304.0 | 350.0 | 11 | regionprofile | 0.074568 | 0.074568 | 0.074568 | 0.074568 | 1.677722e+08 | 1.258291e+07 | 4194304.0 | 1.0 | 4194304.0 | 350.0 | 8.0 | regionprofile | 0.074743 | 0.074743 | 0.074743 | 0.074743 | 1.677722e+08 | 1.258291e+07 | 4194304.0 | 1.0 | 4194304.0 | 350.0 | 9.0 | regionprofile | 0.074239 | 0.074239 | 0.074239 | 0.074239 | 1.677722e+08 | 1.258291e+07 | 4194304.0 | 1.0 | 4194304.0 | 350.0 | Basic_MULADDSUB | |
{'name': 'Basic_NESTED_INIT', 'type': 'function'} | 1048576.0 | 9.0 | regionprofile | 0.020804 | 0.020804 | 0.020804 | 0.020804 | 8.242408e+06 | 3.090903e+06 | 1030301.0 | 1.0 | 1030301.0 | 1000.0 | 12 | regionprofile | 0.013998 | 0.013998 | 0.013998 | 0.013998 | 8.242408e+06 | 3.090903e+06 | 1030301.0 | 1.0 | 1030301.0 | 1000.0 | 9.0 | regionprofile | 0.014054 | 0.014054 | 0.014054 | 0.014054 | 8.242408e+06 | 3.090903e+06 | 1030301.0 | 1.0 | 1030301.0 | 1000.0 | 10.0 | regionprofile | 0.014137 | 0.014137 | 0.014137 | 0.014137 | 8.242408e+06 | 3.090903e+06 | 1030301.0 | 1.0 | 1030301.0 | 1000.0 | Basic_NESTED_INIT |
4194304.0 | 9.0 | regionprofile | 0.059309 | 0.059309 | 0.059309 | 0.059309 | 3.338625e+07 | 1.251984e+07 | 4173281.0 | 1.0 | 4173281.0 | 1000.0 | 12 | regionprofile | 0.053096 | 0.053096 | 0.053096 | 0.053096 | 3.338625e+07 | 1.251984e+07 | 4173281.0 | 1.0 | 4173281.0 | 1000.0 | 9.0 | regionprofile | 0.052983 | 0.052983 | 0.052983 | 0.052983 | 3.338625e+07 | 1.251984e+07 | 4173281.0 | 1.0 | 4173281.0 | 1000.0 | 10.0 | regionprofile | 0.053923 | 0.053923 | 0.053923 | 0.053923 | 3.338625e+07 | 1.251984e+07 | 4173281.0 | 1.0 | 4173281.0 | 1000.0 | Basic_NESTED_INIT | |
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} | 1048576.0 | 10.0 | regionprofile | 0.124602 | 0.124602 | 0.124602 | 0.124602 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 13 | regionprofile | 0.127398 | 0.127398 | 0.127398 | 0.127398 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 10.0 | regionprofile | 0.127383 | 0.127383 | 0.127383 | 0.127383 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 11.0 | regionprofile | 0.124411 | 0.124411 | 0.124411 | 0.124411 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Basic_PI_ATOMIC |
4194304.0 | 10.0 | regionprofile | 0.436276 | 0.436276 | 0.436276 | 0.436276 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 13 | regionprofile | 0.436276 | 0.436276 | 0.436276 | 0.436276 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 10.0 | regionprofile | 0.436273 | 0.436273 | 0.436273 | 0.436273 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 11.0 | regionprofile | 0.436265 | 0.436265 | 0.436265 | 0.436265 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Basic_PI_ATOMIC | |
{'name': 'Basic_PI_REDUCE', 'type': 'function'} | 1048576.0 | 11.0 | regionprofile | 0.002415 | 0.002415 | 0.002415 | 0.002415 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 14 | regionprofile | 0.002491 | 0.002491 | 0.002491 | 0.002491 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 11.0 | regionprofile | 0.002498 | 0.002498 | 0.002498 | 0.002498 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 12.0 | regionprofile | 0.002483 | 0.002483 | 0.002483 | 0.002483 | 1.600000e+01 | 6.291457e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Basic_PI_REDUCE |
4194304.0 | 11.0 | regionprofile | 0.004901 | 0.004901 | 0.004901 | 0.004901 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 14 | regionprofile | 0.003644 | 0.003644 | 0.003644 | 0.003644 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 11.0 | regionprofile | 0.004066 | 0.004066 | 0.004066 | 0.004066 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 12.0 | regionprofile | 0.004452 | 0.004452 | 0.004452 | 0.004452 | 1.600000e+01 | 2.516582e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Basic_PI_REDUCE | |
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} | 1048576.0 | 12.0 | regionprofile | 0.002241 | 0.002241 | 0.002241 | 0.002241 | 4.194328e+06 | 1.048577e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 15 | regionprofile | 0.002302 | 0.002302 | 0.002302 | 0.002302 | 4.194328e+06 | 1.048577e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 12.0 | regionprofile | 0.002328 | 0.002328 | 0.002328 | 0.002328 | 4.194328e+06 | 1.048577e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 13.0 | regionprofile | 0.002294 | 0.002294 | 0.002294 | 0.002294 | 4.194328e+06 | 1.048577e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Basic_REDUCE3_INT |
4194304.0 | 12.0 | regionprofile | 0.005101 | 0.005101 | 0.005101 | 0.005101 | 1.677724e+07 | 4.194305e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 15 | regionprofile | 0.004159 | 0.004159 | 0.004159 | 0.004159 | 1.677724e+07 | 4.194305e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 12.0 | regionprofile | 0.004315 | 0.004315 | 0.004315 | 0.004315 | 1.677724e+07 | 4.194305e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 13.0 | regionprofile | 0.005068 | 0.005068 | 0.005068 | 0.005068 | 1.677724e+07 | 4.194305e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Basic_REDUCE3_INT | |
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} | 1048576.0 | 13.0 | regionprofile | 0.016689 | 0.016689 | 0.016689 | 0.016689 | 1.677726e+07 | 2.097154e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 16 | regionprofile | 0.008741 | 0.008741 | 0.008741 | 0.008741 | 1.677726e+07 | 2.097154e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 13.0 | regionprofile | 0.005151 | 0.005151 | 0.005151 | 0.005151 | 1.677726e+07 | 2.097154e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 14.0 | regionprofile | 0.005093 | 0.005093 | 0.005093 | 0.005093 | 1.677726e+07 | 2.097154e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Basic_REDUCE_STRUCT |
4194304.0 | 13.0 | regionprofile | 0.066340 | 0.066340 | 0.066340 | 0.066340 | 6.710891e+07 | 8.388610e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 16 | regionprofile | 0.030358 | 0.030358 | 0.030358 | 0.030358 | 6.710891e+07 | 8.388610e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 13.0 | regionprofile | 0.014375 | 0.014375 | 0.014375 | 0.014375 | 6.710891e+07 | 8.388610e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 14.0 | regionprofile | 0.013960 | 0.013960 | 0.013960 | 0.013960 | 6.710891e+07 | 8.388610e+06 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Basic_REDUCE_STRUCT | |
{'name': 'Basic_TRAP_INT', 'type': 'function'} | 1048576.0 | 14.0 | regionprofile | 0.002439 | 0.002439 | 0.002439 | 0.002439 | 1.600000e+01 | 1.048576e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 17 | regionprofile | 0.002337 | 0.002337 | 0.002337 | 0.002337 | 1.600000e+01 | 1.048576e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 14.0 | regionprofile | 0.002438 | 0.002438 | 0.002438 | 0.002438 | 1.600000e+01 | 1.048576e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 15.0 | regionprofile | 0.002481 | 0.002481 | 0.002481 | 0.002481 | 1.600000e+01 | 1.048576e+07 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Basic_TRAP_INT |
4194304.0 | 14.0 | regionprofile | 0.004905 | 0.004905 | 0.004905 | 0.004905 | 1.600000e+01 | 4.194304e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 17 | regionprofile | 0.003767 | 0.003767 | 0.003767 | 0.003767 | 1.600000e+01 | 4.194304e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 14.0 | regionprofile | 0.004063 | 0.004063 | 0.004063 | 0.004063 | 1.600000e+01 | 4.194304e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 15.0 | regionprofile | 0.004744 | 0.004744 | 0.004744 | 0.004744 | 1.600000e+01 | 4.194304e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Basic_TRAP_INT | |
{'name': 'Lcals', 'type': 'function'} | 1048576.0 | 15.0 | regionprofile | 0.387646 | 0.387646 | 0.387646 | 0.387646 | 1.677722e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 2000.0 | 18 | regionprofile | 0.385061 | 0.385061 | 0.385061 | 0.385061 | 1.677722e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 2000.0 | 15.0 | regionprofile | 0.384342 | 0.384342 | 0.384342 | 0.384342 | 1.677722e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 2000.0 | 16.0 | regionprofile | 0.384261 | 0.384261 | 0.384261 | 0.384261 | 1.677722e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 2000.0 | Lcals |
4194304.0 | 15.0 | regionprofile | 1.448976 | 1.448976 | 1.448976 | 1.448976 | 6.710886e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 2000.0 | 18 | regionprofile | 1.442976 | 1.442976 | 1.442976 | 1.442976 | 6.710886e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 2000.0 | 15.0 | regionprofile | 1.434713 | 1.434713 | 1.434713 | 1.434713 | 6.710886e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 2000.0 | 16.0 | regionprofile | 1.430823 | 1.430823 | 1.430823 | 1.430823 | 6.710886e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 2000.0 | Lcals | |
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} | 1048576.0 | 16.0 | regionprofile | 0.062320 | 0.062320 | 0.062320 | 0.062320 | 1.677722e+08 | 9.437184e+06 | 1048576.0 | 1.0 | 1048576.0 | 200.0 | 19 | regionprofile | 0.060230 | 0.060230 | 0.060230 | 0.060230 | 1.677722e+08 | 9.437184e+06 | 1048576.0 | 1.0 | 1048576.0 | 200.0 | 16.0 | regionprofile | 0.058127 | 0.058127 | 0.058127 | 0.058127 | 1.677722e+08 | 9.437184e+06 | 1048576.0 | 1.0 | 1048576.0 | 200.0 | 17.0 | regionprofile | 0.056180 | 0.056180 | 0.056180 | 0.056180 | 1.677722e+08 | 9.437184e+06 | 1048576.0 | 1.0 | 1048576.0 | 200.0 | Lcals_DIFF_PREDICT |
4194304.0 | 16.0 | regionprofile | 0.235796 | 0.235796 | 0.235796 | 0.235796 | 6.710886e+08 | 3.774874e+07 | 4194304.0 | 1.0 | 4194304.0 | 200.0 | 19 | regionprofile | 0.231809 | 0.231809 | 0.231809 | 0.231809 | 6.710886e+08 | 3.774874e+07 | 4194304.0 | 1.0 | 4194304.0 | 200.0 | 16.0 | regionprofile | 0.224191 | 0.224191 | 0.224191 | 0.224191 | 6.710886e+08 | 3.774874e+07 | 4194304.0 | 1.0 | 4194304.0 | 200.0 | 17.0 | regionprofile | 0.213626 | 0.213626 | 0.213626 | 0.213626 | 6.710886e+08 | 3.774874e+07 | 4194304.0 | 1.0 | 4194304.0 | 200.0 | Lcals_DIFF_PREDICT | |
{'name': 'Lcals_EOS', 'type': 'function'} | 1048576.0 | 17.0 | regionprofile | 0.022697 | 0.022697 | 0.022697 | 0.022697 | 3.355449e+07 | 1.677722e+07 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 20 | regionprofile | 0.022655 | 0.022655 | 0.022655 | 0.022655 | 3.355449e+07 | 1.677722e+07 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 17.0 | regionprofile | 0.022638 | 0.022638 | 0.022638 | 0.022638 | 3.355449e+07 | 1.677722e+07 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | 18.0 | regionprofile | 0.022745 | 0.022745 | 0.022745 | 0.022745 | 3.355449e+07 | 1.677722e+07 | 1048576.0 | 1.0 | 1048576.0 | 500.0 | Lcals_EOS |
4194304.0 | 17.0 | regionprofile | 0.082976 | 0.082976 | 0.082976 | 0.082976 | 1.342178e+08 | 6.710886e+07 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 20 | regionprofile | 0.083900 | 0.083900 | 0.083900 | 0.083900 | 1.342178e+08 | 6.710886e+07 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 17.0 | regionprofile | 0.083853 | 0.083853 | 0.083853 | 0.083853 | 1.342178e+08 | 6.710886e+07 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | 18.0 | regionprofile | 0.083815 | 0.083815 | 0.083815 | 0.083815 | 1.342178e+08 | 6.710886e+07 | 4194304.0 | 1.0 | 4194304.0 | 500.0 | Lcals_EOS | |
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} | 1048576.0 | 18.0 | regionprofile | 0.048539 | 0.048539 | 0.048539 | 0.048539 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 21 | regionprofile | 0.048605 | 0.048605 | 0.048605 | 0.048605 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 18.0 | regionprofile | 0.049276 | 0.049276 | 0.049276 | 0.049276 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 19.0 | regionprofile | 0.049684 | 0.049684 | 0.049684 | 0.049684 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | Lcals_FIRST_DIFF |
4194304.0 | 18.0 | regionprofile | 0.173176 | 0.173176 | 0.173176 | 0.173176 | 6.710887e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 21 | regionprofile | 0.173146 | 0.173146 | 0.173146 | 0.173146 | 6.710887e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 18.0 | regionprofile | 0.173415 | 0.173415 | 0.173415 | 0.173415 | 6.710887e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 19.0 | regionprofile | 0.173615 | 0.173615 | 0.173615 | 0.173615 | 6.710887e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | Lcals_FIRST_DIFF | |
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} | 1048576.0 | 19.0 | regionprofile | 0.005432 | 0.005432 | 0.005432 | 0.005432 | 8.388640e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 22 | regionprofile | 0.004740 | 0.004740 | 0.004740 | 0.004740 | 8.388640e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 19.0 | regionprofile | 0.004543 | 0.004543 | 0.004543 | 0.004543 | 8.388640e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | 20.0 | regionprofile | 0.004423 | 0.004423 | 0.004423 | 0.004423 | 8.388640e+06 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | Lcals_FIRST_MIN |
4194304.0 | 19.0 | regionprofile | 0.012204 | 0.012204 | 0.012204 | 0.012204 | 3.355446e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 22 | regionprofile | 0.009819 | 0.009819 | 0.009819 | 0.009819 | 3.355446e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 19.0 | regionprofile | 0.009199 | 0.009199 | 0.009199 | 0.009199 | 3.355446e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | 20.0 | regionprofile | 0.010418 | 0.010418 | 0.010418 | 0.010418 | 3.355446e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | Lcals_FIRST_MIN | |
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} | 1048576.0 | 20.0 | regionprofile | 0.048626 | 0.048626 | 0.048626 | 0.048626 | 1.677721e+07 | 1.048575e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 23 | regionprofile | 0.048618 | 0.048618 | 0.048618 | 0.048618 | 1.677721e+07 | 1.048575e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 20.0 | regionprofile | 0.049289 | 0.049289 | 0.049289 | 0.049289 | 1.677721e+07 | 1.048575e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 21.0 | regionprofile | 0.049992 | 0.049992 | 0.049992 | 0.049992 | 1.677721e+07 | 1.048575e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | Lcals_FIRST_SUM |
4194304.0 | 20.0 | regionprofile | 0.173164 | 0.173164 | 0.173164 | 0.173164 | 6.710886e+07 | 4.194303e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 23 | regionprofile | 0.173199 | 0.173199 | 0.173199 | 0.173199 | 6.710886e+07 | 4.194303e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 20.0 | regionprofile | 0.173526 | 0.173526 | 0.173526 | 0.173526 | 6.710886e+07 | 4.194303e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 21.0 | regionprofile | 0.174873 | 0.174873 | 0.174873 | 0.174873 | 6.710886e+07 | 4.194303e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | Lcals_FIRST_SUM | |
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} | 1048576.0 | 21.0 | regionprofile | 0.049047 | 0.049047 | 0.049047 | 0.049047 | 8.388608e+07 | 6.291456e+06 | 1048576.0 | 2.0 | 1048576.0 | 500.0 | 24 | regionprofile | 0.048864 | 0.048864 | 0.048864 | 0.048864 | 8.388608e+07 | 6.291456e+06 | 1048576.0 | 2.0 | 1048576.0 | 500.0 | 21.0 | regionprofile | 0.048966 | 0.048966 | 0.048966 | 0.048966 | 8.388608e+07 | 6.291456e+06 | 1048576.0 | 2.0 | 1048576.0 | 500.0 | 22.0 | regionprofile | 0.049001 | 0.049001 | 0.049001 | 0.049001 | 8.388608e+07 | 6.291456e+06 | 1048576.0 | 2.0 | 1048576.0 | 500.0 | Lcals_GEN_LIN_RECUR |
4194304.0 | 21.0 | regionprofile | 0.202498 | 0.202498 | 0.202498 | 0.202498 | 3.355443e+08 | 2.516582e+07 | 4194304.0 | 2.0 | 4194304.0 | 500.0 | 24 | regionprofile | 0.201866 | 0.201866 | 0.201866 | 0.201866 | 3.355443e+08 | 2.516582e+07 | 4194304.0 | 2.0 | 4194304.0 | 500.0 | 21.0 | regionprofile | 0.201679 | 0.201679 | 0.201679 | 0.201679 | 3.355443e+08 | 2.516582e+07 | 4194304.0 | 2.0 | 4194304.0 | 500.0 | 22.0 | regionprofile | 0.201866 | 0.201866 | 0.201866 | 0.201866 | 3.355443e+08 | 2.516582e+07 | 4194304.0 | 2.0 | 4194304.0 | 500.0 | Lcals_GEN_LIN_RECUR | |
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} | 1048576.0 | 22.0 | regionprofile | 0.033742 | 0.033742 | 0.033742 | 0.033742 | 2.516583e+07 | 5.242880e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 25 | regionprofile | 0.033975 | 0.033975 | 0.033975 | 0.033975 | 2.516583e+07 | 5.242880e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 22.0 | regionprofile | 0.033803 | 0.033803 | 0.033803 | 0.033803 | 2.516583e+07 | 5.242880e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 23.0 | regionprofile | 0.034674 | 0.034674 | 0.034674 | 0.034674 | 2.516583e+07 | 5.242880e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | Lcals_HYDRO_1D |
4194304.0 | 22.0 | regionprofile | 0.123291 | 0.123291 | 0.123291 | 0.123291 | 1.006633e+08 | 2.097152e+07 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 25 | regionprofile | 0.123211 | 0.123211 | 0.123211 | 0.123211 | 1.006633e+08 | 2.097152e+07 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 22.0 | regionprofile | 0.123233 | 0.123233 | 0.123233 | 0.123233 | 1.006633e+08 | 2.097152e+07 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 23.0 | regionprofile | 0.123642 | 0.123642 | 0.123642 | 0.123642 | 1.006633e+08 | 2.097152e+07 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | Lcals_HYDRO_1D | |
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} | 1048576.0 | 23.0 | regionprofile | 0.022563 | 0.022563 | 0.022563 | 0.022563 | 1.506676e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 100.0 | 26 | regionprofile | 0.022560 | 0.022560 | 0.022560 | 0.022560 | 1.506676e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 100.0 | 23.0 | regionprofile | 0.022730 | 0.022730 | 0.022730 | 0.022730 | 1.506676e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 100.0 | 24.0 | regionprofile | 0.022833 | 0.022833 | 0.022833 | 0.022833 | 1.506676e+08 | 4.595730e+07 | 3145728.0 | 3.0 | 1048576.0 | 100.0 | Lcals_HYDRO_2D |
4194304.0 | 23.0 | regionprofile | 0.086440 | 0.086440 | 0.086440 | 0.086440 | 6.033247e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 100.0 | 26 | regionprofile | 0.086084 | 0.086084 | 0.086084 | 0.086084 | 6.033247e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 100.0 | 23.0 | regionprofile | 0.086260 | 0.086260 | 0.086260 | 0.086260 | 6.033247e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 100.0 | 24.0 | regionprofile | 0.088949 | 0.088949 | 0.088949 | 0.088949 | 6.033247e+08 | 1.841891e+08 | 12582912.0 | 3.0 | 4194304.0 | 100.0 | Lcals_HYDRO_2D | |
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} | 1048576.0 | 24.0 | regionprofile | 0.046893 | 0.046893 | 0.046893 | 0.046893 | 9.227469e+07 | 1.782579e+07 | 1048576.0 | 1.0 | 1048576.0 | 400.0 | 27 | regionprofile | 0.046963 | 0.046963 | 0.046963 | 0.046963 | 9.227469e+07 | 1.782579e+07 | 1048576.0 | 1.0 | 1048576.0 | 400.0 | 24.0 | regionprofile | 0.046876 | 0.046876 | 0.046876 | 0.046876 | 9.227469e+07 | 1.782579e+07 | 1048576.0 | 1.0 | 1048576.0 | 400.0 | 25.0 | regionprofile | 0.046740 | 0.046740 | 0.046740 | 0.046740 | 9.227469e+07 | 1.782579e+07 | 1048576.0 | 1.0 | 1048576.0 | 400.0 | Lcals_INT_PREDICT |
4194304.0 | 24.0 | regionprofile | 0.182489 | 0.182489 | 0.182489 | 0.182489 | 3.690988e+08 | 7.130317e+07 | 4194304.0 | 1.0 | 4194304.0 | 400.0 | 27 | regionprofile | 0.182741 | 0.182741 | 0.182741 | 0.182741 | 3.690988e+08 | 7.130317e+07 | 4194304.0 | 1.0 | 4194304.0 | 400.0 | 24.0 | regionprofile | 0.182463 | 0.182463 | 0.182463 | 0.182463 | 3.690988e+08 | 7.130317e+07 | 4194304.0 | 1.0 | 4194304.0 | 400.0 | 25.0 | regionprofile | 0.182635 | 0.182635 | 0.182635 | 0.182635 | 3.690988e+08 | 7.130317e+07 | 4194304.0 | 1.0 | 4194304.0 | 400.0 | Lcals_INT_PREDICT | |
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} | 1048576.0 | 25.0 | regionprofile | 0.002752 | 0.002752 | 0.002752 | 0.002752 | 4.194304e+07 | 4.194304e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 28 | regionprofile | 0.002752 | 0.002752 | 0.002752 | 0.002752 | 4.194304e+07 | 4.194304e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 25.0 | regionprofile | 0.002794 | 0.002794 | 0.002794 | 0.002794 | 4.194304e+07 | 4.194304e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | 26.0 | regionprofile | 0.002788 | 0.002788 | 0.002788 | 0.002788 | 4.194304e+07 | 4.194304e+06 | 1048576.0 | 1.0 | 1048576.0 | 50.0 | Lcals_PLANCKIAN |
4194304.0 | 25.0 | regionprofile | 0.010289 | 0.010289 | 0.010289 | 0.010289 | 1.677722e+08 | 1.677722e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 28 | regionprofile | 0.010275 | 0.010275 | 0.010275 | 0.010275 | 1.677722e+08 | 1.677722e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 25.0 | regionprofile | 0.010296 | 0.010296 | 0.010296 | 0.010296 | 1.677722e+08 | 1.677722e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | 26.0 | regionprofile | 0.010319 | 0.010319 | 0.010319 | 0.010319 | 1.677722e+08 | 1.677722e+07 | 4194304.0 | 1.0 | 4194304.0 | 50.0 | Lcals_PLANCKIAN | |
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} | 1048576.0 | 26.0 | regionprofile | 0.044959 | 0.044959 | 0.044959 | 0.044959 | 3.355440e+07 | 2.097150e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 29 | regionprofile | 0.045027 | 0.045027 | 0.045027 | 0.045027 | 3.355440e+07 | 2.097150e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 26.0 | regionprofile | 0.045229 | 0.045229 | 0.045229 | 0.045229 | 3.355440e+07 | 2.097150e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 27.0 | regionprofile | 0.045130 | 0.045130 | 0.045130 | 0.045130 | 3.355440e+07 | 2.097150e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | Lcals_TRIDIAG_ELIM |
4194304.0 | 26.0 | regionprofile | 0.166569 | 0.166569 | 0.166569 | 0.166569 | 1.342177e+08 | 8.388606e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 29 | regionprofile | 0.166845 | 0.166845 | 0.166845 | 0.166845 | 1.342177e+08 | 8.388606e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 26.0 | regionprofile | 0.166518 | 0.166518 | 0.166518 | 0.166518 | 1.342177e+08 | 8.388606e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 27.0 | regionprofile | 0.166988 | 0.166988 | 0.166988 | 0.166988 | 1.342177e+08 | 8.388606e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | Lcals_TRIDIAG_ELIM | |
{'name': 'Polybench', 'type': 'function'} | 1048576.0 | 27.0 | regionprofile | 0.597587 | 0.597587 | 0.597587 | 0.597587 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1050625.0 | 120.0 | 30 | regionprofile | 0.607722 | 0.607722 | 0.607722 | 0.607722 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1050625.0 | 120.0 | 27.0 | regionprofile | 0.706357 | 0.706357 | 0.706357 | 0.706357 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1050625.0 | 120.0 | 28.0 | regionprofile | 1.018105 | 1.018105 | 1.018105 | 1.018105 | 3.359049e+09 | 6.797544e+09 | 125952040.0 | 160.0 | 1050625.0 | 120.0 | Polybench |
4194304.0 | 27.0 | regionprofile | 2.745905 | 2.745905 | 2.745905 | 2.745905 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4198401.0 | 120.0 | 30 | regionprofile | 2.767598 | 2.767598 | 2.767598 | 2.767598 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4198401.0 | 120.0 | 27.0 | regionprofile | 2.944963 | 2.944963 | 2.944963 | 2.944963 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4198401.0 | 120.0 | 28.0 | regionprofile | 3.518693 | 3.518693 | 3.518693 | 3.518693 | 1.342898e+10 | 3.576198e+10 | 503562280.0 | 160.0 | 4198401.0 | 120.0 | Polybench | |
{'name': 'Polybench_2MM', 'type': 'function'} | 1048576.0 | 28.0 | regionprofile | 0.006426 | 0.006426 | 0.006426 | 0.006426 | 5.198800e+07 | 5.683881e+09 | 2101250.0 | 2.0 | 1050625.0 | 2.0 | 31 | regionprofile | 0.005658 | 0.005658 | 0.005658 | 0.005658 | 5.198800e+07 | 5.683881e+09 | 2101250.0 | 2.0 | 1050625.0 | 2.0 | 28.0 | regionprofile | 0.006097 | 0.006097 | 0.006097 | 0.006097 | 5.198800e+07 | 5.683881e+09 | 2101250.0 | 2.0 | 1050625.0 | 2.0 | 29.0 | regionprofile | 0.006958 | 0.006958 | 0.006958 | 0.006958 | 5.198800e+07 | 5.683881e+09 | 2101250.0 | 2.0 | 1050625.0 | 2.0 | Polybench_2MM |
4194304.0 | 28.0 | regionprofile | 0.032479 | 0.032479 | 0.032479 | 0.032479 | 1.710669e+08 | 3.131167e+10 | 8396802.0 | 2.0 | 4198401.0 | 2.0 | 31 | regionprofile | 0.032284 | 0.032284 | 0.032284 | 0.032284 | 1.710669e+08 | 3.131167e+10 | 8396802.0 | 2.0 | 4198401.0 | 2.0 | 28.0 | regionprofile | 0.032710 | 0.032710 | 0.032710 | 0.032710 | 1.710669e+08 | 3.131167e+10 | 8396802.0 | 2.0 | 4198401.0 | 2.0 | 29.0 | regionprofile | 0.034111 | 0.034111 | 0.034111 | 0.034111 | 1.710669e+08 | 3.131167e+10 | 8396802.0 | 2.0 | 4198401.0 | 2.0 | Polybench_2MM | |
{'name': 'Polybench_3MM', 'type': 'function'} | 1048576.0 | 29.0 | regionprofile | 0.009667 | 0.009667 | 0.009667 | 0.009667 | 7.826900e+07 | 6.797544e+09 | 3151875.0 | 3.0 | 1050625.0 | 2.0 | 32 | regionprofile | 0.008535 | 0.008535 | 0.008535 | 0.008535 | 7.826900e+07 | 6.797544e+09 | 3151875.0 | 3.0 | 1050625.0 | 2.0 | 29.0 | regionprofile | 0.009183 | 0.009183 | 0.009183 | 0.009183 | 7.826900e+07 | 6.797544e+09 | 3151875.0 | 3.0 | 1050625.0 | 2.0 | 30.0 | regionprofile | 0.010579 | 0.010579 | 0.010579 | 0.010579 | 7.826900e+07 | 6.797544e+09 | 3151875.0 | 3.0 | 1050625.0 | 2.0 | Polybench_3MM |
4194304.0 | 29.0 | regionprofile | 0.044100 | 0.044100 | 0.044100 | 0.044100 | 2.403887e+08 | 3.576198e+10 | 12595203.0 | 3.0 | 4198401.0 | 2.0 | 32 | regionprofile | 0.043582 | 0.043582 | 0.043582 | 0.043582 | 2.403887e+08 | 3.576198e+10 | 12595203.0 | 3.0 | 4198401.0 | 2.0 | 29.0 | regionprofile | 0.044093 | 0.044093 | 0.044093 | 0.044093 | 2.403887e+08 | 3.576198e+10 | 12595203.0 | 3.0 | 4198401.0 | 2.0 | 30.0 | regionprofile | 0.046054 | 0.046054 | 0.046054 | 0.046054 | 2.403887e+08 | 3.576198e+10 | 12595203.0 | 3.0 | 4198401.0 | 2.0 | Polybench_3MM | |
{'name': 'Polybench_ADI', 'type': 'function'} | 1048576.0 | 30.0 | regionprofile | 0.040744 | 0.040744 | 0.040744 | 0.040744 | 4.026528e+08 | 1.423279e+08 | 8184.0 | 8.0 | 1046529.0 | 4.0 | 33 | regionprofile | 0.047805 | 0.047805 | 0.047805 | 0.047805 | 4.026528e+08 | 1.423279e+08 | 8184.0 | 8.0 | 1046529.0 | 4.0 | 30.0 | regionprofile | 0.100471 | 0.100471 | 0.100471 | 0.100471 | 4.026528e+08 | 1.423279e+08 | 8184.0 | 8.0 | 1046529.0 | 4.0 | 31.0 | regionprofile | 0.240500 | 0.240500 | 0.240500 | 0.240500 | 4.026528e+08 | 1.423279e+08 | 8184.0 | 8.0 | 1046529.0 | 4.0 | Polybench_ADI |
4194304.0 | 30.0 | regionprofile | 0.082385 | 0.082385 | 0.082385 | 0.082385 | 1.610612e+09 | 5.698684e+08 | 16376.0 | 8.0 | 4190209.0 | 4.0 | 33 | regionprofile | 0.103245 | 0.103245 | 0.103245 | 0.103245 | 1.610612e+09 | 5.698684e+08 | 16376.0 | 8.0 | 4190209.0 | 4.0 | 30.0 | regionprofile | 0.191407 | 0.191407 | 0.191407 | 0.191407 | 1.610612e+09 | 5.698684e+08 | 16376.0 | 8.0 | 4190209.0 | 4.0 | 31.0 | regionprofile | 0.430046 | 0.430046 | 0.430046 | 0.430046 | 1.610612e+09 | 5.698684e+08 | 16376.0 | 8.0 | 4190209.0 | 4.0 | Polybench_ADI | |
{'name': 'Polybench_ATAX', 'type': 'function'} | 1048576.0 | 31.0 | regionprofile | 0.027871 | 0.027871 | 0.027871 | 0.027871 | 1.685100e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | 34 | regionprofile | 0.029419 | 0.029419 | 0.029419 | 0.029419 | 1.685100e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | 31.0 | regionprofile | 0.036441 | 0.036441 | 0.036441 | 0.036441 | 1.685100e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | 32.0 | regionprofile | 0.066951 | 0.066951 | 0.066951 | 0.066951 | 1.685100e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | Polybench_ATAX |
4194304.0 | 31.0 | regionprofile | 0.052800 | 0.052800 | 0.052800 | 0.052800 | 6.725638e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | 34 | regionprofile | 0.060357 | 0.060357 | 0.060357 | 0.060357 | 6.725638e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | 31.0 | regionprofile | 0.076365 | 0.076365 | 0.076365 | 0.076365 | 6.725638e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | 32.0 | regionprofile | 0.133815 | 0.133815 | 0.133815 | 0.133815 | 6.725638e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | Polybench_ATAX | |
{'name': 'Polybench_FDTD_2D', 'type': 'function'} | 1048576.0 | 32.0 | regionprofile | 0.037338 | 0.037338 | 0.037338 | 0.037338 | 3.359049e+09 | 4.616192e+08 | 125952040.0 | 160.0 | 1049600.0 | 8.0 | 35 | regionprofile | 0.037041 | 0.037041 | 0.037041 | 0.037041 | 3.359049e+09 | 4.616192e+08 | 125952040.0 | 160.0 | 1049600.0 | 8.0 | 32.0 | regionprofile | 0.037118 | 0.037118 | 0.037118 | 0.037118 | 3.359049e+09 | 4.616192e+08 | 125952040.0 | 160.0 | 1049600.0 | 8.0 | 33.0 | regionprofile | 0.037303 | 0.037303 | 0.037303 | 0.037303 | 3.359049e+09 | 4.616192e+08 | 125952040.0 | 160.0 | 1049600.0 | 8.0 | Polybench_FDTD_2D |
4194304.0 | 32.0 | regionprofile | 0.133181 | 0.133181 | 0.133181 | 0.133181 | 1.342898e+10 | 1.845985e+09 | 503562280.0 | 160.0 | 4196352.0 | 8.0 | 35 | regionprofile | 0.133258 | 0.133258 | 0.133258 | 0.133258 | 1.342898e+10 | 1.845985e+09 | 503562280.0 | 160.0 | 4196352.0 | 8.0 | 32.0 | regionprofile | 0.133467 | 0.133467 | 0.133467 | 0.133467 | 1.342898e+10 | 1.845985e+09 | 503562280.0 | 160.0 | 4196352.0 | 8.0 | 33.0 | regionprofile | 0.134053 | 0.134053 | 0.134053 | 0.134053 | 1.342898e+10 | 1.845985e+09 | 503562280.0 | 160.0 | 4196352.0 | 8.0 | Polybench_FDTD_2D | |
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} | 1048576.0 | 33.0 | regionprofile | 0.211207 | 0.211207 | 0.211207 | 0.211207 | 1.681000e+07 | 1.076891e+09 | 1050625.0 | 1.0 | 1050625.0 | 8.0 | 36 | regionprofile | 0.209059 | 0.209059 | 0.209059 | 0.209059 | 1.681000e+07 | 1.076891e+09 | 1050625.0 | 1.0 | 1050625.0 | 8.0 | 33.0 | regionprofile | 0.210531 | 0.210531 | 0.210531 | 0.210531 | 1.681000e+07 | 1.076891e+09 | 1050625.0 | 1.0 | 1050625.0 | 8.0 | 34.0 | regionprofile | 0.211865 | 0.211865 | 0.211865 | 0.211865 | 1.681000e+07 | 1.076891e+09 | 1050625.0 | 1.0 | 1050625.0 | 8.0 | Polybench_FLOYD_WARSHALL |
4194304.0 | 33.0 | regionprofile | 1.516510 | 1.516510 | 1.516510 | 1.516510 | 6.717442e+07 | 8.602524e+09 | 4198401.0 | 1.0 | 4198401.0 | 8.0 | 36 | regionprofile | 1.516435 | 1.516435 | 1.516435 | 1.516435 | 6.717442e+07 | 8.602524e+09 | 4198401.0 | 1.0 | 4198401.0 | 8.0 | 33.0 | regionprofile | 1.520934 | 1.520934 | 1.520934 | 1.520934 | 6.717442e+07 | 8.602524e+09 | 4198401.0 | 1.0 | 4198401.0 | 8.0 | 34.0 | regionprofile | 1.517274 | 1.517274 | 1.517274 | 1.517274 | 6.717442e+07 | 8.602524e+09 | 4198401.0 | 1.0 | 4198401.0 | 8.0 | Polybench_FLOYD_WARSHALL | |
{'name': 'Polybench_GEMM', 'type': 'function'} | 1048576.0 | 34.0 | regionprofile | 0.006501 | 0.006501 | 0.006501 | 0.006501 | 2.808500e+07 | 3.783301e+09 | 1050625.0 | 1.0 | 1050625.0 | 4.0 | 37 | regionprofile | 0.006145 | 0.006145 | 0.006145 | 0.006145 | 2.808500e+07 | 3.783301e+09 | 1050625.0 | 1.0 | 1050625.0 | 4.0 | 34.0 | regionprofile | 0.006426 | 0.006426 | 0.006426 | 0.006426 | 2.808500e+07 | 3.783301e+09 | 1050625.0 | 1.0 | 1050625.0 | 4.0 | 35.0 | regionprofile | 0.006745 | 0.006745 | 0.006745 | 0.006745 | 2.808500e+07 | 3.783301e+09 | 1050625.0 | 1.0 | 1050625.0 | 4.0 | Polybench_GEMM |
4194304.0 | 34.0 | regionprofile | 0.041682 | 0.041682 | 0.041682 | 0.041682 | 7.292801e+07 | 1.511844e+10 | 4198401.0 | 1.0 | 4198401.0 | 4.0 | 37 | regionprofile | 0.025073 | 0.025073 | 0.025073 | 0.025073 | 7.292801e+07 | 1.511844e+10 | 4198401.0 | 1.0 | 4198401.0 | 4.0 | 34.0 | regionprofile | 0.024905 | 0.024905 | 0.024905 | 0.024905 | 7.292801e+07 | 1.511844e+10 | 4198401.0 | 1.0 | 4198401.0 | 4.0 | 35.0 | regionprofile | 0.025525 | 0.025525 | 0.025525 | 0.025525 | 7.292801e+07 | 1.511844e+10 | 4198401.0 | 1.0 | 4198401.0 | 4.0 | Polybench_GEMM | |
{'name': 'Polybench_GEMVER', 'type': 'function'} | 1048576.0 | 35.0 | regionprofile | 0.007053 | 0.007053 | 0.007053 | 0.007053 | 3.372660e+07 | 1.050728e+07 | 3152900.0 | 4.0 | 1050625.0 | 20.0 | 38 | regionprofile | 0.007155 | 0.007155 | 0.007155 | 0.007155 | 3.372660e+07 | 1.050728e+07 | 3152900.0 | 4.0 | 1050625.0 | 20.0 | 35.0 | regionprofile | 0.008014 | 0.008014 | 0.008014 | 0.008014 | 3.372660e+07 | 1.050728e+07 | 3152900.0 | 4.0 | 1050625.0 | 20.0 | 36.0 | regionprofile | 0.013533 | 0.013533 | 0.013533 | 0.013533 | 3.372660e+07 | 1.050728e+07 | 3152900.0 | 4.0 | 1050625.0 | 20.0 | Polybench_GEMVER |
4194304.0 | 35.0 | regionprofile | 0.014683 | 0.014683 | 0.014683 | 0.014683 | 1.345619e+08 | 4.198606e+07 | 12597252.0 | 4.0 | 4198401.0 | 20.0 | 38 | regionprofile | 0.015327 | 0.015327 | 0.015327 | 0.015327 | 1.345619e+08 | 4.198606e+07 | 12597252.0 | 4.0 | 4198401.0 | 20.0 | 35.0 | regionprofile | 0.017301 | 0.017301 | 0.017301 | 0.017301 | 1.345619e+08 | 4.198606e+07 | 12597252.0 | 4.0 | 4198401.0 | 20.0 | 36.0 | regionprofile | 0.027680 | 0.027680 | 0.027680 | 0.027680 | 1.345619e+08 | 4.198606e+07 | 12597252.0 | 4.0 | 4198401.0 | 20.0 | Polybench_GEMVER | |
{'name': 'Polybench_GESUMMV', 'type': 'function'} | 1048576.0 | 36.0 | regionprofile | 0.026386 | 0.026386 | 0.026386 | 0.026386 | 1.683460e+07 | 4.205575e+06 | 1025.0 | 1.0 | 1050625.0 | 120.0 | 39 | regionprofile | 0.028663 | 0.028663 | 0.028663 | 0.028663 | 1.683460e+07 | 4.205575e+06 | 1025.0 | 1.0 | 1050625.0 | 120.0 | 36.0 | regionprofile | 0.055810 | 0.055810 | 0.055810 | 0.055810 | 1.683460e+07 | 4.205575e+06 | 1025.0 | 1.0 | 1050625.0 | 120.0 | 37.0 | regionprofile | 0.151902 | 0.151902 | 0.151902 | 0.151902 | 1.683460e+07 | 4.205575e+06 | 1025.0 | 1.0 | 1050625.0 | 120.0 | Polybench_GESUMMV |
4194304.0 | 36.0 | regionprofile | 0.056391 | 0.056391 | 0.056391 | 0.056391 | 6.722359e+07 | 1.679975e+07 | 2049.0 | 1.0 | 4198401.0 | 120.0 | 39 | regionprofile | 0.059838 | 0.059838 | 0.059838 | 0.059838 | 6.722359e+07 | 1.679975e+07 | 2049.0 | 1.0 | 4198401.0 | 120.0 | 36.0 | regionprofile | 0.109365 | 0.109365 | 0.109365 | 0.109365 | 6.722359e+07 | 1.679975e+07 | 2049.0 | 1.0 | 4198401.0 | 120.0 | 37.0 | regionprofile | 0.308204 | 0.308204 | 0.308204 | 0.308204 | 6.722359e+07 | 1.679975e+07 | 2049.0 | 1.0 | 4198401.0 | 120.0 | Polybench_GESUMMV | |
{'name': 'Polybench_HEAT_3D', 'type': 'function'} | 1048576.0 | 37.0 | regionprofile | 0.021932 | 0.021932 | 0.021932 | 0.021932 | 6.592000e+08 | 6.000000e+08 | 40000000.0 | 40.0 | 1000000.0 | 20.0 | 40 | regionprofile | 0.021845 | 0.021845 | 0.021845 | 0.021845 | 6.592000e+08 | 6.000000e+08 | 40000000.0 | 40.0 | 1000000.0 | 20.0 | 37.0 | regionprofile | 0.022019 | 0.022019 | 0.022019 | 0.022019 | 6.592000e+08 | 6.000000e+08 | 40000000.0 | 40.0 | 1000000.0 | 20.0 | 38.0 | regionprofile | 0.022779 | 0.022779 | 0.022779 | 0.022779 | 6.592000e+08 | 6.000000e+08 | 40000000.0 | 40.0 | 1000000.0 | 20.0 | Polybench_HEAT_3D |
4194304.0 | 37.0 | regionprofile | 0.077673 | 0.077673 | 0.077673 | 0.077673 | 2.670592e+09 | 2.457600e+09 | 163840000.0 | 40.0 | 4096000.0 | 20.0 | 40 | regionprofile | 0.077731 | 0.077731 | 0.077731 | 0.077731 | 2.670592e+09 | 2.457600e+09 | 163840000.0 | 40.0 | 4096000.0 | 20.0 | 37.0 | regionprofile | 0.077833 | 0.077833 | 0.077833 | 0.077833 | 2.670592e+09 | 2.457600e+09 | 163840000.0 | 40.0 | 4096000.0 | 20.0 | 38.0 | regionprofile | 0.077977 | 0.077977 | 0.077977 | 0.077977 | 2.670592e+09 | 2.457600e+09 | 163840000.0 | 40.0 | 4096000.0 | 20.0 | Polybench_HEAT_3D | |
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} | 1048576.0 | 38.0 | regionprofile | 0.078164 | 0.078164 | 0.078164 | 0.078164 | 5.368704e+08 | 1.006631e+08 | 33554368.0 | 32.0 | 1048574.0 | 100.0 | 41 | regionprofile | 0.077995 | 0.077995 | 0.077995 | 0.077995 | 5.368704e+08 | 1.006631e+08 | 33554368.0 | 32.0 | 1048574.0 | 100.0 | 38.0 | regionprofile | 0.078657 | 0.078657 | 0.078657 | 0.078657 | 5.368704e+08 | 1.006631e+08 | 33554368.0 | 32.0 | 1048574.0 | 100.0 | 39.0 | regionprofile | 0.079618 | 0.079618 | 0.079618 | 0.079618 | 5.368704e+08 | 1.006631e+08 | 33554368.0 | 32.0 | 1048574.0 | 100.0 | Polybench_JACOBI_1D |
4194304.0 | 38.0 | regionprofile | 0.277418 | 0.277418 | 0.277418 | 0.277418 | 2.147483e+09 | 4.026530e+08 | 134217664.0 | 32.0 | 4194302.0 | 100.0 | 41 | regionprofile | 0.277588 | 0.277588 | 0.277588 | 0.277588 | 2.147483e+09 | 4.026530e+08 | 134217664.0 | 32.0 | 4194302.0 | 100.0 | 38.0 | regionprofile | 0.278394 | 0.278394 | 0.278394 | 0.278394 | 2.147483e+09 | 4.026530e+08 | 134217664.0 | 32.0 | 4194302.0 | 100.0 | 39.0 | regionprofile | 0.279469 | 0.279469 | 0.279469 | 0.279469 | 2.147483e+09 | 4.026530e+08 | 134217664.0 | 32.0 | 4194302.0 | 100.0 | Polybench_JACOBI_1D | |
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} | 1048576.0 | 39.0 | regionprofile | 0.102414 | 0.102414 | 0.102414 | 0.102414 | 1.342176e+09 | 4.186116e+08 | 83722320.0 | 2.0 | 1046529.0 | 50.0 | 42 | regionprofile | 0.103129 | 0.103129 | 0.103129 | 0.103129 | 1.342176e+09 | 4.186116e+08 | 83722320.0 | 2.0 | 1046529.0 | 50.0 | 39.0 | regionprofile | 0.102936 | 0.102936 | 0.102936 | 0.102936 | 1.342176e+09 | 4.186116e+08 | 83722320.0 | 2.0 | 1046529.0 | 50.0 | 40.0 | regionprofile | 0.103622 | 0.103622 | 0.103622 | 0.103622 | 1.342176e+09 | 4.186116e+08 | 83722320.0 | 2.0 | 1046529.0 | 50.0 | Polybench_JACOBI_2D |
4194304.0 | 39.0 | regionprofile | 0.370495 | 0.370495 | 0.370495 | 0.370495 | 5.368708e+09 | 1.676084e+09 | 335216720.0 | 2.0 | 4190209.0 | 50.0 | 42 | regionprofile | 0.370471 | 0.370471 | 0.370471 | 0.370471 | 5.368708e+09 | 1.676084e+09 | 335216720.0 | 2.0 | 4190209.0 | 50.0 | 39.0 | regionprofile | 0.370860 | 0.370860 | 0.370860 | 0.370860 | 5.368708e+09 | 1.676084e+09 | 335216720.0 | 2.0 | 4190209.0 | 50.0 | 40.0 | regionprofile | 0.373282 | 0.373282 | 0.373282 | 0.373282 | 5.368708e+09 | 1.676084e+09 | 335216720.0 | 2.0 | 4190209.0 | 50.0 | Polybench_JACOBI_2D | |
{'name': 'Polybench_MVT', 'type': 'function'} | 1048576.0 | 40.0 | regionprofile | 0.021796 | 0.021796 | 0.021796 | 0.021796 | 1.685920e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | 43 | regionprofile | 0.025190 | 0.025190 | 0.025190 | 0.025190 | 1.685920e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | 40.0 | regionprofile | 0.032569 | 0.032569 | 0.032569 | 0.032569 | 1.685920e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | 41.0 | regionprofile | 0.065664 | 0.065664 | 0.065664 | 0.065664 | 1.685920e+07 | 4.202500e+06 | 2050.0 | 2.0 | 1050625.0 | 100.0 | Polybench_MVT |
4194304.0 | 40.0 | regionprofile | 0.046011 | 0.046011 | 0.046011 | 0.046011 | 6.727277e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | 43 | regionprofile | 0.052310 | 0.052310 | 0.052310 | 0.052310 | 6.727277e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | 40.0 | regionprofile | 0.067230 | 0.067230 | 0.067230 | 0.067230 | 6.727277e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | 41.0 | regionprofile | 0.131107 | 0.131107 | 0.131107 | 0.131107 | 6.727277e+07 | 1.679360e+07 | 4098.0 | 2.0 | 4198401.0 | 100.0 | Polybench_MVT | |
{'name': 'Stream', 'type': 'function'} | 1048576.0 | 41.0 | regionprofile | 0.259377 | 0.259377 | 0.259377 | 0.259377 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 44 | regionprofile | 0.259331 | 0.259331 | 0.259331 | 0.259331 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 41.0 | regionprofile | 0.258056 | 0.258056 | 0.258056 | 0.258056 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 42.0 | regionprofile | 0.260428 | 0.260428 | 0.260428 | 0.260428 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | Stream |
4194304.0 | 41.0 | regionprofile | 0.772383 | 0.772383 | 0.772383 | 0.772383 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 44 | regionprofile | 0.772250 | 0.772250 | 0.772250 | 0.772250 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 41.0 | regionprofile | 0.775312 | 0.775312 | 0.775312 | 0.775312 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 42.0 | regionprofile | 0.770613 | 0.770613 | 0.770613 | 0.770613 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | Stream | |
{'name': 'Stream_ADD', 'type': 'function'} | 1048576.0 | 42.0 | regionprofile | 0.033683 | 0.033683 | 0.033683 | 0.033683 | 2.516582e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 45 | regionprofile | 0.033667 | 0.033667 | 0.033667 | 0.033667 | 2.516582e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 42.0 | regionprofile | 0.033741 | 0.033741 | 0.033741 | 0.033741 | 2.516582e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 43.0 | regionprofile | 0.034200 | 0.034200 | 0.034200 | 0.034200 | 2.516582e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | Stream_ADD |
4194304.0 | 42.0 | regionprofile | 0.122838 | 0.122838 | 0.122838 | 0.122838 | 1.006633e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 45 | regionprofile | 0.122784 | 0.122784 | 0.122784 | 0.122784 | 1.006633e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 42.0 | regionprofile | 0.123120 | 0.123120 | 0.123120 | 0.123120 | 1.006633e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 43.0 | regionprofile | 0.123213 | 0.123213 | 0.123213 | 0.123213 | 1.006633e+08 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | Stream_ADD | |
{'name': 'Stream_COPY', 'type': 'function'} | 1048576.0 | 43.0 | regionprofile | 0.043248 | 0.043248 | 0.043248 | 0.043248 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | 46 | regionprofile | 0.043435 | 0.043435 | 0.043435 | 0.043435 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | 43.0 | regionprofile | 0.043811 | 0.043811 | 0.043811 | 0.043811 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | 44.0 | regionprofile | 0.044358 | 0.044358 | 0.044358 | 0.044358 | 1.677722e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | Stream_COPY |
4194304.0 | 43.0 | regionprofile | 0.154253 | 0.154253 | 0.154253 | 0.154253 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | 46 | regionprofile | 0.154664 | 0.154664 | 0.154664 | 0.154664 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | 43.0 | regionprofile | 0.155116 | 0.155116 | 0.155116 | 0.155116 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | 44.0 | regionprofile | 0.155421 | 0.155421 | 0.155421 | 0.155421 | 6.710886e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | Stream_COPY | |
{'name': 'Stream_DOT', 'type': 'function'} | 1048576.0 | 44.0 | regionprofile | 0.105313 | 0.105313 | 0.105313 | 0.105313 | 1.677723e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 47 | regionprofile | 0.104737 | 0.104737 | 0.104737 | 0.104737 | 1.677723e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 44.0 | regionprofile | 0.102990 | 0.102990 | 0.102990 | 0.102990 | 1.677723e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | 45.0 | regionprofile | 0.103427 | 0.103427 | 0.103427 | 0.103427 | 1.677723e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 2000.0 | Stream_DOT |
4194304.0 | 44.0 | regionprofile | 0.218152 | 0.218152 | 0.218152 | 0.218152 | 6.710888e+07 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 47 | regionprofile | 0.217043 | 0.217043 | 0.217043 | 0.217043 | 6.710888e+07 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 44.0 | regionprofile | 0.218356 | 0.218356 | 0.218356 | 0.218356 | 6.710888e+07 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | 45.0 | regionprofile | 0.213373 | 0.213373 | 0.213373 | 0.213373 | 6.710888e+07 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 2000.0 | Stream_DOT | |
{'name': 'Stream_MUL', 'type': 'function'} | 1048576.0 | 45.0 | regionprofile | 0.043266 | 0.043266 | 0.043266 | 0.043266 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | 48 | regionprofile | 0.043424 | 0.043424 | 0.043424 | 0.043424 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | 45.0 | regionprofile | 0.043797 | 0.043797 | 0.043797 | 0.043797 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | 46.0 | regionprofile | 0.044307 | 0.044307 | 0.044307 | 0.044307 | 1.677722e+07 | 1.048576e+06 | 1048576.0 | 1.0 | 1048576.0 | 1800.0 | Stream_MUL |
4194304.0 | 45.0 | regionprofile | 0.154268 | 0.154268 | 0.154268 | 0.154268 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | 48 | regionprofile | 0.154717 | 0.154717 | 0.154717 | 0.154717 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | 45.0 | regionprofile | 0.155135 | 0.155135 | 0.155135 | 0.155135 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | 46.0 | regionprofile | 0.155466 | 0.155466 | 0.155466 | 0.155466 | 6.710886e+07 | 4.194304e+06 | 4194304.0 | 1.0 | 4194304.0 | 1800.0 | Stream_MUL | |
{'name': 'Stream_TRIAD', 'type': 'function'} | 1048576.0 | 46.0 | regionprofile | 0.033838 | 0.033838 | 0.033838 | 0.033838 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 49 | regionprofile | 0.034037 | 0.034037 | 0.034037 | 0.034037 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 46.0 | regionprofile | 0.033686 | 0.033686 | 0.033686 | 0.033686 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | 47.0 | regionprofile | 0.034100 | 0.034100 | 0.034100 | 0.034100 | 2.516582e+07 | 2.097152e+06 | 1048576.0 | 1.0 | 1048576.0 | 1000.0 | Stream_TRIAD |
4194304.0 | 46.0 | regionprofile | 0.122837 | 0.122837 | 0.122837 | 0.122837 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 49 | regionprofile | 0.123006 | 0.123006 | 0.123006 | 0.123006 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 46.0 | regionprofile | 0.123550 | 0.123550 | 0.123550 | 0.123550 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | 47.0 | regionprofile | 0.123103 | 0.123103 | 0.123103 | 0.123103 | 1.006633e+08 | 8.388608e+06 | 4194304.0 | 1.0 | 4194304.0 | 1000.0 | Stream_TRIAD | |
{'name': 'Basic_INDEXLIST', 'type': 'function'} | 1048576.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 5 | regionprofile | 0.004352 | 0.004352 | 0.004352 | 0.004352 | 1.048578e+07 | 0.000000e+00 | 1048576.0 | 1.0 | 1048576.0 | 100.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Basic_INDEXLIST |
4194304.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 5 | regionprofile | 0.010043 | 0.010043 | 0.010043 | 0.010043 | 4.194306e+07 | 0.000000e+00 | 4194304.0 | 1.0 | 4194304.0 | 100.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Basic_INDEXLIST | |
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} | 1048576.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 6 | regionprofile | 0.008052 | 0.008052 | 0.008052 | 0.008052 | 2.726300e+07 | 0.000000e+00 | 3145729.0 | 3.0 | 1048576.0 | 100.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Basic_INDEXLIST_3LOOP |
4194304.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 6 | regionprofile | 0.024382 | 0.024382 | 0.024382 | 0.024382 | 1.090519e+08 | 0.000000e+00 | 12582913.0 | 3.0 | 4194304.0 | 100.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Basic_INDEXLIST_3LOOP | |
{'name': 'Basic_MAT_MAT_SHARED', 'type': 'function'} | 1048576.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 10 | regionprofile | 0.006440 | 0.006440 | 0.006440 | 0.006440 | 1.677722e+07 | 2.147484e+09 | 1048576.0 | 1.0 | 1048576.0 | 5.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 8.0 | regionprofile | 0.005955 | 0.005955 | 0.005955 | 0.005955 | 1.677722e+07 | 2.147484e+09 | 1048576.0 | 1.0 | 1048576.0 | 5.0 | Basic_MAT_MAT_SHARED |
4194304.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 10 | regionprofile | 0.042978 | 0.042978 | 0.042978 | 0.042978 | 6.710886e+07 | 1.717987e+10 | 4194304.0 | 1.0 | 4194304.0 | 5.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 8.0 | regionprofile | 0.040467 | 0.040467 | 0.040467 | 0.040467 | 6.710886e+07 | 1.717987e+10 | 4194304.0 | 1.0 | 4194304.0 | 5.0 | Basic_MAT_MAT_SHARED |
Filter with respect to metadata
The metadata table of a thicket helps select certain variation based on specific metadata. For example, selecting a certain compiler. In this example, we filter the metadata to select the profiles generated with a block size of 128
.
[11]:
# selecting profiles originating form the quartz cluster
filter_metadata_func = lambda x: x["gpu_targets_block_sizes"] == "128"
th_example = th_obj.filter_metadata(filter_metadata_func)
display(HTML(th_example.metadata.to_html()))
spot.channels | spot.options | spot.format.version | spot.timeseries.metrics | spot.metrics | cali.caliper.version | variant | Algorithm_MEMCPY | Algorithm_MEMSET | Algorithm_REDUCE_SUM | Algorithm_SORTPAIRS | Algorithm_SORT | Algorithm_SCAN | Apps_VOL3D | Apps_PRESSURE | Apps_NODAL_ACCUMULATION_3D | Apps_MASS3DPA | Apps_LTIMES_NOVIEW | Apps_LTIMES | Apps_HALOEXCHANGE_FUSED | Apps_HALOEXCHANGE | Apps_FIR | Apps_ENERGY | Apps_DIFFUSION3DPA | Apps_DEL_DOT_VEC_2D | Apps_CONVECTION3DPA | Stream_TRIAD | Stream_MUL | Stream_DOT | Stream_COPY | Stream_ADD | Polybench_MVT | Polybench_JACOBI_2D | Polybench_JACOBI_1D | Polybench_HEAT_3D | Polybench_GESUMMV | Polybench_GEMVER | Polybench_GEMM | Polybench_FLOYD_WARSHALL | Polybench_FDTD_2D | Polybench_ATAX | Polybench_ADI | Polybench_3MM | Polybench_2MM | Lcals_TRIDIAG_ELIM | Lcals_PLANCKIAN | Lcals_INT_PREDICT | Lcals_HYDRO_2D | Lcals_HYDRO_1D | Lcals_GEN_LIN_RECUR | Lcals_FIRST_SUM | Lcals_FIRST_MIN | Lcals_FIRST_DIFF | Lcals_EOS | Lcals_DIFF_PREDICT | Basic_TRAP_INT | Basic_REDUCE_STRUCT | Basic_REDUCE3_INT | Basic_PI_REDUCE | Basic_PI_ATOMIC | Basic_NESTED_INIT | Basic_MULADDSUB | Basic_MAT_MAT_SHARED | Basic_INIT_VIEW1D_OFFSET | Basic_INIT_VIEW1D | Basic_INIT3 | Basic_INDEXLIST_3LOOP | Basic_INDEXLIST | Basic_IF_QUAD | Basic_DAXPY_ATOMIC | Basic_DAXPY | SizeMeaning | Tuning | machine_build | systype_build | gpu_targets_block_sizes | cuda_compiler_version | Compiler_path_version | compiler | compiler_version | rajaperf_compiler_options | rajaperf_compiler | cmake_exe_linker_flags | cmake_cxx_flags | cmake_build_type | raja_version | perfsuite_version | cluster | cmdline | libraries | launchdate | user | cali.channel | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
profile |
Group with the metadata
The metadata table also supports the grouping of the thicket based on the unique values present in the provided column(s). The following example groups the thicket according to any unique combination of values in the launchdate
and gpu_targets_block_sizes
columns.
[12]:
# create sub-thickets from unique values in the cluster column
grouping_metadata_cols = ["launchdate", "gpu_targets_block_sizes"]
sub_thickets = th_lassen.groupby(grouping_metadata_cols)
for th in sub_thickets:
display(th.metadata)
3 thickets created...
[(1668559508, 128), (1668559517, 128), (1668559527, 128)]
spot.channels | spot.options | spot.format.version | spot.timeseries.metrics | spot.metrics | cali.caliper.version | variant | Algorithm_MEMCPY | Algorithm_MEMSET | Algorithm_REDUCE_SUM | ... | cmake_cxx_flags | cmake_build_type | raja_version | perfsuite_version | cluster | cmdline | libraries | launchdate | user | cali.channel | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
profile | |||||||||||||||||||||
2118176828577267326 | regionprofile | 2 | min#inclusive#sum#time.duration,max#inclusive#... | 2.9.0-dev | Base_CUDA | {'Problem size': 1048576,'Reps':100,'Iteration... | {'Problem size': 1048576,'Reps':100,'Iteration... | {'Problem size': 1048576,'Reps':50,'Iterations... | ... | -qthreaded | Release | 2022.03.0 | 2022.3.0 | lassen | [./my-xl-block-128-build/bin/raja-perf.exe, --... | [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/wo... | 1668559508 | burgess9 | spot |
1 rows × 92 columns
spot.channels | spot.options | spot.format.version | spot.timeseries.metrics | spot.metrics | cali.caliper.version | variant | Algorithm_MEMCPY | Algorithm_MEMSET | Algorithm_REDUCE_SUM | ... | cmake_cxx_flags | cmake_build_type | raja_version | perfsuite_version | cluster | cmdline | libraries | launchdate | user | cali.channel | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
profile | |||||||||||||||||||||
7599706706110163426 | regionprofile | 2 | min#inclusive#sum#time.duration,max#inclusive#... | 2.9.0-dev | Base_CUDA | {'Problem size': 1048576,'Reps':100,'Iteration... | {'Problem size': 1048576,'Reps':100,'Iteration... | {'Problem size': 1048576,'Reps':50,'Iterations... | ... | -qthreaded | Release | 2022.03.0 | 2022.3.0 | lassen | [./my-xl-block-128-build/bin/raja-perf.exe, --... | [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/wo... | 1668559517 | burgess9 | spot |
1 rows × 92 columns
spot.channels | spot.options | spot.format.version | spot.timeseries.metrics | spot.metrics | cali.caliper.version | variant | Algorithm_MEMCPY | Algorithm_MEMSET | Algorithm_REDUCE_SUM | ... | cmake_cxx_flags | cmake_build_type | raja_version | perfsuite_version | cluster | cmdline | libraries | launchdate | user | cali.channel | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
profile | |||||||||||||||||||||
5411856859942141273 | regionprofile | 2 | min#inclusive#sum#time.duration,max#inclusive#... | 2.9.0-dev | Base_CUDA | {'Problem size': 1048576,'Reps':100,'Iteration... | {'Problem size': 1048576,'Reps':100,'Iteration... | {'Problem size': 1048576,'Reps':50,'Iterations... | ... | -qthreaded | Release | 2022.03.0 | 2022.3.0 | lassen | [./my-xl-block-128-build/bin/raja-perf.exe, --... | [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/wo... | 1668559527 | burgess9 | spot |
1 rows × 92 columns
The aggregated statistics in a thicket is a GraphFrame. Therefore, it contains a graph and corresponding dataframe component. The table supports an order-reduction mechanism and stores processed applications’ performance. Each row of the aggregated statistics table holds data aggregated across all profiles associated with a particular call tree node. Below is an example of an empty aggregated statistics table
View aggregated statistics table
[13]:
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name | |
---|---|
node | |
{'name': 'Base_CUDA', 'type': 'function'} | Base_CUDA |
{'name': 'Algorithm', 'type': 'function'} | Algorithm |
{'name': 'Algorithm_MEMCPY', 'type': 'function'} | Algorithm_MEMCPY |
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} | Algorithm_MEMCPY.block_128 |
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} | Algorithm_MEMCPY.library |
{'name': 'Algorithm_MEMSET', 'type': 'function'} | Algorithm_MEMSET |
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} | Algorithm_MEMSET.block_128 |
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} | Algorithm_MEMSET.library |
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} | Algorithm_REDUCE_SUM |
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} | Algorithm_REDUCE_SUM.block_128 |
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} | Algorithm_REDUCE_SUM.cub |
{'name': 'Algorithm_SCAN', 'type': 'function'} | Algorithm_SCAN |
{'name': 'Algorithm_SCAN.default', 'type': 'function'} | Algorithm_SCAN.default |
{'name': 'Apps', 'type': 'function'} | Apps |
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} | Apps_CONVECTION3DPA |
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} | Apps_CONVECTION3DPA.block_64 |
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} | Apps_DEL_DOT_VEC_2D |
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} | Apps_DEL_DOT_VEC_2D.block_128 |
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} | Apps_DIFFUSION3DPA |
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} | Apps_DIFFUSION3DPA.block_64 |
{'name': 'Apps_ENERGY', 'type': 'function'} | Apps_ENERGY |
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} | Apps_ENERGY.block_128 |
{'name': 'Apps_FIR', 'type': 'function'} | Apps_FIR |
{'name': 'Apps_FIR.block_128', 'type': 'function'} | Apps_FIR.block_128 |
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} | Apps_HALOEXCHANGE |
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} | Apps_HALOEXCHANGE.block_128 |
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED |
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED.block_128 |
{'name': 'Apps_LTIMES', 'type': 'function'} | Apps_LTIMES |
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} | Apps_LTIMES.block_128 |
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} | Apps_LTIMES_NOVIEW |
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} | Apps_LTIMES_NOVIEW.block_128 |
{'name': 'Apps_MASS3DPA', 'type': 'function'} | Apps_MASS3DPA |
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} | Apps_MASS3DPA.block_25 |
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D |
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D.block_128 |
{'name': 'Apps_PRESSURE', 'type': 'function'} | Apps_PRESSURE |
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} | Apps_PRESSURE.block_128 |
{'name': 'Apps_VOL3D', 'type': 'function'} | Apps_VOL3D |
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} | Apps_VOL3D.block_128 |
{'name': 'Basic', 'type': 'function'} | Basic |
{'name': 'Basic_DAXPY', 'type': 'function'} | Basic_DAXPY |
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} | Basic_DAXPY.block_128 |
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} | Basic_DAXPY_ATOMIC |
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} | Basic_DAXPY_ATOMIC.block_128 |
{'name': 'Basic_IF_QUAD', 'type': 'function'} | Basic_IF_QUAD |
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} | Basic_IF_QUAD.block_128 |
{'name': 'Basic_INDEXLIST', 'type': 'function'} | Basic_INDEXLIST |
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} | Basic_INDEXLIST.block_256 |
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} | Basic_INDEXLIST_3LOOP |
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} | Basic_INDEXLIST_3LOOP.block_256 |
{'name': 'Basic_INIT3', 'type': 'function'} | Basic_INIT3 |
{'name': 'Basic_INIT3.block_128', 'type': 'function'} | Basic_INIT3.block_128 |
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} | Basic_INIT_VIEW1D |
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} | Basic_INIT_VIEW1D.block_128 |
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET |
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET.block_128 |
{'name': 'Basic_MULADDSUB', 'type': 'function'} | Basic_MULADDSUB |
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} | Basic_MULADDSUB.block_128 |
{'name': 'Basic_NESTED_INIT', 'type': 'function'} | Basic_NESTED_INIT |
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} | Basic_NESTED_INIT.block_128 |
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} | Basic_PI_ATOMIC |
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} | Basic_PI_ATOMIC.block_128 |
{'name': 'Basic_PI_REDUCE', 'type': 'function'} | Basic_PI_REDUCE |
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} | Basic_PI_REDUCE.block_128 |
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} | Basic_REDUCE3_INT |
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} | Basic_REDUCE3_INT.block_128 |
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} | Basic_REDUCE_STRUCT |
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} | Basic_REDUCE_STRUCT.block_128 |
{'name': 'Basic_TRAP_INT', 'type': 'function'} | Basic_TRAP_INT |
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} | Basic_TRAP_INT.block_128 |
{'name': 'Lcals', 'type': 'function'} | Lcals |
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} | Lcals_DIFF_PREDICT |
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} | Lcals_DIFF_PREDICT.block_128 |
{'name': 'Lcals_EOS', 'type': 'function'} | Lcals_EOS |
{'name': 'Lcals_EOS.block_128', 'type': 'function'} | Lcals_EOS.block_128 |
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} | Lcals_FIRST_DIFF |
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} | Lcals_FIRST_DIFF.block_128 |
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} | Lcals_FIRST_MIN |
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} | Lcals_FIRST_MIN.block_128 |
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} | Lcals_FIRST_SUM |
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} | Lcals_FIRST_SUM.block_128 |
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} | Lcals_GEN_LIN_RECUR |
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} | Lcals_GEN_LIN_RECUR.block_128 |
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} | Lcals_HYDRO_1D |
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} | Lcals_HYDRO_1D.block_128 |
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} | Lcals_HYDRO_2D |
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} | Lcals_HYDRO_2D.block_128 |
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} | Lcals_INT_PREDICT |
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} | Lcals_INT_PREDICT.block_128 |
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} | Lcals_PLANCKIAN |
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} | Lcals_PLANCKIAN.block_128 |
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} | Lcals_TRIDIAG_ELIM |
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} | Lcals_TRIDIAG_ELIM.block_128 |
{'name': 'Polybench', 'type': 'function'} | Polybench |
{'name': 'Polybench_2MM', 'type': 'function'} | Polybench_2MM |
{'name': 'Polybench_2MM.block_128', 'type': 'function'} | Polybench_2MM.block_128 |
{'name': 'Polybench_3MM', 'type': 'function'} | Polybench_3MM |
{'name': 'Polybench_3MM.block_128', 'type': 'function'} | Polybench_3MM.block_128 |
{'name': 'Polybench_ADI', 'type': 'function'} | Polybench_ADI |
{'name': 'Polybench_ADI.block_128', 'type': 'function'} | Polybench_ADI.block_128 |
{'name': 'Polybench_ATAX', 'type': 'function'} | Polybench_ATAX |
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} | Polybench_ATAX.block_128 |
{'name': 'Polybench_FDTD_2D', 'type': 'function'} | Polybench_FDTD_2D |
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} | Polybench_FDTD_2D.block_128 |
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} | Polybench_FLOYD_WARSHALL |
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} | Polybench_FLOYD_WARSHALL.block_128 |
{'name': 'Polybench_GEMM', 'type': 'function'} | Polybench_GEMM |
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} | Polybench_GEMM.block_128 |
{'name': 'Polybench_GEMVER', 'type': 'function'} | Polybench_GEMVER |
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} | Polybench_GEMVER.block_128 |
{'name': 'Polybench_GESUMMV', 'type': 'function'} | Polybench_GESUMMV |
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} | Polybench_GESUMMV.block_128 |
{'name': 'Polybench_HEAT_3D', 'type': 'function'} | Polybench_HEAT_3D |
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} | Polybench_HEAT_3D.block_128 |
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} | Polybench_JACOBI_1D |
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} | Polybench_JACOBI_1D.block_128 |
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} | Polybench_JACOBI_2D |
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} | Polybench_JACOBI_2D.block_128 |
{'name': 'Polybench_MVT', 'type': 'function'} | Polybench_MVT |
{'name': 'Polybench_MVT.block_128', 'type': 'function'} | Polybench_MVT.block_128 |
{'name': 'Stream', 'type': 'function'} | Stream |
{'name': 'Stream_ADD', 'type': 'function'} | Stream_ADD |
{'name': 'Stream_ADD.block_128', 'type': 'function'} | Stream_ADD.block_128 |
{'name': 'Stream_COPY', 'type': 'function'} | Stream_COPY |
{'name': 'Stream_COPY.block_128', 'type': 'function'} | Stream_COPY.block_128 |
{'name': 'Stream_DOT', 'type': 'function'} | Stream_DOT |
{'name': 'Stream_DOT.block_128', 'type': 'function'} | Stream_DOT.block_128 |
{'name': 'Stream_MUL', 'type': 'function'} | Stream_MUL |
{'name': 'Stream_MUL.block_128', 'type': 'function'} | Stream_MUL.block_128 |
{'name': 'Stream_TRIAD', 'type': 'function'} | Stream_TRIAD |
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} | Stream_TRIAD.block_128 |
Filter with respect to aggregated statistics
The aggregated statistics table also supports a filter function. In the example below, we filter the table to select the nodes with the names Base_CUDA
, Algorithm
, and Stream_MUL
.
[14]:
stats_nodes = ["Base_CUDA", "Algorithm", "Stream_MUL"]
th_stats_name = th_obj.filter_stats(lambda x: x["name"] in stats_nodes)
display(HTML(th_stats_name.statsframe.dataframe.to_html()))
name | |
---|---|
node | |
{'name': 'Base_CUDA', 'type': 'function'} | Base_CUDA |
{'name': 'Algorithm', 'type': 'function'} | Algorithm |
{'name': 'Stream_MUL', 'type': 'function'} | Stream_MUL |
Calculate the median and mean of performance data column, append to aggregated statistics table
The aggregated statistics table allows users to select a column from the performance data to perform an average on the values in a column. After performing the median()
and mean()
operation on said column, two new columns are appended to the statistics table with the mean and median values corresponding to the columns provided by the user.
Below is an example where we calculate the mean and median of the values in the Total time (exc)
column, which is the total exclusive time corresponding to the respective nodes.
[15]:
metrics = ["Total time (exc)"]
tt.median(th_lassen, columns=metrics)
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name | Total time (exc)_median | |
---|---|---|
node | ||
{'name': 'Base_CUDA', 'type': 'function'} | Base_CUDA | 0.000636 |
{'name': 'Algorithm', 'type': 'function'} | Algorithm | 0.000048 |
{'name': 'Algorithm_MEMCPY', 'type': 'function'} | Algorithm_MEMCPY | 0.000016 |
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} | Algorithm_MEMCPY.block_128 | 0.002440 |
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} | Algorithm_MEMCPY.library | 0.002609 |
{'name': 'Algorithm_MEMSET', 'type': 'function'} | Algorithm_MEMSET | 0.000016 |
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} | Algorithm_MEMSET.block_128 | 0.001488 |
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} | Algorithm_MEMSET.library | 0.001281 |
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} | Algorithm_REDUCE_SUM | 0.000019 |
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} | Algorithm_REDUCE_SUM.block_128 | 0.002683 |
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} | Algorithm_REDUCE_SUM.cub | 0.001527 |
{'name': 'Algorithm_SCAN', 'type': 'function'} | Algorithm_SCAN | 0.000008 |
{'name': 'Algorithm_SCAN.default', 'type': 'function'} | Algorithm_SCAN.default | 0.002917 |
{'name': 'Apps', 'type': 'function'} | Apps | 0.000086 |
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} | Apps_CONVECTION3DPA | 0.000008 |
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} | Apps_CONVECTION3DPA.block_64 | 0.003161 |
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} | Apps_DEL_DOT_VEC_2D | 0.000009 |
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} | Apps_DEL_DOT_VEC_2D.block_128 | 0.007006 |
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} | Apps_DIFFUSION3DPA | 0.000010 |
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} | Apps_DIFFUSION3DPA.block_64 | 0.004926 |
{'name': 'Apps_ENERGY', 'type': 'function'} | Apps_ENERGY | 0.000009 |
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} | Apps_ENERGY.block_128 | 0.039303 |
{'name': 'Apps_FIR', 'type': 'function'} | Apps_FIR | 0.000010 |
{'name': 'Apps_FIR.block_128', 'type': 'function'} | Apps_FIR.block_128 | 0.004157 |
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} | Apps_HALOEXCHANGE | 0.000009 |
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} | Apps_HALOEXCHANGE.block_128 | 0.037478 |
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED | 0.000010 |
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED.block_128 | 0.009430 |
{'name': 'Apps_LTIMES', 'type': 'function'} | Apps_LTIMES | 0.000008 |
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} | Apps_LTIMES.block_128 | 0.012088 |
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} | Apps_LTIMES_NOVIEW | 0.000011 |
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} | Apps_LTIMES_NOVIEW.block_128 | 0.012085 |
{'name': 'Apps_MASS3DPA', 'type': 'function'} | Apps_MASS3DPA | 0.000008 |
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} | Apps_MASS3DPA.block_25 | 0.001904 |
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D | 0.000009 |
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D.block_128 | 0.007408 |
{'name': 'Apps_PRESSURE', 'type': 'function'} | Apps_PRESSURE | 0.000009 |
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} | Apps_PRESSURE.block_128 | 0.048251 |
{'name': 'Apps_VOL3D', 'type': 'function'} | Apps_VOL3D | 0.000009 |
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} | Apps_VOL3D.block_128 | 0.005903 |
{'name': 'Basic', 'type': 'function'} | Basic | 0.000097 |
{'name': 'Basic_DAXPY', 'type': 'function'} | Basic_DAXPY | 0.000011 |
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} | Basic_DAXPY.block_128 | 0.016924 |
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} | Basic_DAXPY_ATOMIC | 0.000009 |
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} | Basic_DAXPY_ATOMIC.block_128 | 0.016947 |
{'name': 'Basic_IF_QUAD', 'type': 'function'} | Basic_IF_QUAD | 0.000008 |
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} | Basic_IF_QUAD.block_128 | 0.012622 |
{'name': 'Basic_INDEXLIST', 'type': 'function'} | Basic_INDEXLIST | 0.000008 |
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} | Basic_INDEXLIST.block_256 | 0.004506 |
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} | Basic_INDEXLIST_3LOOP | 0.000009 |
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} | Basic_INDEXLIST_3LOOP.block_256 | 0.008175 |
{'name': 'Basic_INIT3', 'type': 'function'} | Basic_INIT3 | 0.000009 |
{'name': 'Basic_INIT3.block_128', 'type': 'function'} | Basic_INIT3.block_128 | 0.028197 |
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} | Basic_INIT_VIEW1D | 0.000009 |
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} | Basic_INIT_VIEW1D.block_128 | 0.041780 |
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET | 0.000009 |
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET.block_128 | 0.041795 |
{'name': 'Basic_MULADDSUB', 'type': 'function'} | Basic_MULADDSUB | 0.000011 |
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} | Basic_MULADDSUB.block_128 | 0.019746 |
{'name': 'Basic_NESTED_INIT', 'type': 'function'} | Basic_NESTED_INIT | 0.000009 |
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} | Basic_NESTED_INIT.block_128 | 0.020670 |
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} | Basic_PI_ATOMIC | 0.000009 |
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} | Basic_PI_ATOMIC.block_128 | 0.124113 |
{'name': 'Basic_PI_REDUCE', 'type': 'function'} | Basic_PI_REDUCE | 0.000008 |
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} | Basic_PI_REDUCE.block_128 | 0.002602 |
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} | Basic_REDUCE3_INT | 0.000008 |
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} | Basic_REDUCE3_INT.block_128 | 0.002380 |
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} | Basic_REDUCE_STRUCT | 0.000008 |
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} | Basic_REDUCE_STRUCT.block_128 | 0.016917 |
{'name': 'Basic_TRAP_INT', 'type': 'function'} | Basic_TRAP_INT | 0.000007 |
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} | Basic_TRAP_INT.block_128 | 0.002596 |
{'name': 'Lcals', 'type': 'function'} | Lcals | 0.000074 |
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} | Lcals_DIFF_PREDICT | 0.000009 |
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} | Lcals_DIFF_PREDICT.block_128 | 0.061523 |
{'name': 'Lcals_EOS', 'type': 'function'} | Lcals_EOS | 0.000010 |
{'name': 'Lcals_EOS.block_128', 'type': 'function'} | Lcals_EOS.block_128 | 0.022685 |
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} | Lcals_FIRST_DIFF | 0.000009 |
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} | Lcals_FIRST_DIFF.block_128 | 0.048455 |
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} | Lcals_FIRST_MIN | 0.000013 |
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} | Lcals_FIRST_MIN.block_128 | 0.048713 |
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} | Lcals_FIRST_SUM | 0.000009 |
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} | Lcals_FIRST_SUM.block_128 | 0.048516 |
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} | Lcals_GEN_LIN_RECUR | 0.000009 |
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} | Lcals_GEN_LIN_RECUR.block_128 | 0.048865 |
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} | Lcals_HYDRO_1D | 0.000009 |
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} | Lcals_HYDRO_1D.block_128 | 0.033971 |
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} | Lcals_HYDRO_2D | 0.000010 |
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} | Lcals_HYDRO_2D.block_128 | 0.022995 |
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} | Lcals_INT_PREDICT | 0.000010 |
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} | Lcals_INT_PREDICT.block_128 | 0.046876 |
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} | Lcals_PLANCKIAN | 0.000010 |
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} | Lcals_PLANCKIAN.block_128 | 0.002759 |
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} | Lcals_TRIDIAG_ELIM | 0.000010 |
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} | Lcals_TRIDIAG_ELIM.block_128 | 0.044681 |
{'name': 'Polybench', 'type': 'function'} | Polybench | 0.000086 |
{'name': 'Polybench_2MM', 'type': 'function'} | Polybench_2MM | 0.000008 |
{'name': 'Polybench_2MM.block_128', 'type': 'function'} | Polybench_2MM.block_128 | 0.005949 |
{'name': 'Polybench_3MM', 'type': 'function'} | Polybench_3MM | 0.000008 |
{'name': 'Polybench_3MM.block_128', 'type': 'function'} | Polybench_3MM.block_128 | 0.008885 |
{'name': 'Polybench_ADI', 'type': 'function'} | Polybench_ADI | 0.000009 |
{'name': 'Polybench_ADI.block_128', 'type': 'function'} | Polybench_ADI.block_128 | 0.033592 |
{'name': 'Polybench_ATAX', 'type': 'function'} | Polybench_ATAX | 0.000009 |
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} | Polybench_ATAX.block_128 | 0.037403 |
{'name': 'Polybench_FDTD_2D', 'type': 'function'} | Polybench_FDTD_2D | 0.000009 |
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} | Polybench_FDTD_2D.block_128 | 0.037109 |
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} | Polybench_FLOYD_WARSHALL | 0.000008 |
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} | Polybench_FLOYD_WARSHALL.block_128 | 0.209053 |
{'name': 'Polybench_GEMM', 'type': 'function'} | Polybench_GEMM | 0.000010 |
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} | Polybench_GEMM.block_128 | 0.006741 |
{'name': 'Polybench_GEMVER', 'type': 'function'} | Polybench_GEMVER | 0.000008 |
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} | Polybench_GEMVER.block_128 | 0.008432 |
{'name': 'Polybench_GESUMMV', 'type': 'function'} | Polybench_GESUMMV | 0.000009 |
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} | Polybench_GESUMMV.block_128 | 0.037410 |
{'name': 'Polybench_HEAT_3D', 'type': 'function'} | Polybench_HEAT_3D | 0.000009 |
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} | Polybench_HEAT_3D.block_128 | 0.021826 |
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} | Polybench_JACOBI_1D | 0.000011 |
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} | Polybench_JACOBI_1D.block_128 | 0.077718 |
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} | Polybench_JACOBI_2D | 0.000010 |
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} | Polybench_JACOBI_2D.block_128 | 0.102580 |
{'name': 'Polybench_MVT', 'type': 'function'} | Polybench_MVT | 0.000009 |
{'name': 'Polybench_MVT.block_128', 'type': 'function'} | Polybench_MVT.block_128 | 0.035797 |
{'name': 'Stream', 'type': 'function'} | Stream | 0.000033 |
{'name': 'Stream_ADD', 'type': 'function'} | Stream_ADD | 0.000009 |
{'name': 'Stream_ADD.block_128', 'type': 'function'} | Stream_ADD.block_128 | 0.033798 |
{'name': 'Stream_COPY', 'type': 'function'} | Stream_COPY | 0.000009 |
{'name': 'Stream_COPY.block_128', 'type': 'function'} | Stream_COPY.block_128 | 0.043261 |
{'name': 'Stream_DOT', 'type': 'function'} | Stream_DOT | 0.000009 |
{'name': 'Stream_DOT.block_128', 'type': 'function'} | Stream_DOT.block_128 | 0.113655 |
{'name': 'Stream_MUL', 'type': 'function'} | Stream_MUL | 0.000011 |
{'name': 'Stream_MUL.block_128', 'type': 'function'} | Stream_MUL.block_128 | 0.043271 |
{'name': 'Stream_TRIAD', 'type': 'function'} | Stream_TRIAD | 0.000008 |
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} | Stream_TRIAD.block_128 | 0.033730 |
[16]:
tt.mean(th_lassen, columns=metrics)
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name | Total time (exc)_median | Total time (exc)_mean | |
---|---|---|---|
node | |||
{'name': 'Base_CUDA', 'type': 'function'} | Base_CUDA | 0.000636 | 0.000632 |
{'name': 'Algorithm', 'type': 'function'} | Algorithm | 0.000048 | 0.000048 |
{'name': 'Algorithm_MEMCPY', 'type': 'function'} | Algorithm_MEMCPY | 0.000016 | 0.000016 |
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} | Algorithm_MEMCPY.block_128 | 0.002440 | 0.002442 |
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} | Algorithm_MEMCPY.library | 0.002609 | 0.002609 |
{'name': 'Algorithm_MEMSET', 'type': 'function'} | Algorithm_MEMSET | 0.000016 | 0.000015 |
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} | Algorithm_MEMSET.block_128 | 0.001488 | 0.001487 |
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} | Algorithm_MEMSET.library | 0.001281 | 0.001279 |
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} | Algorithm_REDUCE_SUM | 0.000019 | 0.000019 |
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} | Algorithm_REDUCE_SUM.block_128 | 0.002683 | 0.002686 |
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} | Algorithm_REDUCE_SUM.cub | 0.001527 | 0.001526 |
{'name': 'Algorithm_SCAN', 'type': 'function'} | Algorithm_SCAN | 0.000008 | 0.000008 |
{'name': 'Algorithm_SCAN.default', 'type': 'function'} | Algorithm_SCAN.default | 0.002917 | 0.002917 |
{'name': 'Apps', 'type': 'function'} | Apps | 0.000086 | 0.000088 |
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} | Apps_CONVECTION3DPA | 0.000008 | 0.000008 |
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} | Apps_CONVECTION3DPA.block_64 | 0.003161 | 0.003161 |
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} | Apps_DEL_DOT_VEC_2D | 0.000009 | 0.000009 |
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} | Apps_DEL_DOT_VEC_2D.block_128 | 0.007006 | 0.007019 |
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} | Apps_DIFFUSION3DPA | 0.000010 | 0.000009 |
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} | Apps_DIFFUSION3DPA.block_64 | 0.004926 | 0.004932 |
{'name': 'Apps_ENERGY', 'type': 'function'} | Apps_ENERGY | 0.000009 | 0.000009 |
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} | Apps_ENERGY.block_128 | 0.039303 | 0.039285 |
{'name': 'Apps_FIR', 'type': 'function'} | Apps_FIR | 0.000010 | 0.000010 |
{'name': 'Apps_FIR.block_128', 'type': 'function'} | Apps_FIR.block_128 | 0.004157 | 0.004165 |
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} | Apps_HALOEXCHANGE | 0.000009 | 0.000009 |
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} | Apps_HALOEXCHANGE.block_128 | 0.037478 | 0.037577 |
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED | 0.000010 | 0.000010 |
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED.block_128 | 0.009430 | 0.009341 |
{'name': 'Apps_LTIMES', 'type': 'function'} | Apps_LTIMES | 0.000008 | 0.000008 |
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} | Apps_LTIMES.block_128 | 0.012088 | 0.012090 |
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} | Apps_LTIMES_NOVIEW | 0.000011 | 0.000011 |
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} | Apps_LTIMES_NOVIEW.block_128 | 0.012085 | 0.012087 |
{'name': 'Apps_MASS3DPA', 'type': 'function'} | Apps_MASS3DPA | 0.000008 | 0.000008 |
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} | Apps_MASS3DPA.block_25 | 0.001904 | 0.001906 |
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D | 0.000009 | 0.000008 |
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D.block_128 | 0.007408 | 0.007401 |
{'name': 'Apps_PRESSURE', 'type': 'function'} | Apps_PRESSURE | 0.000009 | 0.000009 |
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} | Apps_PRESSURE.block_128 | 0.048251 | 0.048255 |
{'name': 'Apps_VOL3D', 'type': 'function'} | Apps_VOL3D | 0.000009 | 0.000009 |
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} | Apps_VOL3D.block_128 | 0.005903 | 0.005877 |
{'name': 'Basic', 'type': 'function'} | Basic | 0.000097 | 0.000098 |
{'name': 'Basic_DAXPY', 'type': 'function'} | Basic_DAXPY | 0.000011 | 0.000011 |
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} | Basic_DAXPY.block_128 | 0.016924 | 0.016901 |
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} | Basic_DAXPY_ATOMIC | 0.000009 | 0.000009 |
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} | Basic_DAXPY_ATOMIC.block_128 | 0.016947 | 0.016938 |
{'name': 'Basic_IF_QUAD', 'type': 'function'} | Basic_IF_QUAD | 0.000008 | 0.000008 |
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} | Basic_IF_QUAD.block_128 | 0.012622 | 0.012651 |
{'name': 'Basic_INDEXLIST', 'type': 'function'} | Basic_INDEXLIST | 0.000008 | 0.000009 |
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} | Basic_INDEXLIST.block_256 | 0.004506 | 0.004434 |
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} | Basic_INDEXLIST_3LOOP | 0.000009 | 0.000009 |
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} | Basic_INDEXLIST_3LOOP.block_256 | 0.008175 | 0.008132 |
{'name': 'Basic_INIT3', 'type': 'function'} | Basic_INIT3 | 0.000009 | 0.000009 |
{'name': 'Basic_INIT3.block_128', 'type': 'function'} | Basic_INIT3.block_128 | 0.028197 | 0.028248 |
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} | Basic_INIT_VIEW1D | 0.000009 | 0.000008 |
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} | Basic_INIT_VIEW1D.block_128 | 0.041780 | 0.040132 |
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET | 0.000009 | 0.000009 |
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET.block_128 | 0.041795 | 0.040145 |
{'name': 'Basic_MULADDSUB', 'type': 'function'} | Basic_MULADDSUB | 0.000011 | 0.000011 |
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} | Basic_MULADDSUB.block_128 | 0.019746 | 0.019738 |
{'name': 'Basic_NESTED_INIT', 'type': 'function'} | Basic_NESTED_INIT | 0.000009 | 0.000009 |
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} | Basic_NESTED_INIT.block_128 | 0.020670 | 0.019845 |
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} | Basic_PI_ATOMIC | 0.000009 | 0.000009 |
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} | Basic_PI_ATOMIC.block_128 | 0.124113 | 0.120364 |
{'name': 'Basic_PI_REDUCE', 'type': 'function'} | Basic_PI_REDUCE | 0.000008 | 0.000008 |
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} | Basic_PI_REDUCE.block_128 | 0.002602 | 0.002619 |
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} | Basic_REDUCE3_INT | 0.000008 | 0.000008 |
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} | Basic_REDUCE3_INT.block_128 | 0.002380 | 0.002404 |
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} | Basic_REDUCE_STRUCT | 0.000008 | 0.000008 |
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} | Basic_REDUCE_STRUCT.block_128 | 0.016917 | 0.017113 |
{'name': 'Basic_TRAP_INT', 'type': 'function'} | Basic_TRAP_INT | 0.000007 | 0.000007 |
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} | Basic_TRAP_INT.block_128 | 0.002596 | 0.002601 |
{'name': 'Lcals', 'type': 'function'} | Lcals | 0.000074 | 0.000075 |
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} | Lcals_DIFF_PREDICT | 0.000009 | 0.000009 |
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} | Lcals_DIFF_PREDICT.block_128 | 0.061523 | 0.061192 |
{'name': 'Lcals_EOS', 'type': 'function'} | Lcals_EOS | 0.000010 | 0.000010 |
{'name': 'Lcals_EOS.block_128', 'type': 'function'} | Lcals_EOS.block_128 | 0.022685 | 0.022687 |
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} | Lcals_FIRST_DIFF | 0.000009 | 0.000009 |
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} | Lcals_FIRST_DIFF.block_128 | 0.048455 | 0.048426 |
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} | Lcals_FIRST_MIN | 0.000013 | 0.000015 |
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} | Lcals_FIRST_MIN.block_128 | 0.048713 | 0.058216 |
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} | Lcals_FIRST_SUM | 0.000009 | 0.000009 |
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} | Lcals_FIRST_SUM.block_128 | 0.048516 | 0.048493 |
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} | Lcals_GEN_LIN_RECUR | 0.000009 | 0.000009 |
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} | Lcals_GEN_LIN_RECUR.block_128 | 0.048865 | 0.048918 |
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} | Lcals_HYDRO_1D | 0.000009 | 0.000009 |
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} | Lcals_HYDRO_1D.block_128 | 0.033971 | 0.033970 |
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} | Lcals_HYDRO_2D | 0.000010 | 0.000010 |
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} | Lcals_HYDRO_2D.block_128 | 0.022995 | 0.023016 |
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} | Lcals_INT_PREDICT | 0.000010 | 0.000010 |
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} | Lcals_INT_PREDICT.block_128 | 0.046876 | 0.046896 |
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} | Lcals_PLANCKIAN | 0.000010 | 0.000010 |
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} | Lcals_PLANCKIAN.block_128 | 0.002759 | 0.002759 |
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} | Lcals_TRIDIAG_ELIM | 0.000010 | 0.000010 |
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} | Lcals_TRIDIAG_ELIM.block_128 | 0.044681 | 0.044794 |
{'name': 'Polybench', 'type': 'function'} | Polybench | 0.000086 | 0.000086 |
{'name': 'Polybench_2MM', 'type': 'function'} | Polybench_2MM | 0.000008 | 0.000008 |
{'name': 'Polybench_2MM.block_128', 'type': 'function'} | Polybench_2MM.block_128 | 0.005949 | 0.005949 |
{'name': 'Polybench_3MM', 'type': 'function'} | Polybench_3MM | 0.000008 | 0.000008 |
{'name': 'Polybench_3MM.block_128', 'type': 'function'} | Polybench_3MM.block_128 | 0.008885 | 0.008885 |
{'name': 'Polybench_ADI', 'type': 'function'} | Polybench_ADI | 0.000009 | 0.000009 |
{'name': 'Polybench_ADI.block_128', 'type': 'function'} | Polybench_ADI.block_128 | 0.033592 | 0.033582 |
{'name': 'Polybench_ATAX', 'type': 'function'} | Polybench_ATAX | 0.000009 | 0.000009 |
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} | Polybench_ATAX.block_128 | 0.037403 | 0.037419 |
{'name': 'Polybench_FDTD_2D', 'type': 'function'} | Polybench_FDTD_2D | 0.000009 | 0.000009 |
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} | Polybench_FDTD_2D.block_128 | 0.037109 | 0.037101 |
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} | Polybench_FLOYD_WARSHALL | 0.000008 | 0.000008 |
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} | Polybench_FLOYD_WARSHALL.block_128 | 0.209053 | 0.209057 |
{'name': 'Polybench_GEMM', 'type': 'function'} | Polybench_GEMM | 0.000010 | 0.000010 |
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} | Polybench_GEMM.block_128 | 0.006741 | 0.006742 |
{'name': 'Polybench_GEMVER', 'type': 'function'} | Polybench_GEMVER | 0.000008 | 0.000008 |
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} | Polybench_GEMVER.block_128 | 0.008432 | 0.008436 |
{'name': 'Polybench_GESUMMV', 'type': 'function'} | Polybench_GESUMMV | 0.000009 | 0.000009 |
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} | Polybench_GESUMMV.block_128 | 0.037410 | 0.037365 |
{'name': 'Polybench_HEAT_3D', 'type': 'function'} | Polybench_HEAT_3D | 0.000009 | 0.000009 |
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} | Polybench_HEAT_3D.block_128 | 0.021826 | 0.021828 |
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} | Polybench_JACOBI_1D | 0.000011 | 0.000011 |
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} | Polybench_JACOBI_1D.block_128 | 0.077718 | 0.077743 |
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} | Polybench_JACOBI_2D | 0.000010 | 0.000010 |
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} | Polybench_JACOBI_2D.block_128 | 0.102580 | 0.102596 |
{'name': 'Polybench_MVT', 'type': 'function'} | Polybench_MVT | 0.000009 | 0.000010 |
{'name': 'Polybench_MVT.block_128', 'type': 'function'} | Polybench_MVT.block_128 | 0.035797 | 0.035768 |
{'name': 'Stream', 'type': 'function'} | Stream | 0.000033 | 0.000034 |
{'name': 'Stream_ADD', 'type': 'function'} | Stream_ADD | 0.000009 | 0.000009 |
{'name': 'Stream_ADD.block_128', 'type': 'function'} | Stream_ADD.block_128 | 0.033798 | 0.033840 |
{'name': 'Stream_COPY', 'type': 'function'} | Stream_COPY | 0.000009 | 0.000009 |
{'name': 'Stream_COPY.block_128', 'type': 'function'} | Stream_COPY.block_128 | 0.043261 | 0.043184 |
{'name': 'Stream_DOT', 'type': 'function'} | Stream_DOT | 0.000009 | 0.000009 |
{'name': 'Stream_DOT.block_128', 'type': 'function'} | Stream_DOT.block_128 | 0.113655 | 0.112893 |
{'name': 'Stream_MUL', 'type': 'function'} | Stream_MUL | 0.000011 | 0.000011 |
{'name': 'Stream_MUL.block_128', 'type': 'function'} | Stream_MUL.block_128 | 0.043271 | 0.043180 |
{'name': 'Stream_TRIAD', 'type': 'function'} | Stream_TRIAD | 0.000008 | 0.000008 |
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} | Stream_TRIAD.block_128 | 0.033730 | 0.033717 |
Calculate the percentile of performance data column, append to aggregated statistics table
The aggregated statistics table allows users to select a column from the performance data to perform the percentiles()
operation. This results in a new column appended to the statistics table containing the 25th, 50th, and 75th percentiles of the values in the provided column.
Below is an example where we calculate the percentiles of the values in the same Total time (exc)
column.
[17]:
tt.percentiles(th_lassen, columns=metrics)
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name | Total time (exc)_median | Total time (exc)_mean | Total time (exc)_percentiles | |
---|---|---|---|---|
node | ||||
{'name': 'Base_CUDA', 'type': 'function'} | Base_CUDA | 0.000636 | 0.000632 | [0.0006265, 0.000636, 0.000639] |
{'name': 'Algorithm', 'type': 'function'} | Algorithm | 0.000048 | 0.000048 | [4.6500000000000005e-05, 4.8e-05, 4.9e-05] |
{'name': 'Algorithm_MEMCPY', 'type': 'function'} | Algorithm_MEMCPY | 0.000016 | 0.000016 | [1.6e-05, 1.6e-05, 1.6e-05] |
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} | Algorithm_MEMCPY.block_128 | 0.002440 | 0.002442 | [0.0024395, 0.00244, 0.0024435] |
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} | Algorithm_MEMCPY.library | 0.002609 | 0.002609 | [0.0026085, 0.002609, 0.002609] |
{'name': 'Algorithm_MEMSET', 'type': 'function'} | Algorithm_MEMSET | 0.000016 | 0.000015 | [1.4999999999999999e-05, 1.6e-05, 1.6e-05] |
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} | Algorithm_MEMSET.block_128 | 0.001488 | 0.001487 | [0.001486, 0.001488, 0.0014889999999999999] |
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} | Algorithm_MEMSET.library | 0.001281 | 0.001279 | [0.001278, 0.001281, 0.0012815] |
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} | Algorithm_REDUCE_SUM | 0.000019 | 0.000019 | [1.8500000000000002e-05, 1.9e-05, 1.95e-05] |
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} | Algorithm_REDUCE_SUM.block_128 | 0.002683 | 0.002686 | [0.0026804999999999997, 0.002683, 0.0026895] |
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} | Algorithm_REDUCE_SUM.cub | 0.001527 | 0.001526 | [0.0015225, 0.001527, 0.0015305] |
{'name': 'Algorithm_SCAN', 'type': 'function'} | Algorithm_SCAN | 0.000008 | 0.000008 | [7.499999999999999e-06, 8e-06, 8.5e-06] |
{'name': 'Algorithm_SCAN.default', 'type': 'function'} | Algorithm_SCAN.default | 0.002917 | 0.002917 | [0.002909, 0.002917, 0.0029245] |
{'name': 'Apps', 'type': 'function'} | Apps | 0.000086 | 0.000088 | [8.55e-05, 8.6e-05, 8.950000000000001e-05] |
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} | Apps_CONVECTION3DPA | 0.000008 | 0.000008 | [8e-06, 8e-06, 8e-06] |
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} | Apps_CONVECTION3DPA.block_64 | 0.003161 | 0.003161 | [0.0031590000000000003, 0.003161, 0.0031635] |
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} | Apps_DEL_DOT_VEC_2D | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} | Apps_DEL_DOT_VEC_2D.block_128 | 0.007006 | 0.007019 | [0.0069984999999999995, 0.007006, 0.007033] |
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} | Apps_DIFFUSION3DPA | 0.000010 | 0.000009 | [9e-06, 1e-05, 1e-05] |
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} | Apps_DIFFUSION3DPA.block_64 | 0.004926 | 0.004932 | [0.0049250000000000006, 0.004926, 0.004935999999999999] |
{'name': 'Apps_ENERGY', 'type': 'function'} | Apps_ENERGY | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9.5e-06] |
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} | Apps_ENERGY.block_128 | 0.039303 | 0.039285 | [0.039274, 0.039303, 0.039305] |
{'name': 'Apps_FIR', 'type': 'function'} | Apps_FIR | 0.000010 | 0.000010 | [9.5e-06, 1e-05, 1e-05] |
{'name': 'Apps_FIR.block_128', 'type': 'function'} | Apps_FIR.block_128 | 0.004157 | 0.004165 | [0.0041554999999999995, 0.004157, 0.004171] |
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} | Apps_HALOEXCHANGE | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} | Apps_HALOEXCHANGE.block_128 | 0.037478 | 0.037577 | [0.037467, 0.037478, 0.037637000000000004] |
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED | 0.000010 | 0.000010 | [1e-05, 1e-05, 1e-05] |
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} | Apps_HALOEXCHANGE_FUSED.block_128 | 0.009430 | 0.009341 | [0.0092775, 0.00943, 0.009448499999999999] |
{'name': 'Apps_LTIMES', 'type': 'function'} | Apps_LTIMES | 0.000008 | 0.000008 | [8e-06, 8e-06, 8.5e-06] |
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} | Apps_LTIMES.block_128 | 0.012088 | 0.012090 | [0.0120835, 0.012088, 0.012095999999999999] |
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} | Apps_LTIMES_NOVIEW | 0.000011 | 0.000011 | [1.0500000000000001e-05, 1.1e-05, 1.1e-05] |
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} | Apps_LTIMES_NOVIEW.block_128 | 0.012085 | 0.012087 | [0.0120825, 0.012085, 0.01209] |
{'name': 'Apps_MASS3DPA', 'type': 'function'} | Apps_MASS3DPA | 0.000008 | 0.000008 | [8e-06, 8e-06, 8e-06] |
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} | Apps_MASS3DPA.block_25 | 0.001904 | 0.001906 | [0.0019030000000000002, 0.001904, 0.001908] |
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D | 0.000009 | 0.000008 | [8e-06, 9e-06, 9e-06] |
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} | Apps_NODAL_ACCUMULATION_3D.block_128 | 0.007408 | 0.007401 | [0.0073785, 0.007408, 0.0074265] |
{'name': 'Apps_PRESSURE', 'type': 'function'} | Apps_PRESSURE | 0.000009 | 0.000009 | [9e-06, 9e-06, 9.5e-06] |
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} | Apps_PRESSURE.block_128 | 0.048251 | 0.048255 | [0.0482315, 0.048251, 0.048277] |
{'name': 'Apps_VOL3D', 'type': 'function'} | Apps_VOL3D | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9.5e-06] |
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} | Apps_VOL3D.block_128 | 0.005903 | 0.005877 | [0.005823, 0.005903, 0.005944] |
{'name': 'Basic', 'type': 'function'} | Basic | 0.000097 | 0.000098 | [9.65e-05, 9.7e-05, 9.9e-05] |
{'name': 'Basic_DAXPY', 'type': 'function'} | Basic_DAXPY | 0.000011 | 0.000011 | [1.0500000000000001e-05, 1.1e-05, 1.1e-05] |
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} | Basic_DAXPY.block_128 | 0.016924 | 0.016901 | [0.016877000000000003, 0.016924, 0.016936] |
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} | Basic_DAXPY_ATOMIC | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} | Basic_DAXPY_ATOMIC.block_128 | 0.016947 | 0.016938 | [0.0169195, 0.016947, 0.0169605] |
{'name': 'Basic_IF_QUAD', 'type': 'function'} | Basic_IF_QUAD | 0.000008 | 0.000008 | [8e-06, 8e-06, 8.5e-06] |
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} | Basic_IF_QUAD.block_128 | 0.012622 | 0.012651 | [0.012606, 0.012622, 0.012681] |
{'name': 'Basic_INDEXLIST', 'type': 'function'} | Basic_INDEXLIST | 0.000008 | 0.000009 | [8e-06, 8e-06, 9e-06] |
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} | Basic_INDEXLIST.block_256 | 0.004506 | 0.004434 | [0.004391, 0.004506, 0.004512499999999999] |
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} | Basic_INDEXLIST_3LOOP | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} | Basic_INDEXLIST_3LOOP.block_256 | 0.008175 | 0.008132 | [0.008101500000000001, 0.008175, 0.0081845] |
{'name': 'Basic_INIT3', 'type': 'function'} | Basic_INIT3 | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9e-06] |
{'name': 'Basic_INIT3.block_128', 'type': 'function'} | Basic_INIT3.block_128 | 0.028197 | 0.028248 | [0.0281965, 0.028197, 0.0282745] |
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} | Basic_INIT_VIEW1D | 0.000009 | 0.000008 | [8e-06, 9e-06, 9e-06] |
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} | Basic_INIT_VIEW1D.block_128 | 0.041780 | 0.040132 | [0.0391845, 0.04178, 0.041902999999999996] |
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} | Basic_INIT_VIEW1D_OFFSET.block_128 | 0.041795 | 0.040145 | [0.039199, 0.041795, 0.041916499999999995] |
{'name': 'Basic_MULADDSUB', 'type': 'function'} | Basic_MULADDSUB | 0.000011 | 0.000011 | [1.0500000000000001e-05, 1.1e-05, 1.1e-05] |
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} | Basic_MULADDSUB.block_128 | 0.019746 | 0.019738 | [0.019731, 0.019746, 0.019748500000000002] |
{'name': 'Basic_NESTED_INIT', 'type': 'function'} | Basic_NESTED_INIT | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9e-06] |
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} | Basic_NESTED_INIT.block_128 | 0.020670 | 0.019845 | [0.019377, 0.02067, 0.020726] |
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} | Basic_PI_ATOMIC | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9e-06] |
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} | Basic_PI_ATOMIC.block_128 | 0.124113 | 0.120364 | [0.11722250000000001, 0.124113, 0.1253795] |
{'name': 'Basic_PI_REDUCE', 'type': 'function'} | Basic_PI_REDUCE | 0.000008 | 0.000008 | [7.499999999999999e-06, 8e-06, 8e-06] |
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} | Basic_PI_REDUCE.block_128 | 0.002602 | 0.002619 | [0.0025905, 0.002602, 0.0026395000000000004] |
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} | Basic_REDUCE3_INT | 0.000008 | 0.000008 | [8e-06, 8e-06, 8e-06] |
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} | Basic_REDUCE3_INT.block_128 | 0.002380 | 0.002404 | [0.002376, 0.00238, 0.0024200000000000003] |
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} | Basic_REDUCE_STRUCT | 0.000008 | 0.000008 | [8e-06, 8e-06, 8.5e-06] |
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} | Basic_REDUCE_STRUCT.block_128 | 0.016917 | 0.017113 | [0.016814000000000003, 0.016917, 0.0173145] |
{'name': 'Basic_TRAP_INT', 'type': 'function'} | Basic_TRAP_INT | 0.000007 | 0.000007 | [7e-06, 7e-06, 7.499999999999999e-06] |
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} | Basic_TRAP_INT.block_128 | 0.002596 | 0.002601 | [0.0025944999999999996, 0.002596, 0.0026055] |
{'name': 'Lcals', 'type': 'function'} | Lcals | 0.000074 | 0.000075 | [7.25e-05, 7.4e-05, 7.7e-05] |
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} | Lcals_DIFF_PREDICT | 0.000009 | 0.000009 | [9e-06, 9e-06, 9.5e-06] |
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} | Lcals_DIFF_PREDICT.block_128 | 0.061523 | 0.061192 | [0.0610085, 0.061523, 0.0615415] |
{'name': 'Lcals_EOS', 'type': 'function'} | Lcals_EOS | 0.000010 | 0.000010 | [9.5e-06, 1e-05, 1e-05] |
{'name': 'Lcals_EOS.block_128', 'type': 'function'} | Lcals_EOS.block_128 | 0.022685 | 0.022687 | [0.0226795, 0.022685, 0.022692999999999998] |
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} | Lcals_FIRST_DIFF | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9e-06] |
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} | Lcals_FIRST_DIFF.block_128 | 0.048455 | 0.048426 | [0.0483995, 0.048455, 0.0484675] |
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} | Lcals_FIRST_MIN | 0.000013 | 0.000015 | [1.2499999999999999e-05, 1.3e-05, 1.6e-05] |
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} | Lcals_FIRST_MIN.block_128 | 0.048713 | 0.058216 | [0.0474975, 0.048713, 0.0641825] |
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} | Lcals_FIRST_SUM | 0.000009 | 0.000009 | [9e-06, 9e-06, 9.5e-06] |
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} | Lcals_FIRST_SUM.block_128 | 0.048516 | 0.048493 | [0.048479999999999995, 0.048516, 0.0485175] |
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} | Lcals_GEN_LIN_RECUR | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} | Lcals_GEN_LIN_RECUR.block_128 | 0.048865 | 0.048918 | [0.0488645, 0.048865, 0.0489445] |
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} | Lcals_HYDRO_1D | 0.000009 | 0.000009 | [9e-06, 9e-06, 9.5e-06] |
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} | Lcals_HYDRO_1D.block_128 | 0.033971 | 0.033970 | [0.033904500000000004, 0.033971, 0.034036] |
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} | Lcals_HYDRO_2D | 0.000010 | 0.000010 | [1e-05, 1e-05, 1.0500000000000001e-05] |
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} | Lcals_HYDRO_2D.block_128 | 0.022995 | 0.023016 | [0.0229875, 0.022995, 0.0230345] |
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} | Lcals_INT_PREDICT | 0.000010 | 0.000010 | [9.5e-06, 1e-05, 1e-05] |
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} | Lcals_INT_PREDICT.block_128 | 0.046876 | 0.046896 | [0.046875, 0.046876, 0.046907500000000005] |
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} | Lcals_PLANCKIAN | 0.000010 | 0.000010 | [1e-05, 1e-05, 1e-05] |
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} | Lcals_PLANCKIAN.block_128 | 0.002759 | 0.002759 | [0.0027565000000000003, 0.002759, 0.002762] |
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} | Lcals_TRIDIAG_ELIM | 0.000010 | 0.000010 | [1e-05, 1e-05, 1e-05] |
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} | Lcals_TRIDIAG_ELIM.block_128 | 0.044681 | 0.044794 | [0.0446265, 0.044681, 0.044905] |
{'name': 'Polybench', 'type': 'function'} | Polybench | 0.000086 | 0.000086 | [8.55e-05, 8.6e-05, 8.7e-05] |
{'name': 'Polybench_2MM', 'type': 'function'} | Polybench_2MM | 0.000008 | 0.000008 | [8e-06, 8e-06, 8.5e-06] |
{'name': 'Polybench_2MM.block_128', 'type': 'function'} | Polybench_2MM.block_128 | 0.005949 | 0.005949 | [0.005937, 0.005949, 0.0059615] |
{'name': 'Polybench_3MM', 'type': 'function'} | Polybench_3MM | 0.000008 | 0.000008 | [8e-06, 8e-06, 8e-06] |
{'name': 'Polybench_3MM.block_128', 'type': 'function'} | Polybench_3MM.block_128 | 0.008885 | 0.008885 | [0.008884, 0.008885, 0.008885500000000001] |
{'name': 'Polybench_ADI', 'type': 'function'} | Polybench_ADI | 0.000009 | 0.000009 | [9e-06, 9e-06, 9.5e-06] |
{'name': 'Polybench_ADI.block_128', 'type': 'function'} | Polybench_ADI.block_128 | 0.033592 | 0.033582 | [0.0335465, 0.033592, 0.033622] |
{'name': 'Polybench_ATAX', 'type': 'function'} | Polybench_ATAX | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} | Polybench_ATAX.block_128 | 0.037403 | 0.037419 | [0.037385, 0.037403, 0.037445] |
{'name': 'Polybench_FDTD_2D', 'type': 'function'} | Polybench_FDTD_2D | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9.5e-06] |
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} | Polybench_FDTD_2D.block_128 | 0.037109 | 0.037101 | [0.037089, 0.037109, 0.0371165] |
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} | Polybench_FLOYD_WARSHALL | 0.000008 | 0.000008 | [8e-06, 8e-06, 8.5e-06] |
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} | Polybench_FLOYD_WARSHALL.block_128 | 0.209053 | 0.209057 | [0.208826, 0.209053, 0.2092865] |
{'name': 'Polybench_GEMM', 'type': 'function'} | Polybench_GEMM | 0.000010 | 0.000010 | [1e-05, 1e-05, 1.0500000000000001e-05] |
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} | Polybench_GEMM.block_128 | 0.006741 | 0.006742 | [0.0067405, 0.006741, 0.0067434999999999995] |
{'name': 'Polybench_GEMVER', 'type': 'function'} | Polybench_GEMVER | 0.000008 | 0.000008 | [8e-06, 8e-06, 8.5e-06] |
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} | Polybench_GEMVER.block_128 | 0.008432 | 0.008436 | [0.0084285, 0.008432, 0.008441500000000001] |
{'name': 'Polybench_GESUMMV', 'type': 'function'} | Polybench_GESUMMV | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9e-06] |
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} | Polybench_GESUMMV.block_128 | 0.037410 | 0.037365 | [0.0372855, 0.03741, 0.037467] |
{'name': 'Polybench_HEAT_3D', 'type': 'function'} | Polybench_HEAT_3D | 0.000009 | 0.000009 | [9e-06, 9e-06, 9.5e-06] |
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} | Polybench_HEAT_3D.block_128 | 0.021826 | 0.021828 | [0.0218135, 0.021826, 0.021842] |
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} | Polybench_JACOBI_1D | 0.000011 | 0.000011 | [9.999999999999999e-06, 1.1e-05, 1.15e-05] |
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} | Polybench_JACOBI_1D.block_128 | 0.077718 | 0.077743 | [0.07769899999999999, 0.077718, 0.077775] |
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} | Polybench_JACOBI_2D | 0.000010 | 0.000010 | [9.5e-06, 1e-05, 1e-05] |
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} | Polybench_JACOBI_2D.block_128 | 0.102580 | 0.102596 | [0.1025345, 0.10258, 0.10265] |
{'name': 'Polybench_MVT', 'type': 'function'} | Polybench_MVT | 0.000009 | 0.000010 | [9e-06, 9e-06, 1.0500000000000001e-05] |
{'name': 'Polybench_MVT.block_128', 'type': 'function'} | Polybench_MVT.block_128 | 0.035797 | 0.035768 | [0.0357435, 0.035797, 0.035806500000000005] |
{'name': 'Stream', 'type': 'function'} | Stream | 0.000033 | 0.000034 | [3.3e-05, 3.3e-05, 3.45e-05] |
{'name': 'Stream_ADD', 'type': 'function'} | Stream_ADD | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Stream_ADD.block_128', 'type': 'function'} | Stream_ADD.block_128 | 0.033798 | 0.033840 | [0.0337565, 0.033798, 0.033902] |
{'name': 'Stream_COPY', 'type': 'function'} | Stream_COPY | 0.000009 | 0.000009 | [8.5e-06, 9e-06, 9e-06] |
{'name': 'Stream_COPY.block_128', 'type': 'function'} | Stream_COPY.block_128 | 0.043261 | 0.043184 | [0.043116, 0.043261, 0.043290999999999996] |
{'name': 'Stream_DOT', 'type': 'function'} | Stream_DOT | 0.000009 | 0.000009 | [9e-06, 9e-06, 9e-06] |
{'name': 'Stream_DOT.block_128', 'type': 'function'} | Stream_DOT.block_128 | 0.113655 | 0.112893 | [0.11216000000000001, 0.113655, 0.11400650000000001] |
{'name': 'Stream_MUL', 'type': 'function'} | Stream_MUL | 0.000011 | 0.000011 | [1.0500000000000001e-05, 1.1e-05, 1.15e-05] |
{'name': 'Stream_MUL.block_128', 'type': 'function'} | Stream_MUL.block_128 | 0.043271 | 0.043180 | [0.043106, 0.043271, 0.0432995] |
{'name': 'Stream_TRIAD', 'type': 'function'} | Stream_TRIAD | 0.000008 | 0.000008 | [8e-06, 8e-06, 8.5e-06] |
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} | Stream_TRIAD.block_128 | 0.033730 | 0.033717 | [0.033696000000000004, 0.03373, 0.0337445] |
View aggregated statistics call tree
[18]:
print(th_lassen.statsframe.tree(metric_column="Total time (exc)_median"))
__ __ __ __
/ /_ ____ _/ /______/ /_ ___ / /_
/ __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
/ / / / /_/ / /_/ /__/ / / / __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/ v2023.1.0
0.001 Base_CUDA
├─ 0.000 Algorithm
│ ├─ 0.000 Algorithm_MEMCPY
│ │ ├─ 0.002 Algorithm_MEMCPY.block_128
│ │ └─ 0.003 Algorithm_MEMCPY.library
│ ├─ 0.000 Algorithm_MEMSET
│ │ ├─ 0.001 Algorithm_MEMSET.block_128
│ │ └─ 0.001 Algorithm_MEMSET.library
│ ├─ 0.000 Algorithm_REDUCE_SUM
│ │ ├─ 0.003 Algorithm_REDUCE_SUM.block_128
│ │ └─ 0.002 Algorithm_REDUCE_SUM.cub
│ └─ 0.000 Algorithm_SCAN
│ └─ 0.003 Algorithm_SCAN.default
├─ 0.000 Apps
│ ├─ 0.000 Apps_CONVECTION3DPA
│ │ └─ 0.003 Apps_CONVECTION3DPA.block_64
│ ├─ 0.000 Apps_DEL_DOT_VEC_2D
│ │ └─ 0.007 Apps_DEL_DOT_VEC_2D.block_128
│ ├─ 0.000 Apps_DIFFUSION3DPA
│ │ └─ 0.005 Apps_DIFFUSION3DPA.block_64
│ ├─ 0.000 Apps_ENERGY
│ │ └─ 0.039 Apps_ENERGY.block_128
│ ├─ 0.000 Apps_FIR
│ │ └─ 0.004 Apps_FIR.block_128
│ ├─ 0.000 Apps_HALOEXCHANGE
│ │ └─ 0.037 Apps_HALOEXCHANGE.block_128
│ ├─ 0.000 Apps_HALOEXCHANGE_FUSED
│ │ └─ 0.009 Apps_HALOEXCHANGE_FUSED.block_128
│ ├─ 0.000 Apps_LTIMES
│ │ └─ 0.012 Apps_LTIMES.block_128
│ ├─ 0.000 Apps_LTIMES_NOVIEW
│ │ └─ 0.012 Apps_LTIMES_NOVIEW.block_128
│ ├─ 0.000 Apps_MASS3DPA
│ │ └─ 0.002 Apps_MASS3DPA.block_25
│ ├─ 0.000 Apps_NODAL_ACCUMULATION_3D
│ │ └─ 0.007 Apps_NODAL_ACCUMULATION_3D.block_128
│ ├─ 0.000 Apps_PRESSURE
│ │ └─ 0.048 Apps_PRESSURE.block_128
│ └─ 0.000 Apps_VOL3D
│ └─ 0.006 Apps_VOL3D.block_128
├─ 0.000 Basic
│ ├─ 0.000 Basic_DAXPY
│ │ └─ 0.017 Basic_DAXPY.block_128
│ ├─ 0.000 Basic_DAXPY_ATOMIC
│ │ └─ 0.017 Basic_DAXPY_ATOMIC.block_128
│ ├─ 0.000 Basic_IF_QUAD
│ │ └─ 0.013 Basic_IF_QUAD.block_128
│ ├─ 0.000 Basic_INDEXLIST
│ │ └─ 0.005 Basic_INDEXLIST.block_256
│ ├─ 0.000 Basic_INDEXLIST_3LOOP
│ │ └─ 0.008 Basic_INDEXLIST_3LOOP.block_256
│ ├─ 0.000 Basic_INIT3
│ │ └─ 0.028 Basic_INIT3.block_128
│ ├─ 0.000 Basic_INIT_VIEW1D
│ │ └─ 0.042 Basic_INIT_VIEW1D.block_128
│ ├─ 0.000 Basic_INIT_VIEW1D_OFFSET
│ │ └─ 0.042 Basic_INIT_VIEW1D_OFFSET.block_128
│ ├─ 0.000 Basic_MULADDSUB
│ │ └─ 0.020 Basic_MULADDSUB.block_128
│ ├─ 0.000 Basic_NESTED_INIT
│ │ └─ 0.021 Basic_NESTED_INIT.block_128
│ ├─ 0.000 Basic_PI_ATOMIC
│ │ └─ 0.124 Basic_PI_ATOMIC.block_128
│ ├─ 0.000 Basic_PI_REDUCE
│ │ └─ 0.003 Basic_PI_REDUCE.block_128
│ ├─ 0.000 Basic_REDUCE3_INT
│ │ └─ 0.002 Basic_REDUCE3_INT.block_128
│ ├─ 0.000 Basic_REDUCE_STRUCT
│ │ └─ 0.017 Basic_REDUCE_STRUCT.block_128
│ └─ 0.000 Basic_TRAP_INT
│ └─ 0.003 Basic_TRAP_INT.block_128
├─ 0.000 Lcals
│ ├─ 0.000 Lcals_DIFF_PREDICT
│ │ └─ 0.062 Lcals_DIFF_PREDICT.block_128
│ ├─ 0.000 Lcals_EOS
│ │ └─ 0.023 Lcals_EOS.block_128
│ ├─ 0.000 Lcals_FIRST_DIFF
│ │ └─ 0.048 Lcals_FIRST_DIFF.block_128
│ ├─ 0.000 Lcals_FIRST_MIN
│ │ └─ 0.049 Lcals_FIRST_MIN.block_128
│ ├─ 0.000 Lcals_FIRST_SUM
│ │ └─ 0.049 Lcals_FIRST_SUM.block_128
│ ├─ 0.000 Lcals_GEN_LIN_RECUR
│ │ └─ 0.049 Lcals_GEN_LIN_RECUR.block_128
│ ├─ 0.000 Lcals_HYDRO_1D
│ │ └─ 0.034 Lcals_HYDRO_1D.block_128
│ ├─ 0.000 Lcals_HYDRO_2D
│ │ └─ 0.023 Lcals_HYDRO_2D.block_128
│ ├─ 0.000 Lcals_INT_PREDICT
│ │ └─ 0.047 Lcals_INT_PREDICT.block_128
│ ├─ 0.000 Lcals_PLANCKIAN
│ │ └─ 0.003 Lcals_PLANCKIAN.block_128
│ └─ 0.000 Lcals_TRIDIAG_ELIM
│ └─ 0.045 Lcals_TRIDIAG_ELIM.block_128
├─ 0.000 Polybench
│ ├─ 0.000 Polybench_2MM
│ │ └─ 0.006 Polybench_2MM.block_128
│ ├─ 0.000 Polybench_3MM
│ │ └─ 0.009 Polybench_3MM.block_128
│ ├─ 0.000 Polybench_ADI
│ │ └─ 0.034 Polybench_ADI.block_128
│ ├─ 0.000 Polybench_ATAX
│ │ └─ 0.037 Polybench_ATAX.block_128
│ ├─ 0.000 Polybench_FDTD_2D
│ │ └─ 0.037 Polybench_FDTD_2D.block_128
│ ├─ 0.000 Polybench_FLOYD_WARSHALL
│ │ └─ 0.209 Polybench_FLOYD_WARSHALL.block_128
│ ├─ 0.000 Polybench_GEMM
│ │ └─ 0.007 Polybench_GEMM.block_128
│ ├─ 0.000 Polybench_GEMVER
│ │ └─ 0.008 Polybench_GEMVER.block_128
│ ├─ 0.000 Polybench_GESUMMV
│ │ └─ 0.037 Polybench_GESUMMV.block_128
│ ├─ 0.000 Polybench_HEAT_3D
│ │ └─ 0.022 Polybench_HEAT_3D.block_128
│ ├─ 0.000 Polybench_JACOBI_1D
│ │ └─ 0.078 Polybench_JACOBI_1D.block_128
│ ├─ 0.000 Polybench_JACOBI_2D
│ │ └─ 0.103 Polybench_JACOBI_2D.block_128
│ └─ 0.000 Polybench_MVT
│ └─ 0.036 Polybench_MVT.block_128
└─ 0.000 Stream
├─ 0.000 Stream_ADD
│ └─ 0.034 Stream_ADD.block_128
├─ 0.000 Stream_COPY
│ └─ 0.043 Stream_COPY.block_128
├─ 0.000 Stream_DOT
│ └─ 0.114 Stream_DOT.block_128
├─ 0.000 Stream_MUL
│ └─ 0.043 Stream_MUL.block_128
└─ 0.000 Stream_TRIAD
└─ 0.034 Stream_TRIAD.block_128
Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.21)
█ 0.19 - 0.21
█ 0.15 - 0.19
█ 0.10 - 0.15
█ 0.06 - 0.10
█ 0.02 - 0.06
█ 0.00 - 0.02
name User code ◀ Only in left graph ▶ Only in right graph
Use the Query Language
Thicket’s query language provides users the capability to select or query
specific nodes based on the call tree of the thicket. The performance data is then updated as part of the operation.
Initial call tree:
[19]:
print(th_lassen.statsframe.tree("Total time (exc)_median"))
__ __ __ __
/ /_ ____ _/ /______/ /_ ___ / /_
/ __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
/ / / / /_/ / /_/ /__/ / / / __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/ v2023.1.0
0.001 Base_CUDA
├─ 0.000 Algorithm
│ ├─ 0.000 Algorithm_MEMCPY
│ │ ├─ 0.002 Algorithm_MEMCPY.block_128
│ │ └─ 0.003 Algorithm_MEMCPY.library
│ ├─ 0.000 Algorithm_MEMSET
│ │ ├─ 0.001 Algorithm_MEMSET.block_128
│ │ └─ 0.001 Algorithm_MEMSET.library
│ ├─ 0.000 Algorithm_REDUCE_SUM
│ │ ├─ 0.003 Algorithm_REDUCE_SUM.block_128
│ │ └─ 0.002 Algorithm_REDUCE_SUM.cub
│ └─ 0.000 Algorithm_SCAN
│ └─ 0.003 Algorithm_SCAN.default
├─ 0.000 Apps
│ ├─ 0.000 Apps_CONVECTION3DPA
│ │ └─ 0.003 Apps_CONVECTION3DPA.block_64
│ ├─ 0.000 Apps_DEL_DOT_VEC_2D
│ │ └─ 0.007 Apps_DEL_DOT_VEC_2D.block_128
│ ├─ 0.000 Apps_DIFFUSION3DPA
│ │ └─ 0.005 Apps_DIFFUSION3DPA.block_64
│ ├─ 0.000 Apps_ENERGY
│ │ └─ 0.039 Apps_ENERGY.block_128
│ ├─ 0.000 Apps_FIR
│ │ └─ 0.004 Apps_FIR.block_128
│ ├─ 0.000 Apps_HALOEXCHANGE
│ │ └─ 0.037 Apps_HALOEXCHANGE.block_128
│ ├─ 0.000 Apps_HALOEXCHANGE_FUSED
│ │ └─ 0.009 Apps_HALOEXCHANGE_FUSED.block_128
│ ├─ 0.000 Apps_LTIMES
│ │ └─ 0.012 Apps_LTIMES.block_128
│ ├─ 0.000 Apps_LTIMES_NOVIEW
│ │ └─ 0.012 Apps_LTIMES_NOVIEW.block_128
│ ├─ 0.000 Apps_MASS3DPA
│ │ └─ 0.002 Apps_MASS3DPA.block_25
│ ├─ 0.000 Apps_NODAL_ACCUMULATION_3D
│ │ └─ 0.007 Apps_NODAL_ACCUMULATION_3D.block_128
│ ├─ 0.000 Apps_PRESSURE
│ │ └─ 0.048 Apps_PRESSURE.block_128
│ └─ 0.000 Apps_VOL3D
│ └─ 0.006 Apps_VOL3D.block_128
├─ 0.000 Basic
│ ├─ 0.000 Basic_DAXPY
│ │ └─ 0.017 Basic_DAXPY.block_128
│ ├─ 0.000 Basic_DAXPY_ATOMIC
│ │ └─ 0.017 Basic_DAXPY_ATOMIC.block_128
│ ├─ 0.000 Basic_IF_QUAD
│ │ └─ 0.013 Basic_IF_QUAD.block_128
│ ├─ 0.000 Basic_INDEXLIST
│ │ └─ 0.005 Basic_INDEXLIST.block_256
│ ├─ 0.000 Basic_INDEXLIST_3LOOP
│ │ └─ 0.008 Basic_INDEXLIST_3LOOP.block_256
│ ├─ 0.000 Basic_INIT3
│ │ └─ 0.028 Basic_INIT3.block_128
│ ├─ 0.000 Basic_INIT_VIEW1D
│ │ └─ 0.042 Basic_INIT_VIEW1D.block_128
│ ├─ 0.000 Basic_INIT_VIEW1D_OFFSET
│ │ └─ 0.042 Basic_INIT_VIEW1D_OFFSET.block_128
│ ├─ 0.000 Basic_MULADDSUB
│ │ └─ 0.020 Basic_MULADDSUB.block_128
│ ├─ 0.000 Basic_NESTED_INIT
│ │ └─ 0.021 Basic_NESTED_INIT.block_128
│ ├─ 0.000 Basic_PI_ATOMIC
│ │ └─ 0.124 Basic_PI_ATOMIC.block_128
│ ├─ 0.000 Basic_PI_REDUCE
│ │ └─ 0.003 Basic_PI_REDUCE.block_128
│ ├─ 0.000 Basic_REDUCE3_INT
│ │ └─ 0.002 Basic_REDUCE3_INT.block_128
│ ├─ 0.000 Basic_REDUCE_STRUCT
│ │ └─ 0.017 Basic_REDUCE_STRUCT.block_128
│ └─ 0.000 Basic_TRAP_INT
│ └─ 0.003 Basic_TRAP_INT.block_128
├─ 0.000 Lcals
│ ├─ 0.000 Lcals_DIFF_PREDICT
│ │ └─ 0.062 Lcals_DIFF_PREDICT.block_128
│ ├─ 0.000 Lcals_EOS
│ │ └─ 0.023 Lcals_EOS.block_128
│ ├─ 0.000 Lcals_FIRST_DIFF
│ │ └─ 0.048 Lcals_FIRST_DIFF.block_128
│ ├─ 0.000 Lcals_FIRST_MIN
│ │ └─ 0.049 Lcals_FIRST_MIN.block_128
│ ├─ 0.000 Lcals_FIRST_SUM
│ │ └─ 0.049 Lcals_FIRST_SUM.block_128
│ ├─ 0.000 Lcals_GEN_LIN_RECUR
│ │ └─ 0.049 Lcals_GEN_LIN_RECUR.block_128
│ ├─ 0.000 Lcals_HYDRO_1D
│ │ └─ 0.034 Lcals_HYDRO_1D.block_128
│ ├─ 0.000 Lcals_HYDRO_2D
│ │ └─ 0.023 Lcals_HYDRO_2D.block_128
│ ├─ 0.000 Lcals_INT_PREDICT
│ │ └─ 0.047 Lcals_INT_PREDICT.block_128
│ ├─ 0.000 Lcals_PLANCKIAN
│ │ └─ 0.003 Lcals_PLANCKIAN.block_128
│ └─ 0.000 Lcals_TRIDIAG_ELIM
│ └─ 0.045 Lcals_TRIDIAG_ELIM.block_128
├─ 0.000 Polybench
│ ├─ 0.000 Polybench_2MM
│ │ └─ 0.006 Polybench_2MM.block_128
│ ├─ 0.000 Polybench_3MM
│ │ └─ 0.009 Polybench_3MM.block_128
│ ├─ 0.000 Polybench_ADI
│ │ └─ 0.034 Polybench_ADI.block_128
│ ├─ 0.000 Polybench_ATAX
│ │ └─ 0.037 Polybench_ATAX.block_128
│ ├─ 0.000 Polybench_FDTD_2D
│ │ └─ 0.037 Polybench_FDTD_2D.block_128
│ ├─ 0.000 Polybench_FLOYD_WARSHALL
│ │ └─ 0.209 Polybench_FLOYD_WARSHALL.block_128
│ ├─ 0.000 Polybench_GEMM
│ │ └─ 0.007 Polybench_GEMM.block_128
│ ├─ 0.000 Polybench_GEMVER
│ │ └─ 0.008 Polybench_GEMVER.block_128
│ ├─ 0.000 Polybench_GESUMMV
│ │ └─ 0.037 Polybench_GESUMMV.block_128
│ ├─ 0.000 Polybench_HEAT_3D
│ │ └─ 0.022 Polybench_HEAT_3D.block_128
│ ├─ 0.000 Polybench_JACOBI_1D
│ │ └─ 0.078 Polybench_JACOBI_1D.block_128
│ ├─ 0.000 Polybench_JACOBI_2D
│ │ └─ 0.103 Polybench_JACOBI_2D.block_128
│ └─ 0.000 Polybench_MVT
│ └─ 0.036 Polybench_MVT.block_128
└─ 0.000 Stream
├─ 0.000 Stream_ADD
│ └─ 0.034 Stream_ADD.block_128
├─ 0.000 Stream_COPY
│ └─ 0.043 Stream_COPY.block_128
├─ 0.000 Stream_DOT
│ └─ 0.114 Stream_DOT.block_128
├─ 0.000 Stream_MUL
│ └─ 0.043 Stream_MUL.block_128
└─ 0.000 Stream_TRIAD
└─ 0.034 Stream_TRIAD.block_128
Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.21)
█ 0.19 - 0.21
█ 0.15 - 0.19
█ 0.10 - 0.15
█ 0.06 - 0.10
█ 0.02 - 0.06
█ 0.00 - 0.02
name User code ◀ Only in left graph ▶ Only in right graph
Example 1
In the example below, we use a thicket query that only select the parents and children nodes of Algorithm
, maintaining the structure of the call tree.
NOTE: A DeprecationWarning
is generated when using “old-style” queries (i.e., queries with QueryMatcher
) when using the newest version of Hatchet. In the future, queries will be defined with Query
.
[20]:
alg_query_ex1 = (
ht.QueryMatcher()
.match("*")
.rel(
".",
lambda row: row["name"].apply(
lambda x: re.match(
r"Algorithm.*", x
)
is not None).all()
)
)
# applying the query on the lassen thicket
th_algorithm_ex1 = th_lassen.query(alg_query_ex1)
tt.median(th_algorithm_ex1, columns=["Total time (exc)"])
print(th_algorithm_ex1.statsframe.tree("Total time (exc)_median"))
/var/folders/g4/9_n9kks14t17ldr01p_rgsw00000gn/T/ipykernel_29938/3034768233.py:2: DeprecationWarning: Old-style queries are deprecated and will be removed in the future. Please use new-style queries instead. For QueryMatcher, the equivalent new-style queries are hatchet.query.Query for base-syntax queries and hatchet.query.ObjectQuery for the object-dialect.
ht.QueryMatcher()
__ __ __ __
/ /_ ____ _/ /______/ /_ ___ / /_
/ __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
/ / / / /_/ / /_/ /__/ / / / __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/ v2023.1.0
0.001 Base_CUDA
└─ 0.000 Algorithm
├─ 0.000 Algorithm_MEMCPY
│ ├─ 0.002 Algorithm_MEMCPY.block_128
│ └─ 0.003 Algorithm_MEMCPY.library
├─ 0.000 Algorithm_MEMSET
│ ├─ 0.001 Algorithm_MEMSET.block_128
│ └─ 0.001 Algorithm_MEMSET.library
├─ 0.000 Algorithm_REDUCE_SUM
│ ├─ 0.003 Algorithm_REDUCE_SUM.block_128
│ └─ 0.002 Algorithm_REDUCE_SUM.cub
└─ 0.000 Algorithm_SCAN
└─ 0.003 Algorithm_SCAN.default
Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.00)
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
name User code ◀ Only in left graph ▶ Only in right graph
Example 2
In the example below, we use a thicket query that selects only the Algorithm
node and its children nodes, maintaining the structure of the call tree.
NOTE: A DeprecationWarning
is generated when using “old-style” queries (i.e., queries with QueryMatcher
) when using the newest version of Hatchet. In the future, queries will be defined with Query
.
[21]:
alg_query_ex2 = (
ht.QueryMatcher()
.match(
".",
lambda row: row["name"].apply(
lambda x: re.match(
r"Algorithm.*", x
)
is not None).all()
)
.rel("*")
)
# applying the second query on the lassen thicket
th_algorithm_ex2 = th_lassen.query(alg_query_ex2)
tt.median(th_algorithm_ex2, columns=["Total time (exc)"])
print(th_algorithm_ex2.statsframe.tree("Total time (exc)_median"))
__ __ __ __
/ /_ ____ _/ /______/ /_ ___ / /_
/ __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
/ / / / /_/ / /_/ /__/ / / / __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/ v2023.1.0
0.000 Algorithm
├─ 0.000 Algorithm_MEMCPY
│ ├─ 0.002 Algorithm_MEMCPY.block_128
│ └─ 0.003 Algorithm_MEMCPY.library
├─ 0.000 Algorithm_MEMSET
│ ├─ 0.001 Algorithm_MEMSET.block_128
│ └─ 0.001 Algorithm_MEMSET.library
├─ 0.000 Algorithm_REDUCE_SUM
│ ├─ 0.003 Algorithm_REDUCE_SUM.block_128
│ └─ 0.002 Algorithm_REDUCE_SUM.cub
└─ 0.000 Algorithm_SCAN
└─ 0.003 Algorithm_SCAN.default
Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.00)
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
█ 0.00 - 0.00
name User code ◀ Only in left graph ▶ Only in right graph
/var/folders/g4/9_n9kks14t17ldr01p_rgsw00000gn/T/ipykernel_29938/3679505811.py:2: DeprecationWarning: Old-style queries are deprecated and will be removed in the future. Please use new-style queries instead. For QueryMatcher, the equivalent new-style queries are hatchet.query.Query for base-syntax queries and hatchet.query.ObjectQuery for the object-dialect.
ht.QueryMatcher()
Display histogram
The display_histogram()
function allows users to select a node and metric value (a column in the performance data table) for which a histogram is generated.
Some available keyword arguments are the following,
height
: height (in inches) of each facet.aspect
: aspect ratio of each facet, aspect * height will give you the width of each facet.bins
: rather generic, and can set the number of bins.binwidth
: width of each bin, overrides bins but can be used with binrange.binrange
: lowest and highest value for bin edges; can be used either with bins or binwidth. Defaults to extremes.color
: Set the color of the bars.
An exhaustive list of available arguments can be found here.
[22]:
n = pd.unique(th_algorithm_ex1.dataframe.reset_index()["node"])[0]
[23]:
tt.display_histogram(th_algorithm_ex1, node=n, column="Total time (exc)")
[23]:
<seaborn.axisgrid.FacetGrid at 0x7f932b0ab190>

Display heatmap
The display_heatmap()
function allows users to select column(s) from the performance data table, for which a heatmap is generated based on the values of the column.
Some available keyword arguments are the following: * vmax
: maximum value to anchor the color map. * vmin
: minimum value to anchor the color map. * linecolor
: color of the lines that will divide each cell. * linewidths
: Width of the lines that will divide each cell.
An exhaustive list of available arguments can be found here.
[24]:
th_algorithm_ex1.dataframe.columns
[24]:
Index(['nid', 'spot.channel', 'Min time/rank', 'Max time/rank',
'Avg time/rank', 'Total time', 'Min time/rank (exc)',
'Max time/rank (exc)', 'Avg time/rank (exc)', 'Total time (exc)',
'name', 'Avg time/rank (inc)', 'Total time (inc)',
'Min time/rank (inc)', 'Max time/rank (inc)'],
dtype='object')
[25]:
plt.figure(figsize=(30,30))
metrics = ["Total time (exc)_median"]
tt.display_heatmap(th_algorithm_ex1, columns=metrics)
[25]:
<AxesSubplot: ylabel='node'>
