Basic Thicket Tutorial: Thicket 101

Thicket is a python-based toolkit for Exploratory Data Analysis (EDA) of parallel performance data that enables performance optimization and understanding of applications’ performance on supercomputers. It bridges the performance tool gap between being able to consider only a single instance of a simulation run (e.g., single platform, single measurement tool, or single scale) and finding actionable insights in multi-dimensional, multi-scale, multi-architecture, and multi-tool performance datasets.

NOTE: An interactive version of this notebook is available in the Binder environment.

Binder


1. Import Necessary Packages

To explore the structure and various capabilities of thicket components, we begin by importing necessary packages. These include python extensions and thicket’s statistical functions.

[1]:
import re

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from IPython.display import display
from IPython.display import HTML
import hatchet as ht

import thicket as tt

display(HTML("<style>.container { width:80% !important; }</style>"))

2. Read in Performance Profiles

For this notebook, we select profiles generated on Lawrence Livermore National Lab (LLNL) machine, lassen. We create two thicket objects, one generated with the same problem size of 1048576 and the other generated with different problem sizes (1048576 and 4194304).

[2]:
lassen1 = [f"../data/lassen/XL_BaseCuda_01048576_0{x}.cali" for x in range(1, 4)]
lassen2 = [f"../data/lassen/XL_BaseCuda_04194304_01.cali"]

# generate thicket(s)
th_lassen = tt.Thicket.from_caliperreader(lassen1)
th_obj = tt.Thicket.from_caliperreader(lassen1+lassen2)

3. More Information on a Function

You can use the help() method within Python to see the information for a given object. You can do this by typing help(object). This will allow you to see the arguments for the function, and what will be returned. An example is below.

[3]:
help(tt.median)
Help on function median in module thicket.stats.median:

median(thicket, columns=None)
    Calculate the median for each node in the performance data table.

    Designed to take in a thicket, and append one or more columns to the
    aggregated statistics table for the median calculation for each node.

    Arguments:
        thicket (thicket): Thicket object
        columns (list): List of hardware/timing metrics to perform median calculation
            on. Note, if using a columnar joined thicket a list of tuples must be passed
            in with the format (column index, column name).

4. Thicket Components

4.1 Performance Data

The performance data table is a multi-dimensional, multi-indexed component of thicket. The rows represent nodes that each contain a different execution (i.e., profile index) of the associated call tree node.

View performance data table:

[4]:
display(HTML(th_lassen.dataframe.to_html()))
nid spot.channel Min time/rank Max time/rank Avg time/rank Total time Min time/rank (exc) Max time/rank (exc) Avg time/rank (exc) Total time (exc) name
node profile
{'name': 'Base_CUDA', 'type': 'function'} 2118176828577267326 0 regionprofile 1.892446 1.892446 1.892446 1.892446 0.000642 0.000642 0.000642 0.000642 Base_CUDA
5411856859942141273 0 regionprofile 1.893661 1.893661 1.893661 1.893661 0.000636 0.000636 0.000636 0.000636 Base_CUDA
7599706706110163426 0 regionprofile 1.885701 1.885701 1.885701 1.885701 0.000617 0.000617 0.000617 0.000617 Base_CUDA
{'name': 'Algorithm', 'type': 'function'} 2118176828577267326 8 regionprofile 0.015058 0.015058 0.015058 0.015058 0.000050 0.000050 0.000050 0.000050 Algorithm
5411856859942141273 8 regionprofile 0.015072 0.015072 0.015072 0.015072 0.000045 0.000045 0.000045 0.000045 Algorithm
7599706706110163426 8 regionprofile 0.015026 0.015026 0.015026 0.015026 0.000048 0.000048 0.000048 0.000048 Algorithm
{'name': 'Algorithm_MEMCPY', 'type': 'function'} 2118176828577267326 19 regionprofile 0.005065 0.005065 0.005065 0.005065 0.000016 0.000016 0.000016 0.000016 Algorithm_MEMCPY
5411856859942141273 19 regionprofile 0.005063 0.005063 0.005063 0.005063 0.000016 0.000016 0.000016 0.000016 Algorithm_MEMCPY
7599706706110163426 19 regionprofile 0.005072 0.005072 0.005072 0.005072 0.000016 0.000016 0.000016 0.000016 Algorithm_MEMCPY
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} 2118176828577267326 23 regionprofile 0.002440 0.002440 0.002440 0.002440 0.002440 0.002440 0.002440 0.002440 Algorithm_MEMCPY.block_128
5411856859942141273 23 regionprofile 0.002439 0.002439 0.002439 0.002439 0.002439 0.002439 0.002439 0.002439 Algorithm_MEMCPY.block_128
7599706706110163426 23 regionprofile 0.002447 0.002447 0.002447 0.002447 0.002447 0.002447 0.002447 0.002447 Algorithm_MEMCPY.block_128
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} 2118176828577267326 21 regionprofile 0.002609 0.002609 0.002609 0.002609 0.002609 0.002609 0.002609 0.002609 Algorithm_MEMCPY.library
5411856859942141273 21 regionprofile 0.002608 0.002608 0.002608 0.002608 0.002608 0.002608 0.002608 0.002608 Algorithm_MEMCPY.library
7599706706110163426 21 regionprofile 0.002609 0.002609 0.002609 0.002609 0.002609 0.002609 0.002609 0.002609 Algorithm_MEMCPY.library
{'name': 'Algorithm_MEMSET', 'type': 'function'} 2118176828577267326 14 regionprofile 0.002783 0.002783 0.002783 0.002783 0.000014 0.000014 0.000014 0.000014 Algorithm_MEMSET
5411856859942141273 14 regionprofile 0.002788 0.002788 0.002788 0.002788 0.000016 0.000016 0.000016 0.000016 Algorithm_MEMSET
7599706706110163426 14 regionprofile 0.002775 0.002775 0.002775 0.002775 0.000016 0.000016 0.000016 0.000016 Algorithm_MEMSET
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} 2118176828577267326 17 regionprofile 0.001488 0.001488 0.001488 0.001488 0.001488 0.001488 0.001488 0.001488 Algorithm_MEMSET.block_128
5411856859942141273 17 regionprofile 0.001490 0.001490 0.001490 0.001490 0.001490 0.001490 0.001490 0.001490 Algorithm_MEMSET.block_128
7599706706110163426 17 regionprofile 0.001484 0.001484 0.001484 0.001484 0.001484 0.001484 0.001484 0.001484 Algorithm_MEMSET.block_128
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} 2118176828577267326 15 regionprofile 0.001281 0.001281 0.001281 0.001281 0.001281 0.001281 0.001281 0.001281 Algorithm_MEMSET.library
5411856859942141273 15 regionprofile 0.001282 0.001282 0.001282 0.001282 0.001282 0.001282 0.001282 0.001282 Algorithm_MEMSET.library
7599706706110163426 15 regionprofile 0.001275 0.001275 0.001275 0.001275 0.001275 0.001275 0.001275 0.001275 Algorithm_MEMSET.library
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} 2118176828577267326 11 regionprofile 0.004220 0.004220 0.004220 0.004220 0.000019 0.000019 0.000019 0.000019 Algorithm_REDUCE_SUM
5411856859942141273 11 regionprofile 0.004250 0.004250 0.004250 0.004250 0.000020 0.000020 0.000020 0.000020 Algorithm_REDUCE_SUM
7599706706110163426 11 regionprofile 0.004223 0.004223 0.004223 0.004223 0.000018 0.000018 0.000018 0.000018 Algorithm_REDUCE_SUM
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} 2118176828577267326 13 regionprofile 0.002683 0.002683 0.002683 0.002683 0.002683 0.002683 0.002683 0.002683 Algorithm_REDUCE_SUM.block_128
5411856859942141273 13 regionprofile 0.002696 0.002696 0.002696 0.002696 0.002696 0.002696 0.002696 0.002696 Algorithm_REDUCE_SUM.block_128
7599706706110163426 13 regionprofile 0.002678 0.002678 0.002678 0.002678 0.002678 0.002678 0.002678 0.002678 Algorithm_REDUCE_SUM.block_128
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} 2118176828577267326 12 regionprofile 0.001518 0.001518 0.001518 0.001518 0.001518 0.001518 0.001518 0.001518 Algorithm_REDUCE_SUM.cub
5411856859942141273 12 regionprofile 0.001534 0.001534 0.001534 0.001534 0.001534 0.001534 0.001534 0.001534 Algorithm_REDUCE_SUM.cub
7599706706110163426 12 regionprofile 0.001527 0.001527 0.001527 0.001527 0.001527 0.001527 0.001527 0.001527 Algorithm_REDUCE_SUM.cub
{'name': 'Algorithm_SCAN', 'type': 'function'} 2118176828577267326 9 regionprofile 0.002940 0.002940 0.002940 0.002940 0.000008 0.000008 0.000008 0.000008 Algorithm_SCAN
5411856859942141273 9 regionprofile 0.002926 0.002926 0.002926 0.002926 0.000009 0.000009 0.000009 0.000009 Algorithm_SCAN
7599706706110163426 9 regionprofile 0.002908 0.002908 0.002908 0.002908 0.000007 0.000007 0.000007 0.000007 Algorithm_SCAN
{'name': 'Algorithm_SCAN.default', 'type': 'function'} 2118176828577267326 10 regionprofile 0.002932 0.002932 0.002932 0.002932 0.002932 0.002932 0.002932 0.002932 Algorithm_SCAN.default
5411856859942141273 10 regionprofile 0.002917 0.002917 0.002917 0.002917 0.002917 0.002917 0.002917 0.002917 Algorithm_SCAN.default
7599706706110163426 10 regionprofile 0.002901 0.002901 0.002901 0.002901 0.002901 0.002901 0.002901 0.002901 Algorithm_SCAN.default
{'name': 'Apps', 'type': 'function'} 2118176828577267326 1 regionprofile 0.193195 0.193195 0.193195 0.193195 0.000093 0.000093 0.000093 0.000093 Apps
5411856859942141273 1 regionprofile 0.193273 0.193273 0.193273 0.193273 0.000085 0.000085 0.000085 0.000085 Apps
7599706706110163426 1 regionprofile 0.193437 0.193437 0.193437 0.193437 0.000086 0.000086 0.000086 0.000086 Apps
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} 2118176828577267326 112 regionprofile 0.003169 0.003169 0.003169 0.003169 0.000008 0.000008 0.000008 0.000008 Apps_CONVECTION3DPA
5411856859942141273 112 regionprofile 0.003174 0.003174 0.003174 0.003174 0.000008 0.000008 0.000008 0.000008 Apps_CONVECTION3DPA
7599706706110163426 112 regionprofile 0.003165 0.003165 0.003165 0.003165 0.000008 0.000008 0.000008 0.000008 Apps_CONVECTION3DPA
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} 2118176828577267326 113 regionprofile 0.003161 0.003161 0.003161 0.003161 0.003161 0.003161 0.003161 0.003161 Apps_CONVECTION3DPA.block_64
5411856859942141273 113 regionprofile 0.003166 0.003166 0.003166 0.003166 0.003166 0.003166 0.003166 0.003166 Apps_CONVECTION3DPA.block_64
7599706706110163426 113 regionprofile 0.003157 0.003157 0.003157 0.003157 0.003157 0.003157 0.003157 0.003157 Apps_CONVECTION3DPA.block_64
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} 2118176828577267326 114 regionprofile 0.007015 0.007015 0.007015 0.007015 0.000009 0.000009 0.000009 0.000009 Apps_DEL_DOT_VEC_2D
5411856859942141273 114 regionprofile 0.007069 0.007069 0.007069 0.007069 0.000009 0.000009 0.000009 0.000009 Apps_DEL_DOT_VEC_2D
7599706706110163426 114 regionprofile 0.007000 0.007000 0.007000 0.007000 0.000009 0.000009 0.000009 0.000009 Apps_DEL_DOT_VEC_2D
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} 2118176828577267326 115 regionprofile 0.007006 0.007006 0.007006 0.007006 0.007006 0.007006 0.007006 0.007006 Apps_DEL_DOT_VEC_2D.block_128
5411856859942141273 115 regionprofile 0.007060 0.007060 0.007060 0.007060 0.007060 0.007060 0.007060 0.007060 Apps_DEL_DOT_VEC_2D.block_128
7599706706110163426 115 regionprofile 0.006991 0.006991 0.006991 0.006991 0.006991 0.006991 0.006991 0.006991 Apps_DEL_DOT_VEC_2D.block_128
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} 2118176828577267326 116 regionprofile 0.004936 0.004936 0.004936 0.004936 0.000010 0.000010 0.000010 0.000010 Apps_DIFFUSION3DPA
5411856859942141273 116 regionprofile 0.004932 0.004932 0.004932 0.004932 0.000008 0.000008 0.000008 0.000008 Apps_DIFFUSION3DPA
7599706706110163426 116 regionprofile 0.004956 0.004956 0.004956 0.004956 0.000010 0.000010 0.000010 0.000010 Apps_DIFFUSION3DPA
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} 2118176828577267326 117 regionprofile 0.004926 0.004926 0.004926 0.004926 0.004926 0.004926 0.004926 0.004926 Apps_DIFFUSION3DPA.block_64
5411856859942141273 117 regionprofile 0.004924 0.004924 0.004924 0.004924 0.004924 0.004924 0.004924 0.004924 Apps_DIFFUSION3DPA.block_64
7599706706110163426 117 regionprofile 0.004946 0.004946 0.004946 0.004946 0.004946 0.004946 0.004946 0.004946 Apps_DIFFUSION3DPA.block_64
{'name': 'Apps_ENERGY', 'type': 'function'} 2118176828577267326 118 regionprofile 0.039311 0.039311 0.039311 0.039311 0.000008 0.000008 0.000008 0.000008 Apps_ENERGY
5411856859942141273 118 regionprofile 0.039254 0.039254 0.039254 0.039254 0.000009 0.000009 0.000009 0.000009 Apps_ENERGY
7599706706110163426 118 regionprofile 0.039317 0.039317 0.039317 0.039317 0.000010 0.000010 0.000010 0.000010 Apps_ENERGY
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} 2118176828577267326 119 regionprofile 0.039303 0.039303 0.039303 0.039303 0.039303 0.039303 0.039303 0.039303 Apps_ENERGY.block_128
5411856859942141273 119 regionprofile 0.039245 0.039245 0.039245 0.039245 0.039245 0.039245 0.039245 0.039245 Apps_ENERGY.block_128
7599706706110163426 119 regionprofile 0.039307 0.039307 0.039307 0.039307 0.039307 0.039307 0.039307 0.039307 Apps_ENERGY.block_128
{'name': 'Apps_FIR', 'type': 'function'} 2118176828577267326 120 regionprofile 0.004163 0.004163 0.004163 0.004163 0.000009 0.000009 0.000009 0.000009 Apps_FIR
5411856859942141273 120 regionprofile 0.004167 0.004167 0.004167 0.004167 0.000010 0.000010 0.000010 0.000010 Apps_FIR
7599706706110163426 120 regionprofile 0.004195 0.004195 0.004195 0.004195 0.000010 0.000010 0.000010 0.000010 Apps_FIR
{'name': 'Apps_FIR.block_128', 'type': 'function'} 2118176828577267326 121 regionprofile 0.004154 0.004154 0.004154 0.004154 0.004154 0.004154 0.004154 0.004154 Apps_FIR.block_128
5411856859942141273 121 regionprofile 0.004157 0.004157 0.004157 0.004157 0.004157 0.004157 0.004157 0.004157 Apps_FIR.block_128
7599706706110163426 121 regionprofile 0.004185 0.004185 0.004185 0.004185 0.004185 0.004185 0.004185 0.004185 Apps_FIR.block_128
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} 2118176828577267326 122 regionprofile 0.037487 0.037487 0.037487 0.037487 0.000009 0.000009 0.000009 0.000009 Apps_HALOEXCHANGE
5411856859942141273 122 regionprofile 0.037465 0.037465 0.037465 0.037465 0.000009 0.000009 0.000009 0.000009 Apps_HALOEXCHANGE
7599706706110163426 122 regionprofile 0.037805 0.037805 0.037805 0.037805 0.000009 0.000009 0.000009 0.000009 Apps_HALOEXCHANGE
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} 2118176828577267326 123 regionprofile 0.037478 0.037478 0.037478 0.037478 0.037478 0.037478 0.037478 0.037478 Apps_HALOEXCHANGE.block_128
5411856859942141273 123 regionprofile 0.037456 0.037456 0.037456 0.037456 0.037456 0.037456 0.037456 0.037456 Apps_HALOEXCHANGE.block_128
7599706706110163426 123 regionprofile 0.037796 0.037796 0.037796 0.037796 0.037796 0.037796 0.037796 0.037796 Apps_HALOEXCHANGE.block_128
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} 2118176828577267326 124 regionprofile 0.009477 0.009477 0.009477 0.009477 0.000010 0.000010 0.000010 0.000010 Apps_HALOEXCHANGE_FUSED
5411856859942141273 124 regionprofile 0.009440 0.009440 0.009440 0.009440 0.000010 0.000010 0.000010 0.000010 Apps_HALOEXCHANGE_FUSED
7599706706110163426 124 regionprofile 0.009135 0.009135 0.009135 0.009135 0.000010 0.000010 0.000010 0.000010 Apps_HALOEXCHANGE_FUSED
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} 2118176828577267326 125 regionprofile 0.009467 0.009467 0.009467 0.009467 0.009467 0.009467 0.009467 0.009467 Apps_HALOEXCHANGE_FUSED.block_128
5411856859942141273 125 regionprofile 0.009430 0.009430 0.009430 0.009430 0.009430 0.009430 0.009430 0.009430 Apps_HALOEXCHANGE_FUSED.block_128
7599706706110163426 125 regionprofile 0.009125 0.009125 0.009125 0.009125 0.009125 0.009125 0.009125 0.009125 Apps_HALOEXCHANGE_FUSED.block_128
{'name': 'Apps_LTIMES', 'type': 'function'} 2118176828577267326 126 regionprofile 0.012112 0.012112 0.012112 0.012112 0.000008 0.000008 0.000008 0.000008 Apps_LTIMES
5411856859942141273 126 regionprofile 0.012087 0.012087 0.012087 0.012087 0.000008 0.000008 0.000008 0.000008 Apps_LTIMES
7599706706110163426 126 regionprofile 0.012097 0.012097 0.012097 0.012097 0.000009 0.000009 0.000009 0.000009 Apps_LTIMES
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} 2118176828577267326 127 regionprofile 0.012104 0.012104 0.012104 0.012104 0.012104 0.012104 0.012104 0.012104 Apps_LTIMES.block_128
5411856859942141273 127 regionprofile 0.012079 0.012079 0.012079 0.012079 0.012079 0.012079 0.012079 0.012079 Apps_LTIMES.block_128
7599706706110163426 127 regionprofile 0.012088 0.012088 0.012088 0.012088 0.012088 0.012088 0.012088 0.012088 Apps_LTIMES.block_128
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} 2118176828577267326 128 regionprofile 0.012095 0.012095 0.012095 0.012095 0.000010 0.000010 0.000010 0.000010 Apps_LTIMES_NOVIEW
5411856859942141273 128 regionprofile 0.012091 0.012091 0.012091 0.012091 0.000011 0.000011 0.000011 0.000011 Apps_LTIMES_NOVIEW
7599706706110163426 128 regionprofile 0.012106 0.012106 0.012106 0.012106 0.000011 0.000011 0.000011 0.000011 Apps_LTIMES_NOVIEW
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} 2118176828577267326 129 regionprofile 0.012085 0.012085 0.012085 0.012085 0.012085 0.012085 0.012085 0.012085 Apps_LTIMES_NOVIEW.block_128
5411856859942141273 129 regionprofile 0.012080 0.012080 0.012080 0.012080 0.012080 0.012080 0.012080 0.012080 Apps_LTIMES_NOVIEW.block_128
7599706706110163426 129 regionprofile 0.012095 0.012095 0.012095 0.012095 0.012095 0.012095 0.012095 0.012095 Apps_LTIMES_NOVIEW.block_128
{'name': 'Apps_MASS3DPA', 'type': 'function'} 2118176828577267326 130 regionprofile 0.001910 0.001910 0.001910 0.001910 0.000008 0.000008 0.000008 0.000008 Apps_MASS3DPA
5411856859942141273 130 regionprofile 0.001920 0.001920 0.001920 0.001920 0.000008 0.000008 0.000008 0.000008 Apps_MASS3DPA
7599706706110163426 130 regionprofile 0.001912 0.001912 0.001912 0.001912 0.000008 0.000008 0.000008 0.000008 Apps_MASS3DPA
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} 2118176828577267326 131 regionprofile 0.001902 0.001902 0.001902 0.001902 0.001902 0.001902 0.001902 0.001902 Apps_MASS3DPA.block_25
5411856859942141273 131 regionprofile 0.001912 0.001912 0.001912 0.001912 0.001912 0.001912 0.001912 0.001912 Apps_MASS3DPA.block_25
7599706706110163426 131 regionprofile 0.001904 0.001904 0.001904 0.001904 0.001904 0.001904 0.001904 0.001904 Apps_MASS3DPA.block_25
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} 2118176828577267326 2 regionprofile 0.007415 0.007415 0.007415 0.007415 0.000007 0.000007 0.000007 0.000007 Apps_NODAL_ACCUMULATION_3D
5411856859942141273 2 regionprofile 0.007454 0.007454 0.007454 0.007454 0.000009 0.000009 0.000009 0.000009 Apps_NODAL_ACCUMULATION_3D
7599706706110163426 2 regionprofile 0.007358 0.007358 0.007358 0.007358 0.000009 0.000009 0.000009 0.000009 Apps_NODAL_ACCUMULATION_3D
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} 2118176828577267326 3 regionprofile 0.007408 0.007408 0.007408 0.007408 0.007408 0.007408 0.007408 0.007408 Apps_NODAL_ACCUMULATION_3D.block_128
5411856859942141273 3 regionprofile 0.007445 0.007445 0.007445 0.007445 0.007445 0.007445 0.007445 0.007445 Apps_NODAL_ACCUMULATION_3D.block_128
7599706706110163426 3 regionprofile 0.007349 0.007349 0.007349 0.007349 0.007349 0.007349 0.007349 0.007349 Apps_NODAL_ACCUMULATION_3D.block_128
{'name': 'Apps_PRESSURE', 'type': 'function'} 2118176828577267326 4 regionprofile 0.048260 0.048260 0.048260 0.048260 0.000009 0.000009 0.000009 0.000009 Apps_PRESSURE
5411856859942141273 4 regionprofile 0.048222 0.048222 0.048222 0.048222 0.000010 0.000010 0.000010 0.000010 Apps_PRESSURE
7599706706110163426 4 regionprofile 0.048312 0.048312 0.048312 0.048312 0.000009 0.000009 0.000009 0.000009 Apps_PRESSURE
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} 2118176828577267326 5 regionprofile 0.048251 0.048251 0.048251 0.048251 0.048251 0.048251 0.048251 0.048251 Apps_PRESSURE.block_128
5411856859942141273 5 regionprofile 0.048212 0.048212 0.048212 0.048212 0.048212 0.048212 0.048212 0.048212 Apps_PRESSURE.block_128
7599706706110163426 5 regionprofile 0.048303 0.048303 0.048303 0.048303 0.048303 0.048303 0.048303 0.048303 Apps_PRESSURE.block_128
{'name': 'Apps_VOL3D', 'type': 'function'} 2118176828577267326 6 regionprofile 0.005752 0.005752 0.005752 0.005752 0.000009 0.000009 0.000009 0.000009 Apps_VOL3D
5411856859942141273 6 regionprofile 0.005913 0.005913 0.005913 0.005913 0.000010 0.000010 0.000010 0.000010 Apps_VOL3D
7599706706110163426 6 regionprofile 0.005993 0.005993 0.005993 0.005993 0.000008 0.000008 0.000008 0.000008 Apps_VOL3D
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} 2118176828577267326 7 regionprofile 0.005743 0.005743 0.005743 0.005743 0.005743 0.005743 0.005743 0.005743 Apps_VOL3D.block_128
5411856859942141273 7 regionprofile 0.005903 0.005903 0.005903 0.005903 0.005903 0.005903 0.005903 0.005903 Apps_VOL3D.block_128
7599706706110163426 7 regionprofile 0.005985 0.005985 0.005985 0.005985 0.005985 0.005985 0.005985 0.005985 Apps_VOL3D.block_128
{'name': 'Basic', 'type': 'function'} 2118176828577267326 16 regionprofile 0.361976 0.361976 0.361976 0.361976 0.000096 0.000096 0.000096 0.000096 Basic
5411856859942141273 16 regionprofile 0.332863 0.332863 0.332863 0.332863 0.000101 0.000101 0.000101 0.000101 Basic
7599706706110163426 16 regionprofile 0.362643 0.362643 0.362643 0.362643 0.000097 0.000097 0.000097 0.000097 Basic
{'name': 'Basic_DAXPY', 'type': 'function'} 2118176828577267326 18 regionprofile 0.016935 0.016935 0.016935 0.016935 0.000011 0.000011 0.000011 0.000011 Basic_DAXPY
5411856859942141273 18 regionprofile 0.016840 0.016840 0.016840 0.016840 0.000010 0.000010 0.000010 0.000010 Basic_DAXPY
7599706706110163426 18 regionprofile 0.016959 0.016959 0.016959 0.016959 0.000011 0.000011 0.000011 0.000011 Basic_DAXPY
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} 2118176828577267326 20 regionprofile 0.016924 0.016924 0.016924 0.016924 0.016924 0.016924 0.016924 0.016924 Basic_DAXPY.block_128
5411856859942141273 20 regionprofile 0.016830 0.016830 0.016830 0.016830 0.016830 0.016830 0.016830 0.016830 Basic_DAXPY.block_128
7599706706110163426 20 regionprofile 0.016948 0.016948 0.016948 0.016948 0.016948 0.016948 0.016948 0.016948 Basic_DAXPY.block_128
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} 2118176828577267326 22 regionprofile 0.016983 0.016983 0.016983 0.016983 0.000009 0.000009 0.000009 0.000009 Basic_DAXPY_ATOMIC
5411856859942141273 22 regionprofile 0.016901 0.016901 0.016901 0.016901 0.000009 0.000009 0.000009 0.000009 Basic_DAXPY_ATOMIC
7599706706110163426 22 regionprofile 0.016956 0.016956 0.016956 0.016956 0.000009 0.000009 0.000009 0.000009 Basic_DAXPY_ATOMIC
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} 2118176828577267326 24 regionprofile 0.016974 0.016974 0.016974 0.016974 0.016974 0.016974 0.016974 0.016974 Basic_DAXPY_ATOMIC.block_128
5411856859942141273 24 regionprofile 0.016892 0.016892 0.016892 0.016892 0.016892 0.016892 0.016892 0.016892 Basic_DAXPY_ATOMIC.block_128
7599706706110163426 24 regionprofile 0.016947 0.016947 0.016947 0.016947 0.016947 0.016947 0.016947 0.016947 Basic_DAXPY_ATOMIC.block_128
{'name': 'Basic_IF_QUAD', 'type': 'function'} 2118176828577267326 25 regionprofile 0.012748 0.012748 0.012748 0.012748 0.000008 0.000008 0.000008 0.000008 Basic_IF_QUAD
5411856859942141273 25 regionprofile 0.012599 0.012599 0.012599 0.012599 0.000009 0.000009 0.000009 0.000009 Basic_IF_QUAD
7599706706110163426 25 regionprofile 0.012630 0.012630 0.012630 0.012630 0.000008 0.000008 0.000008 0.000008 Basic_IF_QUAD
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} 2118176828577267326 26 regionprofile 0.012740 0.012740 0.012740 0.012740 0.012740 0.012740 0.012740 0.012740 Basic_IF_QUAD.block_128
5411856859942141273 26 regionprofile 0.012590 0.012590 0.012590 0.012590 0.012590 0.012590 0.012590 0.012590 Basic_IF_QUAD.block_128
7599706706110163426 26 regionprofile 0.012622 0.012622 0.012622 0.012622 0.012622 0.012622 0.012622 0.012622 Basic_IF_QUAD.block_128
{'name': 'Basic_INDEXLIST', 'type': 'function'} 2118176828577267326 27 regionprofile 0.004529 0.004529 0.004529 0.004529 0.000010 0.000010 0.000010 0.000010 Basic_INDEXLIST
5411856859942141273 27 regionprofile 0.004284 0.004284 0.004284 0.004284 0.000008 0.000008 0.000008 0.000008 Basic_INDEXLIST
7599706706110163426 27 regionprofile 0.004514 0.004514 0.004514 0.004514 0.000008 0.000008 0.000008 0.000008 Basic_INDEXLIST
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} 2118176828577267326 28 regionprofile 0.004519 0.004519 0.004519 0.004519 0.004519 0.004519 0.004519 0.004519 Basic_INDEXLIST.block_256
5411856859942141273 28 regionprofile 0.004276 0.004276 0.004276 0.004276 0.004276 0.004276 0.004276 0.004276 Basic_INDEXLIST.block_256
7599706706110163426 28 regionprofile 0.004506 0.004506 0.004506 0.004506 0.004506 0.004506 0.004506 0.004506 Basic_INDEXLIST.block_256
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} 2118176828577267326 29 regionprofile 0.008203 0.008203 0.008203 0.008203 0.000009 0.000009 0.000009 0.000009 Basic_INDEXLIST_3LOOP
5411856859942141273 29 regionprofile 0.008037 0.008037 0.008037 0.008037 0.000009 0.000009 0.000009 0.000009 Basic_INDEXLIST_3LOOP
7599706706110163426 29 regionprofile 0.008184 0.008184 0.008184 0.008184 0.000009 0.000009 0.000009 0.000009 Basic_INDEXLIST_3LOOP
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} 2118176828577267326 30 regionprofile 0.008194 0.008194 0.008194 0.008194 0.008194 0.008194 0.008194 0.008194 Basic_INDEXLIST_3LOOP.block_256
5411856859942141273 30 regionprofile 0.008028 0.008028 0.008028 0.008028 0.008028 0.008028 0.008028 0.008028 Basic_INDEXLIST_3LOOP.block_256
7599706706110163426 30 regionprofile 0.008175 0.008175 0.008175 0.008175 0.008175 0.008175 0.008175 0.008175 Basic_INDEXLIST_3LOOP.block_256
{'name': 'Basic_INIT3', 'type': 'function'} 2118176828577267326 31 regionprofile 0.028361 0.028361 0.028361 0.028361 0.000009 0.000009 0.000009 0.000009 Basic_INIT3
5411856859942141273 31 regionprofile 0.028205 0.028205 0.028205 0.028205 0.000008 0.000008 0.000008 0.000008 Basic_INIT3
7599706706110163426 31 regionprofile 0.028205 0.028205 0.028205 0.028205 0.000009 0.000009 0.000009 0.000009 Basic_INIT3
{'name': 'Basic_INIT3.block_128', 'type': 'function'} 2118176828577267326 32 regionprofile 0.028352 0.028352 0.028352 0.028352 0.028352 0.028352 0.028352 0.028352 Basic_INIT3.block_128
5411856859942141273 32 regionprofile 0.028197 0.028197 0.028197 0.028197 0.028197 0.028197 0.028197 0.028197 Basic_INIT3.block_128
7599706706110163426 32 regionprofile 0.028196 0.028196 0.028196 0.028196 0.028196 0.028196 0.028196 0.028196 Basic_INIT3.block_128
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} 2118176828577267326 33 regionprofile 0.042035 0.042035 0.042035 0.042035 0.000009 0.000009 0.000009 0.000009 Basic_INIT_VIEW1D
5411856859942141273 33 regionprofile 0.036596 0.036596 0.036596 0.036596 0.000007 0.000007 0.000007 0.000007 Basic_INIT_VIEW1D
7599706706110163426 33 regionprofile 0.041789 0.041789 0.041789 0.041789 0.000009 0.000009 0.000009 0.000009 Basic_INIT_VIEW1D
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} 2118176828577267326 34 regionprofile 0.042026 0.042026 0.042026 0.042026 0.042026 0.042026 0.042026 0.042026 Basic_INIT_VIEW1D.block_128
5411856859942141273 34 regionprofile 0.036589 0.036589 0.036589 0.036589 0.036589 0.036589 0.036589 0.036589 Basic_INIT_VIEW1D.block_128
7599706706110163426 34 regionprofile 0.041780 0.041780 0.041780 0.041780 0.041780 0.041780 0.041780 0.041780 Basic_INIT_VIEW1D.block_128
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} 2118176828577267326 35 regionprofile 0.042047 0.042047 0.042047 0.042047 0.000009 0.000009 0.000009 0.000009 Basic_INIT_VIEW1D_OFFSET
5411856859942141273 35 regionprofile 0.036612 0.036612 0.036612 0.036612 0.000009 0.000009 0.000009 0.000009 Basic_INIT_VIEW1D_OFFSET
7599706706110163426 35 regionprofile 0.041804 0.041804 0.041804 0.041804 0.000009 0.000009 0.000009 0.000009 Basic_INIT_VIEW1D_OFFSET
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} 2118176828577267326 36 regionprofile 0.042038 0.042038 0.042038 0.042038 0.042038 0.042038 0.042038 0.042038 Basic_INIT_VIEW1D_OFFSET.block_128
5411856859942141273 36 regionprofile 0.036603 0.036603 0.036603 0.036603 0.036603 0.036603 0.036603 0.036603 Basic_INIT_VIEW1D_OFFSET.block_128
7599706706110163426 36 regionprofile 0.041795 0.041795 0.041795 0.041795 0.041795 0.041795 0.041795 0.041795 Basic_INIT_VIEW1D_OFFSET.block_128
{'name': 'Basic_MULADDSUB', 'type': 'function'} 2118176828577267326 37 regionprofile 0.019727 0.019727 0.019727 0.019727 0.000011 0.000011 0.000011 0.000011 Basic_MULADDSUB
5411856859942141273 37 regionprofile 0.019762 0.019762 0.019762 0.019762 0.000011 0.000011 0.000011 0.000011 Basic_MULADDSUB
7599706706110163426 37 regionprofile 0.019756 0.019756 0.019756 0.019756 0.000010 0.000010 0.000010 0.000010 Basic_MULADDSUB
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} 2118176828577267326 38 regionprofile 0.019716 0.019716 0.019716 0.019716 0.019716 0.019716 0.019716 0.019716 Basic_MULADDSUB.block_128
5411856859942141273 38 regionprofile 0.019751 0.019751 0.019751 0.019751 0.019751 0.019751 0.019751 0.019751 Basic_MULADDSUB.block_128
7599706706110163426 38 regionprofile 0.019746 0.019746 0.019746 0.019746 0.019746 0.019746 0.019746 0.019746 Basic_MULADDSUB.block_128
{'name': 'Basic_NESTED_INIT', 'type': 'function'} 2118176828577267326 39 regionprofile 0.020790 0.020790 0.020790 0.020790 0.000008 0.000008 0.000008 0.000008 Basic_NESTED_INIT
5411856859942141273 39 regionprofile 0.018093 0.018093 0.018093 0.018093 0.000009 0.000009 0.000009 0.000009 Basic_NESTED_INIT
7599706706110163426 39 regionprofile 0.020679 0.020679 0.020679 0.020679 0.000009 0.000009 0.000009 0.000009 Basic_NESTED_INIT
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} 2118176828577267326 40 regionprofile 0.020782 0.020782 0.020782 0.020782 0.020782 0.020782 0.020782 0.020782 Basic_NESTED_INIT.block_128
5411856859942141273 40 regionprofile 0.018084 0.018084 0.018084 0.018084 0.018084 0.018084 0.018084 0.018084 Basic_NESTED_INIT.block_128
7599706706110163426 40 regionprofile 0.020670 0.020670 0.020670 0.020670 0.020670 0.020670 0.020670 0.020670 Basic_NESTED_INIT.block_128
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} 2118176828577267326 41 regionprofile 0.124121 0.124121 0.124121 0.124121 0.000008 0.000008 0.000008 0.000008 Basic_PI_ATOMIC
5411856859942141273 41 regionprofile 0.110341 0.110341 0.110341 0.110341 0.000009 0.000009 0.000009 0.000009 Basic_PI_ATOMIC
7599706706110163426 41 regionprofile 0.126655 0.126655 0.126655 0.126655 0.000009 0.000009 0.000009 0.000009 Basic_PI_ATOMIC
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} 2118176828577267326 42 regionprofile 0.124113 0.124113 0.124113 0.124113 0.124113 0.124113 0.124113 0.124113 Basic_PI_ATOMIC.block_128
5411856859942141273 42 regionprofile 0.110332 0.110332 0.110332 0.110332 0.110332 0.110332 0.110332 0.110332 Basic_PI_ATOMIC.block_128
7599706706110163426 42 regionprofile 0.126646 0.126646 0.126646 0.126646 0.126646 0.126646 0.126646 0.126646 Basic_PI_ATOMIC.block_128
{'name': 'Basic_PI_REDUCE', 'type': 'function'} 2118176828577267326 43 regionprofile 0.002610 0.002610 0.002610 0.002610 0.000008 0.000008 0.000008 0.000008 Basic_PI_REDUCE
5411856859942141273 43 regionprofile 0.002586 0.002586 0.002586 0.002586 0.000007 0.000007 0.000007 0.000007 Basic_PI_REDUCE
7599706706110163426 43 regionprofile 0.002685 0.002685 0.002685 0.002685 0.000008 0.000008 0.000008 0.000008 Basic_PI_REDUCE
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} 2118176828577267326 44 regionprofile 0.002602 0.002602 0.002602 0.002602 0.002602 0.002602 0.002602 0.002602 Basic_PI_REDUCE.block_128
5411856859942141273 44 regionprofile 0.002579 0.002579 0.002579 0.002579 0.002579 0.002579 0.002579 0.002579 Basic_PI_REDUCE.block_128
7599706706110163426 44 regionprofile 0.002677 0.002677 0.002677 0.002677 0.002677 0.002677 0.002677 0.002677 Basic_PI_REDUCE.block_128
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} 2118176828577267326 45 regionprofile 0.002468 0.002468 0.002468 0.002468 0.000008 0.000008 0.000008 0.000008 Basic_REDUCE3_INT
5411856859942141273 45 regionprofile 0.002380 0.002380 0.002380 0.002380 0.000008 0.000008 0.000008 0.000008 Basic_REDUCE3_INT
7599706706110163426 45 regionprofile 0.002388 0.002388 0.002388 0.002388 0.000008 0.000008 0.000008 0.000008 Basic_REDUCE3_INT
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} 2118176828577267326 46 regionprofile 0.002460 0.002460 0.002460 0.002460 0.002460 0.002460 0.002460 0.002460 Basic_REDUCE3_INT.block_128
5411856859942141273 46 regionprofile 0.002372 0.002372 0.002372 0.002372 0.002372 0.002372 0.002372 0.002372 Basic_REDUCE3_INT.block_128
7599706706110163426 46 regionprofile 0.002380 0.002380 0.002380 0.002380 0.002380 0.002380 0.002380 0.002380 Basic_REDUCE3_INT.block_128
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} 2118176828577267326 47 regionprofile 0.017720 0.017720 0.017720 0.017720 0.000008 0.000008 0.000008 0.000008 Basic_REDUCE_STRUCT
5411856859942141273 47 regionprofile 0.016925 0.016925 0.016925 0.016925 0.000008 0.000008 0.000008 0.000008 Basic_REDUCE_STRUCT
7599706706110163426 47 regionprofile 0.016720 0.016720 0.016720 0.016720 0.000009 0.000009 0.000009 0.000009 Basic_REDUCE_STRUCT
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} 2118176828577267326 48 regionprofile 0.017712 0.017712 0.017712 0.017712 0.017712 0.017712 0.017712 0.017712 Basic_REDUCE_STRUCT.block_128
5411856859942141273 48 regionprofile 0.016917 0.016917 0.016917 0.016917 0.016917 0.016917 0.016917 0.016917 Basic_REDUCE_STRUCT.block_128
7599706706110163426 48 regionprofile 0.016711 0.016711 0.016711 0.016711 0.016711 0.016711 0.016711 0.016711 Basic_REDUCE_STRUCT.block_128
{'name': 'Basic_TRAP_INT', 'type': 'function'} 2118176828577267326 49 regionprofile 0.002603 0.002603 0.002603 0.002603 0.000007 0.000007 0.000007 0.000007 Basic_TRAP_INT
5411856859942141273 49 regionprofile 0.002601 0.002601 0.002601 0.002601 0.000008 0.000008 0.000008 0.000008 Basic_TRAP_INT
7599706706110163426 49 regionprofile 0.002622 0.002622 0.002622 0.002622 0.000007 0.000007 0.000007 0.000007 Basic_TRAP_INT
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} 2118176828577267326 50 regionprofile 0.002596 0.002596 0.002596 0.002596 0.002596 0.002596 0.002596 0.002596 Basic_TRAP_INT.block_128
5411856859942141273 50 regionprofile 0.002593 0.002593 0.002593 0.002593 0.002593 0.002593 0.002593 0.002593 Basic_TRAP_INT.block_128
7599706706110163426 50 regionprofile 0.002615 0.002615 0.002615 0.002615 0.002615 0.002615 0.002615 0.002615 Basic_TRAP_INT.block_128
{'name': 'Lcals', 'type': 'function'} 2118176828577267326 51 regionprofile 0.430549 0.430549 0.430549 0.430549 0.000071 0.000071 0.000071 0.000071 Lcals
5411856859942141273 51 regionprofile 0.460418 0.460418 0.460418 0.460418 0.000080 0.000080 0.000080 0.000080 Lcals
7599706706110163426 51 regionprofile 0.427691 0.427691 0.427691 0.427691 0.000074 0.000074 0.000074 0.000074 Lcals
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} 2118176828577267326 52 regionprofile 0.061532 0.061532 0.061532 0.061532 0.000009 0.000009 0.000009 0.000009 Lcals_DIFF_PREDICT
5411856859942141273 52 regionprofile 0.060504 0.060504 0.060504 0.060504 0.000010 0.000010 0.000010 0.000010 Lcals_DIFF_PREDICT
7599706706110163426 52 regionprofile 0.061569 0.061569 0.061569 0.061569 0.000009 0.000009 0.000009 0.000009 Lcals_DIFF_PREDICT
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} 2118176828577267326 53 regionprofile 0.061523 0.061523 0.061523 0.061523 0.061523 0.061523 0.061523 0.061523 Lcals_DIFF_PREDICT.block_128
5411856859942141273 53 regionprofile 0.060494 0.060494 0.060494 0.060494 0.060494 0.060494 0.060494 0.060494 Lcals_DIFF_PREDICT.block_128
7599706706110163426 53 regionprofile 0.061560 0.061560 0.061560 0.061560 0.061560 0.061560 0.061560 0.061560 Lcals_DIFF_PREDICT.block_128
{'name': 'Lcals_EOS', 'type': 'function'} 2118176828577267326 54 regionprofile 0.022710 0.022710 0.022710 0.022710 0.000009 0.000009 0.000009 0.000009 Lcals_EOS
5411856859942141273 54 regionprofile 0.022695 0.022695 0.022695 0.022695 0.000010 0.000010 0.000010 0.000010 Lcals_EOS
7599706706110163426 54 regionprofile 0.022684 0.022684 0.022684 0.022684 0.000010 0.000010 0.000010 0.000010 Lcals_EOS
{'name': 'Lcals_EOS.block_128', 'type': 'function'} 2118176828577267326 55 regionprofile 0.022701 0.022701 0.022701 0.022701 0.022701 0.022701 0.022701 0.022701 Lcals_EOS.block_128
5411856859942141273 55 regionprofile 0.022685 0.022685 0.022685 0.022685 0.022685 0.022685 0.022685 0.022685 Lcals_EOS.block_128
7599706706110163426 55 regionprofile 0.022674 0.022674 0.022674 0.022674 0.022674 0.022674 0.022674 0.022674 Lcals_EOS.block_128
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} 2118176828577267326 56 regionprofile 0.048464 0.048464 0.048464 0.048464 0.000009 0.000009 0.000009 0.000009 Lcals_FIRST_DIFF
5411856859942141273 56 regionprofile 0.048488 0.048488 0.048488 0.048488 0.000008 0.000008 0.000008 0.000008 Lcals_FIRST_DIFF
7599706706110163426 56 regionprofile 0.048353 0.048353 0.048353 0.048353 0.000009 0.000009 0.000009 0.000009 Lcals_FIRST_DIFF
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} 2118176828577267326 57 regionprofile 0.048455 0.048455 0.048455 0.048455 0.048455 0.048455 0.048455 0.048455 Lcals_FIRST_DIFF.block_128
5411856859942141273 57 regionprofile 0.048480 0.048480 0.048480 0.048480 0.048480 0.048480 0.048480 0.048480 Lcals_FIRST_DIFF.block_128
7599706706110163426 57 regionprofile 0.048344 0.048344 0.048344 0.048344 0.048344 0.048344 0.048344 0.048344 Lcals_FIRST_DIFF.block_128
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} 2118176828577267326 58 regionprofile 0.048732 0.048732 0.048732 0.048732 0.000019 0.000019 0.000019 0.000019 Lcals_FIRST_MIN
5411856859942141273 58 regionprofile 0.079664 0.079664 0.079664 0.079664 0.000012 0.000012 0.000012 0.000012 Lcals_FIRST_MIN
7599706706110163426 58 regionprofile 0.046295 0.046295 0.046295 0.046295 0.000013 0.000013 0.000013 0.000013 Lcals_FIRST_MIN
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} 2118176828577267326 59 regionprofile 0.048713 0.048713 0.048713 0.048713 0.048713 0.048713 0.048713 0.048713 Lcals_FIRST_MIN.block_128
5411856859942141273 59 regionprofile 0.079652 0.079652 0.079652 0.079652 0.079652 0.079652 0.079652 0.079652 Lcals_FIRST_MIN.block_128
7599706706110163426 59 regionprofile 0.046282 0.046282 0.046282 0.046282 0.046282 0.046282 0.046282 0.046282 Lcals_FIRST_MIN.block_128
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} 2118176828577267326 60 regionprofile 0.048529 0.048529 0.048529 0.048529 0.000010 0.000010 0.000010 0.000010 Lcals_FIRST_SUM
5411856859942141273 60 regionprofile 0.048525 0.048525 0.048525 0.048525 0.000009 0.000009 0.000009 0.000009 Lcals_FIRST_SUM
7599706706110163426 60 regionprofile 0.048453 0.048453 0.048453 0.048453 0.000009 0.000009 0.000009 0.000009 Lcals_FIRST_SUM
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} 2118176828577267326 61 regionprofile 0.048519 0.048519 0.048519 0.048519 0.048519 0.048519 0.048519 0.048519 Lcals_FIRST_SUM.block_128
5411856859942141273 61 regionprofile 0.048516 0.048516 0.048516 0.048516 0.048516 0.048516 0.048516 0.048516 Lcals_FIRST_SUM.block_128
7599706706110163426 61 regionprofile 0.048444 0.048444 0.048444 0.048444 0.048444 0.048444 0.048444 0.048444 Lcals_FIRST_SUM.block_128
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} 2118176828577267326 62 regionprofile 0.048873 0.048873 0.048873 0.048873 0.000009 0.000009 0.000009 0.000009 Lcals_GEN_LIN_RECUR
5411856859942141273 62 regionprofile 0.049033 0.049033 0.049033 0.049033 0.000009 0.000009 0.000009 0.000009 Lcals_GEN_LIN_RECUR
7599706706110163426 62 regionprofile 0.048874 0.048874 0.048874 0.048874 0.000009 0.000009 0.000009 0.000009 Lcals_GEN_LIN_RECUR
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} 2118176828577267326 63 regionprofile 0.048864 0.048864 0.048864 0.048864 0.048864 0.048864 0.048864 0.048864 Lcals_GEN_LIN_RECUR.block_128
5411856859942141273 63 regionprofile 0.049024 0.049024 0.049024 0.049024 0.049024 0.049024 0.049024 0.049024 Lcals_GEN_LIN_RECUR.block_128
7599706706110163426 63 regionprofile 0.048865 0.048865 0.048865 0.048865 0.048865 0.048865 0.048865 0.048865 Lcals_GEN_LIN_RECUR.block_128
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} 2118176828577267326 64 regionprofile 0.033847 0.033847 0.033847 0.033847 0.000009 0.000009 0.000009 0.000009 Lcals_HYDRO_1D
5411856859942141273 64 regionprofile 0.034110 0.034110 0.034110 0.034110 0.000009 0.000009 0.000009 0.000009 Lcals_HYDRO_1D
7599706706110163426 64 regionprofile 0.033981 0.033981 0.033981 0.033981 0.000010 0.000010 0.000010 0.000010 Lcals_HYDRO_1D
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} 2118176828577267326 65 regionprofile 0.033838 0.033838 0.033838 0.033838 0.033838 0.033838 0.033838 0.033838 Lcals_HYDRO_1D.block_128
5411856859942141273 65 regionprofile 0.034101 0.034101 0.034101 0.034101 0.034101 0.034101 0.034101 0.034101 Lcals_HYDRO_1D.block_128
7599706706110163426 65 regionprofile 0.033971 0.033971 0.033971 0.033971 0.033971 0.033971 0.033971 0.033971 Lcals_HYDRO_1D.block_128
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} 2118176828577267326 66 regionprofile 0.022991 0.022991 0.022991 0.022991 0.000011 0.000011 0.000011 0.000011 Lcals_HYDRO_2D
5411856859942141273 66 regionprofile 0.023084 0.023084 0.023084 0.023084 0.000010 0.000010 0.000010 0.000010 Lcals_HYDRO_2D
7599706706110163426 66 regionprofile 0.023005 0.023005 0.023005 0.023005 0.000010 0.000010 0.000010 0.000010 Lcals_HYDRO_2D
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} 2118176828577267326 67 regionprofile 0.022980 0.022980 0.022980 0.022980 0.022980 0.022980 0.022980 0.022980 Lcals_HYDRO_2D.block_128
5411856859942141273 67 regionprofile 0.023074 0.023074 0.023074 0.023074 0.023074 0.023074 0.023074 0.023074 Lcals_HYDRO_2D.block_128
7599706706110163426 67 regionprofile 0.022995 0.022995 0.022995 0.022995 0.022995 0.022995 0.022995 0.022995 Lcals_HYDRO_2D.block_128
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} 2118176828577267326 68 regionprofile 0.046886 0.046886 0.046886 0.046886 0.000010 0.000010 0.000010 0.000010 Lcals_INT_PREDICT
5411856859942141273 68 regionprofile 0.046884 0.046884 0.046884 0.046884 0.000010 0.000010 0.000010 0.000010 Lcals_INT_PREDICT
7599706706110163426 68 regionprofile 0.046948 0.046948 0.046948 0.046948 0.000009 0.000009 0.000009 0.000009 Lcals_INT_PREDICT
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} 2118176828577267326 69 regionprofile 0.046876 0.046876 0.046876 0.046876 0.046876 0.046876 0.046876 0.046876 Lcals_INT_PREDICT.block_128
5411856859942141273 69 regionprofile 0.046874 0.046874 0.046874 0.046874 0.046874 0.046874 0.046874 0.046874 Lcals_INT_PREDICT.block_128
7599706706110163426 69 regionprofile 0.046939 0.046939 0.046939 0.046939 0.046939 0.046939 0.046939 0.046939 Lcals_INT_PREDICT.block_128
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} 2118176828577267326 70 regionprofile 0.002775 0.002775 0.002775 0.002775 0.000010 0.000010 0.000010 0.000010 Lcals_PLANCKIAN
5411856859942141273 70 regionprofile 0.002769 0.002769 0.002769 0.002769 0.000010 0.000010 0.000010 0.000010 Lcals_PLANCKIAN
7599706706110163426 70 regionprofile 0.002764 0.002764 0.002764 0.002764 0.000010 0.000010 0.000010 0.000010 Lcals_PLANCKIAN
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} 2118176828577267326 71 regionprofile 0.002765 0.002765 0.002765 0.002765 0.002765 0.002765 0.002765 0.002765 Lcals_PLANCKIAN.block_128
5411856859942141273 71 regionprofile 0.002759 0.002759 0.002759 0.002759 0.002759 0.002759 0.002759 0.002759 Lcals_PLANCKIAN.block_128
7599706706110163426 71 regionprofile 0.002754 0.002754 0.002754 0.002754 0.002754 0.002754 0.002754 0.002754 Lcals_PLANCKIAN.block_128
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} 2118176828577267326 72 regionprofile 0.045139 0.045139 0.045139 0.045139 0.000010 0.000010 0.000010 0.000010 Lcals_TRIDIAG_ELIM
5411856859942141273 72 regionprofile 0.044582 0.044582 0.044582 0.044582 0.000010 0.000010 0.000010 0.000010 Lcals_TRIDIAG_ELIM
7599706706110163426 72 regionprofile 0.044691 0.044691 0.044691 0.044691 0.000010 0.000010 0.000010 0.000010 Lcals_TRIDIAG_ELIM
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} 2118176828577267326 73 regionprofile 0.045129 0.045129 0.045129 0.045129 0.045129 0.045129 0.045129 0.045129 Lcals_TRIDIAG_ELIM.block_128
5411856859942141273 73 regionprofile 0.044572 0.044572 0.044572 0.044572 0.044572 0.044572 0.044572 0.044572 Lcals_TRIDIAG_ELIM.block_128
7599706706110163426 73 regionprofile 0.044681 0.044681 0.044681 0.044681 0.044681 0.044681 0.044681 0.044681 Lcals_TRIDIAG_ELIM.block_128
{'name': 'Polybench', 'type': 'function'} 2118176828577267326 74 regionprofile 0.623023 0.623023 0.623023 0.623023 0.000085 0.000085 0.000085 0.000085 Polybench
5411856859942141273 74 regionprofile 0.622754 0.622754 0.622754 0.622754 0.000086 0.000086 0.000086 0.000086 Polybench
7599706706110163426 74 regionprofile 0.622254 0.622254 0.622254 0.622254 0.000088 0.000088 0.000088 0.000088 Polybench
{'name': 'Polybench_2MM', 'type': 'function'} 2118176828577267326 75 regionprofile 0.005957 0.005957 0.005957 0.005957 0.000008 0.000008 0.000008 0.000008 Polybench_2MM
5411856859942141273 75 regionprofile 0.005934 0.005934 0.005934 0.005934 0.000009 0.000009 0.000009 0.000009 Polybench_2MM
7599706706110163426 75 regionprofile 0.005982 0.005982 0.005982 0.005982 0.000008 0.000008 0.000008 0.000008 Polybench_2MM
{'name': 'Polybench_2MM.block_128', 'type': 'function'} 2118176828577267326 76 regionprofile 0.005949 0.005949 0.005949 0.005949 0.005949 0.005949 0.005949 0.005949 Polybench_2MM.block_128
5411856859942141273 76 regionprofile 0.005925 0.005925 0.005925 0.005925 0.005925 0.005925 0.005925 0.005925 Polybench_2MM.block_128
7599706706110163426 76 regionprofile 0.005974 0.005974 0.005974 0.005974 0.005974 0.005974 0.005974 0.005974 Polybench_2MM.block_128
{'name': 'Polybench_3MM', 'type': 'function'} 2118176828577267326 77 regionprofile 0.008891 0.008891 0.008891 0.008891 0.000008 0.000008 0.000008 0.000008 Polybench_3MM
5411856859942141273 77 regionprofile 0.008893 0.008893 0.008893 0.008893 0.000008 0.000008 0.000008 0.000008 Polybench_3MM
7599706706110163426 77 regionprofile 0.008894 0.008894 0.008894 0.008894 0.000008 0.000008 0.000008 0.000008 Polybench_3MM
{'name': 'Polybench_3MM.block_128', 'type': 'function'} 2118176828577267326 78 regionprofile 0.008883 0.008883 0.008883 0.008883 0.008883 0.008883 0.008883 0.008883 Polybench_3MM.block_128
5411856859942141273 78 regionprofile 0.008885 0.008885 0.008885 0.008885 0.008885 0.008885 0.008885 0.008885 Polybench_3MM.block_128
7599706706110163426 78 regionprofile 0.008886 0.008886 0.008886 0.008886 0.008886 0.008886 0.008886 0.008886 Polybench_3MM.block_128
{'name': 'Polybench_ADI', 'type': 'function'} 2118176828577267326 79 regionprofile 0.033510 0.033510 0.033510 0.033510 0.000009 0.000009 0.000009 0.000009 Polybench_ADI
5411856859942141273 79 regionprofile 0.033662 0.033662 0.033662 0.033662 0.000010 0.000010 0.000010 0.000010 Polybench_ADI
7599706706110163426 79 regionprofile 0.033601 0.033601 0.033601 0.033601 0.000009 0.000009 0.000009 0.000009 Polybench_ADI
{'name': 'Polybench_ADI.block_128', 'type': 'function'} 2118176828577267326 80 regionprofile 0.033501 0.033501 0.033501 0.033501 0.033501 0.033501 0.033501 0.033501 Polybench_ADI.block_128
5411856859942141273 80 regionprofile 0.033652 0.033652 0.033652 0.033652 0.033652 0.033652 0.033652 0.033652 Polybench_ADI.block_128
7599706706110163426 80 regionprofile 0.033592 0.033592 0.033592 0.033592 0.033592 0.033592 0.033592 0.033592 Polybench_ADI.block_128
{'name': 'Polybench_ATAX', 'type': 'function'} 2118176828577267326 81 regionprofile 0.037496 0.037496 0.037496 0.037496 0.000009 0.000009 0.000009 0.000009 Polybench_ATAX
5411856859942141273 81 regionprofile 0.037376 0.037376 0.037376 0.037376 0.000009 0.000009 0.000009 0.000009 Polybench_ATAX
7599706706110163426 81 regionprofile 0.037412 0.037412 0.037412 0.037412 0.000009 0.000009 0.000009 0.000009 Polybench_ATAX
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} 2118176828577267326 82 regionprofile 0.037487 0.037487 0.037487 0.037487 0.037487 0.037487 0.037487 0.037487 Polybench_ATAX.block_128
5411856859942141273 82 regionprofile 0.037367 0.037367 0.037367 0.037367 0.037367 0.037367 0.037367 0.037367 Polybench_ATAX.block_128
7599706706110163426 82 regionprofile 0.037403 0.037403 0.037403 0.037403 0.037403 0.037403 0.037403 0.037403 Polybench_ATAX.block_128
{'name': 'Polybench_FDTD_2D', 'type': 'function'} 2118176828577267326 83 regionprofile 0.037077 0.037077 0.037077 0.037077 0.000008 0.000008 0.000008 0.000008 Polybench_FDTD_2D
5411856859942141273 83 regionprofile 0.037133 0.037133 0.037133 0.037133 0.000009 0.000009 0.000009 0.000009 Polybench_FDTD_2D
7599706706110163426 83 regionprofile 0.037119 0.037119 0.037119 0.037119 0.000010 0.000010 0.000010 0.000010 Polybench_FDTD_2D
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} 2118176828577267326 84 regionprofile 0.037069 0.037069 0.037069 0.037069 0.037069 0.037069 0.037069 0.037069 Polybench_FDTD_2D.block_128
5411856859942141273 84 regionprofile 0.037124 0.037124 0.037124 0.037124 0.037124 0.037124 0.037124 0.037124 Polybench_FDTD_2D.block_128
7599706706110163426 84 regionprofile 0.037109 0.037109 0.037109 0.037109 0.037109 0.037109 0.037109 0.037109 Polybench_FDTD_2D.block_128
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} 2118176828577267326 85 regionprofile 0.209528 0.209528 0.209528 0.209528 0.000008 0.000008 0.000008 0.000008 Polybench_FLOYD_WARSHALL
5411856859942141273 85 regionprofile 0.209062 0.209062 0.209062 0.209062 0.000009 0.000009 0.000009 0.000009 Polybench_FLOYD_WARSHALL
7599706706110163426 85 regionprofile 0.208607 0.208607 0.208607 0.208607 0.000008 0.000008 0.000008 0.000008 Polybench_FLOYD_WARSHALL
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} 2118176828577267326 86 regionprofile 0.209520 0.209520 0.209520 0.209520 0.209520 0.209520 0.209520 0.209520 Polybench_FLOYD_WARSHALL.block_128
5411856859942141273 86 regionprofile 0.209053 0.209053 0.209053 0.209053 0.209053 0.209053 0.209053 0.209053 Polybench_FLOYD_WARSHALL.block_128
7599706706110163426 86 regionprofile 0.208599 0.208599 0.208599 0.208599 0.208599 0.208599 0.208599 0.208599 Polybench_FLOYD_WARSHALL.block_128
{'name': 'Polybench_GEMM', 'type': 'function'} 2118176828577267326 87 regionprofile 0.006751 0.006751 0.006751 0.006751 0.000010 0.000010 0.000010 0.000010 Polybench_GEMM
5411856859942141273 87 regionprofile 0.006751 0.006751 0.006751 0.006751 0.000011 0.000011 0.000011 0.000011 Polybench_GEMM
7599706706110163426 87 regionprofile 0.006756 0.006756 0.006756 0.006756 0.000010 0.000010 0.000010 0.000010 Polybench_GEMM
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} 2118176828577267326 88 regionprofile 0.006741 0.006741 0.006741 0.006741 0.006741 0.006741 0.006741 0.006741 Polybench_GEMM.block_128
5411856859942141273 88 regionprofile 0.006740 0.006740 0.006740 0.006740 0.006740 0.006740 0.006740 0.006740 Polybench_GEMM.block_128
7599706706110163426 88 regionprofile 0.006746 0.006746 0.006746 0.006746 0.006746 0.006746 0.006746 0.006746 Polybench_GEMM.block_128
{'name': 'Polybench_GEMVER', 'type': 'function'} 2118176828577267326 89 regionprofile 0.008459 0.008459 0.008459 0.008459 0.000008 0.000008 0.000008 0.000008 Polybench_GEMVER
5411856859942141273 89 regionprofile 0.008441 0.008441 0.008441 0.008441 0.000009 0.000009 0.000009 0.000009 Polybench_GEMVER
7599706706110163426 89 regionprofile 0.008433 0.008433 0.008433 0.008433 0.000008 0.000008 0.000008 0.000008 Polybench_GEMVER
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} 2118176828577267326 90 regionprofile 0.008451 0.008451 0.008451 0.008451 0.008451 0.008451 0.008451 0.008451 Polybench_GEMVER.block_128
5411856859942141273 90 regionprofile 0.008432 0.008432 0.008432 0.008432 0.008432 0.008432 0.008432 0.008432 Polybench_GEMVER.block_128
7599706706110163426 90 regionprofile 0.008425 0.008425 0.008425 0.008425 0.008425 0.008425 0.008425 0.008425 Polybench_GEMVER.block_128
{'name': 'Polybench_GESUMMV', 'type': 'function'} 2118176828577267326 91 regionprofile 0.037419 0.037419 0.037419 0.037419 0.000009 0.000009 0.000009 0.000009 Polybench_GESUMMV
5411856859942141273 91 regionprofile 0.037170 0.037170 0.037170 0.037170 0.000009 0.000009 0.000009 0.000009 Polybench_GESUMMV
7599706706110163426 91 regionprofile 0.037532 0.037532 0.037532 0.037532 0.000008 0.000008 0.000008 0.000008 Polybench_GESUMMV
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} 2118176828577267326 92 regionprofile 0.037410 0.037410 0.037410 0.037410 0.037410 0.037410 0.037410 0.037410 Polybench_GESUMMV.block_128
5411856859942141273 92 regionprofile 0.037161 0.037161 0.037161 0.037161 0.037161 0.037161 0.037161 0.037161 Polybench_GESUMMV.block_128
7599706706110163426 92 regionprofile 0.037524 0.037524 0.037524 0.037524 0.037524 0.037524 0.037524 0.037524 Polybench_GESUMMV.block_128
{'name': 'Polybench_HEAT_3D', 'type': 'function'} 2118176828577267326 93 regionprofile 0.021835 0.021835 0.021835 0.021835 0.000009 0.000009 0.000009 0.000009 Polybench_HEAT_3D
5411856859942141273 93 regionprofile 0.021867 0.021867 0.021867 0.021867 0.000009 0.000009 0.000009 0.000009 Polybench_HEAT_3D
7599706706110163426 93 regionprofile 0.021811 0.021811 0.021811 0.021811 0.000010 0.000010 0.000010 0.000010 Polybench_HEAT_3D
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} 2118176828577267326 94 regionprofile 0.021826 0.021826 0.021826 0.021826 0.021826 0.021826 0.021826 0.021826 Polybench_HEAT_3D.block_128
5411856859942141273 94 regionprofile 0.021858 0.021858 0.021858 0.021858 0.021858 0.021858 0.021858 0.021858 Polybench_HEAT_3D.block_128
7599706706110163426 94 regionprofile 0.021801 0.021801 0.021801 0.021801 0.021801 0.021801 0.021801 0.021801 Polybench_HEAT_3D.block_128
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} 2118176828577267326 95 regionprofile 0.077689 0.077689 0.077689 0.077689 0.000009 0.000009 0.000009 0.000009 Polybench_JACOBI_1D
5411856859942141273 95 regionprofile 0.077843 0.077843 0.077843 0.077843 0.000011 0.000011 0.000011 0.000011 Polybench_JACOBI_1D
7599706706110163426 95 regionprofile 0.077730 0.077730 0.077730 0.077730 0.000012 0.000012 0.000012 0.000012 Polybench_JACOBI_1D
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} 2118176828577267326 96 regionprofile 0.077680 0.077680 0.077680 0.077680 0.077680 0.077680 0.077680 0.077680 Polybench_JACOBI_1D.block_128
5411856859942141273 96 regionprofile 0.077832 0.077832 0.077832 0.077832 0.077832 0.077832 0.077832 0.077832 Polybench_JACOBI_1D.block_128
7599706706110163426 96 regionprofile 0.077718 0.077718 0.077718 0.077718 0.077718 0.077718 0.077718 0.077718 Polybench_JACOBI_1D.block_128
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} 2118176828577267326 97 regionprofile 0.102498 0.102498 0.102498 0.102498 0.000009 0.000009 0.000009 0.000009 Polybench_JACOBI_2D
5411856859942141273 97 regionprofile 0.102730 0.102730 0.102730 0.102730 0.000010 0.000010 0.000010 0.000010 Polybench_JACOBI_2D
7599706706110163426 97 regionprofile 0.102590 0.102590 0.102590 0.102590 0.000010 0.000010 0.000010 0.000010 Polybench_JACOBI_2D
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} 2118176828577267326 98 regionprofile 0.102489 0.102489 0.102489 0.102489 0.102489 0.102489 0.102489 0.102489 Polybench_JACOBI_2D.block_128
5411856859942141273 98 regionprofile 0.102720 0.102720 0.102720 0.102720 0.102720 0.102720 0.102720 0.102720 Polybench_JACOBI_2D.block_128
7599706706110163426 98 regionprofile 0.102580 0.102580 0.102580 0.102580 0.102580 0.102580 0.102580 0.102580 Polybench_JACOBI_2D.block_128
{'name': 'Polybench_MVT', 'type': 'function'} 2118176828577267326 99 regionprofile 0.035828 0.035828 0.035828 0.035828 0.000012 0.000012 0.000012 0.000012 Polybench_MVT
5411856859942141273 99 regionprofile 0.035806 0.035806 0.035806 0.035806 0.000009 0.000009 0.000009 0.000009 Polybench_MVT
7599706706110163426 99 regionprofile 0.035699 0.035699 0.035699 0.035699 0.000009 0.000009 0.000009 0.000009 Polybench_MVT
{'name': 'Polybench_MVT.block_128', 'type': 'function'} 2118176828577267326 100 regionprofile 0.035816 0.035816 0.035816 0.035816 0.035816 0.035816 0.035816 0.035816 Polybench_MVT.block_128
5411856859942141273 100 regionprofile 0.035797 0.035797 0.035797 0.035797 0.035797 0.035797 0.035797 0.035797 Polybench_MVT.block_128
7599706706110163426 100 regionprofile 0.035690 0.035690 0.035690 0.035690 0.035690 0.035690 0.035690 0.035690 Polybench_MVT.block_128
{'name': 'Stream', 'type': 'function'} 2118176828577267326 101 regionprofile 0.268003 0.268003 0.268003 0.268003 0.000033 0.000033 0.000033 0.000033 Stream
5411856859942141273 101 regionprofile 0.268645 0.268645 0.268645 0.268645 0.000036 0.000036 0.000036 0.000036 Stream
7599706706110163426 101 regionprofile 0.264033 0.264033 0.264033 0.264033 0.000033 0.000033 0.000033 0.000033 Stream
{'name': 'Stream_ADD', 'type': 'function'} 2118176828577267326 102 regionprofile 0.034015 0.034015 0.034015 0.034015 0.000009 0.000009 0.000009 0.000009 Stream_ADD
5411856859942141273 102 regionprofile 0.033807 0.033807 0.033807 0.033807 0.000009 0.000009 0.000009 0.000009 Stream_ADD
7599706706110163426 102 regionprofile 0.033724 0.033724 0.033724 0.033724 0.000009 0.000009 0.000009 0.000009 Stream_ADD
{'name': 'Stream_ADD.block_128', 'type': 'function'} 2118176828577267326 103 regionprofile 0.034006 0.034006 0.034006 0.034006 0.034006 0.034006 0.034006 0.034006 Stream_ADD.block_128
5411856859942141273 103 regionprofile 0.033798 0.033798 0.033798 0.033798 0.033798 0.033798 0.033798 0.033798 Stream_ADD.block_128
7599706706110163426 103 regionprofile 0.033715 0.033715 0.033715 0.033715 0.033715 0.033715 0.033715 0.033715 Stream_ADD.block_128
{'name': 'Stream_COPY', 'type': 'function'} 2118176828577267326 104 regionprofile 0.043270 0.043270 0.043270 0.043270 0.000009 0.000009 0.000009 0.000009 Stream_COPY
5411856859942141273 104 regionprofile 0.043329 0.043329 0.043329 0.043329 0.000008 0.000008 0.000008 0.000008 Stream_COPY
7599706706110163426 104 regionprofile 0.042980 0.042980 0.042980 0.042980 0.000009 0.000009 0.000009 0.000009 Stream_COPY
{'name': 'Stream_COPY.block_128', 'type': 'function'} 2118176828577267326 105 regionprofile 0.043261 0.043261 0.043261 0.043261 0.043261 0.043261 0.043261 0.043261 Stream_COPY.block_128
5411856859942141273 105 regionprofile 0.043321 0.043321 0.043321 0.043321 0.043321 0.043321 0.043321 0.043321 Stream_COPY.block_128
7599706706110163426 105 regionprofile 0.042971 0.042971 0.042971 0.042971 0.042971 0.042971 0.042971 0.042971 Stream_COPY.block_128
{'name': 'Stream_DOT', 'type': 'function'} 2118176828577267326 106 regionprofile 0.113664 0.113664 0.113664 0.113664 0.000009 0.000009 0.000009 0.000009 Stream_DOT
5411856859942141273 106 regionprofile 0.114367 0.114367 0.114367 0.114367 0.000009 0.000009 0.000009 0.000009 Stream_DOT
7599706706110163426 106 regionprofile 0.110674 0.110674 0.110674 0.110674 0.000009 0.000009 0.000009 0.000009 Stream_DOT
{'name': 'Stream_DOT.block_128', 'type': 'function'} 2118176828577267326 107 regionprofile 0.113655 0.113655 0.113655 0.113655 0.113655 0.113655 0.113655 0.113655 Stream_DOT.block_128
5411856859942141273 107 regionprofile 0.114358 0.114358 0.114358 0.114358 0.114358 0.114358 0.114358 0.114358 Stream_DOT.block_128
7599706706110163426 107 regionprofile 0.110665 0.110665 0.110665 0.110665 0.110665 0.110665 0.110665 0.110665 Stream_DOT.block_128
{'name': 'Stream_MUL', 'type': 'function'} 2118176828577267326 108 regionprofile 0.043283 0.043283 0.043283 0.043283 0.000012 0.000012 0.000012 0.000012 Stream_MUL
5411856859942141273 108 regionprofile 0.043339 0.043339 0.043339 0.043339 0.000011 0.000011 0.000011 0.000011 Stream_MUL
7599706706110163426 108 regionprofile 0.042951 0.042951 0.042951 0.042951 0.000010 0.000010 0.000010 0.000010 Stream_MUL
{'name': 'Stream_MUL.block_128', 'type': 'function'} 2118176828577267326 109 regionprofile 0.043271 0.043271 0.043271 0.043271 0.043271 0.043271 0.043271 0.043271 Stream_MUL.block_128
5411856859942141273 109 regionprofile 0.043328 0.043328 0.043328 0.043328 0.043328 0.043328 0.043328 0.043328 Stream_MUL.block_128
7599706706110163426 109 regionprofile 0.042941 0.042941 0.042941 0.042941 0.042941 0.042941 0.042941 0.042941 Stream_MUL.block_128
{'name': 'Stream_TRIAD', 'type': 'function'} 2118176828577267326 110 regionprofile 0.033738 0.033738 0.033738 0.033738 0.000008 0.000008 0.000008 0.000008 Stream_TRIAD
5411856859942141273 110 regionprofile 0.033767 0.033767 0.033767 0.033767 0.000008 0.000008 0.000008 0.000008 Stream_TRIAD
7599706706110163426 110 regionprofile 0.033671 0.033671 0.033671 0.033671 0.000009 0.000009 0.000009 0.000009 Stream_TRIAD
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} 2118176828577267326 111 regionprofile 0.033730 0.033730 0.033730 0.033730 0.033730 0.033730 0.033730 0.033730 Stream_TRIAD.block_128
5411856859942141273 111 regionprofile 0.033759 0.033759 0.033759 0.033759 0.033759 0.033759 0.033759 0.033759 Stream_TRIAD.block_128
7599706706110163426 111 regionprofile 0.033662 0.033662 0.033662 0.033662 0.033662 0.033662 0.033662 0.033662 Stream_TRIAD.block_128

The metadata table stores HPC simulation information such as an application’s build settings and execution context. A row corresponds to a single execution of the application and is identified by a unique profile index.

View metadata table:

[5]:
display(HTML(th_lassen.metadata.to_html()))
spot.channels spot.options spot.format.version spot.timeseries.metrics spot.metrics cali.caliper.version variant Algorithm_MEMCPY Algorithm_MEMSET Algorithm_REDUCE_SUM Algorithm_SORTPAIRS Algorithm_SORT Algorithm_SCAN Apps_VOL3D Apps_PRESSURE Apps_NODAL_ACCUMULATION_3D Apps_MASS3DPA Apps_LTIMES_NOVIEW Apps_LTIMES Apps_HALOEXCHANGE_FUSED Apps_HALOEXCHANGE Apps_FIR Apps_ENERGY Apps_DIFFUSION3DPA Apps_DEL_DOT_VEC_2D Apps_CONVECTION3DPA Stream_TRIAD Stream_MUL Stream_DOT Stream_COPY Stream_ADD Polybench_MVT Polybench_JACOBI_2D Polybench_JACOBI_1D Polybench_HEAT_3D Polybench_GESUMMV Polybench_GEMVER Polybench_GEMM Polybench_FLOYD_WARSHALL Polybench_FDTD_2D Polybench_ATAX Polybench_ADI Polybench_3MM Polybench_2MM Lcals_TRIDIAG_ELIM Lcals_PLANCKIAN Lcals_INT_PREDICT Lcals_HYDRO_2D Lcals_HYDRO_1D Lcals_GEN_LIN_RECUR Lcals_FIRST_SUM Lcals_FIRST_MIN Lcals_FIRST_DIFF Lcals_EOS Lcals_DIFF_PREDICT Basic_TRAP_INT Basic_REDUCE_STRUCT Basic_REDUCE3_INT Basic_PI_REDUCE Basic_PI_ATOMIC Basic_NESTED_INIT Basic_MULADDSUB Basic_MAT_MAT_SHARED Basic_INIT_VIEW1D_OFFSET Basic_INIT_VIEW1D Basic_INIT3 Basic_INDEXLIST_3LOOP Basic_INDEXLIST Basic_IF_QUAD Basic_DAXPY_ATOMIC Basic_DAXPY SizeMeaning Tuning systype_build gpu_targets_block_sizes cuda_compiler_version Compiler_path_version compiler compiler_version rajaperf_compiler_options rajaperf_compiler cmake_exe_linker_flags cmake_cxx_flags cmake_build_type raja_version perfsuite_version cluster cmdline libraries launchdate user cali.channel
profile
2118176828577267326 regionprofile 2 min#inclusive#sum#time.duration,max#inclusive#sum#time.duration,avg#inclusive#sum#time.duration,sum#inclusive#sum#time.duration,min#sum#sum#time.duration,max#sum#sum#time.duration,avg#sum#sum#time.duration,sum#sum#sum#time.duration 2.9.0-dev Base_CUDA {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554432,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} {'Problem size': 1135363,'Reps':100,'Iterations/rep': 1135363,'Kernels/rep': 1,'Bytes/rep': 36598760,'FLOPS/rep': 81746136} {'Problem size': 1048576,'Reps':700,'Iterations/rep': 2097152,'Kernels/rep': 2,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} {'Problem size': 1030301,'Reps':100,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 33464144,'FLOPS/rep': 9272709} {'Problem size': 1048500,'Reps':50,'Iterations/rep': 1048500,'Kernels/rep': 1,'Bytes/rep': 16977632,'FLOPS/rep': 42518772} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 2,'Bytes/rep': 7491120,'FLOPS/rep': 0} {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 156,'Bytes/rep': 7491120,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':160,'Iterations/rep': 1048560,'Kernels/rep': 1,'Bytes/rep': 16777088,'FLOPS/rep': 33553920} {'Problem size': 1048576,'Reps':130,'Iterations/rep': 6291456,'Kernels/rep': 6,'Bytes/rep': 226492416,'FLOPS/rep': 57671680} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 57409728,'FLOPS/rep': 115752960} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 50397216,'FLOPS/rep': 56623104} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 32244000,'FLOPS/rep': 60342272} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777232,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 1048576} {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16859200,'FLOPS/rep': 4202500} {'Problem size': 1046529,'Reps':50,'Iterations/rep': 83722320,'Kernels/rep': 2,'Bytes/rep': 1342176000,'FLOPS/rep': 418611600} {'Problem size': 1048574,'Reps':100,'Iterations/rep': 33554368,'Kernels/rep': 32,'Bytes/rep': 536870400,'FLOPS/rep': 100663104} {'Problem size': 1000000,'Reps':20,'Iterations/rep': 40000000,'Kernels/rep': 40,'Bytes/rep': 659200000,'FLOPS/rep': 600000000} {'Problem size': 1050625,'Reps':120,'Iterations/rep': 1025,'Kernels/rep': 1,'Bytes/rep': 16834600,'FLOPS/rep': 4205575} {'Problem size': 1050625,'Reps':20,'Iterations/rep': 3152900,'Kernels/rep': 4,'Bytes/rep': 33726600,'FLOPS/rep': 10507275} {'Problem size': 1050625,'Reps':4,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 28085000,'FLOPS/rep': 3783300625} {'Problem size': 1050625,'Reps':8,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 16810000,'FLOPS/rep': 1076890625} {'Problem size': 1049600,'Reps':8,'Iterations/rep': 125952040,'Kernels/rep': 160,'Bytes/rep': 3359048960,'FLOPS/rep': 461619200} {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16851000,'FLOPS/rep': 4202500} {'Problem size': 1046529,'Reps':4,'Iterations/rep': 8184,'Kernels/rep': 8,'Bytes/rep': 402652800,'FLOPS/rep': 142327944} {'Problem size': 1050625,'Reps':2,'Iterations/rep': 3151875,'Kernels/rep': 3,'Bytes/rep': 78269000,'FLOPS/rep': 6797543750} {'Problem size': 1050625,'Reps':2,'Iterations/rep': 2101250,'Kernels/rep': 2,'Bytes/rep': 51988000,'FLOPS/rep': 5683881250} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554400,'FLOPS/rep': 2097150} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 4194304} {'Problem size': 1048576,'Reps':400,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 92274688,'FLOPS/rep': 17825792} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145728,'Kernels/rep': 3,'Bytes/rep': 150667584,'FLOPS/rep': 45957296} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165832,'FLOPS/rep': 5242880} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 2,'Bytes/rep': 83886080,'FLOPS/rep': 6291456} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777208,'FLOPS/rep': 1048575} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388640,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777224,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554488,'FLOPS/rep': 16777216} {'Problem size': 1048576,'Reps':200,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 167772160,'FLOPS/rep': 9437184} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 10485760} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777264,'FLOPS/rep': 2097154} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 4194328,'FLOPS/rep': 1048577} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} {'Problem size': 1030301,'Reps':1000,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 8242408,'FLOPS/rep': 3090903} {'Problem size': 1048576,'Reps':350,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} {'Problem size': 1048576,'Reps':5,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 2147483648} {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145729,'Kernels/rep': 3,'Bytes/rep': 27263004,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 10485776,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':180,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 11534336} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} Direct default blueos_3_ppc64le_ib_p9 128 11.2.152 xl-2022.08.19 xlc++_r-16.1.1.12 16.1.1.12 /usr/tce/packages/cuda/cuda-11.2.0/bin/nvcc;/usr/tce/packages/xl/xl-2022.08.19/bin/xlc++_r -qthreaded Release 2022.03.0 2022.3.0 lassen [./my-xl-block-128-build/bin/raja-perf.exe, --variants, Base_CUDA, --size, 1048576] [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/workspace/asde/caliper-lassen/lib64/libcaliper.so.2, /usr/workspace/wsb/asde/adiak-lassen/lib/libadiak.so, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpiprofilesupport.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpi_ibm.so.3, /usr/tce/packages/xl/xl-2022.08.19/lib/libibmc++.so.1, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libstdc++.so.6, /lib64/libm.so.6, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libgcc_s.so.1, /lib64/libpthread.so.0, /lib64/libc.so.6, /lib64/ld64.so.2, /usr/tce/packages/cuda/cuda-10.1.243/nvidia/lib64/libnvToolsExt.so.1, /usr/tce/packages/cuda/cuda-10.1.243/bin/../extras/CUPTI/lib64/libcupti.so.10.1, /lib64/libpapi.so.5.2.0.0, /lib64/libdw.so.1, /lib64/libunwind.so.8, /lib64/libutil.so.1, /lib64/libz.so.1, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libhwloc_ompi.so.15, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent_pthreads-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-rte.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-pal.so.3, /lib64/libpfm.so.4, /lib64/libsensors.so.4, /lib64/libelf.so.1, /lib64/liblzma.so.5, /lib64/libbz2.so.1, /lib64/libnss_files.so.2] 1668559508 burgess9 spot
5411856859942141273 regionprofile 2 min#inclusive#sum#time.duration,max#inclusive#sum#time.duration,avg#inclusive#sum#time.duration,sum#inclusive#sum#time.duration,min#sum#sum#time.duration,max#sum#sum#time.duration,avg#sum#sum#time.duration,sum#sum#sum#time.duration 2.9.0-dev Base_CUDA {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554432,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} {'Problem size': 1135363,'Reps':100,'Iterations/rep': 1135363,'Kernels/rep': 1,'Bytes/rep': 36598760,'FLOPS/rep': 81746136} {'Problem size': 1048576,'Reps':700,'Iterations/rep': 2097152,'Kernels/rep': 2,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} {'Problem size': 1030301,'Reps':100,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 33464144,'FLOPS/rep': 9272709} {'Problem size': 1048500,'Reps':50,'Iterations/rep': 1048500,'Kernels/rep': 1,'Bytes/rep': 16977632,'FLOPS/rep': 42518772} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 2,'Bytes/rep': 7491120,'FLOPS/rep': 0} {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 156,'Bytes/rep': 7491120,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':160,'Iterations/rep': 1048560,'Kernels/rep': 1,'Bytes/rep': 16777088,'FLOPS/rep': 33553920} {'Problem size': 1048576,'Reps':130,'Iterations/rep': 6291456,'Kernels/rep': 6,'Bytes/rep': 226492416,'FLOPS/rep': 57671680} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 57409728,'FLOPS/rep': 115752960} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 50397216,'FLOPS/rep': 56623104} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 32244000,'FLOPS/rep': 60342272} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777232,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 1048576} {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16859200,'FLOPS/rep': 4202500} {'Problem size': 1046529,'Reps':50,'Iterations/rep': 83722320,'Kernels/rep': 2,'Bytes/rep': 1342176000,'FLOPS/rep': 418611600} {'Problem size': 1048574,'Reps':100,'Iterations/rep': 33554368,'Kernels/rep': 32,'Bytes/rep': 536870400,'FLOPS/rep': 100663104} {'Problem size': 1000000,'Reps':20,'Iterations/rep': 40000000,'Kernels/rep': 40,'Bytes/rep': 659200000,'FLOPS/rep': 600000000} {'Problem size': 1050625,'Reps':120,'Iterations/rep': 1025,'Kernels/rep': 1,'Bytes/rep': 16834600,'FLOPS/rep': 4205575} {'Problem size': 1050625,'Reps':20,'Iterations/rep': 3152900,'Kernels/rep': 4,'Bytes/rep': 33726600,'FLOPS/rep': 10507275} {'Problem size': 1050625,'Reps':4,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 28085000,'FLOPS/rep': 3783300625} {'Problem size': 1050625,'Reps':8,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 16810000,'FLOPS/rep': 1076890625} {'Problem size': 1049600,'Reps':8,'Iterations/rep': 125952040,'Kernels/rep': 160,'Bytes/rep': 3359048960,'FLOPS/rep': 461619200} {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16851000,'FLOPS/rep': 4202500} {'Problem size': 1046529,'Reps':4,'Iterations/rep': 8184,'Kernels/rep': 8,'Bytes/rep': 402652800,'FLOPS/rep': 142327944} {'Problem size': 1050625,'Reps':2,'Iterations/rep': 3151875,'Kernels/rep': 3,'Bytes/rep': 78269000,'FLOPS/rep': 6797543750} {'Problem size': 1050625,'Reps':2,'Iterations/rep': 2101250,'Kernels/rep': 2,'Bytes/rep': 51988000,'FLOPS/rep': 5683881250} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554400,'FLOPS/rep': 2097150} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 4194304} {'Problem size': 1048576,'Reps':400,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 92274688,'FLOPS/rep': 17825792} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145728,'Kernels/rep': 3,'Bytes/rep': 150667584,'FLOPS/rep': 45957296} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165832,'FLOPS/rep': 5242880} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 2,'Bytes/rep': 83886080,'FLOPS/rep': 6291456} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777208,'FLOPS/rep': 1048575} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388640,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777224,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554488,'FLOPS/rep': 16777216} {'Problem size': 1048576,'Reps':200,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 167772160,'FLOPS/rep': 9437184} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 10485760} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777264,'FLOPS/rep': 2097154} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 4194328,'FLOPS/rep': 1048577} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} {'Problem size': 1030301,'Reps':1000,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 8242408,'FLOPS/rep': 3090903} {'Problem size': 1048576,'Reps':350,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} {'Problem size': 1048576,'Reps':5,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 2147483648} {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145729,'Kernels/rep': 3,'Bytes/rep': 27263004,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 10485776,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':180,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 11534336} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} Direct default blueos_3_ppc64le_ib_p9 128 11.2.152 xl-2022.08.19 xlc++_r-16.1.1.12 16.1.1.12 /usr/tce/packages/cuda/cuda-11.2.0/bin/nvcc;/usr/tce/packages/xl/xl-2022.08.19/bin/xlc++_r -qthreaded Release 2022.03.0 2022.3.0 lassen [./my-xl-block-128-build/bin/raja-perf.exe, --variants, Base_CUDA, --size, 1048576] [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/workspace/asde/caliper-lassen/lib64/libcaliper.so.2, /usr/workspace/wsb/asde/adiak-lassen/lib/libadiak.so, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpiprofilesupport.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpi_ibm.so.3, /usr/tce/packages/xl/xl-2022.08.19/lib/libibmc++.so.1, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libstdc++.so.6, /lib64/libm.so.6, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libgcc_s.so.1, /lib64/libpthread.so.0, /lib64/libc.so.6, /lib64/ld64.so.2, /usr/tce/packages/cuda/cuda-10.1.243/nvidia/lib64/libnvToolsExt.so.1, /usr/tce/packages/cuda/cuda-10.1.243/bin/../extras/CUPTI/lib64/libcupti.so.10.1, /lib64/libpapi.so.5.2.0.0, /lib64/libdw.so.1, /lib64/libunwind.so.8, /lib64/libutil.so.1, /lib64/libz.so.1, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libhwloc_ompi.so.15, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent_pthreads-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-rte.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-pal.so.3, /lib64/libpfm.so.4, /lib64/libsensors.so.4, /lib64/libelf.so.1, /lib64/liblzma.so.5, /lib64/libbz2.so.1, /lib64/libnss_files.so.2] 1668559527 burgess9 spot
7599706706110163426 regionprofile 2 min#inclusive#sum#time.duration,max#inclusive#sum#time.duration,avg#inclusive#sum#time.duration,sum#inclusive#sum#time.duration,min#sum#sum#time.duration,max#sum#sum#time.duration,avg#sum#sum#time.duration,sum#sum#sum#time.duration 2.9.0-dev Base_CUDA {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388616,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554432,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':20,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} {'Problem size': 1135363,'Reps':100,'Iterations/rep': 1135363,'Kernels/rep': 1,'Bytes/rep': 36598760,'FLOPS/rep': 81746136} {'Problem size': 1048576,'Reps':700,'Iterations/rep': 2097152,'Kernels/rep': 2,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} {'Problem size': 1030301,'Reps':100,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 33464144,'FLOPS/rep': 9272709} {'Problem size': 1048500,'Reps':50,'Iterations/rep': 1048500,'Kernels/rep': 1,'Bytes/rep': 16977632,'FLOPS/rep': 42518772} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 14955008,'FLOPS/rep': 52428800} {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 2,'Bytes/rep': 7491120,'FLOPS/rep': 0} {'Problem size': 1030301,'Reps':50,'Iterations/rep': 187278,'Kernels/rep': 156,'Bytes/rep': 7491120,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':160,'Iterations/rep': 1048560,'Kernels/rep': 1,'Bytes/rep': 16777088,'FLOPS/rep': 33553920} {'Problem size': 1048576,'Reps':130,'Iterations/rep': 6291456,'Kernels/rep': 6,'Bytes/rep': 226492416,'FLOPS/rep': 57671680} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 57409728,'FLOPS/rep': 115752960} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 50397216,'FLOPS/rep': 56623104} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 32244000,'FLOPS/rep': 60342272} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777232,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':1800,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 1048576} {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16859200,'FLOPS/rep': 4202500} {'Problem size': 1046529,'Reps':50,'Iterations/rep': 83722320,'Kernels/rep': 2,'Bytes/rep': 1342176000,'FLOPS/rep': 418611600} {'Problem size': 1048574,'Reps':100,'Iterations/rep': 33554368,'Kernels/rep': 32,'Bytes/rep': 536870400,'FLOPS/rep': 100663104} {'Problem size': 1000000,'Reps':20,'Iterations/rep': 40000000,'Kernels/rep': 40,'Bytes/rep': 659200000,'FLOPS/rep': 600000000} {'Problem size': 1050625,'Reps':120,'Iterations/rep': 1025,'Kernels/rep': 1,'Bytes/rep': 16834600,'FLOPS/rep': 4205575} {'Problem size': 1050625,'Reps':20,'Iterations/rep': 3152900,'Kernels/rep': 4,'Bytes/rep': 33726600,'FLOPS/rep': 10507275} {'Problem size': 1050625,'Reps':4,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 28085000,'FLOPS/rep': 3783300625} {'Problem size': 1050625,'Reps':8,'Iterations/rep': 1050625,'Kernels/rep': 1,'Bytes/rep': 16810000,'FLOPS/rep': 1076890625} {'Problem size': 1049600,'Reps':8,'Iterations/rep': 125952040,'Kernels/rep': 160,'Bytes/rep': 3359048960,'FLOPS/rep': 461619200} {'Problem size': 1050625,'Reps':100,'Iterations/rep': 2050,'Kernels/rep': 2,'Bytes/rep': 16851000,'FLOPS/rep': 4202500} {'Problem size': 1046529,'Reps':4,'Iterations/rep': 8184,'Kernels/rep': 8,'Bytes/rep': 402652800,'FLOPS/rep': 142327944} {'Problem size': 1050625,'Reps':2,'Iterations/rep': 3151875,'Kernels/rep': 3,'Bytes/rep': 78269000,'FLOPS/rep': 6797543750} {'Problem size': 1050625,'Reps':2,'Iterations/rep': 2101250,'Kernels/rep': 2,'Bytes/rep': 51988000,'FLOPS/rep': 5683881250} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554400,'FLOPS/rep': 2097150} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 4194304} {'Problem size': 1048576,'Reps':400,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 92274688,'FLOPS/rep': 17825792} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145728,'Kernels/rep': 3,'Bytes/rep': 150667584,'FLOPS/rep': 45957296} {'Problem size': 1048576,'Reps':1000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165832,'FLOPS/rep': 5242880} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 2,'Bytes/rep': 83886080,'FLOPS/rep': 6291456} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777208,'FLOPS/rep': 1048575} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388640,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':2000,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777224,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 33554488,'FLOPS/rep': 16777216} {'Problem size': 1048576,'Reps':200,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 167772160,'FLOPS/rep': 9437184} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 10485760} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777264,'FLOPS/rep': 2097154} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 4194328,'FLOPS/rep': 1048577} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} {'Problem size': 1048576,'Reps':50,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16,'FLOPS/rep': 6291457} {'Problem size': 1030301,'Reps':1000,'Iterations/rep': 1030301,'Kernels/rep': 1,'Bytes/rep': 8242408,'FLOPS/rep': 3090903} {'Problem size': 1048576,'Reps':350,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 3145728} {'Problem size': 1048576,'Reps':5,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 16777216,'FLOPS/rep': 2147483648} {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':2500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 8388608,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 1048576} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 3145729,'Kernels/rep': 3,'Bytes/rep': 27263004,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':100,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 10485776,'FLOPS/rep': 0} {'Problem size': 1048576,'Reps':180,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 41943040,'FLOPS/rep': 11534336} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} {'Problem size': 1048576,'Reps':500,'Iterations/rep': 1048576,'Kernels/rep': 1,'Bytes/rep': 25165824,'FLOPS/rep': 2097152} Direct default blueos_3_ppc64le_ib_p9 128 11.2.152 xl-2022.08.19 xlc++_r-16.1.1.12 16.1.1.12 /usr/tce/packages/cuda/cuda-11.2.0/bin/nvcc;/usr/tce/packages/xl/xl-2022.08.19/bin/xlc++_r -qthreaded Release 2022.03.0 2022.3.0 lassen [./my-xl-block-128-build/bin/raja-perf.exe, --variants, Base_CUDA, --size, 1048576] [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/workspace/asde/caliper-lassen/lib64/libcaliper.so.2, /usr/workspace/wsb/asde/adiak-lassen/lib/libadiak.so, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpiprofilesupport.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libmpi_ibm.so.3, /usr/tce/packages/xl/xl-2022.08.19/lib/libibmc++.so.1, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libstdc++.so.6, /lib64/libm.so.6, /usr/tce/packages/gcc/gcc-4.9.3/gnu/lib64/libgcc_s.so.1, /lib64/libpthread.so.0, /lib64/libc.so.6, /lib64/ld64.so.2, /usr/tce/packages/cuda/cuda-10.1.243/nvidia/lib64/libnvToolsExt.so.1, /usr/tce/packages/cuda/cuda-10.1.243/bin/../extras/CUPTI/lib64/libcupti.so.10.1, /lib64/libpapi.so.5.2.0.0, /lib64/libdw.so.1, /lib64/libunwind.so.8, /lib64/libutil.so.1, /lib64/libz.so.1, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libhwloc_ompi.so.15, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libevent_pthreads-2.1.so.6, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-rte.so.3, /usr/tce/packages/spectrum-mpi/ibm/spectrum-mpi-rolling-release/lib/libopen-pal.so.3, /lib64/libpfm.so.4, /lib64/libsensors.so.4, /lib64/libelf.so.1, /lib64/liblzma.so.5, /lib64/libbz2.so.1, /lib64/libnss_files.so.2] 1668559517 burgess9 spot

Composing multiple Thickets:

We can compose thickets in a hierarchical, horizontal ordering using thicket’s columnar_join function. In this example, we compose profiles of two different problem sizes and four different block sizes seamlessly.

[6]:
problem_sizes = ["1M", "4M"]
data = {
    "block_128": [f"../data/lassen/new-cali/Base_CUDA-block_128-{i}.cali" for i in problem_sizes],
    "block_256": [f"../data/lassen/new-cali/Base_CUDA-block_256-{i}.cali" for i in problem_sizes],
    "block_512": [f"../data/lassen/new-cali/Base_CUDA-block_512-{i}.cali" for i in problem_sizes],
    "block_1024": [f"../data/lassen/new-cali/Base_CUDA-block_1024-{i}.cali" for i in problem_sizes],
}
[7]:
block_128 = tt.Thicket.from_caliperreader(data["block_128"])
block_256 = tt.Thicket.from_caliperreader(data["block_256"])
block_512 = tt.Thicket.from_caliperreader(data["block_512"])
block_1024 = tt.Thicket.from_caliperreader(data["block_1024"])
[8]:
th_cj = tt.Thicket.columnar_join(
    thicket_list=[block_128, block_256, block_512, block_1024],
    header_list=["Block 128", "Block 256", "Block 512", "Block 1024"],
    column_name="ProblemSizeRunParam"
)
[9]:
print(th_cj.tree())
    __          __       __         __
   / /_  ____ _/ /______/ /_  ___  / /_
  / __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
 / / / / /_/ / /_/ /__/ / / /  __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/  v2023.1.0

-1.000 RAJAPerf
├─ -1.000 Algorithm
│  ├─ -1.000 Algorithm_MEMCPY
│  ├─ -1.000 Algorithm_MEMSET
│  └─ -1.000 Algorithm_REDUCE_SUM
├─ -1.000 Apps
│  ├─ -1.000 Apps_DEL_DOT_VEC_2D
│  ├─ -1.000 Apps_ENERGY
│  ├─ -1.000 Apps_FIR
│  ├─ -1.000 Apps_HALOEXCHANGE
│  ├─ -1.000 Apps_HALOEXCHANGE_FUSED
│  ├─ -1.000 Apps_LTIMES
│  ├─ -1.000 Apps_LTIMES_NOVIEW
│  ├─ -1.000 Apps_NODAL_ACCUMULATION_3D
│  ├─ -1.000 Apps_PRESSURE
│  └─ -1.000 Apps_VOL3D
├─ -1.000 Basic
│  ├─ -1.000 Basic_DAXPY
│  ├─ -1.000 Basic_DAXPY_ATOMIC
│  ├─ -1.000 Basic_IF_QUAD
│  ├─ -1.000 Basic_INDEXLIST
│  ├─ -1.000 Basic_INDEXLIST_3LOOP
│  ├─ -1.000 Basic_INIT3
│  ├─ -1.000 Basic_INIT_VIEW1D
│  ├─ -1.000 Basic_INIT_VIEW1D_OFFSET
│  ├─ -1.000 Basic_MAT_MAT_SHARED
│  ├─ -1.000 Basic_MULADDSUB
│  ├─ -1.000 Basic_NESTED_INIT
│  ├─ -1.000 Basic_PI_ATOMIC
│  ├─ -1.000 Basic_PI_REDUCE
│  ├─ -1.000 Basic_REDUCE3_INT
│  ├─ -1.000 Basic_REDUCE_STRUCT
│  └─ -1.000 Basic_TRAP_INT
├─ -1.000 Lcals
│  ├─ -1.000 Lcals_DIFF_PREDICT
│  ├─ -1.000 Lcals_EOS
│  ├─ -1.000 Lcals_FIRST_DIFF
│  ├─ -1.000 Lcals_FIRST_MIN
│  ├─ -1.000 Lcals_FIRST_SUM
│  ├─ -1.000 Lcals_GEN_LIN_RECUR
│  ├─ -1.000 Lcals_HYDRO_1D
│  ├─ -1.000 Lcals_HYDRO_2D
│  ├─ -1.000 Lcals_INT_PREDICT
│  ├─ -1.000 Lcals_PLANCKIAN
│  └─ -1.000 Lcals_TRIDIAG_ELIM
├─ -1.000 Polybench
│  ├─ -1.000 Polybench_2MM
│  ├─ -1.000 Polybench_3MM
│  ├─ -1.000 Polybench_ADI
│  ├─ -1.000 Polybench_ATAX
│  ├─ -1.000 Polybench_FDTD_2D
│  ├─ -1.000 Polybench_FLOYD_WARSHALL
│  ├─ -1.000 Polybench_GEMM
│  ├─ -1.000 Polybench_GEMVER
│  ├─ -1.000 Polybench_GESUMMV
│  ├─ -1.000 Polybench_HEAT_3D
│  ├─ -1.000 Polybench_JACOBI_1D
│  ├─ -1.000 Polybench_JACOBI_2D
│  └─ -1.000 Polybench_MVT
└─ -1.000 Stream
   ├─ -1.000 Stream_ADD
   ├─ -1.000 Stream_COPY
   ├─ -1.000 Stream_DOT
   ├─ -1.000 Stream_MUL
   └─ -1.000 Stream_TRIAD

Legend (Metric: thicket_tree Min: -1.00 Max: -1.00)
-1.00 - -1.00
-1.00 - -1.00
-1.00 - -1.00
-1.00 - -1.00
-1.00 - -1.00
-1.00 - -1.00

name User code     Only in left graph     Only in right graph

[10]:
display(HTML(th_cj.dataframe.to_html()))
Block 128 Block 256 Block 512 Block 1024 name
nid spot.channel Min time/rank Max time/rank Avg time/rank Total time Bytes/Rep Flops/Rep Iterations/Rep Kernels/Rep ProblemSize Reps nid spot.channel Min time/rank Max time/rank Avg time/rank Total time Bytes/Rep Flops/Rep Iterations/Rep Kernels/Rep ProblemSize Reps nid spot.channel Min time/rank Max time/rank Avg time/rank Total time Bytes/Rep Flops/Rep Iterations/Rep Kernels/Rep ProblemSize Reps nid spot.channel Min time/rank Max time/rank Avg time/rank Total time Bytes/Rep Flops/Rep Iterations/Rep Kernels/Rep ProblemSize Reps
node ProblemSizeRunParam
{'name': 'RAJAPerf', 'type': 'function'} 1048576.0 0.0 regionprofile 1.772533 1.772533 1.772533 1.772533 3.359049e+09 6.797544e+09 125952040.0 160.0 1135363.0 2500.0 0 regionprofile 1.763211 1.763211 1.763211 1.763211 3.359049e+09 6.797544e+09 125952040.0 160.0 1135363.0 2500.0 0.0 regionprofile 1.836441 1.836441 1.836441 1.836441 3.359049e+09 6.797544e+09 125952040.0 160.0 1135363.0 2500.0 0.0 regionprofile 2.159525 2.159525 2.159525 2.159525 3.359049e+09 6.797544e+09 125952040.0 160.0 1135363.0 2500.0 RAJAPerf
4194304.0 0.0 regionprofile 6.716152 6.716152 6.716152 6.716152 1.342898e+10 3.576198e+10 503562280.0 160.0 4437343.0 2500.0 0 regionprofile 6.727046 6.727046 6.727046 6.727046 1.342898e+10 3.576198e+10 503562280.0 160.0 4437343.0 2500.0 0.0 regionprofile 6.807905 6.807905 6.807905 6.807905 1.342898e+10 3.576198e+10 503562280.0 160.0 4437343.0 2500.0 0.0 regionprofile 7.424382 7.424382 7.424382 7.424382 1.342898e+10 3.576198e+10 503562280.0 160.0 4437343.0 2500.0 RAJAPerf
{'name': 'Algorithm', 'type': 'function'} 1048576.0 58.0 regionprofile 0.006471 0.006471 0.006471 0.006471 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 100.0 61 regionprofile 0.005986 0.005986 0.005986 0.005986 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 100.0 58.0 regionprofile 0.006234 0.006234 0.006234 0.006234 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 100.0 59.0 regionprofile 0.006306 0.006306 0.006306 0.006306 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 100.0 Algorithm
4194304.0 58.0 regionprofile 0.018733 0.018733 0.018733 0.018733 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 100.0 61 regionprofile 0.016742 0.016742 0.016742 0.016742 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 100.0 58.0 regionprofile 0.017144 0.017144 0.017144 0.017144 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 100.0 59.0 regionprofile 0.017673 0.017673 0.017673 0.017673 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 100.0 Algorithm
{'name': 'Algorithm_MEMCPY', 'type': 'function'} 1048576.0 61.0 regionprofile 0.002441 0.002441 0.002441 0.002441 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 100.0 64 regionprofile 0.002444 0.002444 0.002444 0.002444 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 100.0 61.0 regionprofile 0.002502 0.002502 0.002502 0.002502 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 100.0 62.0 regionprofile 0.002497 0.002497 0.002497 0.002497 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 100.0 Algorithm_MEMCPY
4194304.0 61.0 regionprofile 0.008628 0.008628 0.008628 0.008628 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 64 regionprofile 0.008647 0.008647 0.008647 0.008647 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 61.0 regionprofile 0.008661 0.008661 0.008661 0.008661 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 62.0 regionprofile 0.008700 0.008700 0.008700 0.008700 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 Algorithm_MEMCPY
{'name': 'Algorithm_MEMSET', 'type': 'function'} 1048576.0 60.0 regionprofile 0.001485 0.001485 0.001485 0.001485 8.388616e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 63 regionprofile 0.001248 0.001248 0.001248 0.001248 8.388616e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 60.0 regionprofile 0.001248 0.001248 0.001248 0.001248 8.388616e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 61.0 regionprofile 0.001248 0.001248 0.001248 0.001248 8.388616e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 Algorithm_MEMSET
4194304.0 60.0 regionprofile 0.004979 0.004979 0.004979 0.004979 3.355444e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 63 regionprofile 0.004082 0.004082 0.004082 0.004082 3.355444e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 60.0 regionprofile 0.004070 0.004070 0.004070 0.004070 3.355444e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 61.0 regionprofile 0.004083 0.004083 0.004083 0.004083 3.355444e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 Algorithm_MEMSET
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} 1048576.0 59.0 regionprofile 0.002526 0.002526 0.002526 0.002526 8.388616e+06 1.048576e+06 1048576.0 1.0 1048576.0 50.0 62 regionprofile 0.002276 0.002276 0.002276 0.002276 8.388616e+06 1.048576e+06 1048576.0 1.0 1048576.0 50.0 59.0 regionprofile 0.002465 0.002465 0.002465 0.002465 8.388616e+06 1.048576e+06 1048576.0 1.0 1048576.0 50.0 60.0 regionprofile 0.002543 0.002543 0.002543 0.002543 8.388616e+06 1.048576e+06 1048576.0 1.0 1048576.0 50.0 Algorithm_REDUCE_SUM
4194304.0 59.0 regionprofile 0.005103 0.005103 0.005103 0.005103 3.355444e+07 4.194304e+06 4194304.0 1.0 4194304.0 50.0 62 regionprofile 0.003991 0.003991 0.003991 0.003991 3.355444e+07 4.194304e+06 4194304.0 1.0 4194304.0 50.0 59.0 regionprofile 0.004393 0.004393 0.004393 0.004393 3.355444e+07 4.194304e+06 4194304.0 1.0 4194304.0 50.0 60.0 regionprofile 0.004869 0.004869 0.004869 0.004869 3.355444e+07 4.194304e+06 4194304.0 1.0 4194304.0 50.0 Algorithm_REDUCE_SUM
{'name': 'Apps', 'type': 'function'} 1048576.0 47.0 regionprofile 0.173218 0.173218 0.173218 0.173218 2.264924e+08 8.174614e+07 6291456.0 156.0 1135363.0 700.0 50 regionprofile 0.171198 0.171198 0.171198 0.171198 2.264924e+08 8.174614e+07 6291456.0 156.0 1135363.0 700.0 47.0 regionprofile 0.170593 0.170593 0.170593 0.170593 2.264924e+08 8.174614e+07 6291456.0 156.0 1135363.0 700.0 48.0 regionprofile 0.175296 0.175296 0.175296 0.175296 2.264924e+08 8.174614e+07 6291456.0 156.0 1135363.0 700.0 Apps
4194304.0 47.0 regionprofile 0.548273 0.548273 0.548273 0.548273 9.059697e+08 3.194887e+08 25165824.0 156.0 4437343.0 700.0 50 regionprofile 0.564474 0.564474 0.564474 0.564474 9.059697e+08 3.194887e+08 25165824.0 156.0 4437343.0 700.0 47.0 regionprofile 0.564357 0.564357 0.564357 0.564357 9.059697e+08 3.194887e+08 25165824.0 156.0 4437343.0 700.0 48.0 regionprofile 0.572482 0.572482 0.572482 0.572482 9.059697e+08 3.194887e+08 25165824.0 156.0 4437343.0 700.0 Apps
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} 1048576.0 48.0 regionprofile 0.007075 0.007075 0.007075 0.007075 5.039722e+07 5.662310e+07 1048576.0 1.0 1048576.0 100.0 51 regionprofile 0.007010 0.007010 0.007010 0.007010 5.039722e+07 5.662310e+07 1048576.0 1.0 1048576.0 100.0 48.0 regionprofile 0.007162 0.007162 0.007162 0.007162 5.039722e+07 5.662310e+07 1048576.0 1.0 1048576.0 100.0 49.0 regionprofile 0.008157 0.008157 0.008157 0.008157 5.039722e+07 5.662310e+07 1048576.0 1.0 1048576.0 100.0 Apps_DEL_DOT_VEC_2D
4194304.0 48.0 regionprofile 0.026371 0.026371 0.026371 0.026371 2.014577e+08 2.264924e+08 4194304.0 1.0 4194304.0 100.0 51 regionprofile 0.026163 0.026163 0.026163 0.026163 2.014577e+08 2.264924e+08 4194304.0 1.0 4194304.0 100.0 48.0 regionprofile 0.026415 0.026415 0.026415 0.026415 2.014577e+08 2.264924e+08 4194304.0 1.0 4194304.0 100.0 49.0 regionprofile 0.029889 0.029889 0.029889 0.029889 2.014577e+08 2.264924e+08 4194304.0 1.0 4194304.0 100.0 Apps_DEL_DOT_VEC_2D
{'name': 'Apps_ENERGY', 'type': 'function'} 1048576.0 49.0 regionprofile 0.039007 0.039007 0.039007 0.039007 2.264924e+08 5.767168e+07 6291456.0 6.0 1048576.0 130.0 52 regionprofile 0.039096 0.039096 0.039096 0.039096 2.264924e+08 5.767168e+07 6291456.0 6.0 1048576.0 130.0 49.0 regionprofile 0.039065 0.039065 0.039065 0.039065 2.264924e+08 5.767168e+07 6291456.0 6.0 1048576.0 130.0 50.0 regionprofile 0.039312 0.039312 0.039312 0.039312 2.264924e+08 5.767168e+07 6291456.0 6.0 1048576.0 130.0 Apps_ENERGY
4194304.0 49.0 regionprofile 0.147485 0.147485 0.147485 0.147485 9.059697e+08 2.306867e+08 25165824.0 6.0 4194304.0 130.0 52 regionprofile 0.146777 0.146777 0.146777 0.146777 9.059697e+08 2.306867e+08 25165824.0 6.0 4194304.0 130.0 49.0 regionprofile 0.146717 0.146717 0.146717 0.146717 9.059697e+08 2.306867e+08 25165824.0 6.0 4194304.0 130.0 50.0 regionprofile 0.146882 0.146882 0.146882 0.146882 9.059697e+08 2.306867e+08 25165824.0 6.0 4194304.0 130.0 Apps_ENERGY
{'name': 'Apps_FIR', 'type': 'function'} 1048576.0 50.0 regionprofile 0.004151 0.004151 0.004151 0.004151 1.677709e+07 3.355392e+07 1048560.0 1.0 1048576.0 160.0 53 regionprofile 0.004203 0.004203 0.004203 0.004203 1.677709e+07 3.355392e+07 1048560.0 1.0 1048576.0 160.0 50.0 regionprofile 0.004278 0.004278 0.004278 0.004278 1.677709e+07 3.355392e+07 1048560.0 1.0 1048576.0 160.0 51.0 regionprofile 0.004374 0.004374 0.004374 0.004374 1.677709e+07 3.355392e+07 1048560.0 1.0 1048576.0 160.0 Apps_FIR
4194304.0 50.0 regionprofile 0.014257 0.014257 0.014257 0.014257 6.710874e+07 1.342172e+08 4194288.0 1.0 4194304.0 160.0 53 regionprofile 0.014324 0.014324 0.014324 0.014324 6.710874e+07 1.342172e+08 4194288.0 1.0 4194304.0 160.0 50.0 regionprofile 0.014437 0.014437 0.014437 0.014437 6.710874e+07 1.342172e+08 4194288.0 1.0 4194304.0 160.0 51.0 regionprofile 0.014574 0.014574 0.014574 0.014574 6.710874e+07 1.342172e+08 4194288.0 1.0 4194304.0 160.0 Apps_FIR
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} 1048576.0 51.0 regionprofile 0.031788 0.031788 0.031788 0.031788 7.491120e+06 0.000000e+00 187278.0 156.0 1030301.0 50.0 54 regionprofile 0.031574 0.031574 0.031574 0.031574 7.491120e+06 0.000000e+00 187278.0 156.0 1030301.0 50.0 51.0 regionprofile 0.031545 0.031545 0.031545 0.031545 7.491120e+06 0.000000e+00 187278.0 156.0 1030301.0 50.0 52.0 regionprofile 0.031682 0.031682 0.031682 0.031682 7.491120e+06 0.000000e+00 187278.0 156.0 1030301.0 50.0 Apps_HALOEXCHANGE
4194304.0 51.0 regionprofile 0.032575 0.032575 0.032575 0.032575 1.889592e+07 0.000000e+00 472398.0 156.0 4173281.0 50.0 54 regionprofile 0.032425 0.032425 0.032425 0.032425 1.889592e+07 0.000000e+00 472398.0 156.0 4173281.0 50.0 51.0 regionprofile 0.032377 0.032377 0.032377 0.032377 1.889592e+07 0.000000e+00 472398.0 156.0 4173281.0 50.0 52.0 regionprofile 0.032624 0.032624 0.032624 0.032624 1.889592e+07 0.000000e+00 472398.0 156.0 4173281.0 50.0 Apps_HALOEXCHANGE
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} 1048576.0 52.0 regionprofile 0.005385 0.005385 0.005385 0.005385 7.491120e+06 0.000000e+00 187278.0 2.0 1030301.0 50.0 55 regionprofile 0.003587 0.003587 0.003587 0.003587 7.491120e+06 0.000000e+00 187278.0 2.0 1030301.0 50.0 52.0 regionprofile 0.002755 0.002755 0.002755 0.002755 7.491120e+06 0.000000e+00 187278.0 2.0 1030301.0 50.0 53.0 regionprofile 0.002732 0.002732 0.002732 0.002732 7.491120e+06 0.000000e+00 187278.0 2.0 1030301.0 50.0 Apps_HALOEXCHANGE_FUSED
4194304.0 52.0 regionprofile 0.008215 0.008215 0.008215 0.008215 1.889592e+07 0.000000e+00 472398.0 2.0 4173281.0 50.0 55 regionprofile 0.008030 0.008030 0.008030 0.008030 1.889592e+07 0.000000e+00 472398.0 2.0 4173281.0 50.0 52.0 regionprofile 0.007012 0.007012 0.007012 0.007012 1.889592e+07 0.000000e+00 472398.0 2.0 4173281.0 50.0 53.0 regionprofile 0.006375 0.006375 0.006375 0.006375 1.889592e+07 0.000000e+00 472398.0 2.0 4173281.0 50.0 Apps_HALOEXCHANGE_FUSED
{'name': 'Apps_LTIMES', 'type': 'function'} 1048576.0 53.0 regionprofile 0.012112 0.012112 0.012112 0.012112 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 56 regionprofile 0.012074 0.012074 0.012074 0.012074 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 53.0 regionprofile 0.012070 0.012070 0.012070 0.012070 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 54.0 regionprofile 0.012949 0.012949 0.012949 0.012949 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 Apps_LTIMES
4194304.0 53.0 regionprofile 0.047001 0.047001 0.047001 0.047001 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 56 regionprofile 0.055041 0.055041 0.055041 0.055041 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 53.0 regionprofile 0.055581 0.055581 0.055581 0.055581 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 54.0 regionprofile 0.055605 0.055605 0.055605 0.055605 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 Apps_LTIMES
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} 1048576.0 54.0 regionprofile 0.012124 0.012124 0.012124 0.012124 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 57 regionprofile 0.012075 0.012075 0.012075 0.012075 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 54.0 regionprofile 0.012069 0.012069 0.012069 0.012069 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 55.0 regionprofile 0.012946 0.012946 0.012946 0.012946 1.495501e+07 5.242880e+07 1048576.0 1.0 1048576.0 50.0 Apps_LTIMES_NOVIEW
4194304.0 54.0 regionprofile 0.047010 0.047010 0.047010 0.047010 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 57 regionprofile 0.055052 0.055052 0.055052 0.055052 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 54.0 regionprofile 0.055579 0.055579 0.055579 0.055579 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 55.0 regionprofile 0.055599 0.055599 0.055599 0.055599 5.978163e+07 2.097152e+08 4194304.0 1.0 4194304.0 50.0 Apps_LTIMES_NOVIEW
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} 1048576.0 55.0 regionprofile 0.007383 0.007383 0.007383 0.007383 3.346414e+07 9.272709e+06 1030301.0 1.0 1030301.0 100.0 58 regionprofile 0.007599 0.007599 0.007599 0.007599 3.346414e+07 9.272709e+06 1030301.0 1.0 1030301.0 100.0 55.0 regionprofile 0.007695 0.007695 0.007695 0.007695 3.346414e+07 9.272709e+06 1030301.0 1.0 1030301.0 100.0 56.0 regionprofile 0.007855 0.007855 0.007855 0.007855 3.346414e+07 9.272709e+06 1030301.0 1.0 1030301.0 100.0 Apps_NODAL_ACCUMULATION_3D
4194304.0 55.0 regionprofile 0.027353 0.027353 0.027353 0.027353 1.347969e+08 3.755953e+07 4173281.0 1.0 4173281.0 100.0 58 regionprofile 0.029700 0.029700 0.029700 0.029700 1.347969e+08 3.755953e+07 4173281.0 1.0 4173281.0 100.0 55.0 regionprofile 0.029550 0.029550 0.029550 0.029550 1.347969e+08 3.755953e+07 4173281.0 1.0 4173281.0 100.0 56.0 regionprofile 0.030351 0.030351 0.030351 0.030351 1.347969e+08 3.755953e+07 4173281.0 1.0 4173281.0 100.0 Apps_NODAL_ACCUMULATION_3D
{'name': 'Apps_PRESSURE', 'type': 'function'} 1048576.0 56.0 regionprofile 0.048319 0.048319 0.048319 0.048319 4.194304e+07 3.145728e+06 2097152.0 2.0 1048576.0 700.0 59 regionprofile 0.048264 0.048264 0.048264 0.048264 4.194304e+07 3.145728e+06 2097152.0 2.0 1048576.0 700.0 56.0 regionprofile 0.048032 0.048032 0.048032 0.048032 4.194304e+07 3.145728e+06 2097152.0 2.0 1048576.0 700.0 57.0 regionprofile 0.048526 0.048526 0.048526 0.048526 4.194304e+07 3.145728e+06 2097152.0 2.0 1048576.0 700.0 Apps_PRESSURE
4194304.0 56.0 regionprofile 0.176850 0.176850 0.176850 0.176850 1.677722e+08 1.258291e+07 8388608.0 2.0 4194304.0 700.0 59 regionprofile 0.176168 0.176168 0.176168 0.176168 1.677722e+08 1.258291e+07 8388608.0 2.0 4194304.0 700.0 56.0 regionprofile 0.176042 0.176042 0.176042 0.176042 1.677722e+08 1.258291e+07 8388608.0 2.0 4194304.0 700.0 57.0 regionprofile 0.176699 0.176699 0.176699 0.176699 1.677722e+08 1.258291e+07 8388608.0 2.0 4194304.0 700.0 Apps_PRESSURE
{'name': 'Apps_VOL3D', 'type': 'function'} 1048576.0 57.0 regionprofile 0.005813 0.005813 0.005813 0.005813 3.659876e+07 8.174614e+07 1135363.0 1.0 1135363.0 100.0 60 regionprofile 0.005648 0.005648 0.005648 0.005648 3.659876e+07 8.174614e+07 1135363.0 1.0 1135363.0 100.0 57.0 regionprofile 0.005859 0.005859 0.005859 0.005859 3.659876e+07 8.174614e+07 1135363.0 1.0 1135363.0 100.0 58.0 regionprofile 0.006691 0.006691 0.006691 0.006691 3.659876e+07 8.174614e+07 1135363.0 1.0 1135363.0 100.0 Apps_VOL3D
4194304.0 57.0 regionprofile 0.021084 0.021084 0.021084 0.021084 1.426524e+08 3.194887e+08 4437343.0 1.0 4437343.0 100.0 60 regionprofile 0.020718 0.020718 0.020718 0.020718 1.426524e+08 3.194887e+08 4437343.0 1.0 4437343.0 100.0 57.0 regionprofile 0.020576 0.020576 0.020576 0.020576 1.426524e+08 3.194887e+08 4437343.0 1.0 4437343.0 100.0 58.0 regionprofile 0.023812 0.023812 0.023812 0.023812 1.426524e+08 3.194887e+08 4437343.0 1.0 4437343.0 100.0 Apps_VOL3D
{'name': 'Basic', 'type': 'function'} 1048576.0 1.0 regionprofile 0.347801 0.347801 0.347801 0.347801 4.194304e+07 1.153434e+07 1048576.0 1.0 1048576.0 2500.0 1 regionprofile 0.333475 0.333475 0.333475 0.333475 4.194304e+07 2.147484e+09 3145729.0 3.0 1048576.0 2500.0 1.0 regionprofile 0.310439 0.310439 0.310439 0.310439 4.194304e+07 1.153434e+07 1048576.0 1.0 1048576.0 2500.0 1.0 regionprofile 0.314681 0.314681 0.314681 0.314681 4.194304e+07 2.147484e+09 1048576.0 1.0 1048576.0 2500.0 Basic
4194304.0 1.0 regionprofile 1.181348 1.181348 1.181348 1.181348 1.677722e+08 4.613734e+07 4194304.0 1.0 4194304.0 2500.0 1 regionprofile 1.162457 1.162457 1.162457 1.162457 1.677722e+08 1.717987e+10 12582913.0 3.0 4194304.0 2500.0 1.0 regionprofile 1.070884 1.070884 1.070884 1.070884 1.677722e+08 4.613734e+07 4194304.0 1.0 4194304.0 2500.0 1.0 regionprofile 1.113558 1.113558 1.113558 1.113558 1.677722e+08 1.717987e+10 4194304.0 1.0 4194304.0 2500.0 Basic
{'name': 'Basic_DAXPY', 'type': 'function'} 1048576.0 2.0 regionprofile 0.016944 0.016944 0.016944 0.016944 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 2 regionprofile 0.016949 0.016949 0.016949 0.016949 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 2.0 regionprofile 0.016973 0.016973 0.016973 0.016973 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 2.0 regionprofile 0.017015 0.017015 0.017015 0.017015 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 Basic_DAXPY
4194304.0 2.0 regionprofile 0.061445 0.061445 0.061445 0.061445 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 2 regionprofile 0.061474 0.061474 0.061474 0.061474 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 2.0 regionprofile 0.061560 0.061560 0.061560 0.061560 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 2.0 regionprofile 0.061673 0.061673 0.061673 0.061673 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 Basic_DAXPY
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} 1048576.0 3.0 regionprofile 0.016943 0.016943 0.016943 0.016943 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 3 regionprofile 0.016936 0.016936 0.016936 0.016936 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 3.0 regionprofile 0.016874 0.016874 0.016874 0.016874 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 3.0 regionprofile 0.016940 0.016940 0.016940 0.016940 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 500.0 Basic_DAXPY_ATOMIC
4194304.0 3.0 regionprofile 0.061938 0.061938 0.061938 0.061938 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 3 regionprofile 0.061883 0.061883 0.061883 0.061883 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 3.0 regionprofile 0.061990 0.061990 0.061990 0.061990 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 3.0 regionprofile 0.062022 0.062022 0.062022 0.062022 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 500.0 Basic_DAXPY_ATOMIC
{'name': 'Basic_IF_QUAD', 'type': 'function'} 1048576.0 4.0 regionprofile 0.012530 0.012530 0.012530 0.012530 4.194304e+07 1.153434e+07 1048576.0 1.0 1048576.0 180.0 4 regionprofile 0.012603 0.012603 0.012603 0.012603 4.194304e+07 1.153434e+07 1048576.0 1.0 1048576.0 180.0 4.0 regionprofile 0.012496 0.012496 0.012496 0.012496 4.194304e+07 1.153434e+07 1048576.0 1.0 1048576.0 180.0 4.0 regionprofile 0.012786 0.012786 0.012786 0.012786 4.194304e+07 1.153434e+07 1048576.0 1.0 1048576.0 180.0 Basic_IF_QUAD
4194304.0 4.0 regionprofile 0.047675 0.047675 0.047675 0.047675 1.677722e+08 4.613734e+07 4194304.0 1.0 4194304.0 180.0 4 regionprofile 0.047554 0.047554 0.047554 0.047554 1.677722e+08 4.613734e+07 4194304.0 1.0 4194304.0 180.0 4.0 regionprofile 0.047973 0.047973 0.047973 0.047973 1.677722e+08 4.613734e+07 4194304.0 1.0 4194304.0 180.0 4.0 regionprofile 0.048466 0.048466 0.048466 0.048466 1.677722e+08 4.613734e+07 4194304.0 1.0 4194304.0 180.0 Basic_IF_QUAD
{'name': 'Basic_INIT3', 'type': 'function'} 1048576.0 5.0 regionprofile 0.027995 0.027995 0.027995 0.027995 4.194304e+07 1.048576e+06 1048576.0 1.0 1048576.0 500.0 7 regionprofile 0.028340 0.028340 0.028340 0.028340 4.194304e+07 1.048576e+06 1048576.0 1.0 1048576.0 500.0 5.0 regionprofile 0.027850 0.027850 0.027850 0.027850 4.194304e+07 1.048576e+06 1048576.0 1.0 1048576.0 500.0 5.0 regionprofile 0.028308 0.028308 0.028308 0.028308 4.194304e+07 1.048576e+06 1048576.0 1.0 1048576.0 500.0 Basic_INIT3
4194304.0 5.0 regionprofile 0.105995 0.105995 0.105995 0.105995 1.677722e+08 4.194304e+06 4194304.0 1.0 4194304.0 500.0 7 regionprofile 0.106222 0.106222 0.106222 0.106222 1.677722e+08 4.194304e+06 4194304.0 1.0 4194304.0 500.0 5.0 regionprofile 0.106498 0.106498 0.106498 0.106498 1.677722e+08 4.194304e+06 4194304.0 1.0 4194304.0 500.0 5.0 regionprofile 0.106129 0.106129 0.106129 0.106129 1.677722e+08 4.194304e+06 4194304.0 1.0 4194304.0 500.0 Basic_INIT3
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} 1048576.0 6.0 regionprofile 0.042143 0.042143 0.042143 0.042143 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 8 regionprofile 0.031319 0.031319 0.031319 0.031319 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 6.0 regionprofile 0.031419 0.031419 0.031419 0.031419 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 6.0 regionprofile 0.031429 0.031429 0.031429 0.031429 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 Basic_INIT_VIEW1D
4194304.0 6.0 regionprofile 0.128866 0.128866 0.128866 0.128866 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 8 regionprofile 0.101000 0.101000 0.101000 0.101000 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 6.0 regionprofile 0.101011 0.101011 0.101011 0.101011 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 6.0 regionprofile 0.101026 0.101026 0.101026 0.101026 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 Basic_INIT_VIEW1D
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} 1048576.0 7.0 regionprofile 0.042177 0.042177 0.042177 0.042177 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 9 regionprofile 0.031324 0.031324 0.031324 0.031324 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 7.0 regionprofile 0.031376 0.031376 0.031376 0.031376 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 7.0 regionprofile 0.031453 0.031453 0.031453 0.031453 8.388608e+06 1.048576e+06 1048576.0 1.0 1048576.0 2500.0 Basic_INIT_VIEW1D_OFFSET
4194304.0 7.0 regionprofile 0.123683 0.123683 0.123683 0.123683 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 9 regionprofile 0.100943 0.100943 0.100943 0.100943 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 7.0 regionprofile 0.100950 0.100950 0.100950 0.100950 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 7.0 regionprofile 0.101024 0.101024 0.101024 0.101024 3.355443e+07 4.194304e+06 4194304.0 1.0 4194304.0 2500.0 Basic_INIT_VIEW1D_OFFSET
{'name': 'Basic_MULADDSUB', 'type': 'function'} 1048576.0 8.0 regionprofile 0.019797 0.019797 0.019797 0.019797 4.194304e+07 3.145728e+06 1048576.0 1.0 1048576.0 350.0 11 regionprofile 0.019799 0.019799 0.019799 0.019799 4.194304e+07 3.145728e+06 1048576.0 1.0 1048576.0 350.0 8.0 regionprofile 0.019521 0.019521 0.019521 0.019521 4.194304e+07 3.145728e+06 1048576.0 1.0 1048576.0 350.0 9.0 regionprofile 0.019806 0.019806 0.019806 0.019806 4.194304e+07 3.145728e+06 1048576.0 1.0 1048576.0 350.0 Basic_MULADDSUB
4194304.0 8.0 regionprofile 0.074822 0.074822 0.074822 0.074822 1.677722e+08 1.258291e+07 4194304.0 1.0 4194304.0 350.0 11 regionprofile 0.074568 0.074568 0.074568 0.074568 1.677722e+08 1.258291e+07 4194304.0 1.0 4194304.0 350.0 8.0 regionprofile 0.074743 0.074743 0.074743 0.074743 1.677722e+08 1.258291e+07 4194304.0 1.0 4194304.0 350.0 9.0 regionprofile 0.074239 0.074239 0.074239 0.074239 1.677722e+08 1.258291e+07 4194304.0 1.0 4194304.0 350.0 Basic_MULADDSUB
{'name': 'Basic_NESTED_INIT', 'type': 'function'} 1048576.0 9.0 regionprofile 0.020804 0.020804 0.020804 0.020804 8.242408e+06 3.090903e+06 1030301.0 1.0 1030301.0 1000.0 12 regionprofile 0.013998 0.013998 0.013998 0.013998 8.242408e+06 3.090903e+06 1030301.0 1.0 1030301.0 1000.0 9.0 regionprofile 0.014054 0.014054 0.014054 0.014054 8.242408e+06 3.090903e+06 1030301.0 1.0 1030301.0 1000.0 10.0 regionprofile 0.014137 0.014137 0.014137 0.014137 8.242408e+06 3.090903e+06 1030301.0 1.0 1030301.0 1000.0 Basic_NESTED_INIT
4194304.0 9.0 regionprofile 0.059309 0.059309 0.059309 0.059309 3.338625e+07 1.251984e+07 4173281.0 1.0 4173281.0 1000.0 12 regionprofile 0.053096 0.053096 0.053096 0.053096 3.338625e+07 1.251984e+07 4173281.0 1.0 4173281.0 1000.0 9.0 regionprofile 0.052983 0.052983 0.052983 0.052983 3.338625e+07 1.251984e+07 4173281.0 1.0 4173281.0 1000.0 10.0 regionprofile 0.053923 0.053923 0.053923 0.053923 3.338625e+07 1.251984e+07 4173281.0 1.0 4173281.0 1000.0 Basic_NESTED_INIT
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} 1048576.0 10.0 regionprofile 0.124602 0.124602 0.124602 0.124602 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 13 regionprofile 0.127398 0.127398 0.127398 0.127398 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 10.0 regionprofile 0.127383 0.127383 0.127383 0.127383 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 11.0 regionprofile 0.124411 0.124411 0.124411 0.124411 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 Basic_PI_ATOMIC
4194304.0 10.0 regionprofile 0.436276 0.436276 0.436276 0.436276 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 13 regionprofile 0.436276 0.436276 0.436276 0.436276 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 10.0 regionprofile 0.436273 0.436273 0.436273 0.436273 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 11.0 regionprofile 0.436265 0.436265 0.436265 0.436265 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 Basic_PI_ATOMIC
{'name': 'Basic_PI_REDUCE', 'type': 'function'} 1048576.0 11.0 regionprofile 0.002415 0.002415 0.002415 0.002415 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 14 regionprofile 0.002491 0.002491 0.002491 0.002491 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 11.0 regionprofile 0.002498 0.002498 0.002498 0.002498 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 12.0 regionprofile 0.002483 0.002483 0.002483 0.002483 1.600000e+01 6.291457e+06 1048576.0 1.0 1048576.0 50.0 Basic_PI_REDUCE
4194304.0 11.0 regionprofile 0.004901 0.004901 0.004901 0.004901 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 14 regionprofile 0.003644 0.003644 0.003644 0.003644 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 11.0 regionprofile 0.004066 0.004066 0.004066 0.004066 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 12.0 regionprofile 0.004452 0.004452 0.004452 0.004452 1.600000e+01 2.516582e+07 4194304.0 1.0 4194304.0 50.0 Basic_PI_REDUCE
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} 1048576.0 12.0 regionprofile 0.002241 0.002241 0.002241 0.002241 4.194328e+06 1.048577e+06 1048576.0 1.0 1048576.0 50.0 15 regionprofile 0.002302 0.002302 0.002302 0.002302 4.194328e+06 1.048577e+06 1048576.0 1.0 1048576.0 50.0 12.0 regionprofile 0.002328 0.002328 0.002328 0.002328 4.194328e+06 1.048577e+06 1048576.0 1.0 1048576.0 50.0 13.0 regionprofile 0.002294 0.002294 0.002294 0.002294 4.194328e+06 1.048577e+06 1048576.0 1.0 1048576.0 50.0 Basic_REDUCE3_INT
4194304.0 12.0 regionprofile 0.005101 0.005101 0.005101 0.005101 1.677724e+07 4.194305e+06 4194304.0 1.0 4194304.0 50.0 15 regionprofile 0.004159 0.004159 0.004159 0.004159 1.677724e+07 4.194305e+06 4194304.0 1.0 4194304.0 50.0 12.0 regionprofile 0.004315 0.004315 0.004315 0.004315 1.677724e+07 4.194305e+06 4194304.0 1.0 4194304.0 50.0 13.0 regionprofile 0.005068 0.005068 0.005068 0.005068 1.677724e+07 4.194305e+06 4194304.0 1.0 4194304.0 50.0 Basic_REDUCE3_INT
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} 1048576.0 13.0 regionprofile 0.016689 0.016689 0.016689 0.016689 1.677726e+07 2.097154e+06 1048576.0 1.0 1048576.0 50.0 16 regionprofile 0.008741 0.008741 0.008741 0.008741 1.677726e+07 2.097154e+06 1048576.0 1.0 1048576.0 50.0 13.0 regionprofile 0.005151 0.005151 0.005151 0.005151 1.677726e+07 2.097154e+06 1048576.0 1.0 1048576.0 50.0 14.0 regionprofile 0.005093 0.005093 0.005093 0.005093 1.677726e+07 2.097154e+06 1048576.0 1.0 1048576.0 50.0 Basic_REDUCE_STRUCT
4194304.0 13.0 regionprofile 0.066340 0.066340 0.066340 0.066340 6.710891e+07 8.388610e+06 4194304.0 1.0 4194304.0 50.0 16 regionprofile 0.030358 0.030358 0.030358 0.030358 6.710891e+07 8.388610e+06 4194304.0 1.0 4194304.0 50.0 13.0 regionprofile 0.014375 0.014375 0.014375 0.014375 6.710891e+07 8.388610e+06 4194304.0 1.0 4194304.0 50.0 14.0 regionprofile 0.013960 0.013960 0.013960 0.013960 6.710891e+07 8.388610e+06 4194304.0 1.0 4194304.0 50.0 Basic_REDUCE_STRUCT
{'name': 'Basic_TRAP_INT', 'type': 'function'} 1048576.0 14.0 regionprofile 0.002439 0.002439 0.002439 0.002439 1.600000e+01 1.048576e+07 1048576.0 1.0 1048576.0 50.0 17 regionprofile 0.002337 0.002337 0.002337 0.002337 1.600000e+01 1.048576e+07 1048576.0 1.0 1048576.0 50.0 14.0 regionprofile 0.002438 0.002438 0.002438 0.002438 1.600000e+01 1.048576e+07 1048576.0 1.0 1048576.0 50.0 15.0 regionprofile 0.002481 0.002481 0.002481 0.002481 1.600000e+01 1.048576e+07 1048576.0 1.0 1048576.0 50.0 Basic_TRAP_INT
4194304.0 14.0 regionprofile 0.004905 0.004905 0.004905 0.004905 1.600000e+01 4.194304e+07 4194304.0 1.0 4194304.0 50.0 17 regionprofile 0.003767 0.003767 0.003767 0.003767 1.600000e+01 4.194304e+07 4194304.0 1.0 4194304.0 50.0 14.0 regionprofile 0.004063 0.004063 0.004063 0.004063 1.600000e+01 4.194304e+07 4194304.0 1.0 4194304.0 50.0 15.0 regionprofile 0.004744 0.004744 0.004744 0.004744 1.600000e+01 4.194304e+07 4194304.0 1.0 4194304.0 50.0 Basic_TRAP_INT
{'name': 'Lcals', 'type': 'function'} 1048576.0 15.0 regionprofile 0.387646 0.387646 0.387646 0.387646 1.677722e+08 4.595730e+07 3145728.0 3.0 1048576.0 2000.0 18 regionprofile 0.385061 0.385061 0.385061 0.385061 1.677722e+08 4.595730e+07 3145728.0 3.0 1048576.0 2000.0 15.0 regionprofile 0.384342 0.384342 0.384342 0.384342 1.677722e+08 4.595730e+07 3145728.0 3.0 1048576.0 2000.0 16.0 regionprofile 0.384261 0.384261 0.384261 0.384261 1.677722e+08 4.595730e+07 3145728.0 3.0 1048576.0 2000.0 Lcals
4194304.0 15.0 regionprofile 1.448976 1.448976 1.448976 1.448976 6.710886e+08 1.841891e+08 12582912.0 3.0 4194304.0 2000.0 18 regionprofile 1.442976 1.442976 1.442976 1.442976 6.710886e+08 1.841891e+08 12582912.0 3.0 4194304.0 2000.0 15.0 regionprofile 1.434713 1.434713 1.434713 1.434713 6.710886e+08 1.841891e+08 12582912.0 3.0 4194304.0 2000.0 16.0 regionprofile 1.430823 1.430823 1.430823 1.430823 6.710886e+08 1.841891e+08 12582912.0 3.0 4194304.0 2000.0 Lcals
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} 1048576.0 16.0 regionprofile 0.062320 0.062320 0.062320 0.062320 1.677722e+08 9.437184e+06 1048576.0 1.0 1048576.0 200.0 19 regionprofile 0.060230 0.060230 0.060230 0.060230 1.677722e+08 9.437184e+06 1048576.0 1.0 1048576.0 200.0 16.0 regionprofile 0.058127 0.058127 0.058127 0.058127 1.677722e+08 9.437184e+06 1048576.0 1.0 1048576.0 200.0 17.0 regionprofile 0.056180 0.056180 0.056180 0.056180 1.677722e+08 9.437184e+06 1048576.0 1.0 1048576.0 200.0 Lcals_DIFF_PREDICT
4194304.0 16.0 regionprofile 0.235796 0.235796 0.235796 0.235796 6.710886e+08 3.774874e+07 4194304.0 1.0 4194304.0 200.0 19 regionprofile 0.231809 0.231809 0.231809 0.231809 6.710886e+08 3.774874e+07 4194304.0 1.0 4194304.0 200.0 16.0 regionprofile 0.224191 0.224191 0.224191 0.224191 6.710886e+08 3.774874e+07 4194304.0 1.0 4194304.0 200.0 17.0 regionprofile 0.213626 0.213626 0.213626 0.213626 6.710886e+08 3.774874e+07 4194304.0 1.0 4194304.0 200.0 Lcals_DIFF_PREDICT
{'name': 'Lcals_EOS', 'type': 'function'} 1048576.0 17.0 regionprofile 0.022697 0.022697 0.022697 0.022697 3.355449e+07 1.677722e+07 1048576.0 1.0 1048576.0 500.0 20 regionprofile 0.022655 0.022655 0.022655 0.022655 3.355449e+07 1.677722e+07 1048576.0 1.0 1048576.0 500.0 17.0 regionprofile 0.022638 0.022638 0.022638 0.022638 3.355449e+07 1.677722e+07 1048576.0 1.0 1048576.0 500.0 18.0 regionprofile 0.022745 0.022745 0.022745 0.022745 3.355449e+07 1.677722e+07 1048576.0 1.0 1048576.0 500.0 Lcals_EOS
4194304.0 17.0 regionprofile 0.082976 0.082976 0.082976 0.082976 1.342178e+08 6.710886e+07 4194304.0 1.0 4194304.0 500.0 20 regionprofile 0.083900 0.083900 0.083900 0.083900 1.342178e+08 6.710886e+07 4194304.0 1.0 4194304.0 500.0 17.0 regionprofile 0.083853 0.083853 0.083853 0.083853 1.342178e+08 6.710886e+07 4194304.0 1.0 4194304.0 500.0 18.0 regionprofile 0.083815 0.083815 0.083815 0.083815 1.342178e+08 6.710886e+07 4194304.0 1.0 4194304.0 500.0 Lcals_EOS
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} 1048576.0 18.0 regionprofile 0.048539 0.048539 0.048539 0.048539 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 2000.0 21 regionprofile 0.048605 0.048605 0.048605 0.048605 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 2000.0 18.0 regionprofile 0.049276 0.049276 0.049276 0.049276 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 2000.0 19.0 regionprofile 0.049684 0.049684 0.049684 0.049684 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 2000.0 Lcals_FIRST_DIFF
4194304.0 18.0 regionprofile 0.173176 0.173176 0.173176 0.173176 6.710887e+07 4.194304e+06 4194304.0 1.0 4194304.0 2000.0 21 regionprofile 0.173146 0.173146 0.173146 0.173146 6.710887e+07 4.194304e+06 4194304.0 1.0 4194304.0 2000.0 18.0 regionprofile 0.173415 0.173415 0.173415 0.173415 6.710887e+07 4.194304e+06 4194304.0 1.0 4194304.0 2000.0 19.0 regionprofile 0.173615 0.173615 0.173615 0.173615 6.710887e+07 4.194304e+06 4194304.0 1.0 4194304.0 2000.0 Lcals_FIRST_DIFF
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} 1048576.0 19.0 regionprofile 0.005432 0.005432 0.005432 0.005432 8.388640e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 22 regionprofile 0.004740 0.004740 0.004740 0.004740 8.388640e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 19.0 regionprofile 0.004543 0.004543 0.004543 0.004543 8.388640e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 20.0 regionprofile 0.004423 0.004423 0.004423 0.004423 8.388640e+06 0.000000e+00 1048576.0 1.0 1048576.0 100.0 Lcals_FIRST_MIN
4194304.0 19.0 regionprofile 0.012204 0.012204 0.012204 0.012204 3.355446e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 22 regionprofile 0.009819 0.009819 0.009819 0.009819 3.355446e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 19.0 regionprofile 0.009199 0.009199 0.009199 0.009199 3.355446e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 20.0 regionprofile 0.010418 0.010418 0.010418 0.010418 3.355446e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 Lcals_FIRST_MIN
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} 1048576.0 20.0 regionprofile 0.048626 0.048626 0.048626 0.048626 1.677721e+07 1.048575e+06 1048576.0 1.0 1048576.0 2000.0 23 regionprofile 0.048618 0.048618 0.048618 0.048618 1.677721e+07 1.048575e+06 1048576.0 1.0 1048576.0 2000.0 20.0 regionprofile 0.049289 0.049289 0.049289 0.049289 1.677721e+07 1.048575e+06 1048576.0 1.0 1048576.0 2000.0 21.0 regionprofile 0.049992 0.049992 0.049992 0.049992 1.677721e+07 1.048575e+06 1048576.0 1.0 1048576.0 2000.0 Lcals_FIRST_SUM
4194304.0 20.0 regionprofile 0.173164 0.173164 0.173164 0.173164 6.710886e+07 4.194303e+06 4194304.0 1.0 4194304.0 2000.0 23 regionprofile 0.173199 0.173199 0.173199 0.173199 6.710886e+07 4.194303e+06 4194304.0 1.0 4194304.0 2000.0 20.0 regionprofile 0.173526 0.173526 0.173526 0.173526 6.710886e+07 4.194303e+06 4194304.0 1.0 4194304.0 2000.0 21.0 regionprofile 0.174873 0.174873 0.174873 0.174873 6.710886e+07 4.194303e+06 4194304.0 1.0 4194304.0 2000.0 Lcals_FIRST_SUM
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} 1048576.0 21.0 regionprofile 0.049047 0.049047 0.049047 0.049047 8.388608e+07 6.291456e+06 1048576.0 2.0 1048576.0 500.0 24 regionprofile 0.048864 0.048864 0.048864 0.048864 8.388608e+07 6.291456e+06 1048576.0 2.0 1048576.0 500.0 21.0 regionprofile 0.048966 0.048966 0.048966 0.048966 8.388608e+07 6.291456e+06 1048576.0 2.0 1048576.0 500.0 22.0 regionprofile 0.049001 0.049001 0.049001 0.049001 8.388608e+07 6.291456e+06 1048576.0 2.0 1048576.0 500.0 Lcals_GEN_LIN_RECUR
4194304.0 21.0 regionprofile 0.202498 0.202498 0.202498 0.202498 3.355443e+08 2.516582e+07 4194304.0 2.0 4194304.0 500.0 24 regionprofile 0.201866 0.201866 0.201866 0.201866 3.355443e+08 2.516582e+07 4194304.0 2.0 4194304.0 500.0 21.0 regionprofile 0.201679 0.201679 0.201679 0.201679 3.355443e+08 2.516582e+07 4194304.0 2.0 4194304.0 500.0 22.0 regionprofile 0.201866 0.201866 0.201866 0.201866 3.355443e+08 2.516582e+07 4194304.0 2.0 4194304.0 500.0 Lcals_GEN_LIN_RECUR
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} 1048576.0 22.0 regionprofile 0.033742 0.033742 0.033742 0.033742 2.516583e+07 5.242880e+06 1048576.0 1.0 1048576.0 1000.0 25 regionprofile 0.033975 0.033975 0.033975 0.033975 2.516583e+07 5.242880e+06 1048576.0 1.0 1048576.0 1000.0 22.0 regionprofile 0.033803 0.033803 0.033803 0.033803 2.516583e+07 5.242880e+06 1048576.0 1.0 1048576.0 1000.0 23.0 regionprofile 0.034674 0.034674 0.034674 0.034674 2.516583e+07 5.242880e+06 1048576.0 1.0 1048576.0 1000.0 Lcals_HYDRO_1D
4194304.0 22.0 regionprofile 0.123291 0.123291 0.123291 0.123291 1.006633e+08 2.097152e+07 4194304.0 1.0 4194304.0 1000.0 25 regionprofile 0.123211 0.123211 0.123211 0.123211 1.006633e+08 2.097152e+07 4194304.0 1.0 4194304.0 1000.0 22.0 regionprofile 0.123233 0.123233 0.123233 0.123233 1.006633e+08 2.097152e+07 4194304.0 1.0 4194304.0 1000.0 23.0 regionprofile 0.123642 0.123642 0.123642 0.123642 1.006633e+08 2.097152e+07 4194304.0 1.0 4194304.0 1000.0 Lcals_HYDRO_1D
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} 1048576.0 23.0 regionprofile 0.022563 0.022563 0.022563 0.022563 1.506676e+08 4.595730e+07 3145728.0 3.0 1048576.0 100.0 26 regionprofile 0.022560 0.022560 0.022560 0.022560 1.506676e+08 4.595730e+07 3145728.0 3.0 1048576.0 100.0 23.0 regionprofile 0.022730 0.022730 0.022730 0.022730 1.506676e+08 4.595730e+07 3145728.0 3.0 1048576.0 100.0 24.0 regionprofile 0.022833 0.022833 0.022833 0.022833 1.506676e+08 4.595730e+07 3145728.0 3.0 1048576.0 100.0 Lcals_HYDRO_2D
4194304.0 23.0 regionprofile 0.086440 0.086440 0.086440 0.086440 6.033247e+08 1.841891e+08 12582912.0 3.0 4194304.0 100.0 26 regionprofile 0.086084 0.086084 0.086084 0.086084 6.033247e+08 1.841891e+08 12582912.0 3.0 4194304.0 100.0 23.0 regionprofile 0.086260 0.086260 0.086260 0.086260 6.033247e+08 1.841891e+08 12582912.0 3.0 4194304.0 100.0 24.0 regionprofile 0.088949 0.088949 0.088949 0.088949 6.033247e+08 1.841891e+08 12582912.0 3.0 4194304.0 100.0 Lcals_HYDRO_2D
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} 1048576.0 24.0 regionprofile 0.046893 0.046893 0.046893 0.046893 9.227469e+07 1.782579e+07 1048576.0 1.0 1048576.0 400.0 27 regionprofile 0.046963 0.046963 0.046963 0.046963 9.227469e+07 1.782579e+07 1048576.0 1.0 1048576.0 400.0 24.0 regionprofile 0.046876 0.046876 0.046876 0.046876 9.227469e+07 1.782579e+07 1048576.0 1.0 1048576.0 400.0 25.0 regionprofile 0.046740 0.046740 0.046740 0.046740 9.227469e+07 1.782579e+07 1048576.0 1.0 1048576.0 400.0 Lcals_INT_PREDICT
4194304.0 24.0 regionprofile 0.182489 0.182489 0.182489 0.182489 3.690988e+08 7.130317e+07 4194304.0 1.0 4194304.0 400.0 27 regionprofile 0.182741 0.182741 0.182741 0.182741 3.690988e+08 7.130317e+07 4194304.0 1.0 4194304.0 400.0 24.0 regionprofile 0.182463 0.182463 0.182463 0.182463 3.690988e+08 7.130317e+07 4194304.0 1.0 4194304.0 400.0 25.0 regionprofile 0.182635 0.182635 0.182635 0.182635 3.690988e+08 7.130317e+07 4194304.0 1.0 4194304.0 400.0 Lcals_INT_PREDICT
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} 1048576.0 25.0 regionprofile 0.002752 0.002752 0.002752 0.002752 4.194304e+07 4.194304e+06 1048576.0 1.0 1048576.0 50.0 28 regionprofile 0.002752 0.002752 0.002752 0.002752 4.194304e+07 4.194304e+06 1048576.0 1.0 1048576.0 50.0 25.0 regionprofile 0.002794 0.002794 0.002794 0.002794 4.194304e+07 4.194304e+06 1048576.0 1.0 1048576.0 50.0 26.0 regionprofile 0.002788 0.002788 0.002788 0.002788 4.194304e+07 4.194304e+06 1048576.0 1.0 1048576.0 50.0 Lcals_PLANCKIAN
4194304.0 25.0 regionprofile 0.010289 0.010289 0.010289 0.010289 1.677722e+08 1.677722e+07 4194304.0 1.0 4194304.0 50.0 28 regionprofile 0.010275 0.010275 0.010275 0.010275 1.677722e+08 1.677722e+07 4194304.0 1.0 4194304.0 50.0 25.0 regionprofile 0.010296 0.010296 0.010296 0.010296 1.677722e+08 1.677722e+07 4194304.0 1.0 4194304.0 50.0 26.0 regionprofile 0.010319 0.010319 0.010319 0.010319 1.677722e+08 1.677722e+07 4194304.0 1.0 4194304.0 50.0 Lcals_PLANCKIAN
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} 1048576.0 26.0 regionprofile 0.044959 0.044959 0.044959 0.044959 3.355440e+07 2.097150e+06 1048576.0 1.0 1048576.0 1000.0 29 regionprofile 0.045027 0.045027 0.045027 0.045027 3.355440e+07 2.097150e+06 1048576.0 1.0 1048576.0 1000.0 26.0 regionprofile 0.045229 0.045229 0.045229 0.045229 3.355440e+07 2.097150e+06 1048576.0 1.0 1048576.0 1000.0 27.0 regionprofile 0.045130 0.045130 0.045130 0.045130 3.355440e+07 2.097150e+06 1048576.0 1.0 1048576.0 1000.0 Lcals_TRIDIAG_ELIM
4194304.0 26.0 regionprofile 0.166569 0.166569 0.166569 0.166569 1.342177e+08 8.388606e+06 4194304.0 1.0 4194304.0 1000.0 29 regionprofile 0.166845 0.166845 0.166845 0.166845 1.342177e+08 8.388606e+06 4194304.0 1.0 4194304.0 1000.0 26.0 regionprofile 0.166518 0.166518 0.166518 0.166518 1.342177e+08 8.388606e+06 4194304.0 1.0 4194304.0 1000.0 27.0 regionprofile 0.166988 0.166988 0.166988 0.166988 1.342177e+08 8.388606e+06 4194304.0 1.0 4194304.0 1000.0 Lcals_TRIDIAG_ELIM
{'name': 'Polybench', 'type': 'function'} 1048576.0 27.0 regionprofile 0.597587 0.597587 0.597587 0.597587 3.359049e+09 6.797544e+09 125952040.0 160.0 1050625.0 120.0 30 regionprofile 0.607722 0.607722 0.607722 0.607722 3.359049e+09 6.797544e+09 125952040.0 160.0 1050625.0 120.0 27.0 regionprofile 0.706357 0.706357 0.706357 0.706357 3.359049e+09 6.797544e+09 125952040.0 160.0 1050625.0 120.0 28.0 regionprofile 1.018105 1.018105 1.018105 1.018105 3.359049e+09 6.797544e+09 125952040.0 160.0 1050625.0 120.0 Polybench
4194304.0 27.0 regionprofile 2.745905 2.745905 2.745905 2.745905 1.342898e+10 3.576198e+10 503562280.0 160.0 4198401.0 120.0 30 regionprofile 2.767598 2.767598 2.767598 2.767598 1.342898e+10 3.576198e+10 503562280.0 160.0 4198401.0 120.0 27.0 regionprofile 2.944963 2.944963 2.944963 2.944963 1.342898e+10 3.576198e+10 503562280.0 160.0 4198401.0 120.0 28.0 regionprofile 3.518693 3.518693 3.518693 3.518693 1.342898e+10 3.576198e+10 503562280.0 160.0 4198401.0 120.0 Polybench
{'name': 'Polybench_2MM', 'type': 'function'} 1048576.0 28.0 regionprofile 0.006426 0.006426 0.006426 0.006426 5.198800e+07 5.683881e+09 2101250.0 2.0 1050625.0 2.0 31 regionprofile 0.005658 0.005658 0.005658 0.005658 5.198800e+07 5.683881e+09 2101250.0 2.0 1050625.0 2.0 28.0 regionprofile 0.006097 0.006097 0.006097 0.006097 5.198800e+07 5.683881e+09 2101250.0 2.0 1050625.0 2.0 29.0 regionprofile 0.006958 0.006958 0.006958 0.006958 5.198800e+07 5.683881e+09 2101250.0 2.0 1050625.0 2.0 Polybench_2MM
4194304.0 28.0 regionprofile 0.032479 0.032479 0.032479 0.032479 1.710669e+08 3.131167e+10 8396802.0 2.0 4198401.0 2.0 31 regionprofile 0.032284 0.032284 0.032284 0.032284 1.710669e+08 3.131167e+10 8396802.0 2.0 4198401.0 2.0 28.0 regionprofile 0.032710 0.032710 0.032710 0.032710 1.710669e+08 3.131167e+10 8396802.0 2.0 4198401.0 2.0 29.0 regionprofile 0.034111 0.034111 0.034111 0.034111 1.710669e+08 3.131167e+10 8396802.0 2.0 4198401.0 2.0 Polybench_2MM
{'name': 'Polybench_3MM', 'type': 'function'} 1048576.0 29.0 regionprofile 0.009667 0.009667 0.009667 0.009667 7.826900e+07 6.797544e+09 3151875.0 3.0 1050625.0 2.0 32 regionprofile 0.008535 0.008535 0.008535 0.008535 7.826900e+07 6.797544e+09 3151875.0 3.0 1050625.0 2.0 29.0 regionprofile 0.009183 0.009183 0.009183 0.009183 7.826900e+07 6.797544e+09 3151875.0 3.0 1050625.0 2.0 30.0 regionprofile 0.010579 0.010579 0.010579 0.010579 7.826900e+07 6.797544e+09 3151875.0 3.0 1050625.0 2.0 Polybench_3MM
4194304.0 29.0 regionprofile 0.044100 0.044100 0.044100 0.044100 2.403887e+08 3.576198e+10 12595203.0 3.0 4198401.0 2.0 32 regionprofile 0.043582 0.043582 0.043582 0.043582 2.403887e+08 3.576198e+10 12595203.0 3.0 4198401.0 2.0 29.0 regionprofile 0.044093 0.044093 0.044093 0.044093 2.403887e+08 3.576198e+10 12595203.0 3.0 4198401.0 2.0 30.0 regionprofile 0.046054 0.046054 0.046054 0.046054 2.403887e+08 3.576198e+10 12595203.0 3.0 4198401.0 2.0 Polybench_3MM
{'name': 'Polybench_ADI', 'type': 'function'} 1048576.0 30.0 regionprofile 0.040744 0.040744 0.040744 0.040744 4.026528e+08 1.423279e+08 8184.0 8.0 1046529.0 4.0 33 regionprofile 0.047805 0.047805 0.047805 0.047805 4.026528e+08 1.423279e+08 8184.0 8.0 1046529.0 4.0 30.0 regionprofile 0.100471 0.100471 0.100471 0.100471 4.026528e+08 1.423279e+08 8184.0 8.0 1046529.0 4.0 31.0 regionprofile 0.240500 0.240500 0.240500 0.240500 4.026528e+08 1.423279e+08 8184.0 8.0 1046529.0 4.0 Polybench_ADI
4194304.0 30.0 regionprofile 0.082385 0.082385 0.082385 0.082385 1.610612e+09 5.698684e+08 16376.0 8.0 4190209.0 4.0 33 regionprofile 0.103245 0.103245 0.103245 0.103245 1.610612e+09 5.698684e+08 16376.0 8.0 4190209.0 4.0 30.0 regionprofile 0.191407 0.191407 0.191407 0.191407 1.610612e+09 5.698684e+08 16376.0 8.0 4190209.0 4.0 31.0 regionprofile 0.430046 0.430046 0.430046 0.430046 1.610612e+09 5.698684e+08 16376.0 8.0 4190209.0 4.0 Polybench_ADI
{'name': 'Polybench_ATAX', 'type': 'function'} 1048576.0 31.0 regionprofile 0.027871 0.027871 0.027871 0.027871 1.685100e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 34 regionprofile 0.029419 0.029419 0.029419 0.029419 1.685100e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 31.0 regionprofile 0.036441 0.036441 0.036441 0.036441 1.685100e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 32.0 regionprofile 0.066951 0.066951 0.066951 0.066951 1.685100e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 Polybench_ATAX
4194304.0 31.0 regionprofile 0.052800 0.052800 0.052800 0.052800 6.725638e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 34 regionprofile 0.060357 0.060357 0.060357 0.060357 6.725638e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 31.0 regionprofile 0.076365 0.076365 0.076365 0.076365 6.725638e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 32.0 regionprofile 0.133815 0.133815 0.133815 0.133815 6.725638e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 Polybench_ATAX
{'name': 'Polybench_FDTD_2D', 'type': 'function'} 1048576.0 32.0 regionprofile 0.037338 0.037338 0.037338 0.037338 3.359049e+09 4.616192e+08 125952040.0 160.0 1049600.0 8.0 35 regionprofile 0.037041 0.037041 0.037041 0.037041 3.359049e+09 4.616192e+08 125952040.0 160.0 1049600.0 8.0 32.0 regionprofile 0.037118 0.037118 0.037118 0.037118 3.359049e+09 4.616192e+08 125952040.0 160.0 1049600.0 8.0 33.0 regionprofile 0.037303 0.037303 0.037303 0.037303 3.359049e+09 4.616192e+08 125952040.0 160.0 1049600.0 8.0 Polybench_FDTD_2D
4194304.0 32.0 regionprofile 0.133181 0.133181 0.133181 0.133181 1.342898e+10 1.845985e+09 503562280.0 160.0 4196352.0 8.0 35 regionprofile 0.133258 0.133258 0.133258 0.133258 1.342898e+10 1.845985e+09 503562280.0 160.0 4196352.0 8.0 32.0 regionprofile 0.133467 0.133467 0.133467 0.133467 1.342898e+10 1.845985e+09 503562280.0 160.0 4196352.0 8.0 33.0 regionprofile 0.134053 0.134053 0.134053 0.134053 1.342898e+10 1.845985e+09 503562280.0 160.0 4196352.0 8.0 Polybench_FDTD_2D
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} 1048576.0 33.0 regionprofile 0.211207 0.211207 0.211207 0.211207 1.681000e+07 1.076891e+09 1050625.0 1.0 1050625.0 8.0 36 regionprofile 0.209059 0.209059 0.209059 0.209059 1.681000e+07 1.076891e+09 1050625.0 1.0 1050625.0 8.0 33.0 regionprofile 0.210531 0.210531 0.210531 0.210531 1.681000e+07 1.076891e+09 1050625.0 1.0 1050625.0 8.0 34.0 regionprofile 0.211865 0.211865 0.211865 0.211865 1.681000e+07 1.076891e+09 1050625.0 1.0 1050625.0 8.0 Polybench_FLOYD_WARSHALL
4194304.0 33.0 regionprofile 1.516510 1.516510 1.516510 1.516510 6.717442e+07 8.602524e+09 4198401.0 1.0 4198401.0 8.0 36 regionprofile 1.516435 1.516435 1.516435 1.516435 6.717442e+07 8.602524e+09 4198401.0 1.0 4198401.0 8.0 33.0 regionprofile 1.520934 1.520934 1.520934 1.520934 6.717442e+07 8.602524e+09 4198401.0 1.0 4198401.0 8.0 34.0 regionprofile 1.517274 1.517274 1.517274 1.517274 6.717442e+07 8.602524e+09 4198401.0 1.0 4198401.0 8.0 Polybench_FLOYD_WARSHALL
{'name': 'Polybench_GEMM', 'type': 'function'} 1048576.0 34.0 regionprofile 0.006501 0.006501 0.006501 0.006501 2.808500e+07 3.783301e+09 1050625.0 1.0 1050625.0 4.0 37 regionprofile 0.006145 0.006145 0.006145 0.006145 2.808500e+07 3.783301e+09 1050625.0 1.0 1050625.0 4.0 34.0 regionprofile 0.006426 0.006426 0.006426 0.006426 2.808500e+07 3.783301e+09 1050625.0 1.0 1050625.0 4.0 35.0 regionprofile 0.006745 0.006745 0.006745 0.006745 2.808500e+07 3.783301e+09 1050625.0 1.0 1050625.0 4.0 Polybench_GEMM
4194304.0 34.0 regionprofile 0.041682 0.041682 0.041682 0.041682 7.292801e+07 1.511844e+10 4198401.0 1.0 4198401.0 4.0 37 regionprofile 0.025073 0.025073 0.025073 0.025073 7.292801e+07 1.511844e+10 4198401.0 1.0 4198401.0 4.0 34.0 regionprofile 0.024905 0.024905 0.024905 0.024905 7.292801e+07 1.511844e+10 4198401.0 1.0 4198401.0 4.0 35.0 regionprofile 0.025525 0.025525 0.025525 0.025525 7.292801e+07 1.511844e+10 4198401.0 1.0 4198401.0 4.0 Polybench_GEMM
{'name': 'Polybench_GEMVER', 'type': 'function'} 1048576.0 35.0 regionprofile 0.007053 0.007053 0.007053 0.007053 3.372660e+07 1.050728e+07 3152900.0 4.0 1050625.0 20.0 38 regionprofile 0.007155 0.007155 0.007155 0.007155 3.372660e+07 1.050728e+07 3152900.0 4.0 1050625.0 20.0 35.0 regionprofile 0.008014 0.008014 0.008014 0.008014 3.372660e+07 1.050728e+07 3152900.0 4.0 1050625.0 20.0 36.0 regionprofile 0.013533 0.013533 0.013533 0.013533 3.372660e+07 1.050728e+07 3152900.0 4.0 1050625.0 20.0 Polybench_GEMVER
4194304.0 35.0 regionprofile 0.014683 0.014683 0.014683 0.014683 1.345619e+08 4.198606e+07 12597252.0 4.0 4198401.0 20.0 38 regionprofile 0.015327 0.015327 0.015327 0.015327 1.345619e+08 4.198606e+07 12597252.0 4.0 4198401.0 20.0 35.0 regionprofile 0.017301 0.017301 0.017301 0.017301 1.345619e+08 4.198606e+07 12597252.0 4.0 4198401.0 20.0 36.0 regionprofile 0.027680 0.027680 0.027680 0.027680 1.345619e+08 4.198606e+07 12597252.0 4.0 4198401.0 20.0 Polybench_GEMVER
{'name': 'Polybench_GESUMMV', 'type': 'function'} 1048576.0 36.0 regionprofile 0.026386 0.026386 0.026386 0.026386 1.683460e+07 4.205575e+06 1025.0 1.0 1050625.0 120.0 39 regionprofile 0.028663 0.028663 0.028663 0.028663 1.683460e+07 4.205575e+06 1025.0 1.0 1050625.0 120.0 36.0 regionprofile 0.055810 0.055810 0.055810 0.055810 1.683460e+07 4.205575e+06 1025.0 1.0 1050625.0 120.0 37.0 regionprofile 0.151902 0.151902 0.151902 0.151902 1.683460e+07 4.205575e+06 1025.0 1.0 1050625.0 120.0 Polybench_GESUMMV
4194304.0 36.0 regionprofile 0.056391 0.056391 0.056391 0.056391 6.722359e+07 1.679975e+07 2049.0 1.0 4198401.0 120.0 39 regionprofile 0.059838 0.059838 0.059838 0.059838 6.722359e+07 1.679975e+07 2049.0 1.0 4198401.0 120.0 36.0 regionprofile 0.109365 0.109365 0.109365 0.109365 6.722359e+07 1.679975e+07 2049.0 1.0 4198401.0 120.0 37.0 regionprofile 0.308204 0.308204 0.308204 0.308204 6.722359e+07 1.679975e+07 2049.0 1.0 4198401.0 120.0 Polybench_GESUMMV
{'name': 'Polybench_HEAT_3D', 'type': 'function'} 1048576.0 37.0 regionprofile 0.021932 0.021932 0.021932 0.021932 6.592000e+08 6.000000e+08 40000000.0 40.0 1000000.0 20.0 40 regionprofile 0.021845 0.021845 0.021845 0.021845 6.592000e+08 6.000000e+08 40000000.0 40.0 1000000.0 20.0 37.0 regionprofile 0.022019 0.022019 0.022019 0.022019 6.592000e+08 6.000000e+08 40000000.0 40.0 1000000.0 20.0 38.0 regionprofile 0.022779 0.022779 0.022779 0.022779 6.592000e+08 6.000000e+08 40000000.0 40.0 1000000.0 20.0 Polybench_HEAT_3D
4194304.0 37.0 regionprofile 0.077673 0.077673 0.077673 0.077673 2.670592e+09 2.457600e+09 163840000.0 40.0 4096000.0 20.0 40 regionprofile 0.077731 0.077731 0.077731 0.077731 2.670592e+09 2.457600e+09 163840000.0 40.0 4096000.0 20.0 37.0 regionprofile 0.077833 0.077833 0.077833 0.077833 2.670592e+09 2.457600e+09 163840000.0 40.0 4096000.0 20.0 38.0 regionprofile 0.077977 0.077977 0.077977 0.077977 2.670592e+09 2.457600e+09 163840000.0 40.0 4096000.0 20.0 Polybench_HEAT_3D
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} 1048576.0 38.0 regionprofile 0.078164 0.078164 0.078164 0.078164 5.368704e+08 1.006631e+08 33554368.0 32.0 1048574.0 100.0 41 regionprofile 0.077995 0.077995 0.077995 0.077995 5.368704e+08 1.006631e+08 33554368.0 32.0 1048574.0 100.0 38.0 regionprofile 0.078657 0.078657 0.078657 0.078657 5.368704e+08 1.006631e+08 33554368.0 32.0 1048574.0 100.0 39.0 regionprofile 0.079618 0.079618 0.079618 0.079618 5.368704e+08 1.006631e+08 33554368.0 32.0 1048574.0 100.0 Polybench_JACOBI_1D
4194304.0 38.0 regionprofile 0.277418 0.277418 0.277418 0.277418 2.147483e+09 4.026530e+08 134217664.0 32.0 4194302.0 100.0 41 regionprofile 0.277588 0.277588 0.277588 0.277588 2.147483e+09 4.026530e+08 134217664.0 32.0 4194302.0 100.0 38.0 regionprofile 0.278394 0.278394 0.278394 0.278394 2.147483e+09 4.026530e+08 134217664.0 32.0 4194302.0 100.0 39.0 regionprofile 0.279469 0.279469 0.279469 0.279469 2.147483e+09 4.026530e+08 134217664.0 32.0 4194302.0 100.0 Polybench_JACOBI_1D
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} 1048576.0 39.0 regionprofile 0.102414 0.102414 0.102414 0.102414 1.342176e+09 4.186116e+08 83722320.0 2.0 1046529.0 50.0 42 regionprofile 0.103129 0.103129 0.103129 0.103129 1.342176e+09 4.186116e+08 83722320.0 2.0 1046529.0 50.0 39.0 regionprofile 0.102936 0.102936 0.102936 0.102936 1.342176e+09 4.186116e+08 83722320.0 2.0 1046529.0 50.0 40.0 regionprofile 0.103622 0.103622 0.103622 0.103622 1.342176e+09 4.186116e+08 83722320.0 2.0 1046529.0 50.0 Polybench_JACOBI_2D
4194304.0 39.0 regionprofile 0.370495 0.370495 0.370495 0.370495 5.368708e+09 1.676084e+09 335216720.0 2.0 4190209.0 50.0 42 regionprofile 0.370471 0.370471 0.370471 0.370471 5.368708e+09 1.676084e+09 335216720.0 2.0 4190209.0 50.0 39.0 regionprofile 0.370860 0.370860 0.370860 0.370860 5.368708e+09 1.676084e+09 335216720.0 2.0 4190209.0 50.0 40.0 regionprofile 0.373282 0.373282 0.373282 0.373282 5.368708e+09 1.676084e+09 335216720.0 2.0 4190209.0 50.0 Polybench_JACOBI_2D
{'name': 'Polybench_MVT', 'type': 'function'} 1048576.0 40.0 regionprofile 0.021796 0.021796 0.021796 0.021796 1.685920e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 43 regionprofile 0.025190 0.025190 0.025190 0.025190 1.685920e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 40.0 regionprofile 0.032569 0.032569 0.032569 0.032569 1.685920e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 41.0 regionprofile 0.065664 0.065664 0.065664 0.065664 1.685920e+07 4.202500e+06 2050.0 2.0 1050625.0 100.0 Polybench_MVT
4194304.0 40.0 regionprofile 0.046011 0.046011 0.046011 0.046011 6.727277e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 43 regionprofile 0.052310 0.052310 0.052310 0.052310 6.727277e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 40.0 regionprofile 0.067230 0.067230 0.067230 0.067230 6.727277e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 41.0 regionprofile 0.131107 0.131107 0.131107 0.131107 6.727277e+07 1.679360e+07 4098.0 2.0 4198401.0 100.0 Polybench_MVT
{'name': 'Stream', 'type': 'function'} 1048576.0 41.0 regionprofile 0.259377 0.259377 0.259377 0.259377 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 44 regionprofile 0.259331 0.259331 0.259331 0.259331 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 41.0 regionprofile 0.258056 0.258056 0.258056 0.258056 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 42.0 regionprofile 0.260428 0.260428 0.260428 0.260428 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 Stream
4194304.0 41.0 regionprofile 0.772383 0.772383 0.772383 0.772383 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 44 regionprofile 0.772250 0.772250 0.772250 0.772250 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 41.0 regionprofile 0.775312 0.775312 0.775312 0.775312 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 42.0 regionprofile 0.770613 0.770613 0.770613 0.770613 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 Stream
{'name': 'Stream_ADD', 'type': 'function'} 1048576.0 42.0 regionprofile 0.033683 0.033683 0.033683 0.033683 2.516582e+07 1.048576e+06 1048576.0 1.0 1048576.0 1000.0 45 regionprofile 0.033667 0.033667 0.033667 0.033667 2.516582e+07 1.048576e+06 1048576.0 1.0 1048576.0 1000.0 42.0 regionprofile 0.033741 0.033741 0.033741 0.033741 2.516582e+07 1.048576e+06 1048576.0 1.0 1048576.0 1000.0 43.0 regionprofile 0.034200 0.034200 0.034200 0.034200 2.516582e+07 1.048576e+06 1048576.0 1.0 1048576.0 1000.0 Stream_ADD
4194304.0 42.0 regionprofile 0.122838 0.122838 0.122838 0.122838 1.006633e+08 4.194304e+06 4194304.0 1.0 4194304.0 1000.0 45 regionprofile 0.122784 0.122784 0.122784 0.122784 1.006633e+08 4.194304e+06 4194304.0 1.0 4194304.0 1000.0 42.0 regionprofile 0.123120 0.123120 0.123120 0.123120 1.006633e+08 4.194304e+06 4194304.0 1.0 4194304.0 1000.0 43.0 regionprofile 0.123213 0.123213 0.123213 0.123213 1.006633e+08 4.194304e+06 4194304.0 1.0 4194304.0 1000.0 Stream_ADD
{'name': 'Stream_COPY', 'type': 'function'} 1048576.0 43.0 regionprofile 0.043248 0.043248 0.043248 0.043248 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 1800.0 46 regionprofile 0.043435 0.043435 0.043435 0.043435 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 1800.0 43.0 regionprofile 0.043811 0.043811 0.043811 0.043811 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 1800.0 44.0 regionprofile 0.044358 0.044358 0.044358 0.044358 1.677722e+07 0.000000e+00 1048576.0 1.0 1048576.0 1800.0 Stream_COPY
4194304.0 43.0 regionprofile 0.154253 0.154253 0.154253 0.154253 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 1800.0 46 regionprofile 0.154664 0.154664 0.154664 0.154664 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 1800.0 43.0 regionprofile 0.155116 0.155116 0.155116 0.155116 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 1800.0 44.0 regionprofile 0.155421 0.155421 0.155421 0.155421 6.710886e+07 0.000000e+00 4194304.0 1.0 4194304.0 1800.0 Stream_COPY
{'name': 'Stream_DOT', 'type': 'function'} 1048576.0 44.0 regionprofile 0.105313 0.105313 0.105313 0.105313 1.677723e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 47 regionprofile 0.104737 0.104737 0.104737 0.104737 1.677723e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 44.0 regionprofile 0.102990 0.102990 0.102990 0.102990 1.677723e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 45.0 regionprofile 0.103427 0.103427 0.103427 0.103427 1.677723e+07 2.097152e+06 1048576.0 1.0 1048576.0 2000.0 Stream_DOT
4194304.0 44.0 regionprofile 0.218152 0.218152 0.218152 0.218152 6.710888e+07 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 47 regionprofile 0.217043 0.217043 0.217043 0.217043 6.710888e+07 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 44.0 regionprofile 0.218356 0.218356 0.218356 0.218356 6.710888e+07 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 45.0 regionprofile 0.213373 0.213373 0.213373 0.213373 6.710888e+07 8.388608e+06 4194304.0 1.0 4194304.0 2000.0 Stream_DOT
{'name': 'Stream_MUL', 'type': 'function'} 1048576.0 45.0 regionprofile 0.043266 0.043266 0.043266 0.043266 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 1800.0 48 regionprofile 0.043424 0.043424 0.043424 0.043424 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 1800.0 45.0 regionprofile 0.043797 0.043797 0.043797 0.043797 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 1800.0 46.0 regionprofile 0.044307 0.044307 0.044307 0.044307 1.677722e+07 1.048576e+06 1048576.0 1.0 1048576.0 1800.0 Stream_MUL
4194304.0 45.0 regionprofile 0.154268 0.154268 0.154268 0.154268 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 1800.0 48 regionprofile 0.154717 0.154717 0.154717 0.154717 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 1800.0 45.0 regionprofile 0.155135 0.155135 0.155135 0.155135 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 1800.0 46.0 regionprofile 0.155466 0.155466 0.155466 0.155466 6.710886e+07 4.194304e+06 4194304.0 1.0 4194304.0 1800.0 Stream_MUL
{'name': 'Stream_TRIAD', 'type': 'function'} 1048576.0 46.0 regionprofile 0.033838 0.033838 0.033838 0.033838 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 1000.0 49 regionprofile 0.034037 0.034037 0.034037 0.034037 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 1000.0 46.0 regionprofile 0.033686 0.033686 0.033686 0.033686 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 1000.0 47.0 regionprofile 0.034100 0.034100 0.034100 0.034100 2.516582e+07 2.097152e+06 1048576.0 1.0 1048576.0 1000.0 Stream_TRIAD
4194304.0 46.0 regionprofile 0.122837 0.122837 0.122837 0.122837 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 1000.0 49 regionprofile 0.123006 0.123006 0.123006 0.123006 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 1000.0 46.0 regionprofile 0.123550 0.123550 0.123550 0.123550 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 1000.0 47.0 regionprofile 0.123103 0.123103 0.123103 0.123103 1.006633e+08 8.388608e+06 4194304.0 1.0 4194304.0 1000.0 Stream_TRIAD
{'name': 'Basic_INDEXLIST', 'type': 'function'} 1048576.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 5 regionprofile 0.004352 0.004352 0.004352 0.004352 1.048578e+07 0.000000e+00 1048576.0 1.0 1048576.0 100.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN Basic_INDEXLIST
4194304.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 5 regionprofile 0.010043 0.010043 0.010043 0.010043 4.194306e+07 0.000000e+00 4194304.0 1.0 4194304.0 100.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN Basic_INDEXLIST
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} 1048576.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 6 regionprofile 0.008052 0.008052 0.008052 0.008052 2.726300e+07 0.000000e+00 3145729.0 3.0 1048576.0 100.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN Basic_INDEXLIST_3LOOP
4194304.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 6 regionprofile 0.024382 0.024382 0.024382 0.024382 1.090519e+08 0.000000e+00 12582913.0 3.0 4194304.0 100.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN Basic_INDEXLIST_3LOOP
{'name': 'Basic_MAT_MAT_SHARED', 'type': 'function'} 1048576.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 10 regionprofile 0.006440 0.006440 0.006440 0.006440 1.677722e+07 2.147484e+09 1048576.0 1.0 1048576.0 5.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 8.0 regionprofile 0.005955 0.005955 0.005955 0.005955 1.677722e+07 2.147484e+09 1048576.0 1.0 1048576.0 5.0 Basic_MAT_MAT_SHARED
4194304.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 10 regionprofile 0.042978 0.042978 0.042978 0.042978 6.710886e+07 1.717987e+10 4194304.0 1.0 4194304.0 5.0 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN 8.0 regionprofile 0.040467 0.040467 0.040467 0.040467 6.710886e+07 1.717987e+10 4194304.0 1.0 4194304.0 5.0 Basic_MAT_MAT_SHARED

Filter with respect to metadata

The metadata table of a thicket helps select certain variation based on specific metadata. For example, selecting a certain compiler. In this example, we filter the metadata to select the profiles generated with a block size of 128.

[11]:
# selecting profiles originating form the quartz cluster
filter_metadata_func = lambda x: x["gpu_targets_block_sizes"] == "128"
th_example = th_obj.filter_metadata(filter_metadata_func)
display(HTML(th_example.metadata.to_html()))
spot.channels spot.options spot.format.version spot.timeseries.metrics spot.metrics cali.caliper.version variant Algorithm_MEMCPY Algorithm_MEMSET Algorithm_REDUCE_SUM Algorithm_SORTPAIRS Algorithm_SORT Algorithm_SCAN Apps_VOL3D Apps_PRESSURE Apps_NODAL_ACCUMULATION_3D Apps_MASS3DPA Apps_LTIMES_NOVIEW Apps_LTIMES Apps_HALOEXCHANGE_FUSED Apps_HALOEXCHANGE Apps_FIR Apps_ENERGY Apps_DIFFUSION3DPA Apps_DEL_DOT_VEC_2D Apps_CONVECTION3DPA Stream_TRIAD Stream_MUL Stream_DOT Stream_COPY Stream_ADD Polybench_MVT Polybench_JACOBI_2D Polybench_JACOBI_1D Polybench_HEAT_3D Polybench_GESUMMV Polybench_GEMVER Polybench_GEMM Polybench_FLOYD_WARSHALL Polybench_FDTD_2D Polybench_ATAX Polybench_ADI Polybench_3MM Polybench_2MM Lcals_TRIDIAG_ELIM Lcals_PLANCKIAN Lcals_INT_PREDICT Lcals_HYDRO_2D Lcals_HYDRO_1D Lcals_GEN_LIN_RECUR Lcals_FIRST_SUM Lcals_FIRST_MIN Lcals_FIRST_DIFF Lcals_EOS Lcals_DIFF_PREDICT Basic_TRAP_INT Basic_REDUCE_STRUCT Basic_REDUCE3_INT Basic_PI_REDUCE Basic_PI_ATOMIC Basic_NESTED_INIT Basic_MULADDSUB Basic_MAT_MAT_SHARED Basic_INIT_VIEW1D_OFFSET Basic_INIT_VIEW1D Basic_INIT3 Basic_INDEXLIST_3LOOP Basic_INDEXLIST Basic_IF_QUAD Basic_DAXPY_ATOMIC Basic_DAXPY SizeMeaning Tuning machine_build systype_build gpu_targets_block_sizes cuda_compiler_version Compiler_path_version compiler compiler_version rajaperf_compiler_options rajaperf_compiler cmake_exe_linker_flags cmake_cxx_flags cmake_build_type raja_version perfsuite_version cluster cmdline libraries launchdate user cali.channel
profile

Group with the metadata

The metadata table also supports the grouping of the thicket based on the unique values present in the provided column(s). The following example groups the thicket according to any unique combination of values in the launchdate and gpu_targets_block_sizes columns.

[12]:
# create sub-thickets from unique values in the cluster column
grouping_metadata_cols = ["launchdate", "gpu_targets_block_sizes"]
sub_thickets = th_lassen.groupby(grouping_metadata_cols)
for th in sub_thickets:
    display(th.metadata)
3  thickets created...
[(1668559508, 128), (1668559517, 128), (1668559527, 128)]
spot.channels spot.options spot.format.version spot.timeseries.metrics spot.metrics cali.caliper.version variant Algorithm_MEMCPY Algorithm_MEMSET Algorithm_REDUCE_SUM ... cmake_cxx_flags cmake_build_type raja_version perfsuite_version cluster cmdline libraries launchdate user cali.channel
profile
2118176828577267326 regionprofile 2 min#inclusive#sum#time.duration,max#inclusive#... 2.9.0-dev Base_CUDA {'Problem size': 1048576,'Reps':100,'Iteration... {'Problem size': 1048576,'Reps':100,'Iteration... {'Problem size': 1048576,'Reps':50,'Iterations... ... -qthreaded Release 2022.03.0 2022.3.0 lassen [./my-xl-block-128-build/bin/raja-perf.exe, --... [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/wo... 1668559508 burgess9 spot

1 rows × 92 columns

spot.channels spot.options spot.format.version spot.timeseries.metrics spot.metrics cali.caliper.version variant Algorithm_MEMCPY Algorithm_MEMSET Algorithm_REDUCE_SUM ... cmake_cxx_flags cmake_build_type raja_version perfsuite_version cluster cmdline libraries launchdate user cali.channel
profile
7599706706110163426 regionprofile 2 min#inclusive#sum#time.duration,max#inclusive#... 2.9.0-dev Base_CUDA {'Problem size': 1048576,'Reps':100,'Iteration... {'Problem size': 1048576,'Reps':100,'Iteration... {'Problem size': 1048576,'Reps':50,'Iterations... ... -qthreaded Release 2022.03.0 2022.3.0 lassen [./my-xl-block-128-build/bin/raja-perf.exe, --... [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/wo... 1668559517 burgess9 spot

1 rows × 92 columns

spot.channels spot.options spot.format.version spot.timeseries.metrics spot.metrics cali.caliper.version variant Algorithm_MEMCPY Algorithm_MEMSET Algorithm_REDUCE_SUM ... cmake_cxx_flags cmake_build_type raja_version perfsuite_version cluster cmdline libraries launchdate user cali.channel
profile
5411856859942141273 regionprofile 2 min#inclusive#sum#time.duration,max#inclusive#... 2.9.0-dev Base_CUDA {'Problem size': 1048576,'Reps':100,'Iteration... {'Problem size': 1048576,'Reps':100,'Iteration... {'Problem size': 1048576,'Reps':50,'Iterations... ... -qthreaded Release 2022.03.0 2022.3.0 lassen [./my-xl-block-128-build/bin/raja-perf.exe, --... [/lib64/libdl.so.2, /lib64/librt.so.1, /usr/wo... 1668559527 burgess9 spot

1 rows × 92 columns

The aggregated statistics in a thicket is a GraphFrame. Therefore, it contains a graph and corresponding dataframe component. The table supports an order-reduction mechanism and stores processed applications’ performance. Each row of the aggregated statistics table holds data aggregated across all profiles associated with a particular call tree node. Below is an example of an empty aggregated statistics table

View aggregated statistics table

[13]:
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name
node
{'name': 'Base_CUDA', 'type': 'function'} Base_CUDA
{'name': 'Algorithm', 'type': 'function'} Algorithm
{'name': 'Algorithm_MEMCPY', 'type': 'function'} Algorithm_MEMCPY
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} Algorithm_MEMCPY.block_128
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} Algorithm_MEMCPY.library
{'name': 'Algorithm_MEMSET', 'type': 'function'} Algorithm_MEMSET
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} Algorithm_MEMSET.block_128
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} Algorithm_MEMSET.library
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} Algorithm_REDUCE_SUM
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} Algorithm_REDUCE_SUM.block_128
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} Algorithm_REDUCE_SUM.cub
{'name': 'Algorithm_SCAN', 'type': 'function'} Algorithm_SCAN
{'name': 'Algorithm_SCAN.default', 'type': 'function'} Algorithm_SCAN.default
{'name': 'Apps', 'type': 'function'} Apps
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} Apps_CONVECTION3DPA
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} Apps_CONVECTION3DPA.block_64
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} Apps_DEL_DOT_VEC_2D
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} Apps_DEL_DOT_VEC_2D.block_128
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} Apps_DIFFUSION3DPA
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} Apps_DIFFUSION3DPA.block_64
{'name': 'Apps_ENERGY', 'type': 'function'} Apps_ENERGY
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} Apps_ENERGY.block_128
{'name': 'Apps_FIR', 'type': 'function'} Apps_FIR
{'name': 'Apps_FIR.block_128', 'type': 'function'} Apps_FIR.block_128
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} Apps_HALOEXCHANGE
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} Apps_HALOEXCHANGE.block_128
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} Apps_HALOEXCHANGE_FUSED
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} Apps_HALOEXCHANGE_FUSED.block_128
{'name': 'Apps_LTIMES', 'type': 'function'} Apps_LTIMES
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} Apps_LTIMES.block_128
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} Apps_LTIMES_NOVIEW
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} Apps_LTIMES_NOVIEW.block_128
{'name': 'Apps_MASS3DPA', 'type': 'function'} Apps_MASS3DPA
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} Apps_MASS3DPA.block_25
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D.block_128
{'name': 'Apps_PRESSURE', 'type': 'function'} Apps_PRESSURE
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} Apps_PRESSURE.block_128
{'name': 'Apps_VOL3D', 'type': 'function'} Apps_VOL3D
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} Apps_VOL3D.block_128
{'name': 'Basic', 'type': 'function'} Basic
{'name': 'Basic_DAXPY', 'type': 'function'} Basic_DAXPY
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} Basic_DAXPY.block_128
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} Basic_DAXPY_ATOMIC
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} Basic_DAXPY_ATOMIC.block_128
{'name': 'Basic_IF_QUAD', 'type': 'function'} Basic_IF_QUAD
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} Basic_IF_QUAD.block_128
{'name': 'Basic_INDEXLIST', 'type': 'function'} Basic_INDEXLIST
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} Basic_INDEXLIST.block_256
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} Basic_INDEXLIST_3LOOP
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} Basic_INDEXLIST_3LOOP.block_256
{'name': 'Basic_INIT3', 'type': 'function'} Basic_INIT3
{'name': 'Basic_INIT3.block_128', 'type': 'function'} Basic_INIT3.block_128
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} Basic_INIT_VIEW1D
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} Basic_INIT_VIEW1D.block_128
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET.block_128
{'name': 'Basic_MULADDSUB', 'type': 'function'} Basic_MULADDSUB
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} Basic_MULADDSUB.block_128
{'name': 'Basic_NESTED_INIT', 'type': 'function'} Basic_NESTED_INIT
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} Basic_NESTED_INIT.block_128
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} Basic_PI_ATOMIC
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} Basic_PI_ATOMIC.block_128
{'name': 'Basic_PI_REDUCE', 'type': 'function'} Basic_PI_REDUCE
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} Basic_PI_REDUCE.block_128
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} Basic_REDUCE3_INT
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} Basic_REDUCE3_INT.block_128
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} Basic_REDUCE_STRUCT
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} Basic_REDUCE_STRUCT.block_128
{'name': 'Basic_TRAP_INT', 'type': 'function'} Basic_TRAP_INT
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} Basic_TRAP_INT.block_128
{'name': 'Lcals', 'type': 'function'} Lcals
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} Lcals_DIFF_PREDICT
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} Lcals_DIFF_PREDICT.block_128
{'name': 'Lcals_EOS', 'type': 'function'} Lcals_EOS
{'name': 'Lcals_EOS.block_128', 'type': 'function'} Lcals_EOS.block_128
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} Lcals_FIRST_DIFF
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} Lcals_FIRST_DIFF.block_128
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} Lcals_FIRST_MIN
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} Lcals_FIRST_MIN.block_128
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} Lcals_FIRST_SUM
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} Lcals_FIRST_SUM.block_128
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} Lcals_GEN_LIN_RECUR
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} Lcals_GEN_LIN_RECUR.block_128
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} Lcals_HYDRO_1D
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} Lcals_HYDRO_1D.block_128
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} Lcals_HYDRO_2D
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} Lcals_HYDRO_2D.block_128
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} Lcals_INT_PREDICT
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} Lcals_INT_PREDICT.block_128
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} Lcals_PLANCKIAN
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} Lcals_PLANCKIAN.block_128
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} Lcals_TRIDIAG_ELIM
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} Lcals_TRIDIAG_ELIM.block_128
{'name': 'Polybench', 'type': 'function'} Polybench
{'name': 'Polybench_2MM', 'type': 'function'} Polybench_2MM
{'name': 'Polybench_2MM.block_128', 'type': 'function'} Polybench_2MM.block_128
{'name': 'Polybench_3MM', 'type': 'function'} Polybench_3MM
{'name': 'Polybench_3MM.block_128', 'type': 'function'} Polybench_3MM.block_128
{'name': 'Polybench_ADI', 'type': 'function'} Polybench_ADI
{'name': 'Polybench_ADI.block_128', 'type': 'function'} Polybench_ADI.block_128
{'name': 'Polybench_ATAX', 'type': 'function'} Polybench_ATAX
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} Polybench_ATAX.block_128
{'name': 'Polybench_FDTD_2D', 'type': 'function'} Polybench_FDTD_2D
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} Polybench_FDTD_2D.block_128
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} Polybench_FLOYD_WARSHALL
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} Polybench_FLOYD_WARSHALL.block_128
{'name': 'Polybench_GEMM', 'type': 'function'} Polybench_GEMM
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} Polybench_GEMM.block_128
{'name': 'Polybench_GEMVER', 'type': 'function'} Polybench_GEMVER
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} Polybench_GEMVER.block_128
{'name': 'Polybench_GESUMMV', 'type': 'function'} Polybench_GESUMMV
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} Polybench_GESUMMV.block_128
{'name': 'Polybench_HEAT_3D', 'type': 'function'} Polybench_HEAT_3D
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} Polybench_HEAT_3D.block_128
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} Polybench_JACOBI_1D
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} Polybench_JACOBI_1D.block_128
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} Polybench_JACOBI_2D
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} Polybench_JACOBI_2D.block_128
{'name': 'Polybench_MVT', 'type': 'function'} Polybench_MVT
{'name': 'Polybench_MVT.block_128', 'type': 'function'} Polybench_MVT.block_128
{'name': 'Stream', 'type': 'function'} Stream
{'name': 'Stream_ADD', 'type': 'function'} Stream_ADD
{'name': 'Stream_ADD.block_128', 'type': 'function'} Stream_ADD.block_128
{'name': 'Stream_COPY', 'type': 'function'} Stream_COPY
{'name': 'Stream_COPY.block_128', 'type': 'function'} Stream_COPY.block_128
{'name': 'Stream_DOT', 'type': 'function'} Stream_DOT
{'name': 'Stream_DOT.block_128', 'type': 'function'} Stream_DOT.block_128
{'name': 'Stream_MUL', 'type': 'function'} Stream_MUL
{'name': 'Stream_MUL.block_128', 'type': 'function'} Stream_MUL.block_128
{'name': 'Stream_TRIAD', 'type': 'function'} Stream_TRIAD
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} Stream_TRIAD.block_128

Filter with respect to aggregated statistics

The aggregated statistics table also supports a filter function. In the example below, we filter the table to select the nodes with the names Base_CUDA, Algorithm, and Stream_MUL.

[14]:
stats_nodes = ["Base_CUDA", "Algorithm", "Stream_MUL"]
th_stats_name = th_obj.filter_stats(lambda x: x["name"] in stats_nodes)
display(HTML(th_stats_name.statsframe.dataframe.to_html()))
name
node
{'name': 'Base_CUDA', 'type': 'function'} Base_CUDA
{'name': 'Algorithm', 'type': 'function'} Algorithm
{'name': 'Stream_MUL', 'type': 'function'} Stream_MUL

Calculate the median and mean of performance data column, append to aggregated statistics table

The aggregated statistics table allows users to select a column from the performance data to perform an average on the values in a column. After performing the median() and mean() operation on said column, two new columns are appended to the statistics table with the mean and median values corresponding to the columns provided by the user.

Below is an example where we calculate the mean and median of the values in the Total time (exc) column, which is the total exclusive time corresponding to the respective nodes.

[15]:
metrics = ["Total time (exc)"]
tt.median(th_lassen, columns=metrics)
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name Total time (exc)_median
node
{'name': 'Base_CUDA', 'type': 'function'} Base_CUDA 0.000636
{'name': 'Algorithm', 'type': 'function'} Algorithm 0.000048
{'name': 'Algorithm_MEMCPY', 'type': 'function'} Algorithm_MEMCPY 0.000016
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} Algorithm_MEMCPY.block_128 0.002440
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} Algorithm_MEMCPY.library 0.002609
{'name': 'Algorithm_MEMSET', 'type': 'function'} Algorithm_MEMSET 0.000016
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} Algorithm_MEMSET.block_128 0.001488
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} Algorithm_MEMSET.library 0.001281
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} Algorithm_REDUCE_SUM 0.000019
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} Algorithm_REDUCE_SUM.block_128 0.002683
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} Algorithm_REDUCE_SUM.cub 0.001527
{'name': 'Algorithm_SCAN', 'type': 'function'} Algorithm_SCAN 0.000008
{'name': 'Algorithm_SCAN.default', 'type': 'function'} Algorithm_SCAN.default 0.002917
{'name': 'Apps', 'type': 'function'} Apps 0.000086
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} Apps_CONVECTION3DPA 0.000008
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} Apps_CONVECTION3DPA.block_64 0.003161
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} Apps_DEL_DOT_VEC_2D 0.000009
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} Apps_DEL_DOT_VEC_2D.block_128 0.007006
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} Apps_DIFFUSION3DPA 0.000010
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} Apps_DIFFUSION3DPA.block_64 0.004926
{'name': 'Apps_ENERGY', 'type': 'function'} Apps_ENERGY 0.000009
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} Apps_ENERGY.block_128 0.039303
{'name': 'Apps_FIR', 'type': 'function'} Apps_FIR 0.000010
{'name': 'Apps_FIR.block_128', 'type': 'function'} Apps_FIR.block_128 0.004157
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} Apps_HALOEXCHANGE 0.000009
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} Apps_HALOEXCHANGE.block_128 0.037478
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} Apps_HALOEXCHANGE_FUSED 0.000010
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} Apps_HALOEXCHANGE_FUSED.block_128 0.009430
{'name': 'Apps_LTIMES', 'type': 'function'} Apps_LTIMES 0.000008
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} Apps_LTIMES.block_128 0.012088
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} Apps_LTIMES_NOVIEW 0.000011
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} Apps_LTIMES_NOVIEW.block_128 0.012085
{'name': 'Apps_MASS3DPA', 'type': 'function'} Apps_MASS3DPA 0.000008
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} Apps_MASS3DPA.block_25 0.001904
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D 0.000009
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D.block_128 0.007408
{'name': 'Apps_PRESSURE', 'type': 'function'} Apps_PRESSURE 0.000009
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} Apps_PRESSURE.block_128 0.048251
{'name': 'Apps_VOL3D', 'type': 'function'} Apps_VOL3D 0.000009
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} Apps_VOL3D.block_128 0.005903
{'name': 'Basic', 'type': 'function'} Basic 0.000097
{'name': 'Basic_DAXPY', 'type': 'function'} Basic_DAXPY 0.000011
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} Basic_DAXPY.block_128 0.016924
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} Basic_DAXPY_ATOMIC 0.000009
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} Basic_DAXPY_ATOMIC.block_128 0.016947
{'name': 'Basic_IF_QUAD', 'type': 'function'} Basic_IF_QUAD 0.000008
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} Basic_IF_QUAD.block_128 0.012622
{'name': 'Basic_INDEXLIST', 'type': 'function'} Basic_INDEXLIST 0.000008
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} Basic_INDEXLIST.block_256 0.004506
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} Basic_INDEXLIST_3LOOP 0.000009
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} Basic_INDEXLIST_3LOOP.block_256 0.008175
{'name': 'Basic_INIT3', 'type': 'function'} Basic_INIT3 0.000009
{'name': 'Basic_INIT3.block_128', 'type': 'function'} Basic_INIT3.block_128 0.028197
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} Basic_INIT_VIEW1D 0.000009
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} Basic_INIT_VIEW1D.block_128 0.041780
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET 0.000009
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET.block_128 0.041795
{'name': 'Basic_MULADDSUB', 'type': 'function'} Basic_MULADDSUB 0.000011
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} Basic_MULADDSUB.block_128 0.019746
{'name': 'Basic_NESTED_INIT', 'type': 'function'} Basic_NESTED_INIT 0.000009
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} Basic_NESTED_INIT.block_128 0.020670
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} Basic_PI_ATOMIC 0.000009
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} Basic_PI_ATOMIC.block_128 0.124113
{'name': 'Basic_PI_REDUCE', 'type': 'function'} Basic_PI_REDUCE 0.000008
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} Basic_PI_REDUCE.block_128 0.002602
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} Basic_REDUCE3_INT 0.000008
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} Basic_REDUCE3_INT.block_128 0.002380
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} Basic_REDUCE_STRUCT 0.000008
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} Basic_REDUCE_STRUCT.block_128 0.016917
{'name': 'Basic_TRAP_INT', 'type': 'function'} Basic_TRAP_INT 0.000007
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} Basic_TRAP_INT.block_128 0.002596
{'name': 'Lcals', 'type': 'function'} Lcals 0.000074
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} Lcals_DIFF_PREDICT 0.000009
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} Lcals_DIFF_PREDICT.block_128 0.061523
{'name': 'Lcals_EOS', 'type': 'function'} Lcals_EOS 0.000010
{'name': 'Lcals_EOS.block_128', 'type': 'function'} Lcals_EOS.block_128 0.022685
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} Lcals_FIRST_DIFF 0.000009
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} Lcals_FIRST_DIFF.block_128 0.048455
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} Lcals_FIRST_MIN 0.000013
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} Lcals_FIRST_MIN.block_128 0.048713
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} Lcals_FIRST_SUM 0.000009
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} Lcals_FIRST_SUM.block_128 0.048516
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} Lcals_GEN_LIN_RECUR 0.000009
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} Lcals_GEN_LIN_RECUR.block_128 0.048865
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} Lcals_HYDRO_1D 0.000009
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} Lcals_HYDRO_1D.block_128 0.033971
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} Lcals_HYDRO_2D 0.000010
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} Lcals_HYDRO_2D.block_128 0.022995
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} Lcals_INT_PREDICT 0.000010
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} Lcals_INT_PREDICT.block_128 0.046876
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} Lcals_PLANCKIAN 0.000010
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} Lcals_PLANCKIAN.block_128 0.002759
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} Lcals_TRIDIAG_ELIM 0.000010
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} Lcals_TRIDIAG_ELIM.block_128 0.044681
{'name': 'Polybench', 'type': 'function'} Polybench 0.000086
{'name': 'Polybench_2MM', 'type': 'function'} Polybench_2MM 0.000008
{'name': 'Polybench_2MM.block_128', 'type': 'function'} Polybench_2MM.block_128 0.005949
{'name': 'Polybench_3MM', 'type': 'function'} Polybench_3MM 0.000008
{'name': 'Polybench_3MM.block_128', 'type': 'function'} Polybench_3MM.block_128 0.008885
{'name': 'Polybench_ADI', 'type': 'function'} Polybench_ADI 0.000009
{'name': 'Polybench_ADI.block_128', 'type': 'function'} Polybench_ADI.block_128 0.033592
{'name': 'Polybench_ATAX', 'type': 'function'} Polybench_ATAX 0.000009
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} Polybench_ATAX.block_128 0.037403
{'name': 'Polybench_FDTD_2D', 'type': 'function'} Polybench_FDTD_2D 0.000009
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} Polybench_FDTD_2D.block_128 0.037109
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} Polybench_FLOYD_WARSHALL 0.000008
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} Polybench_FLOYD_WARSHALL.block_128 0.209053
{'name': 'Polybench_GEMM', 'type': 'function'} Polybench_GEMM 0.000010
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} Polybench_GEMM.block_128 0.006741
{'name': 'Polybench_GEMVER', 'type': 'function'} Polybench_GEMVER 0.000008
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} Polybench_GEMVER.block_128 0.008432
{'name': 'Polybench_GESUMMV', 'type': 'function'} Polybench_GESUMMV 0.000009
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} Polybench_GESUMMV.block_128 0.037410
{'name': 'Polybench_HEAT_3D', 'type': 'function'} Polybench_HEAT_3D 0.000009
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} Polybench_HEAT_3D.block_128 0.021826
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} Polybench_JACOBI_1D 0.000011
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} Polybench_JACOBI_1D.block_128 0.077718
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} Polybench_JACOBI_2D 0.000010
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} Polybench_JACOBI_2D.block_128 0.102580
{'name': 'Polybench_MVT', 'type': 'function'} Polybench_MVT 0.000009
{'name': 'Polybench_MVT.block_128', 'type': 'function'} Polybench_MVT.block_128 0.035797
{'name': 'Stream', 'type': 'function'} Stream 0.000033
{'name': 'Stream_ADD', 'type': 'function'} Stream_ADD 0.000009
{'name': 'Stream_ADD.block_128', 'type': 'function'} Stream_ADD.block_128 0.033798
{'name': 'Stream_COPY', 'type': 'function'} Stream_COPY 0.000009
{'name': 'Stream_COPY.block_128', 'type': 'function'} Stream_COPY.block_128 0.043261
{'name': 'Stream_DOT', 'type': 'function'} Stream_DOT 0.000009
{'name': 'Stream_DOT.block_128', 'type': 'function'} Stream_DOT.block_128 0.113655
{'name': 'Stream_MUL', 'type': 'function'} Stream_MUL 0.000011
{'name': 'Stream_MUL.block_128', 'type': 'function'} Stream_MUL.block_128 0.043271
{'name': 'Stream_TRIAD', 'type': 'function'} Stream_TRIAD 0.000008
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} Stream_TRIAD.block_128 0.033730
[16]:
tt.mean(th_lassen, columns=metrics)
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name Total time (exc)_median Total time (exc)_mean
node
{'name': 'Base_CUDA', 'type': 'function'} Base_CUDA 0.000636 0.000632
{'name': 'Algorithm', 'type': 'function'} Algorithm 0.000048 0.000048
{'name': 'Algorithm_MEMCPY', 'type': 'function'} Algorithm_MEMCPY 0.000016 0.000016
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} Algorithm_MEMCPY.block_128 0.002440 0.002442
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} Algorithm_MEMCPY.library 0.002609 0.002609
{'name': 'Algorithm_MEMSET', 'type': 'function'} Algorithm_MEMSET 0.000016 0.000015
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} Algorithm_MEMSET.block_128 0.001488 0.001487
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} Algorithm_MEMSET.library 0.001281 0.001279
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} Algorithm_REDUCE_SUM 0.000019 0.000019
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} Algorithm_REDUCE_SUM.block_128 0.002683 0.002686
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} Algorithm_REDUCE_SUM.cub 0.001527 0.001526
{'name': 'Algorithm_SCAN', 'type': 'function'} Algorithm_SCAN 0.000008 0.000008
{'name': 'Algorithm_SCAN.default', 'type': 'function'} Algorithm_SCAN.default 0.002917 0.002917
{'name': 'Apps', 'type': 'function'} Apps 0.000086 0.000088
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} Apps_CONVECTION3DPA 0.000008 0.000008
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} Apps_CONVECTION3DPA.block_64 0.003161 0.003161
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} Apps_DEL_DOT_VEC_2D 0.000009 0.000009
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} Apps_DEL_DOT_VEC_2D.block_128 0.007006 0.007019
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} Apps_DIFFUSION3DPA 0.000010 0.000009
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} Apps_DIFFUSION3DPA.block_64 0.004926 0.004932
{'name': 'Apps_ENERGY', 'type': 'function'} Apps_ENERGY 0.000009 0.000009
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} Apps_ENERGY.block_128 0.039303 0.039285
{'name': 'Apps_FIR', 'type': 'function'} Apps_FIR 0.000010 0.000010
{'name': 'Apps_FIR.block_128', 'type': 'function'} Apps_FIR.block_128 0.004157 0.004165
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} Apps_HALOEXCHANGE 0.000009 0.000009
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} Apps_HALOEXCHANGE.block_128 0.037478 0.037577
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} Apps_HALOEXCHANGE_FUSED 0.000010 0.000010
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} Apps_HALOEXCHANGE_FUSED.block_128 0.009430 0.009341
{'name': 'Apps_LTIMES', 'type': 'function'} Apps_LTIMES 0.000008 0.000008
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} Apps_LTIMES.block_128 0.012088 0.012090
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} Apps_LTIMES_NOVIEW 0.000011 0.000011
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} Apps_LTIMES_NOVIEW.block_128 0.012085 0.012087
{'name': 'Apps_MASS3DPA', 'type': 'function'} Apps_MASS3DPA 0.000008 0.000008
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} Apps_MASS3DPA.block_25 0.001904 0.001906
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D 0.000009 0.000008
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D.block_128 0.007408 0.007401
{'name': 'Apps_PRESSURE', 'type': 'function'} Apps_PRESSURE 0.000009 0.000009
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} Apps_PRESSURE.block_128 0.048251 0.048255
{'name': 'Apps_VOL3D', 'type': 'function'} Apps_VOL3D 0.000009 0.000009
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} Apps_VOL3D.block_128 0.005903 0.005877
{'name': 'Basic', 'type': 'function'} Basic 0.000097 0.000098
{'name': 'Basic_DAXPY', 'type': 'function'} Basic_DAXPY 0.000011 0.000011
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} Basic_DAXPY.block_128 0.016924 0.016901
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} Basic_DAXPY_ATOMIC 0.000009 0.000009
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} Basic_DAXPY_ATOMIC.block_128 0.016947 0.016938
{'name': 'Basic_IF_QUAD', 'type': 'function'} Basic_IF_QUAD 0.000008 0.000008
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} Basic_IF_QUAD.block_128 0.012622 0.012651
{'name': 'Basic_INDEXLIST', 'type': 'function'} Basic_INDEXLIST 0.000008 0.000009
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} Basic_INDEXLIST.block_256 0.004506 0.004434
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} Basic_INDEXLIST_3LOOP 0.000009 0.000009
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} Basic_INDEXLIST_3LOOP.block_256 0.008175 0.008132
{'name': 'Basic_INIT3', 'type': 'function'} Basic_INIT3 0.000009 0.000009
{'name': 'Basic_INIT3.block_128', 'type': 'function'} Basic_INIT3.block_128 0.028197 0.028248
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} Basic_INIT_VIEW1D 0.000009 0.000008
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} Basic_INIT_VIEW1D.block_128 0.041780 0.040132
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET 0.000009 0.000009
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET.block_128 0.041795 0.040145
{'name': 'Basic_MULADDSUB', 'type': 'function'} Basic_MULADDSUB 0.000011 0.000011
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} Basic_MULADDSUB.block_128 0.019746 0.019738
{'name': 'Basic_NESTED_INIT', 'type': 'function'} Basic_NESTED_INIT 0.000009 0.000009
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} Basic_NESTED_INIT.block_128 0.020670 0.019845
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} Basic_PI_ATOMIC 0.000009 0.000009
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} Basic_PI_ATOMIC.block_128 0.124113 0.120364
{'name': 'Basic_PI_REDUCE', 'type': 'function'} Basic_PI_REDUCE 0.000008 0.000008
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} Basic_PI_REDUCE.block_128 0.002602 0.002619
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} Basic_REDUCE3_INT 0.000008 0.000008
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} Basic_REDUCE3_INT.block_128 0.002380 0.002404
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} Basic_REDUCE_STRUCT 0.000008 0.000008
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} Basic_REDUCE_STRUCT.block_128 0.016917 0.017113
{'name': 'Basic_TRAP_INT', 'type': 'function'} Basic_TRAP_INT 0.000007 0.000007
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} Basic_TRAP_INT.block_128 0.002596 0.002601
{'name': 'Lcals', 'type': 'function'} Lcals 0.000074 0.000075
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} Lcals_DIFF_PREDICT 0.000009 0.000009
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} Lcals_DIFF_PREDICT.block_128 0.061523 0.061192
{'name': 'Lcals_EOS', 'type': 'function'} Lcals_EOS 0.000010 0.000010
{'name': 'Lcals_EOS.block_128', 'type': 'function'} Lcals_EOS.block_128 0.022685 0.022687
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} Lcals_FIRST_DIFF 0.000009 0.000009
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} Lcals_FIRST_DIFF.block_128 0.048455 0.048426
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} Lcals_FIRST_MIN 0.000013 0.000015
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} Lcals_FIRST_MIN.block_128 0.048713 0.058216
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} Lcals_FIRST_SUM 0.000009 0.000009
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} Lcals_FIRST_SUM.block_128 0.048516 0.048493
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} Lcals_GEN_LIN_RECUR 0.000009 0.000009
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} Lcals_GEN_LIN_RECUR.block_128 0.048865 0.048918
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} Lcals_HYDRO_1D 0.000009 0.000009
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} Lcals_HYDRO_1D.block_128 0.033971 0.033970
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} Lcals_HYDRO_2D 0.000010 0.000010
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} Lcals_HYDRO_2D.block_128 0.022995 0.023016
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} Lcals_INT_PREDICT 0.000010 0.000010
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} Lcals_INT_PREDICT.block_128 0.046876 0.046896
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} Lcals_PLANCKIAN 0.000010 0.000010
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} Lcals_PLANCKIAN.block_128 0.002759 0.002759
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} Lcals_TRIDIAG_ELIM 0.000010 0.000010
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} Lcals_TRIDIAG_ELIM.block_128 0.044681 0.044794
{'name': 'Polybench', 'type': 'function'} Polybench 0.000086 0.000086
{'name': 'Polybench_2MM', 'type': 'function'} Polybench_2MM 0.000008 0.000008
{'name': 'Polybench_2MM.block_128', 'type': 'function'} Polybench_2MM.block_128 0.005949 0.005949
{'name': 'Polybench_3MM', 'type': 'function'} Polybench_3MM 0.000008 0.000008
{'name': 'Polybench_3MM.block_128', 'type': 'function'} Polybench_3MM.block_128 0.008885 0.008885
{'name': 'Polybench_ADI', 'type': 'function'} Polybench_ADI 0.000009 0.000009
{'name': 'Polybench_ADI.block_128', 'type': 'function'} Polybench_ADI.block_128 0.033592 0.033582
{'name': 'Polybench_ATAX', 'type': 'function'} Polybench_ATAX 0.000009 0.000009
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} Polybench_ATAX.block_128 0.037403 0.037419
{'name': 'Polybench_FDTD_2D', 'type': 'function'} Polybench_FDTD_2D 0.000009 0.000009
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} Polybench_FDTD_2D.block_128 0.037109 0.037101
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} Polybench_FLOYD_WARSHALL 0.000008 0.000008
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} Polybench_FLOYD_WARSHALL.block_128 0.209053 0.209057
{'name': 'Polybench_GEMM', 'type': 'function'} Polybench_GEMM 0.000010 0.000010
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} Polybench_GEMM.block_128 0.006741 0.006742
{'name': 'Polybench_GEMVER', 'type': 'function'} Polybench_GEMVER 0.000008 0.000008
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} Polybench_GEMVER.block_128 0.008432 0.008436
{'name': 'Polybench_GESUMMV', 'type': 'function'} Polybench_GESUMMV 0.000009 0.000009
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} Polybench_GESUMMV.block_128 0.037410 0.037365
{'name': 'Polybench_HEAT_3D', 'type': 'function'} Polybench_HEAT_3D 0.000009 0.000009
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} Polybench_HEAT_3D.block_128 0.021826 0.021828
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} Polybench_JACOBI_1D 0.000011 0.000011
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} Polybench_JACOBI_1D.block_128 0.077718 0.077743
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} Polybench_JACOBI_2D 0.000010 0.000010
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} Polybench_JACOBI_2D.block_128 0.102580 0.102596
{'name': 'Polybench_MVT', 'type': 'function'} Polybench_MVT 0.000009 0.000010
{'name': 'Polybench_MVT.block_128', 'type': 'function'} Polybench_MVT.block_128 0.035797 0.035768
{'name': 'Stream', 'type': 'function'} Stream 0.000033 0.000034
{'name': 'Stream_ADD', 'type': 'function'} Stream_ADD 0.000009 0.000009
{'name': 'Stream_ADD.block_128', 'type': 'function'} Stream_ADD.block_128 0.033798 0.033840
{'name': 'Stream_COPY', 'type': 'function'} Stream_COPY 0.000009 0.000009
{'name': 'Stream_COPY.block_128', 'type': 'function'} Stream_COPY.block_128 0.043261 0.043184
{'name': 'Stream_DOT', 'type': 'function'} Stream_DOT 0.000009 0.000009
{'name': 'Stream_DOT.block_128', 'type': 'function'} Stream_DOT.block_128 0.113655 0.112893
{'name': 'Stream_MUL', 'type': 'function'} Stream_MUL 0.000011 0.000011
{'name': 'Stream_MUL.block_128', 'type': 'function'} Stream_MUL.block_128 0.043271 0.043180
{'name': 'Stream_TRIAD', 'type': 'function'} Stream_TRIAD 0.000008 0.000008
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} Stream_TRIAD.block_128 0.033730 0.033717

Calculate the percentile of performance data column, append to aggregated statistics table

The aggregated statistics table allows users to select a column from the performance data to perform the percentiles() operation. This results in a new column appended to the statistics table containing the 25th, 50th, and 75th percentiles of the values in the provided column.

Below is an example where we calculate the percentiles of the values in the same Total time (exc) column.

[17]:
tt.percentiles(th_lassen, columns=metrics)
display(HTML(th_lassen.statsframe.dataframe.to_html()))
name Total time (exc)_median Total time (exc)_mean Total time (exc)_percentiles
node
{'name': 'Base_CUDA', 'type': 'function'} Base_CUDA 0.000636 0.000632 [0.0006265, 0.000636, 0.000639]
{'name': 'Algorithm', 'type': 'function'} Algorithm 0.000048 0.000048 [4.6500000000000005e-05, 4.8e-05, 4.9e-05]
{'name': 'Algorithm_MEMCPY', 'type': 'function'} Algorithm_MEMCPY 0.000016 0.000016 [1.6e-05, 1.6e-05, 1.6e-05]
{'name': 'Algorithm_MEMCPY.block_128', 'type': 'function'} Algorithm_MEMCPY.block_128 0.002440 0.002442 [0.0024395, 0.00244, 0.0024435]
{'name': 'Algorithm_MEMCPY.library', 'type': 'function'} Algorithm_MEMCPY.library 0.002609 0.002609 [0.0026085, 0.002609, 0.002609]
{'name': 'Algorithm_MEMSET', 'type': 'function'} Algorithm_MEMSET 0.000016 0.000015 [1.4999999999999999e-05, 1.6e-05, 1.6e-05]
{'name': 'Algorithm_MEMSET.block_128', 'type': 'function'} Algorithm_MEMSET.block_128 0.001488 0.001487 [0.001486, 0.001488, 0.0014889999999999999]
{'name': 'Algorithm_MEMSET.library', 'type': 'function'} Algorithm_MEMSET.library 0.001281 0.001279 [0.001278, 0.001281, 0.0012815]
{'name': 'Algorithm_REDUCE_SUM', 'type': 'function'} Algorithm_REDUCE_SUM 0.000019 0.000019 [1.8500000000000002e-05, 1.9e-05, 1.95e-05]
{'name': 'Algorithm_REDUCE_SUM.block_128', 'type': 'function'} Algorithm_REDUCE_SUM.block_128 0.002683 0.002686 [0.0026804999999999997, 0.002683, 0.0026895]
{'name': 'Algorithm_REDUCE_SUM.cub', 'type': 'function'} Algorithm_REDUCE_SUM.cub 0.001527 0.001526 [0.0015225, 0.001527, 0.0015305]
{'name': 'Algorithm_SCAN', 'type': 'function'} Algorithm_SCAN 0.000008 0.000008 [7.499999999999999e-06, 8e-06, 8.5e-06]
{'name': 'Algorithm_SCAN.default', 'type': 'function'} Algorithm_SCAN.default 0.002917 0.002917 [0.002909, 0.002917, 0.0029245]
{'name': 'Apps', 'type': 'function'} Apps 0.000086 0.000088 [8.55e-05, 8.6e-05, 8.950000000000001e-05]
{'name': 'Apps_CONVECTION3DPA', 'type': 'function'} Apps_CONVECTION3DPA 0.000008 0.000008 [8e-06, 8e-06, 8e-06]
{'name': 'Apps_CONVECTION3DPA.block_64', 'type': 'function'} Apps_CONVECTION3DPA.block_64 0.003161 0.003161 [0.0031590000000000003, 0.003161, 0.0031635]
{'name': 'Apps_DEL_DOT_VEC_2D', 'type': 'function'} Apps_DEL_DOT_VEC_2D 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Apps_DEL_DOT_VEC_2D.block_128', 'type': 'function'} Apps_DEL_DOT_VEC_2D.block_128 0.007006 0.007019 [0.0069984999999999995, 0.007006, 0.007033]
{'name': 'Apps_DIFFUSION3DPA', 'type': 'function'} Apps_DIFFUSION3DPA 0.000010 0.000009 [9e-06, 1e-05, 1e-05]
{'name': 'Apps_DIFFUSION3DPA.block_64', 'type': 'function'} Apps_DIFFUSION3DPA.block_64 0.004926 0.004932 [0.0049250000000000006, 0.004926, 0.004935999999999999]
{'name': 'Apps_ENERGY', 'type': 'function'} Apps_ENERGY 0.000009 0.000009 [8.5e-06, 9e-06, 9.5e-06]
{'name': 'Apps_ENERGY.block_128', 'type': 'function'} Apps_ENERGY.block_128 0.039303 0.039285 [0.039274, 0.039303, 0.039305]
{'name': 'Apps_FIR', 'type': 'function'} Apps_FIR 0.000010 0.000010 [9.5e-06, 1e-05, 1e-05]
{'name': 'Apps_FIR.block_128', 'type': 'function'} Apps_FIR.block_128 0.004157 0.004165 [0.0041554999999999995, 0.004157, 0.004171]
{'name': 'Apps_HALOEXCHANGE', 'type': 'function'} Apps_HALOEXCHANGE 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Apps_HALOEXCHANGE.block_128', 'type': 'function'} Apps_HALOEXCHANGE.block_128 0.037478 0.037577 [0.037467, 0.037478, 0.037637000000000004]
{'name': 'Apps_HALOEXCHANGE_FUSED', 'type': 'function'} Apps_HALOEXCHANGE_FUSED 0.000010 0.000010 [1e-05, 1e-05, 1e-05]
{'name': 'Apps_HALOEXCHANGE_FUSED.block_128', 'type': 'function'} Apps_HALOEXCHANGE_FUSED.block_128 0.009430 0.009341 [0.0092775, 0.00943, 0.009448499999999999]
{'name': 'Apps_LTIMES', 'type': 'function'} Apps_LTIMES 0.000008 0.000008 [8e-06, 8e-06, 8.5e-06]
{'name': 'Apps_LTIMES.block_128', 'type': 'function'} Apps_LTIMES.block_128 0.012088 0.012090 [0.0120835, 0.012088, 0.012095999999999999]
{'name': 'Apps_LTIMES_NOVIEW', 'type': 'function'} Apps_LTIMES_NOVIEW 0.000011 0.000011 [1.0500000000000001e-05, 1.1e-05, 1.1e-05]
{'name': 'Apps_LTIMES_NOVIEW.block_128', 'type': 'function'} Apps_LTIMES_NOVIEW.block_128 0.012085 0.012087 [0.0120825, 0.012085, 0.01209]
{'name': 'Apps_MASS3DPA', 'type': 'function'} Apps_MASS3DPA 0.000008 0.000008 [8e-06, 8e-06, 8e-06]
{'name': 'Apps_MASS3DPA.block_25', 'type': 'function'} Apps_MASS3DPA.block_25 0.001904 0.001906 [0.0019030000000000002, 0.001904, 0.001908]
{'name': 'Apps_NODAL_ACCUMULATION_3D', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D 0.000009 0.000008 [8e-06, 9e-06, 9e-06]
{'name': 'Apps_NODAL_ACCUMULATION_3D.block_128', 'type': 'function'} Apps_NODAL_ACCUMULATION_3D.block_128 0.007408 0.007401 [0.0073785, 0.007408, 0.0074265]
{'name': 'Apps_PRESSURE', 'type': 'function'} Apps_PRESSURE 0.000009 0.000009 [9e-06, 9e-06, 9.5e-06]
{'name': 'Apps_PRESSURE.block_128', 'type': 'function'} Apps_PRESSURE.block_128 0.048251 0.048255 [0.0482315, 0.048251, 0.048277]
{'name': 'Apps_VOL3D', 'type': 'function'} Apps_VOL3D 0.000009 0.000009 [8.5e-06, 9e-06, 9.5e-06]
{'name': 'Apps_VOL3D.block_128', 'type': 'function'} Apps_VOL3D.block_128 0.005903 0.005877 [0.005823, 0.005903, 0.005944]
{'name': 'Basic', 'type': 'function'} Basic 0.000097 0.000098 [9.65e-05, 9.7e-05, 9.9e-05]
{'name': 'Basic_DAXPY', 'type': 'function'} Basic_DAXPY 0.000011 0.000011 [1.0500000000000001e-05, 1.1e-05, 1.1e-05]
{'name': 'Basic_DAXPY.block_128', 'type': 'function'} Basic_DAXPY.block_128 0.016924 0.016901 [0.016877000000000003, 0.016924, 0.016936]
{'name': 'Basic_DAXPY_ATOMIC', 'type': 'function'} Basic_DAXPY_ATOMIC 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Basic_DAXPY_ATOMIC.block_128', 'type': 'function'} Basic_DAXPY_ATOMIC.block_128 0.016947 0.016938 [0.0169195, 0.016947, 0.0169605]
{'name': 'Basic_IF_QUAD', 'type': 'function'} Basic_IF_QUAD 0.000008 0.000008 [8e-06, 8e-06, 8.5e-06]
{'name': 'Basic_IF_QUAD.block_128', 'type': 'function'} Basic_IF_QUAD.block_128 0.012622 0.012651 [0.012606, 0.012622, 0.012681]
{'name': 'Basic_INDEXLIST', 'type': 'function'} Basic_INDEXLIST 0.000008 0.000009 [8e-06, 8e-06, 9e-06]
{'name': 'Basic_INDEXLIST.block_256', 'type': 'function'} Basic_INDEXLIST.block_256 0.004506 0.004434 [0.004391, 0.004506, 0.004512499999999999]
{'name': 'Basic_INDEXLIST_3LOOP', 'type': 'function'} Basic_INDEXLIST_3LOOP 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Basic_INDEXLIST_3LOOP.block_256', 'type': 'function'} Basic_INDEXLIST_3LOOP.block_256 0.008175 0.008132 [0.008101500000000001, 0.008175, 0.0081845]
{'name': 'Basic_INIT3', 'type': 'function'} Basic_INIT3 0.000009 0.000009 [8.5e-06, 9e-06, 9e-06]
{'name': 'Basic_INIT3.block_128', 'type': 'function'} Basic_INIT3.block_128 0.028197 0.028248 [0.0281965, 0.028197, 0.0282745]
{'name': 'Basic_INIT_VIEW1D', 'type': 'function'} Basic_INIT_VIEW1D 0.000009 0.000008 [8e-06, 9e-06, 9e-06]
{'name': 'Basic_INIT_VIEW1D.block_128', 'type': 'function'} Basic_INIT_VIEW1D.block_128 0.041780 0.040132 [0.0391845, 0.04178, 0.041902999999999996]
{'name': 'Basic_INIT_VIEW1D_OFFSET', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Basic_INIT_VIEW1D_OFFSET.block_128', 'type': 'function'} Basic_INIT_VIEW1D_OFFSET.block_128 0.041795 0.040145 [0.039199, 0.041795, 0.041916499999999995]
{'name': 'Basic_MULADDSUB', 'type': 'function'} Basic_MULADDSUB 0.000011 0.000011 [1.0500000000000001e-05, 1.1e-05, 1.1e-05]
{'name': 'Basic_MULADDSUB.block_128', 'type': 'function'} Basic_MULADDSUB.block_128 0.019746 0.019738 [0.019731, 0.019746, 0.019748500000000002]
{'name': 'Basic_NESTED_INIT', 'type': 'function'} Basic_NESTED_INIT 0.000009 0.000009 [8.5e-06, 9e-06, 9e-06]
{'name': 'Basic_NESTED_INIT.block_128', 'type': 'function'} Basic_NESTED_INIT.block_128 0.020670 0.019845 [0.019377, 0.02067, 0.020726]
{'name': 'Basic_PI_ATOMIC', 'type': 'function'} Basic_PI_ATOMIC 0.000009 0.000009 [8.5e-06, 9e-06, 9e-06]
{'name': 'Basic_PI_ATOMIC.block_128', 'type': 'function'} Basic_PI_ATOMIC.block_128 0.124113 0.120364 [0.11722250000000001, 0.124113, 0.1253795]
{'name': 'Basic_PI_REDUCE', 'type': 'function'} Basic_PI_REDUCE 0.000008 0.000008 [7.499999999999999e-06, 8e-06, 8e-06]
{'name': 'Basic_PI_REDUCE.block_128', 'type': 'function'} Basic_PI_REDUCE.block_128 0.002602 0.002619 [0.0025905, 0.002602, 0.0026395000000000004]
{'name': 'Basic_REDUCE3_INT', 'type': 'function'} Basic_REDUCE3_INT 0.000008 0.000008 [8e-06, 8e-06, 8e-06]
{'name': 'Basic_REDUCE3_INT.block_128', 'type': 'function'} Basic_REDUCE3_INT.block_128 0.002380 0.002404 [0.002376, 0.00238, 0.0024200000000000003]
{'name': 'Basic_REDUCE_STRUCT', 'type': 'function'} Basic_REDUCE_STRUCT 0.000008 0.000008 [8e-06, 8e-06, 8.5e-06]
{'name': 'Basic_REDUCE_STRUCT.block_128', 'type': 'function'} Basic_REDUCE_STRUCT.block_128 0.016917 0.017113 [0.016814000000000003, 0.016917, 0.0173145]
{'name': 'Basic_TRAP_INT', 'type': 'function'} Basic_TRAP_INT 0.000007 0.000007 [7e-06, 7e-06, 7.499999999999999e-06]
{'name': 'Basic_TRAP_INT.block_128', 'type': 'function'} Basic_TRAP_INT.block_128 0.002596 0.002601 [0.0025944999999999996, 0.002596, 0.0026055]
{'name': 'Lcals', 'type': 'function'} Lcals 0.000074 0.000075 [7.25e-05, 7.4e-05, 7.7e-05]
{'name': 'Lcals_DIFF_PREDICT', 'type': 'function'} Lcals_DIFF_PREDICT 0.000009 0.000009 [9e-06, 9e-06, 9.5e-06]
{'name': 'Lcals_DIFF_PREDICT.block_128', 'type': 'function'} Lcals_DIFF_PREDICT.block_128 0.061523 0.061192 [0.0610085, 0.061523, 0.0615415]
{'name': 'Lcals_EOS', 'type': 'function'} Lcals_EOS 0.000010 0.000010 [9.5e-06, 1e-05, 1e-05]
{'name': 'Lcals_EOS.block_128', 'type': 'function'} Lcals_EOS.block_128 0.022685 0.022687 [0.0226795, 0.022685, 0.022692999999999998]
{'name': 'Lcals_FIRST_DIFF', 'type': 'function'} Lcals_FIRST_DIFF 0.000009 0.000009 [8.5e-06, 9e-06, 9e-06]
{'name': 'Lcals_FIRST_DIFF.block_128', 'type': 'function'} Lcals_FIRST_DIFF.block_128 0.048455 0.048426 [0.0483995, 0.048455, 0.0484675]
{'name': 'Lcals_FIRST_MIN', 'type': 'function'} Lcals_FIRST_MIN 0.000013 0.000015 [1.2499999999999999e-05, 1.3e-05, 1.6e-05]
{'name': 'Lcals_FIRST_MIN.block_128', 'type': 'function'} Lcals_FIRST_MIN.block_128 0.048713 0.058216 [0.0474975, 0.048713, 0.0641825]
{'name': 'Lcals_FIRST_SUM', 'type': 'function'} Lcals_FIRST_SUM 0.000009 0.000009 [9e-06, 9e-06, 9.5e-06]
{'name': 'Lcals_FIRST_SUM.block_128', 'type': 'function'} Lcals_FIRST_SUM.block_128 0.048516 0.048493 [0.048479999999999995, 0.048516, 0.0485175]
{'name': 'Lcals_GEN_LIN_RECUR', 'type': 'function'} Lcals_GEN_LIN_RECUR 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Lcals_GEN_LIN_RECUR.block_128', 'type': 'function'} Lcals_GEN_LIN_RECUR.block_128 0.048865 0.048918 [0.0488645, 0.048865, 0.0489445]
{'name': 'Lcals_HYDRO_1D', 'type': 'function'} Lcals_HYDRO_1D 0.000009 0.000009 [9e-06, 9e-06, 9.5e-06]
{'name': 'Lcals_HYDRO_1D.block_128', 'type': 'function'} Lcals_HYDRO_1D.block_128 0.033971 0.033970 [0.033904500000000004, 0.033971, 0.034036]
{'name': 'Lcals_HYDRO_2D', 'type': 'function'} Lcals_HYDRO_2D 0.000010 0.000010 [1e-05, 1e-05, 1.0500000000000001e-05]
{'name': 'Lcals_HYDRO_2D.block_128', 'type': 'function'} Lcals_HYDRO_2D.block_128 0.022995 0.023016 [0.0229875, 0.022995, 0.0230345]
{'name': 'Lcals_INT_PREDICT', 'type': 'function'} Lcals_INT_PREDICT 0.000010 0.000010 [9.5e-06, 1e-05, 1e-05]
{'name': 'Lcals_INT_PREDICT.block_128', 'type': 'function'} Lcals_INT_PREDICT.block_128 0.046876 0.046896 [0.046875, 0.046876, 0.046907500000000005]
{'name': 'Lcals_PLANCKIAN', 'type': 'function'} Lcals_PLANCKIAN 0.000010 0.000010 [1e-05, 1e-05, 1e-05]
{'name': 'Lcals_PLANCKIAN.block_128', 'type': 'function'} Lcals_PLANCKIAN.block_128 0.002759 0.002759 [0.0027565000000000003, 0.002759, 0.002762]
{'name': 'Lcals_TRIDIAG_ELIM', 'type': 'function'} Lcals_TRIDIAG_ELIM 0.000010 0.000010 [1e-05, 1e-05, 1e-05]
{'name': 'Lcals_TRIDIAG_ELIM.block_128', 'type': 'function'} Lcals_TRIDIAG_ELIM.block_128 0.044681 0.044794 [0.0446265, 0.044681, 0.044905]
{'name': 'Polybench', 'type': 'function'} Polybench 0.000086 0.000086 [8.55e-05, 8.6e-05, 8.7e-05]
{'name': 'Polybench_2MM', 'type': 'function'} Polybench_2MM 0.000008 0.000008 [8e-06, 8e-06, 8.5e-06]
{'name': 'Polybench_2MM.block_128', 'type': 'function'} Polybench_2MM.block_128 0.005949 0.005949 [0.005937, 0.005949, 0.0059615]
{'name': 'Polybench_3MM', 'type': 'function'} Polybench_3MM 0.000008 0.000008 [8e-06, 8e-06, 8e-06]
{'name': 'Polybench_3MM.block_128', 'type': 'function'} Polybench_3MM.block_128 0.008885 0.008885 [0.008884, 0.008885, 0.008885500000000001]
{'name': 'Polybench_ADI', 'type': 'function'} Polybench_ADI 0.000009 0.000009 [9e-06, 9e-06, 9.5e-06]
{'name': 'Polybench_ADI.block_128', 'type': 'function'} Polybench_ADI.block_128 0.033592 0.033582 [0.0335465, 0.033592, 0.033622]
{'name': 'Polybench_ATAX', 'type': 'function'} Polybench_ATAX 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Polybench_ATAX.block_128', 'type': 'function'} Polybench_ATAX.block_128 0.037403 0.037419 [0.037385, 0.037403, 0.037445]
{'name': 'Polybench_FDTD_2D', 'type': 'function'} Polybench_FDTD_2D 0.000009 0.000009 [8.5e-06, 9e-06, 9.5e-06]
{'name': 'Polybench_FDTD_2D.block_128', 'type': 'function'} Polybench_FDTD_2D.block_128 0.037109 0.037101 [0.037089, 0.037109, 0.0371165]
{'name': 'Polybench_FLOYD_WARSHALL', 'type': 'function'} Polybench_FLOYD_WARSHALL 0.000008 0.000008 [8e-06, 8e-06, 8.5e-06]
{'name': 'Polybench_FLOYD_WARSHALL.block_128', 'type': 'function'} Polybench_FLOYD_WARSHALL.block_128 0.209053 0.209057 [0.208826, 0.209053, 0.2092865]
{'name': 'Polybench_GEMM', 'type': 'function'} Polybench_GEMM 0.000010 0.000010 [1e-05, 1e-05, 1.0500000000000001e-05]
{'name': 'Polybench_GEMM.block_128', 'type': 'function'} Polybench_GEMM.block_128 0.006741 0.006742 [0.0067405, 0.006741, 0.0067434999999999995]
{'name': 'Polybench_GEMVER', 'type': 'function'} Polybench_GEMVER 0.000008 0.000008 [8e-06, 8e-06, 8.5e-06]
{'name': 'Polybench_GEMVER.block_128', 'type': 'function'} Polybench_GEMVER.block_128 0.008432 0.008436 [0.0084285, 0.008432, 0.008441500000000001]
{'name': 'Polybench_GESUMMV', 'type': 'function'} Polybench_GESUMMV 0.000009 0.000009 [8.5e-06, 9e-06, 9e-06]
{'name': 'Polybench_GESUMMV.block_128', 'type': 'function'} Polybench_GESUMMV.block_128 0.037410 0.037365 [0.0372855, 0.03741, 0.037467]
{'name': 'Polybench_HEAT_3D', 'type': 'function'} Polybench_HEAT_3D 0.000009 0.000009 [9e-06, 9e-06, 9.5e-06]
{'name': 'Polybench_HEAT_3D.block_128', 'type': 'function'} Polybench_HEAT_3D.block_128 0.021826 0.021828 [0.0218135, 0.021826, 0.021842]
{'name': 'Polybench_JACOBI_1D', 'type': 'function'} Polybench_JACOBI_1D 0.000011 0.000011 [9.999999999999999e-06, 1.1e-05, 1.15e-05]
{'name': 'Polybench_JACOBI_1D.block_128', 'type': 'function'} Polybench_JACOBI_1D.block_128 0.077718 0.077743 [0.07769899999999999, 0.077718, 0.077775]
{'name': 'Polybench_JACOBI_2D', 'type': 'function'} Polybench_JACOBI_2D 0.000010 0.000010 [9.5e-06, 1e-05, 1e-05]
{'name': 'Polybench_JACOBI_2D.block_128', 'type': 'function'} Polybench_JACOBI_2D.block_128 0.102580 0.102596 [0.1025345, 0.10258, 0.10265]
{'name': 'Polybench_MVT', 'type': 'function'} Polybench_MVT 0.000009 0.000010 [9e-06, 9e-06, 1.0500000000000001e-05]
{'name': 'Polybench_MVT.block_128', 'type': 'function'} Polybench_MVT.block_128 0.035797 0.035768 [0.0357435, 0.035797, 0.035806500000000005]
{'name': 'Stream', 'type': 'function'} Stream 0.000033 0.000034 [3.3e-05, 3.3e-05, 3.45e-05]
{'name': 'Stream_ADD', 'type': 'function'} Stream_ADD 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Stream_ADD.block_128', 'type': 'function'} Stream_ADD.block_128 0.033798 0.033840 [0.0337565, 0.033798, 0.033902]
{'name': 'Stream_COPY', 'type': 'function'} Stream_COPY 0.000009 0.000009 [8.5e-06, 9e-06, 9e-06]
{'name': 'Stream_COPY.block_128', 'type': 'function'} Stream_COPY.block_128 0.043261 0.043184 [0.043116, 0.043261, 0.043290999999999996]
{'name': 'Stream_DOT', 'type': 'function'} Stream_DOT 0.000009 0.000009 [9e-06, 9e-06, 9e-06]
{'name': 'Stream_DOT.block_128', 'type': 'function'} Stream_DOT.block_128 0.113655 0.112893 [0.11216000000000001, 0.113655, 0.11400650000000001]
{'name': 'Stream_MUL', 'type': 'function'} Stream_MUL 0.000011 0.000011 [1.0500000000000001e-05, 1.1e-05, 1.15e-05]
{'name': 'Stream_MUL.block_128', 'type': 'function'} Stream_MUL.block_128 0.043271 0.043180 [0.043106, 0.043271, 0.0432995]
{'name': 'Stream_TRIAD', 'type': 'function'} Stream_TRIAD 0.000008 0.000008 [8e-06, 8e-06, 8.5e-06]
{'name': 'Stream_TRIAD.block_128', 'type': 'function'} Stream_TRIAD.block_128 0.033730 0.033717 [0.033696000000000004, 0.03373, 0.0337445]

View aggregated statistics call tree

[18]:
print(th_lassen.statsframe.tree(metric_column="Total time (exc)_median"))
    __          __       __         __
   / /_  ____ _/ /______/ /_  ___  / /_
  / __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
 / / / / /_/ / /_/ /__/ / / /  __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/  v2023.1.0

0.001 Base_CUDA
├─ 0.000 Algorithm
│  ├─ 0.000 Algorithm_MEMCPY
│  │  ├─ 0.002 Algorithm_MEMCPY.block_128
│  │  └─ 0.003 Algorithm_MEMCPY.library
│  ├─ 0.000 Algorithm_MEMSET
│  │  ├─ 0.001 Algorithm_MEMSET.block_128
│  │  └─ 0.001 Algorithm_MEMSET.library
│  ├─ 0.000 Algorithm_REDUCE_SUM
│  │  ├─ 0.003 Algorithm_REDUCE_SUM.block_128
│  │  └─ 0.002 Algorithm_REDUCE_SUM.cub
│  └─ 0.000 Algorithm_SCAN
│     └─ 0.003 Algorithm_SCAN.default
├─ 0.000 Apps
│  ├─ 0.000 Apps_CONVECTION3DPA
│  │  └─ 0.003 Apps_CONVECTION3DPA.block_64
│  ├─ 0.000 Apps_DEL_DOT_VEC_2D
│  │  └─ 0.007 Apps_DEL_DOT_VEC_2D.block_128
│  ├─ 0.000 Apps_DIFFUSION3DPA
│  │  └─ 0.005 Apps_DIFFUSION3DPA.block_64
│  ├─ 0.000 Apps_ENERGY
│  │  └─ 0.039 Apps_ENERGY.block_128
│  ├─ 0.000 Apps_FIR
│  │  └─ 0.004 Apps_FIR.block_128
│  ├─ 0.000 Apps_HALOEXCHANGE
│  │  └─ 0.037 Apps_HALOEXCHANGE.block_128
│  ├─ 0.000 Apps_HALOEXCHANGE_FUSED
│  │  └─ 0.009 Apps_HALOEXCHANGE_FUSED.block_128
│  ├─ 0.000 Apps_LTIMES
│  │  └─ 0.012 Apps_LTIMES.block_128
│  ├─ 0.000 Apps_LTIMES_NOVIEW
│  │  └─ 0.012 Apps_LTIMES_NOVIEW.block_128
│  ├─ 0.000 Apps_MASS3DPA
│  │  └─ 0.002 Apps_MASS3DPA.block_25
│  ├─ 0.000 Apps_NODAL_ACCUMULATION_3D
│  │  └─ 0.007 Apps_NODAL_ACCUMULATION_3D.block_128
│  ├─ 0.000 Apps_PRESSURE
│  │  └─ 0.048 Apps_PRESSURE.block_128
│  └─ 0.000 Apps_VOL3D
│     └─ 0.006 Apps_VOL3D.block_128
├─ 0.000 Basic
│  ├─ 0.000 Basic_DAXPY
│  │  └─ 0.017 Basic_DAXPY.block_128
│  ├─ 0.000 Basic_DAXPY_ATOMIC
│  │  └─ 0.017 Basic_DAXPY_ATOMIC.block_128
│  ├─ 0.000 Basic_IF_QUAD
│  │  └─ 0.013 Basic_IF_QUAD.block_128
│  ├─ 0.000 Basic_INDEXLIST
│  │  └─ 0.005 Basic_INDEXLIST.block_256
│  ├─ 0.000 Basic_INDEXLIST_3LOOP
│  │  └─ 0.008 Basic_INDEXLIST_3LOOP.block_256
│  ├─ 0.000 Basic_INIT3
│  │  └─ 0.028 Basic_INIT3.block_128
│  ├─ 0.000 Basic_INIT_VIEW1D
│  │  └─ 0.042 Basic_INIT_VIEW1D.block_128
│  ├─ 0.000 Basic_INIT_VIEW1D_OFFSET
│  │  └─ 0.042 Basic_INIT_VIEW1D_OFFSET.block_128
│  ├─ 0.000 Basic_MULADDSUB
│  │  └─ 0.020 Basic_MULADDSUB.block_128
│  ├─ 0.000 Basic_NESTED_INIT
│  │  └─ 0.021 Basic_NESTED_INIT.block_128
│  ├─ 0.000 Basic_PI_ATOMIC
│  │  └─ 0.124 Basic_PI_ATOMIC.block_128
│  ├─ 0.000 Basic_PI_REDUCE
│  │  └─ 0.003 Basic_PI_REDUCE.block_128
│  ├─ 0.000 Basic_REDUCE3_INT
│  │  └─ 0.002 Basic_REDUCE3_INT.block_128
│  ├─ 0.000 Basic_REDUCE_STRUCT
│  │  └─ 0.017 Basic_REDUCE_STRUCT.block_128
│  └─ 0.000 Basic_TRAP_INT
│     └─ 0.003 Basic_TRAP_INT.block_128
├─ 0.000 Lcals
│  ├─ 0.000 Lcals_DIFF_PREDICT
│  │  └─ 0.062 Lcals_DIFF_PREDICT.block_128
│  ├─ 0.000 Lcals_EOS
│  │  └─ 0.023 Lcals_EOS.block_128
│  ├─ 0.000 Lcals_FIRST_DIFF
│  │  └─ 0.048 Lcals_FIRST_DIFF.block_128
│  ├─ 0.000 Lcals_FIRST_MIN
│  │  └─ 0.049 Lcals_FIRST_MIN.block_128
│  ├─ 0.000 Lcals_FIRST_SUM
│  │  └─ 0.049 Lcals_FIRST_SUM.block_128
│  ├─ 0.000 Lcals_GEN_LIN_RECUR
│  │  └─ 0.049 Lcals_GEN_LIN_RECUR.block_128
│  ├─ 0.000 Lcals_HYDRO_1D
│  │  └─ 0.034 Lcals_HYDRO_1D.block_128
│  ├─ 0.000 Lcals_HYDRO_2D
│  │  └─ 0.023 Lcals_HYDRO_2D.block_128
│  ├─ 0.000 Lcals_INT_PREDICT
│  │  └─ 0.047 Lcals_INT_PREDICT.block_128
│  ├─ 0.000 Lcals_PLANCKIAN
│  │  └─ 0.003 Lcals_PLANCKIAN.block_128
│  └─ 0.000 Lcals_TRIDIAG_ELIM
│     └─ 0.045 Lcals_TRIDIAG_ELIM.block_128
├─ 0.000 Polybench
│  ├─ 0.000 Polybench_2MM
│  │  └─ 0.006 Polybench_2MM.block_128
│  ├─ 0.000 Polybench_3MM
│  │  └─ 0.009 Polybench_3MM.block_128
│  ├─ 0.000 Polybench_ADI
│  │  └─ 0.034 Polybench_ADI.block_128
│  ├─ 0.000 Polybench_ATAX
│  │  └─ 0.037 Polybench_ATAX.block_128
│  ├─ 0.000 Polybench_FDTD_2D
│  │  └─ 0.037 Polybench_FDTD_2D.block_128
│  ├─ 0.000 Polybench_FLOYD_WARSHALL
│  │  └─ 0.209 Polybench_FLOYD_WARSHALL.block_128
│  ├─ 0.000 Polybench_GEMM
│  │  └─ 0.007 Polybench_GEMM.block_128
│  ├─ 0.000 Polybench_GEMVER
│  │  └─ 0.008 Polybench_GEMVER.block_128
│  ├─ 0.000 Polybench_GESUMMV
│  │  └─ 0.037 Polybench_GESUMMV.block_128
│  ├─ 0.000 Polybench_HEAT_3D
│  │  └─ 0.022 Polybench_HEAT_3D.block_128
│  ├─ 0.000 Polybench_JACOBI_1D
│  │  └─ 0.078 Polybench_JACOBI_1D.block_128
│  ├─ 0.000 Polybench_JACOBI_2D
│  │  └─ 0.103 Polybench_JACOBI_2D.block_128
│  └─ 0.000 Polybench_MVT
│     └─ 0.036 Polybench_MVT.block_128
└─ 0.000 Stream
   ├─ 0.000 Stream_ADD
   │  └─ 0.034 Stream_ADD.block_128
   ├─ 0.000 Stream_COPY
   │  └─ 0.043 Stream_COPY.block_128
   ├─ 0.000 Stream_DOT
   │  └─ 0.114 Stream_DOT.block_128
   ├─ 0.000 Stream_MUL
   │  └─ 0.043 Stream_MUL.block_128
   └─ 0.000 Stream_TRIAD
      └─ 0.034 Stream_TRIAD.block_128

Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.21)
0.19 - 0.21
0.15 - 0.19
0.10 - 0.15
0.06 - 0.10
0.02 - 0.06
0.00 - 0.02

name User code     Only in left graph     Only in right graph

Use the Query Language

Thicket’s query language provides users the capability to select or query specific nodes based on the call tree of the thicket. The performance data is then updated as part of the operation.

Initial call tree:

[19]:
print(th_lassen.statsframe.tree("Total time (exc)_median"))
    __          __       __         __
   / /_  ____ _/ /______/ /_  ___  / /_
  / __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
 / / / / /_/ / /_/ /__/ / / /  __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/  v2023.1.0

0.001 Base_CUDA
├─ 0.000 Algorithm
│  ├─ 0.000 Algorithm_MEMCPY
│  │  ├─ 0.002 Algorithm_MEMCPY.block_128
│  │  └─ 0.003 Algorithm_MEMCPY.library
│  ├─ 0.000 Algorithm_MEMSET
│  │  ├─ 0.001 Algorithm_MEMSET.block_128
│  │  └─ 0.001 Algorithm_MEMSET.library
│  ├─ 0.000 Algorithm_REDUCE_SUM
│  │  ├─ 0.003 Algorithm_REDUCE_SUM.block_128
│  │  └─ 0.002 Algorithm_REDUCE_SUM.cub
│  └─ 0.000 Algorithm_SCAN
│     └─ 0.003 Algorithm_SCAN.default
├─ 0.000 Apps
│  ├─ 0.000 Apps_CONVECTION3DPA
│  │  └─ 0.003 Apps_CONVECTION3DPA.block_64
│  ├─ 0.000 Apps_DEL_DOT_VEC_2D
│  │  └─ 0.007 Apps_DEL_DOT_VEC_2D.block_128
│  ├─ 0.000 Apps_DIFFUSION3DPA
│  │  └─ 0.005 Apps_DIFFUSION3DPA.block_64
│  ├─ 0.000 Apps_ENERGY
│  │  └─ 0.039 Apps_ENERGY.block_128
│  ├─ 0.000 Apps_FIR
│  │  └─ 0.004 Apps_FIR.block_128
│  ├─ 0.000 Apps_HALOEXCHANGE
│  │  └─ 0.037 Apps_HALOEXCHANGE.block_128
│  ├─ 0.000 Apps_HALOEXCHANGE_FUSED
│  │  └─ 0.009 Apps_HALOEXCHANGE_FUSED.block_128
│  ├─ 0.000 Apps_LTIMES
│  │  └─ 0.012 Apps_LTIMES.block_128
│  ├─ 0.000 Apps_LTIMES_NOVIEW
│  │  └─ 0.012 Apps_LTIMES_NOVIEW.block_128
│  ├─ 0.000 Apps_MASS3DPA
│  │  └─ 0.002 Apps_MASS3DPA.block_25
│  ├─ 0.000 Apps_NODAL_ACCUMULATION_3D
│  │  └─ 0.007 Apps_NODAL_ACCUMULATION_3D.block_128
│  ├─ 0.000 Apps_PRESSURE
│  │  └─ 0.048 Apps_PRESSURE.block_128
│  └─ 0.000 Apps_VOL3D
│     └─ 0.006 Apps_VOL3D.block_128
├─ 0.000 Basic
│  ├─ 0.000 Basic_DAXPY
│  │  └─ 0.017 Basic_DAXPY.block_128
│  ├─ 0.000 Basic_DAXPY_ATOMIC
│  │  └─ 0.017 Basic_DAXPY_ATOMIC.block_128
│  ├─ 0.000 Basic_IF_QUAD
│  │  └─ 0.013 Basic_IF_QUAD.block_128
│  ├─ 0.000 Basic_INDEXLIST
│  │  └─ 0.005 Basic_INDEXLIST.block_256
│  ├─ 0.000 Basic_INDEXLIST_3LOOP
│  │  └─ 0.008 Basic_INDEXLIST_3LOOP.block_256
│  ├─ 0.000 Basic_INIT3
│  │  └─ 0.028 Basic_INIT3.block_128
│  ├─ 0.000 Basic_INIT_VIEW1D
│  │  └─ 0.042 Basic_INIT_VIEW1D.block_128
│  ├─ 0.000 Basic_INIT_VIEW1D_OFFSET
│  │  └─ 0.042 Basic_INIT_VIEW1D_OFFSET.block_128
│  ├─ 0.000 Basic_MULADDSUB
│  │  └─ 0.020 Basic_MULADDSUB.block_128
│  ├─ 0.000 Basic_NESTED_INIT
│  │  └─ 0.021 Basic_NESTED_INIT.block_128
│  ├─ 0.000 Basic_PI_ATOMIC
│  │  └─ 0.124 Basic_PI_ATOMIC.block_128
│  ├─ 0.000 Basic_PI_REDUCE
│  │  └─ 0.003 Basic_PI_REDUCE.block_128
│  ├─ 0.000 Basic_REDUCE3_INT
│  │  └─ 0.002 Basic_REDUCE3_INT.block_128
│  ├─ 0.000 Basic_REDUCE_STRUCT
│  │  └─ 0.017 Basic_REDUCE_STRUCT.block_128
│  └─ 0.000 Basic_TRAP_INT
│     └─ 0.003 Basic_TRAP_INT.block_128
├─ 0.000 Lcals
│  ├─ 0.000 Lcals_DIFF_PREDICT
│  │  └─ 0.062 Lcals_DIFF_PREDICT.block_128
│  ├─ 0.000 Lcals_EOS
│  │  └─ 0.023 Lcals_EOS.block_128
│  ├─ 0.000 Lcals_FIRST_DIFF
│  │  └─ 0.048 Lcals_FIRST_DIFF.block_128
│  ├─ 0.000 Lcals_FIRST_MIN
│  │  └─ 0.049 Lcals_FIRST_MIN.block_128
│  ├─ 0.000 Lcals_FIRST_SUM
│  │  └─ 0.049 Lcals_FIRST_SUM.block_128
│  ├─ 0.000 Lcals_GEN_LIN_RECUR
│  │  └─ 0.049 Lcals_GEN_LIN_RECUR.block_128
│  ├─ 0.000 Lcals_HYDRO_1D
│  │  └─ 0.034 Lcals_HYDRO_1D.block_128
│  ├─ 0.000 Lcals_HYDRO_2D
│  │  └─ 0.023 Lcals_HYDRO_2D.block_128
│  ├─ 0.000 Lcals_INT_PREDICT
│  │  └─ 0.047 Lcals_INT_PREDICT.block_128
│  ├─ 0.000 Lcals_PLANCKIAN
│  │  └─ 0.003 Lcals_PLANCKIAN.block_128
│  └─ 0.000 Lcals_TRIDIAG_ELIM
│     └─ 0.045 Lcals_TRIDIAG_ELIM.block_128
├─ 0.000 Polybench
│  ├─ 0.000 Polybench_2MM
│  │  └─ 0.006 Polybench_2MM.block_128
│  ├─ 0.000 Polybench_3MM
│  │  └─ 0.009 Polybench_3MM.block_128
│  ├─ 0.000 Polybench_ADI
│  │  └─ 0.034 Polybench_ADI.block_128
│  ├─ 0.000 Polybench_ATAX
│  │  └─ 0.037 Polybench_ATAX.block_128
│  ├─ 0.000 Polybench_FDTD_2D
│  │  └─ 0.037 Polybench_FDTD_2D.block_128
│  ├─ 0.000 Polybench_FLOYD_WARSHALL
│  │  └─ 0.209 Polybench_FLOYD_WARSHALL.block_128
│  ├─ 0.000 Polybench_GEMM
│  │  └─ 0.007 Polybench_GEMM.block_128
│  ├─ 0.000 Polybench_GEMVER
│  │  └─ 0.008 Polybench_GEMVER.block_128
│  ├─ 0.000 Polybench_GESUMMV
│  │  └─ 0.037 Polybench_GESUMMV.block_128
│  ├─ 0.000 Polybench_HEAT_3D
│  │  └─ 0.022 Polybench_HEAT_3D.block_128
│  ├─ 0.000 Polybench_JACOBI_1D
│  │  └─ 0.078 Polybench_JACOBI_1D.block_128
│  ├─ 0.000 Polybench_JACOBI_2D
│  │  └─ 0.103 Polybench_JACOBI_2D.block_128
│  └─ 0.000 Polybench_MVT
│     └─ 0.036 Polybench_MVT.block_128
└─ 0.000 Stream
   ├─ 0.000 Stream_ADD
   │  └─ 0.034 Stream_ADD.block_128
   ├─ 0.000 Stream_COPY
   │  └─ 0.043 Stream_COPY.block_128
   ├─ 0.000 Stream_DOT
   │  └─ 0.114 Stream_DOT.block_128
   ├─ 0.000 Stream_MUL
   │  └─ 0.043 Stream_MUL.block_128
   └─ 0.000 Stream_TRIAD
      └─ 0.034 Stream_TRIAD.block_128

Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.21)
0.19 - 0.21
0.15 - 0.19
0.10 - 0.15
0.06 - 0.10
0.02 - 0.06
0.00 - 0.02

name User code     Only in left graph     Only in right graph

Example 1

In the example below, we use a thicket query that only select the parents and children nodes of Algorithm, maintaining the structure of the call tree.

NOTE: A DeprecationWarning is generated when using “old-style” queries (i.e., queries with QueryMatcher) when using the newest version of Hatchet. In the future, queries will be defined with Query.

[20]:
alg_query_ex1 = (
    ht.QueryMatcher()
    .match("*")
    .rel(
        ".",
        lambda row: row["name"].apply(
        lambda x: re.match(
            r"Algorithm.*", x
        )
        is not None).all()
    )
)

# applying the query on the lassen thicket
th_algorithm_ex1 = th_lassen.query(alg_query_ex1)
tt.median(th_algorithm_ex1, columns=["Total time (exc)"])
print(th_algorithm_ex1.statsframe.tree("Total time (exc)_median"))
/var/folders/g4/9_n9kks14t17ldr01p_rgsw00000gn/T/ipykernel_29938/3034768233.py:2: DeprecationWarning: Old-style queries are deprecated and will be removed in the                     future. Please use new-style queries instead.                     For QueryMatcher, the equivalent new-style queries are                     hatchet.query.Query for base-syntax queries and                     hatchet.query.ObjectQuery for the object-dialect.
  ht.QueryMatcher()
    __          __       __         __
   / /_  ____ _/ /______/ /_  ___  / /_
  / __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
 / / / / /_/ / /_/ /__/ / / /  __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/  v2023.1.0

0.001 Base_CUDA
└─ 0.000 Algorithm
   ├─ 0.000 Algorithm_MEMCPY
   │  ├─ 0.002 Algorithm_MEMCPY.block_128
   │  └─ 0.003 Algorithm_MEMCPY.library
   ├─ 0.000 Algorithm_MEMSET
   │  ├─ 0.001 Algorithm_MEMSET.block_128
   │  └─ 0.001 Algorithm_MEMSET.library
   ├─ 0.000 Algorithm_REDUCE_SUM
   │  ├─ 0.003 Algorithm_REDUCE_SUM.block_128
   │  └─ 0.002 Algorithm_REDUCE_SUM.cub
   └─ 0.000 Algorithm_SCAN
      └─ 0.003 Algorithm_SCAN.default

Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.00)
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00

name User code     Only in left graph     Only in right graph

Example 2

In the example below, we use a thicket query that selects only the Algorithm node and its children nodes, maintaining the structure of the call tree.

NOTE: A DeprecationWarning is generated when using “old-style” queries (i.e., queries with QueryMatcher) when using the newest version of Hatchet. In the future, queries will be defined with Query.

[21]:
alg_query_ex2 = (
    ht.QueryMatcher()
    .match(
        ".",
        lambda row: row["name"].apply(
        lambda x: re.match(
            r"Algorithm.*", x
        )
        is not None).all()
    )
    .rel("*")
)

# applying the second query on the lassen thicket
th_algorithm_ex2 = th_lassen.query(alg_query_ex2)
tt.median(th_algorithm_ex2, columns=["Total time (exc)"])
print(th_algorithm_ex2.statsframe.tree("Total time (exc)_median"))
    __          __       __         __
   / /_  ____ _/ /______/ /_  ___  / /_
  / __ \/ __ `/ __/ ___/ __ \/ _ \/ __/
 / / / / /_/ / /_/ /__/ / / /  __/ /_
/_/ /_/\__,_/\__/\___/_/ /_/\___/\__/  v2023.1.0

0.000 Algorithm
├─ 0.000 Algorithm_MEMCPY
│  ├─ 0.002 Algorithm_MEMCPY.block_128
│  └─ 0.003 Algorithm_MEMCPY.library
├─ 0.000 Algorithm_MEMSET
│  ├─ 0.001 Algorithm_MEMSET.block_128
│  └─ 0.001 Algorithm_MEMSET.library
├─ 0.000 Algorithm_REDUCE_SUM
│  ├─ 0.003 Algorithm_REDUCE_SUM.block_128
│  └─ 0.002 Algorithm_REDUCE_SUM.cub
└─ 0.000 Algorithm_SCAN
   └─ 0.003 Algorithm_SCAN.default

Legend (Metric: Total time (exc)_median Min: 0.00 Max: 0.00)
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00
0.00 - 0.00

name User code     Only in left graph     Only in right graph
/var/folders/g4/9_n9kks14t17ldr01p_rgsw00000gn/T/ipykernel_29938/3679505811.py:2: DeprecationWarning: Old-style queries are deprecated and will be removed in the                     future. Please use new-style queries instead.                     For QueryMatcher, the equivalent new-style queries are                     hatchet.query.Query for base-syntax queries and                     hatchet.query.ObjectQuery for the object-dialect.
  ht.QueryMatcher()

Display histogram

The display_histogram() function allows users to select a node and metric value (a column in the performance data table) for which a histogram is generated.

Some available keyword arguments are the following,

  • height: height (in inches) of each facet.

  • aspect: aspect ratio of each facet, aspect * height will give you the width of each facet.

  • bins: rather generic, and can set the number of bins.

  • binwidth: width of each bin, overrides bins but can be used with binrange.

  • binrange: lowest and highest value for bin edges; can be used either with bins or binwidth. Defaults to extremes.

  • color: Set the color of the bars.

An exhaustive list of available arguments can be found here.

[22]:
n = pd.unique(th_algorithm_ex1.dataframe.reset_index()["node"])[0]
[23]:
tt.display_histogram(th_algorithm_ex1, node=n, column="Total time (exc)")
[23]:
<seaborn.axisgrid.FacetGrid at 0x7f932b0ab190>
_images/thicket_tutorial_39_1.png

Display heatmap

The display_heatmap() function allows users to select column(s) from the performance data table, for which a heatmap is generated based on the values of the column.

Some available keyword arguments are the following: * vmax: maximum value to anchor the color map. * vmin: minimum value to anchor the color map. * linecolor: color of the lines that will divide each cell. * linewidths: Width of the lines that will divide each cell.

An exhaustive list of available arguments can be found here.

[24]:
th_algorithm_ex1.dataframe.columns
[24]:
Index(['nid', 'spot.channel', 'Min time/rank', 'Max time/rank',
       'Avg time/rank', 'Total time', 'Min time/rank (exc)',
       'Max time/rank (exc)', 'Avg time/rank (exc)', 'Total time (exc)',
       'name', 'Avg time/rank (inc)', 'Total time (inc)',
       'Min time/rank (inc)', 'Max time/rank (inc)'],
      dtype='object')
[25]:
plt.figure(figsize=(30,30))
metrics = ["Total time (exc)_median"]
tt.display_heatmap(th_algorithm_ex1, columns=metrics)
[25]:
<AxesSubplot: ylabel='node'>
_images/thicket_tutorial_42_1.png