intel-vtune Questions

3

Solved

Running through all the questions on profiling tools, I was surprised to discover VTune by Intel that I hadn't heard of before. At $700, it is even more expensive than AQTime. But before I make th...
Travistravus asked 21/12, 2009 at 7:20

3

I am using Intel(R) VTune(TM) Amplifier XE 2013 Update 5 (build 274450) for my linux application hotspot collect, but the report says the "[Outside any known module]" consume most of the time, so i...
Bunt asked 18/4, 2013 at 2:25

4

Solved

Total Time spent by a function in an application can be broadly divided in to two components: Time spent on actual computation (Tcomp) Time spent on memory accesses (Tmem) Typically profilers p...
Condyloid asked 15/11, 2016 at 9:6

6

Solved

Continuing on from my first question, I am trying to optimize a memory hotspot found via VTune profiling a 64-bit C program. In particular, I'd like to find the fastest way to test if a 128-byte b...
Bijugate asked 2/3, 2013 at 7:43

2

Solved

I am running a program on both Windows and Linux (x86-64). It has been compiled with the same compiler (Intel Parallel Studio XE 2017) with the same options, and the Windows version is 3 times fast...
Radiative asked 10/11, 2016 at 8:18

2

Solved

Edit 3: The images are links to the full-size versions. Sorry for the pictures-of-text, but the graphs would be hard to copy/paste into a text table. I have the following VTune profile for a pro...
Bouie asked 7/10, 2016 at 9:11

2

I have a routine that performs a few MKL calls on small matrices (50-100 x 1000 elements) to fit a model, which I then call for different models. In pseudo-code: double doModelFit(int model, ...) ...
Degrease asked 1/11, 2013 at 19:36

1

Solved

I wrote a multi-thread program with pthread, using the producer-consumer model. When I use Intel VTune profiler to profile my program, I found the producer and consumer spend lots of time on pthr...
Dani asked 3/6, 2013 at 1:50

1

Solved

I am trying to optimize this code. static lvh_distance levenshtein_distance( const std::string & s1, const std::string & s2 ) { const size_t len1 = s1.size(), len2 = s2.size(); std::vect...
Tributary asked 22/4, 2013 at 10:37

1

Solved

I want to optimize my code which is written in c++ on linux platform.For that i am using Intel VTune Performance Analyzer Profiler .When i am identifying Hotspots , it successfully runs the binary ...
Betteanne asked 6/3, 2013 at 10:15

4

Solved

I have a profiling issue - imagine I have the following code... void main() { well_written_function(); badly_written_function(); } void well_written_function() { for (a small number) { highly...
Yellowhammer asked 15/6, 2010 at 16:51

2

Solved

I'm currently developing a C-module for a Java-application that needs some performance improvements (see Improving performance of network coding-encoding for a background). I've tried to optimize t...
Transpierce asked 17/10, 2011 at 14:4
1

© 2022 - 2024 — McMap. All rights reserved.