Performance Troubleshooting

Use the methods defined in the following table to troubleshoot performance issues. When all data is gathered, include the information on a Jira ticket assigned to R&D.

Performance Test

Windows

Linux / Solaris

IO usage

Process Explorer

sar - d

sar - d - f /var/adm/sa/sa dd

(where dd is the day of month)

iostat - zdxnk 2

CPU usage

Process Explorer

Task Manager

sar - u

sar - f /var/adm/sa/sa dd

(where dd is the day of month) top, mpstat - P - ALL 2

Memory usage

Process Explorer

v mstat sar - r - f /var/adm/sa/sa dd

(where dd is the day of month)

Ping

ping dbserver

ping – l 8000 dbserver

ping – l 32000 dbserver

ping – c5 dbserver

ping – c5 – s8000 dbserver

ping – c5 – s32000 dbserver

Latency Requirements

Additional latency information is included in the Server Network Latency topic here.

Identify the Problem Layer

  1. From the Start Page, click the STEP System Administration button and supply the login credentials.
  2. On the Profiler tab, set the From and To date / time parameters to define when the performance issues were noticed.
  3. Click the Generate button to load the data.
  4. In the results section, expand the service methods to determine the issue:

Database Server

After reviewing the profiling data (as defined above), when the issue is a long-running SQL statement, follow these steps.

  1. From the Start Page, click the STEP System Administration button and supply the login credentials.
  2. On the Activity tab, set the Duration and Date / Time parameters according to when the performance issues were noticed.
  1. Click the Fetch data button to load the data.
  2. In the Details section, on the SQL tab, identify the same SQL statement with the same duration.
  3. Attach the SQL statement, including the execution plan, to the Jira issue.

For more information, see the Database Long-Lasting SQL Queries topic here.

Additional Steps

With the DB ToolBox (typically saved in the '/opt/stibo/step/admin/app-server-toolbox/' folder), you may be able to access the additional information outlined below.

Record this information using the Technical Support Tasks table in the Troubleshooting Checklists topic here.

DTP Server

Perform the following checks:

In case of emergency, use the restart action command: restart dtp-servers.

Services representing sidecars can be restarted or even the server itself. Services should be checked after a restart.

Record this information using the Technical Support Tasks table in the Troubleshooting Checklists topic here.

Application Server

Perform the following checks:

If constantly high with only small 'garbage collections'

If CPU usage is high, check if image GraphicsMagick processing (gm.exe, etc.) are starting and stopping. If so, this can be caused by excessive image conversions. Increasing the sizes of image caches can potentially fix this.

Compare the number of threads to the number of CPU cores in the application server. If the number of threads is higher, processes will queue and waits are experienced.

Add all of this information to the Technical Support Tasks table in the Troubleshooting Checklists topic here.

User Interface / Client Application

Perform the following checks:

If these checks and adjustments do not solve the problem, add this information to the Technical Support Tasks table in the Troubleshooting Checklists topic here.