TOPIC
Customer says that their Apple Network Server 500 or 700 running AIX
DISCUSSION
Please keep in mind that Performance tuning is an art and at present we do not have any numbers as to what is healthy performance on the Network Servers. We will be getting better information after we and our customers spend considerable time exercising these boxes. Also, consider that any good UNIX tuning guide is usually between 200 and 400 pages long. IBM offers a 3 to 4 day class in this area. Also the Info Explorer List of Books contains "AIX Versions 4.1 Problem Solving Guide and Reference." This FAQ is only a brief introduction. Four Areas To Check =================== If an AIX system appears to be slow there are four general areas that need to be examined over time before making any suggestions to improve Performance: CPU Usage, Memory Usage, Disk and Local Peripheral I/O Performance, and Network Performance. CPU Usage "ps aux" will show you the memory usage of processes presently running. # ps aux
The columns of interest are SZ and RSS. Processes in UNIX consist of text (code), data, and stack segments. SZ is a measure of the virtual memory allocated for the data and stack segments of a running process and the text segment if it is not shared code. The RSS is a measure of the actual memory allocated for a process. Processes that are using a large percentage of the available memory might be candidates for either program optimization or jobs that could run when the system use is low, by using the cron or batch facilities. Also, nice or renice could be used to lower these processes priorities "iostat" can tell you in general if CPU usage is high. If it is, sar -q will show you the run queue size under the heading runq-sz # sar -q 1 2
w can tell you load average. This is a count of the size of the run queue and can indicate if the cpu can handle the number of processes that are attempting to run at any one time. If this count is too high, some jobs may be good candidates to be run at times when the system use is low, by using the cron or batch facilities. Also nice or renice could be used to lower their priority. # w
Another method to control the usage of system resources by processes is by using /etc/security/limits. Please see the man page on limits for more details. Memory Usage vmstat will show memory usage. Remember to throw out the first entry since it is the sum total activity since the system booted. # vmstat 2
vmstat can indicate if a high paging rate is slowing down the system. The pi and po fields under the page heading are of particular importance. pi may be meaningless since some processes page in at start time. po on the other hand, if the count is large, could be an indication of paging problems. This may indicate that more memory is needed if all the present processes need to be run at the same time. Possible solutions to this are to run some jobs at later times using the cron and/or batch facilities. If code is written in house it might help to check to make sure code optimization techniques, such as shared libraries are used. Make sure that there is sufficient paging space on all the disks on the system. As a general rule, paging should be spread throughout the first 4 or 5 disks on a system to minimize paging problems. Disk and Local Peripheral I/O Performance iostat can be used to determine usage. Remember to throw out the first entry since it is the sum total of activity since the system was booted. # iostat
iostat can indicate whether disk usage is well balanced or not. It may be possible to increase performance by moving certain well used logical volumes from a heavily used disk to a less used disk. If the disk usage is well balanced iostat can also indicate if there are possible scsi or disk hardware problems. Network Performance netstat can indicate that there are excessive network errors. The Ierrs and Oerrs columns from "netstat -i" are of particular interest here. Ierrs and Oerrs should not greater than 1% of the Ipkts or Opkts, respectively. The Coll (collision) column should not be more than 5 or 10 percent of the network bandwidth generally with Ethernet. (There is some question as to if AIX is keeping track of this which we need to review). This may╩be an indication of faulty network components or network congestion. # netstat -i
nfsstat can ind icate that there are excessive network errors. This can be caused by overloaded NFS servers, or possible network congestion or hardware problems. # nfsstat
Which Processes to Kill What processes can I safely kill on my system to perhaps make its performance a little faster? |
Document Information | |
Product Area: | Computers |
Category: | Apple Network Server |
Sub Category: | Network Server 500; Network Server 700 |
Keywords: | knts, ksts |
Copyright © 2000 Apple Computer, Inc. All rights reserved.