Test Tools FAQ

Medusa Labs Test Tools FAQ

Installation

I am getting the error message, “Unable to determine the product installation directory”, after I install and try to run the Tools.

The Tools look for an environment variable that points to the installation directory. This variable is created during installation. Try opening a new command prompt or terminal window and see if the problem persists. It is sometimes necessary to reboot the system to completely register the system environment variable. Make sure that the variable is present in the system environment variable list. Ex. MEDUSA_MLTT_INSTALL_DIR=C:\Program Files\Finisar\Medusa Labs\Test Tools\

I get a pop-up error dialog stating that a DLL can’t be located when I try to run the Windows GUI.

The GUI requires the Microsoft .NET GUI. You will need to run the setup for the .NET framework, which is provided on the installation CD (dotnetfx.exe.)

General

What is the difference between the Pain and Maim tools?

The primary difference between the Tools is their I/O dispatch method. Pain uses a threaded engine, where each thread sends and waits on a single I/O at a time (synchronous I/O.) Maim uses a single thread to send multiple I/Os at a time and processes the I/Os as they individually complete. Each Tool has its own strengths.

Pain

The threaded model makes Pain a good all-round system tester (multi-threading works processors and memory busses better with all the context switching.)

Pain is very fast with low thread counts and small file sizes. It can slow down as the numbers increase. I/O to disks becomes somewhat random, as each thread has its own file or disk extent of the file size.

Pain has a memory only I/O mode.

Synchronous I/O is usually better suited for file system tests, especially with network shares.

Maim

Asynchronous model can offer more precision I/O traffic control with lower overhead.

Can produce bursty, fluctuating queue depth or static queue depth (great for maintaining max stress or pushing queue full.)

May produce better performance out of target by working in a smaller disk area (single file area in single worker thread.)

I just typed “pain” or “maim” at my command line at hit enter. This appears to have started a test. Where is this running to?

When you run one of the Tools without any arguments, they will start an I/O test with the default switch configurations to a data file in the current working directory. To display the online help, enter “pain –h” or “maim –h”.

How many concurrent threads or asynchronous I/Os do the tools support?

The tools support a maximum of 512 threads or I/Os per instance.

What is the maximum file size that the tools can access?

In general the tools are capable of accessing a maximum file size of 2GB, per thread. The asynchronous tool, Maim has an I/O mode (-m18) that is capable of addressing virtually limitless storage extents for full target coverage.

Is there a way to control the LBA range which I run my traffic?


Not exactly. Currently, the only location input the tools accept is offset in MB. This is the –x switch. Ex. –x10 for 10MB. The tools automatically start at a base offset of 1MB (LBA 0x800), to avoid overwriting critical parts of the drive (signature.) You can override the base offset with the –O switch. File size can be used to set the ending LBA (just divide file size by sector size add the starting offset to figure out where the tools will stop.) Note that file size is per thread. So, to be precise with starting and stopping points, you have to use a single Pain thread or use Maim (which uses a single worker thread.)

Is there a lookup table of errors generated by the test suite?

Please see Appendix E of the User’s Guide for a listing of error codes returned by the tools. These are our internal error codes. The Tools will also report any OS error codes and messages (when available) to the console and log files.

Why am I getting an xterm or dtterm error when I try to start a test with Catapult from a telnet or ssh session into my UNIX system?

Currently, Catapult requires a GUI desktop session on UNIX platforms (CDE on Solaris, Gnome/KDE on Linux.)

When I start a test with Catapult (ex. catapult –p pain), it runs to all of my drives. Is there a way to select only the drives I want?


Yes, Catapult has two switches that can be used to explicitly set what drives to run to. The include switch (-i) will cause a test to run to only the drives indicated. The exclude switch (-x) will cause a test to run to all drives except the drives indicated. Drives may be specified individually or as a range. Ex. catapult –p –i2,4,7-10 pain

Why do I get a message from Catapult about drives being excluded due to no drive signature?

Catapult uses drive signatures to track which drives contain the operating system and any active partitions. This is a protection mechanism to prevent accidental destruction of critical OS areas. It is best practice to make sure all drives have a signature. Catapult does have a switch to override the signature check (-o), but we recommend that you allow Catapult to perform the default check.

Why won’t the Tools exit when I hit CTRL-C or try to kill the process?

In test environments, there is always the chance that one or more I/Os sent by the Tools may be held up in a lower level. In such a case, the Tools may not be able to respond. This is more likely to be seen in logical or physical drive tests. For example, a target may reset while an I/O is pending, leaving a device driver to initiate a recovery routine that can easily take many seconds or minutes. Particularly when using the Maim Tool, an I/O may be held up a kernel level with the application locked up waiting for release.

What does it mean when I get I/O halt or I/O timeout error messages?

The Tools automatically monitor I/O traffic for any interruptions that would warrant further investigation and display an error message. There are two conditions that are checked in each performance sample. First, the Tools will report if there are no completions at all in a performance sample (sample interval is 5 seconds by default.) This is considered to be an I/O halt (all traffic has stopped for one or more sample intervals.) Second, the Tools will report if one or more I/Os have not completed in a performance sample, but other completions are occurring. This is considered to be an I/O timeout or “stuck” I/O scenario. The time interval for detecting these conditions can be increased or disabled with the –M switch on the Pain or Maim command line.

Licensing

I want to run a scripted test for an extended period, but my license checkout only has a short time remaining. Is there any way I can extend the checkout time?

Yes, you can perform a check-in of your current license with the –Z switch (ex. Pain –Z.) Then, perform a new checkout of the desired duration (ex. Pain –Z7 to checkout for 7 days.)

I’m getting an Error 18 licensing error on the console when I try to run the tools. What does this mean?

This error essentially means that the license state stored on the system is invalid. This may be the result of a natural expiration because the checkout time has been exceeded. A hardware change, such as the removal of a NIC can cause this. It may also indicate that the license state is damaged due to tampering with the system clock or license lock files. Ensure that the system clock is correct. The tools will always try to contact a license server when a local license check fails. The license server should issue a license if one is available. Alternatively a remote license may be installed if a license server cannot be reached.

The tools cannot checkout a license from my license server.


There are a number of possible causes for a failure to get a license from the server. Make sure that system date and time is correct. A restart of the license server may correct the problem. If you still cannot check out a license, check the following.

Is the server accessible over the network? From the client system, can you ping the server by name or IP address?

YES: Check the server name listed in the MedusaTools.cfg file in the config directory. We recommend using IP address to avoid name resolution issues.

NO: There is a communication issue between the client and server that is independent of the Test Tools.

Is the license service responding? From the client system, run “lsmon server_ipaddress” (Ex. lsmon 192.168.1.2). Does the license server respond with a status output?

YES: Check the status output. Are there available license tokens? Has the license expired?
NO: The service may not be running. Make sure that the SentinelLM service is running on the license server. Make sure that nothing is blocking port 5093.

From the client system, run Wlmadmin. Enter the license server ip address as a defined server. Does the Feature Name (MLTestTools) appear under the license server?

YES: Make sure that there are free tokens (users.) Make sure that the license has not expired.
NO: Try re-adding the license file (right-click the server and add license to server and its file.) If the license can’t be added, the USB dongle may not be attached or the driver is not running. There may also be a problem with the license file.

Does the license file exist on the server? Go to the Windows Service manager and check the properties of the SentinelLM service to find the directory that it is running from. Go to this directory and check for this file.

YES: Check the file for expiration (should have the expiration in clean text.) Run “lsdecode –s lservrc” and check license details for accuracy.
NO: Try copying the license file to this directory and restarting SentinelLM. If the Feature Name is still not visible from Wlmadmin, the USB dongle may not be attached or the driver may not be running.

Is the Server’s USB dongle attached and working? Make sure the USB dongle is attached to the system. Check for its existence in the Windows Device Manager under USB devices (Rainbow SuperPro.)

YES: Run Wechoid.exe on the license server and uncheck all boxes except the computer ID. Note the locking data at the bottom of the screen. At a command prompt, change to the directory where the license server is installed and run “lsdecode –s lservrc”. The locking values should match what is displayed in Wechoid.exe..
NO: Try reinstalling the USB driver

The Tools are taking a very long to perform a license check on my Linux system.

Make sure that the host name is correctly set. If your system is reporting “localhost” as the host name, there is a problem with the host name configuration. Refer to the documentation for your Linux distribution for information on setting the host name.

I did a remote checkout with GetKey to get the Tools running on a system off of my network. Is there a way to check the license back in?

No, remote checkouts must expire naturally. Make sure you only perform a remote checkout for the number of days required to avoid tying up a license unnecessarily.

Performance

The performance numbers reported by the tools are too high to be real. Is there a problem with the calculations performed by the tools?

Higher than expected performance is typically the result of caching by the host operating system. This can happen when running to file systems on any operating system or to physical block devices on UNIX platforms. This behavior may be seen in spite of I/O operation flags being set to a non-buffered mode (default for the test tools.) Using the –c (commit) switch on the tools may help alleviate some performance anomalies. Whenever possible, Medusa Labs recommends running the test tools to physical (raw) devices for optimal performance and accurate reporting. On Linux systems, the “raw” utility should be used (run “man raw” for more information.) On Solaris systems, the /dev/rdsk device paths should be used.

Why do I sometimes see the performance samples on screen vary from the set interval?

The performance samples are created with a dedicated worker thread. Due to extreme system load, there may instances where the thread is interrupted during the performance sample calculations. In extreme cases, it may take considerable time for the thread to be serviced again and execute the console and log print functions. This will result in a sample interval that exceeds the set interval and produces skewed sample output. Thread starvation may be alleviated by decreasing the number of worker threads and/or target devices.

Data Patterns

Can I create custom data patterns?

The tools can read any existing pattern file you might have.
Ex. pain –l0 -@c:\pattern.dat.
There are a number of switches that allow customizing of our stock patterns (-L, -e, -E, -I, -F.) The –l99 pattern is extremely useful in this capacity.

Data Integrity

Will "stale" data corruption scenarios on a target device be detected by the tools?

The test tools utilize a number of methods to maintain a focused, yet varying write data stream. Most data patterns are “reversible”, i.e. they run in one direction on one write pass and the opposite direction on the next pass, continuously alternating. A few data patterns change on the fly with each write pass. By default, all patterns are overlaid with a continuously changing unique I/O signature in every sector. These measures are designed to catch any instances of data corruption as a result of stale data being returned on a read-back operation.

I started running some tests on physical drives on a Windows system. The Tools are reporting data corruptions that I don’t believe are real.

Before you run a physical drive test, make sure that any old partitions are deleted and reboot your system. We have seen cases where a test is started on a physical drive that used to host a file system and the NTFS file system manager is not aware that the partition has been removed. Latent file table updates may be sent down over data written by the Tools. This is usually seen as data corruptions 4K in size.

Test Configurations

Why am I seeing writes when I specified a read only test?

When you use the –r switch with data comparisons enabled (this is the default behavior), there will be one write pass through the entire file size specified, followed by continuous read only traffic. The purpose of this write pass is to lay down the indicated data pattern (or a default pattern, if one is not specified.) In this manner, the data pattern will be continuously verified during the read only testing. If data integrity is not a concern, as in pure performance testing, the –r and –n switches can be used together. This will turn off data compares and the reads will return whatever information is currently on the target.




Since 1997, Gillaspy Associates has built a solid reputation for developing strong relationships with our customers by providing quality solutions and ongoing support.

Downloads

Medusa Labs Test Tools Suite Datasheet


Test Services

Medusa Labs Test Services

Initial Evaluation

Functional

Product

Enterprise

Performance &
Benchmarks


Lab Overview


Test Tools


Medusa Test Tools Suite

FAQ


Training


Overview

Certification

Courses


Course Calendar


Register


Training Locations


Medusa Labs

Home | Products | Protocols | Employment | Link to Us | Privacy | Contact Us

Copyright © 1997 - 2008 Gillaspy Associates. All Rights Reserved