Skip to content

Switch scalability test with idle MT Cbench switches

Konstantinos Papadopoulos edited this page May 18, 2018 · 46 revisions

Test description

This is a switch scalability test with switches emulated using MT-Cbench. The switches are in idle mode, meaning that they do not initiate any traffic to the controller.

The goal is to explore the maximum number of idle switches the controller can sustain, and how a certain-sized topology should be connected to the controller so that the latter can successfully discover it at the minimum time. MT-Cbench switches typically sit idle during main test execution, without sending or receiving any kind of messages.

The controller should be configured to start with the drop-test feature installed to be able to reply to initial OF messages. The emulated switches are arranged in a disconnected topology, meaning they do not have any interconnection between them. This, along with the limited protocol support, constitute MT-Cbench a special-purpose OF emulator and not a full-fledged, realistic OF switch emulator.

The test gradually boots bigger topologies and queries the controller operational DataStore for installed switches, via the NorthBound REST interface. Discovered switches and discovery (installation) times are then reported.

The goal of this test is to investigate the controller idle switch scalability, i.e. the ability to sustain a switch topology for various switch numbers. Specifically, the objectives are:

  • to find the largest possible number of active connections a controller can accept and maintain
  • to find the combination of boot-up-related configuration keys that leads to the fastest successful network boot-up. We consider a boot-up as successful when all network switches have become visible in the operational DataStore of the controller.

To find the number of switches visible to the controller we query its operational DataStore using RESTCONF calls from the NorthBound interface.

Usage

A switch scalability test with idle MT-Cbench switches can be started by specifying the following options in NSTAT command line:

  • --test=sb_idle_scalability
  • --sb-generator-base-dir=<MT-Cbench dir>

Under the stress_test/sample_test_confs/<controller_name>/ directory, the JSON files ending in _sb_idle_scalability_mtcbench can be handled as template configuration files for this kind of test scenario. You can specify them to the --json-config option to run a sample test. For larger-scale stress tests, have a look at the corresponding files under the stress_test/stress_test_confs/<controller_name>/ directory.

Nodes deployment

For this test 3 nodes are required.

  • NSTAT node
  • controller node
  • SouthBound emulator node (MT-Cbench)

In order to deploy these nodes, based on docker containers, we have two options

  • download the prebuilt environment from DockerHub
  • build your own container locally using the provided Dockerfiles for proxy and no-proxy environments, under the path deploy/docker

In both cases, docker has to be installed and any user that will manipulate docker containers, must be added to the docker group. To deploy the required nodes, see installation wiki.

After deployment of docker nodes, update the NSTAT repository using the following steps

  • open a new terminal and execute the command

    docker ps -a

    the output of the above command will be similar to the following

     CONTAINER ID       IMAGE                                  COMMAND               CREATED              STATUS              PORTS    NAMES
    
     4c05473bb7c8       intracom/nstat-sdn-controllers:proxy   "/usr/sbin/sshd -D"   About a minute ago   Up About a minute   22/tcp   controller
     72e4572878e2       intracom/mtcbench:proxy                "/usr/sbin/sshd -D"   About a minute ago   Up About a minute   22/tcp   mtcbench
     60db64735a26       intracom/nstat:proxy                   "/usr/sbin/sshd -D"   About a minute ago   Up About a minute   22/tcp   nstat

    get the container names of all docker containers you created

  • for each docker name execute the following command

       WAIT_UNTIL_RETRY=2
       docker exec -i $container_name /bin/bash -c "rm -rf /opt/nstat; \
          cd /opt; \
          until git clone https://github.com/intracom-telecom-sdn/nstat.git -b master; do \
              echo 'Fail git clone NSTAT. Sleep for $WAIT_UNTIL_RETRY and retry'; \
          done"

    where you should replace the $container_name with the container name of the corresponded docker node, acquired from previous step.

Download the prebuild environment

Edit json test configuration file

The IP addresses of all deployed VMs and the credentials to open SSH connections, must be configured in the json configuration file of the sample test we want to run. This action must be done in nstat_node.

  • Run the command

    docker ps -a
    
    get container names of all created nodes
     - NSTAT node
     - Controller node
     - SouthBound emulator node (MT-Cbench)
    
  • Get IP Addresses of all nodes

    docker exec -i $container_name /bin/bash -c "ifconfig"
  • SSH into nstat_node using the IP Address of this node that was acquired from the previous step

    ssh root@<NSTAT_node_ip>

    the password to connect is root123.

  • Edit json file /opt/nstat/stress_test/sample_test_confs/boron/boron_RPC_sb_active_scalability_mtcbench.json and change the following lines changing IP addresses and SSH credentials:

    "nstat_node_ip":"<NSTAT_node_ip>",
    "nstat_node_ssh_port":"22",
    "nstat_node_username":"root",
    "nstat_node_password":"root123",
    
    "controller_node_ip":"<Controller_node_ip>",
    "controller_node_ssh_port":"22",
    "controller_node_username":"root",
    "controller_node_password":"root123",
    
    "sb_emulator_name":"MTCBENCH",
    "sb_emulator_node_ip":"<MT-Cbench_node_ip>",
    "sb_emulator_node_ssh_port":22,
    "sb_emulator_node_username":"root",
    "sb_emulator_node_password":"root123",

Run the test

In order to run the test

  • Open a new terminal and execute the following command

    docker exec -i nstat /bin/bash -c "export PYTHONPATH=/opt/nstat; source /opt/venv_nstat/bin/activate; \
    python3.4 /opt/nstat/stress_test/nstat.py \
      --test=sb_idle_scalability \
      --ctrl-base-dir=/opt/nstat/controllers/odl_boron_pb/ \
      --sb-emulator-base-dir=/opt/nstat/emulators/sbemu/mtcbench/ \
      --json-config=/opt/nstat/stress_test/sample_test_confs/boron/boron_RPC_sb_idle_scalability_mtcbench.json \
      --json-output=/opt/nstat/results.json \
      --html-report=/opt/nstat/report.html \
      --output-dir=/opt/nstat/results_boron_RPC_sb_idle_scalability_mtcbench/"

Inspect results

Once test execution is over, inspect the results under

/opt/nstat/results_boron_RPC_sb_idle_scalability_mtcbench

Configuration keys

The configuration keys that must be specified in the JSON configuration file are:

Config key type description
nstat_node_ip string IP Address of the NSTAT VM
nstat_node_ssh_port string the ssh port of the NSTAT VM
nstat_node_username string username for ssh login in the NSTAT VM
nstat_node_password string password for ssh login in the NSTAT VM
controller_name string name of the used controller. This value is used in Controller Factory method to return the appropriate controller object. For this test it should be ODL
controller_node_ip string IP Address of the Controller VM
controller_node_ssh_port string The ssh port of the Controller VM
controller_node_username string Username for ssh login in the Controller VM
controller_node_password string Password for ssh login in the Controller VM
sb_emulator_name string The name of SouthBound emulator. This value is used in Generator Factory method to return the appropriate SouthBound emulator object. For this test it should be MTCBENCH
sb_emulator_node_ip string IP Address of the MT-Cbench VM
sb_emulator_node_ssh_port string The ssh port of the MT-Cbench VM
sb_emulator_node_username string username for ssh login in the MT-Cbench VM
sb_emulator_node_password string password for ssh login in the MT-Cbench VM
controller_build_handler string executable for building controller (relative to --ctrl-base-dir command line parameter)
controller_clean_handler string executable for cleaning up controller directory (relative to --ctrl-base-dir command line parameter)
controller_get_handler string executable for downloading the controller prebuild version from its repository and extracts it.
controller_start_handler string executable for starting controller (relative to --ctrl-base-dir command line parameter)
controller_stop_handler string executable for stopping controller (relative to --ctrl-base-dir command line parameter)
controller_status_handler string executable for querying controller status (relative to ctrl-base-dir command line parameter)
controller_statistics_handler string executable for changing the period that the controller collects topology statistics (relative to --ctrl-base-dir command line parameter)
controller_persistent_handler string disables persistence of controller. This can be acchieved by adding the attribute persistent=false in file <controller_base_dir>/etc/org.opendaylight.controller.cluster.datastore.cfg
controller_oper_hosts_handler string makes a RESTCALL to the NorthBound interface of the controller in order to get the number of hosts from the operational datastore
controller_oper_links_handler string makes a RESTCALL to the NorthBound interface of the controller in order to get the number of links from the operational datastore
controller_oper_switches_handler string makes a RESTCALL to the NorthBound interface of the controller in order to get the number of switches from the operational datastore
controller_oper_flows_handler string makes a RESTCALL to the NorthBound interface of the controller in order to get the number of flows from the operational datastore
controller_flowmods_conf_handler string configures the controller plugins to respond with flow modifications on any PacketIN message with ARP payload
controller_logs_dir string controllers logs directory (relative to --ctrl-base-dir command line parameter)
controller_port number controller port number where OF switches should connect
controller_statistics_period_ms array of numbers controller different statistics period values (in (ms))
sb_emulator_build_handler string executable for building MT-Cbench (relative to --sb-emulator-base-dir command line parameter)
sb_emulator_run_handler string executable for running MT-Cbench (relative to --sb-emulator-base-dir command line parameter)
sb_emulator_clean_handler string executable for cleaning up MT-Cbench (relative to --sb-emulator-base-dir command line parameter)
sb_emulator_cleanup boolean whether to cleanup MT-Cbench after test completion
mtcbench_simulated_hosts array of numbers _number of hosts (MACs) simulated by the MT-Cbench
mtcbench_threads array of numbers number of total MT-Cbench threads
mtcbench_switches_per_thread array of numbers number of OF switches simulated per MT-Cbench thread
mtcbench_thread_creation_delay_ms array of numbers delay (in ms) between creation of consecutive MT-Cbench threads
mtcbench_delay_before_traffic_ms array of numbers delay (in ms) before MT-Cbench threads start transmitting OF traffic
mtcbench_mode string MT-Cbench mode ("Latency" or "Throughput")
mtcbench_warmup number number of initial internal iterations that should be treated as "warmup" and are not considered when computing aggregate performance results
mtcbench_ms_per_test number duration (in ms) of generator internal iteration
mtcbench_internal_repeats number number of internal iterations during traffic transmission where performance and other statistics are sampled
java_opts array of strings Java options to initialize JAVA_OPTS env variable
test_repeats number number of external iterations for a test, i.e. the number of times a test should be repeated to derive aggregate results (average, min, max, etc.)
plots array of plot objects configurations for plots to be produced after the test

The array-valued configuration keys shown in bold are the test dimensions of the test scenario. The stress test will be repeated over all possible combinations of their values.

The most important configuration keys are:

  • mtcbench_threads
  • mtcbench_switches_per_thread
  • mtcbench_thread_creation_delay_ms

These keys determine the parameters for progressively booting switches into an SDN network, allowing in this way to find the combination of values for booting a topology of certain size, optimally. The values of generator_threads and generator_switches_per_thread, define the overall number of network nodes (topology size), connected on the controller. This number is equal to (mtcbench_threads * mtcbench_switches_per_thread).

In order to prevent switches from initiating traffic, a large value should be provided to the generator_delay_before_traffic_ms configuration key to prevent the generator threads from sending traffic between the discovery deadline has ended.

Plot configuration

See the plotting page.

Result keys

The result keys produced by this kind of test scenario and which can be used subsequently to generate custom plots, are the following:

Result key type description
global_sample_id number unique (serial) ID for this sample
timestamp number unique timestamp for this sample
date string date this sample was taken
generator_internal_repeats number number of internal iterations during traffic transmission where performance and other statistics were sampled
discovered_switches_error_code number identification of what went wrong during switch discovery (0 successful switch discovery, 201 maximum polling deadline was reached without any new switch discovery. This deadline resets to its initial value if new switches are discovered from the polling function)
successful_bootup_time number has bootup times of successful test iterations. This is helper to make easy ploting of successful cases. For unsuccessful cases it has -1 value.
bootup_time_secs number time in secs in order to discover switches. We can see the number of discovered switches in discovered_switches result key.
max_discovered_switches number the maximum number of discovered switches recorded in the controller datastore from the polling function.
discovered_switches number the final number of recorded switches recorded from the polling function. If this number is 0 it means that the controller was collapsed before the topology is booted up. Polling function starts after topology bootup. The time we get for bootup_time_secs in this case is the sum of time it took for the topology to bootup plus the discovery deadline time.
mtcbench_switches number number of switches discovered for this test sample
mtcbench_simulated_hosts number number of hosts (MACs) simulated by the generator
mtcbench_switches number total number of generator simulated switches (equals to #threads*#switches_per_thread
mtcbench_threads number number of total generator threads
mtcbench_switches_per_thread number number of OF switches simulated per generator thread
mtcbench_thread_creation_delay_ms number delay (in (ms)) between creation of consecutive threads
mtcbench_delay_before_traffic_ms number delay (in (ms)) before we start sending generated traffic (In this test we set this parameter to a big value and we stop the generator before it actually starts generating traffic)
mtcbench_ms_per_test number duration (in (ms)) of generator internal iteration
mtcbench_warmup number number of initial internal iterations that were treated as "warmup" and are not considered when computing aggregate performance results
mtcbench_mode string generator mode (Latency or Throughput)
mtcontroller_ip string controller IP address where OF switches were connected
mtcontroller_port number controller port number where OF switches should connect
controller_java_xopts array of strings controller Java optimization flags (-X)
one_minute_load number one-minute average system load
five_minute_load number five-minute average system load
fifteen_minute_load number fifteen-minute average system load
used_memory_bytes number system used memory in bytes
total_memory_bytes number system total memory in bytes
controller_cpu_system_time number CPU system time for controller
controller_cpu_user_time number CPU user time for controller
controller_num_threads number number of controller threads measured when this sample was taken
controller_num_fds number number of controller open files
controller_statistics_period_ms number the interval (in (ms)) of the statistics period of the controller

The result key in bold (successful_bootup_time) is the main performance metric produced by this test scenario. Another important parameter is multinet_size. In cases of successful topology bootup the number of discovered_switches is the same with multinet_size.

Sample experimental results

The following figures show sample results from switch scalability stress tests with the OpenDaylight controller operating in two modes:

  • RPC mode: the controller is configured to directly reply to the switches with a predefined Flow-Mod message at the OpenFlow plugin level (use of start_droptestRPC.sh handler)

  • DataStore mode: the controller additionally performs updates in its DataStore (use of start_droptestDS.sh handlers)

Clone this wiki locally