CLAMS App User Manual

Using CLAMS App

This document provides general instructions for installing and using CLAMS Apps. App developers may provide additional information specific to their app, hence it’s advised also to look up the app website (or code repository) to get the additional information.

Requirements

Generally, a CLAMS App requires

To run the app in a container (as an HTTP server), container management software such as docker or podman. This is the recommended way to use CLAMS Apps.
- (the CLAMS team is using docker for development and testing, hence the instructions are based on docker commands.)
To run the app locally, Python3 with the clams-python module installed. Python 3.8 or higher is required.
To invoke and execute analysis, HTTP client utility (such as curl).

For Python dependencies, usually CLAMS Apps come with requirements.txt files that list up the Python library. However, there could be other non-Python software/library that are required by the app.

Installation

CLAMS Apps available on the CLAMS App Directory. Currently, all CLAMS Apps are open-source projects and are distributed as

source code downloadable from code repository
pre-built container image

Please visit the app-directory to see which apps are available and where you can download them.

In most cases, you can “install” a CLAMS App by either

downloading pre-built container image directly (quick-and-easy way)
downloading source code from the app code repository and manually building a container image (more flexible way if you want to modify the app, or have to build for a specific HW)

Download prebuilt image

This is the quickest (and recommended) way to get started with a CLAMS App. CLAMS apps in the App Directory come with public prebuilt container images, available in a container registry.

docker pull <prebulit_image_name>

The image name can be found on the App Directory entry of the app.

Build an image from source code

Alternatively, you can build a container image from the source code. This is useful when you want to modify the app itself, or when you want to change the image building process to adjust to your hardware environment (e.g., specific compute engine), or additional software dependencies (e.g. MMIF plugins). To download the source code, you can either use git clone command or download a zip file from the source code repository. The source code repository address can be found on the App Directory entry of the app.

From the locally downloaded project directory, run the following in your terminal to build an image from the included container specification file.

(Assuming you are using docker as your container manager)

$ docker build . -f Containerfile -t <IMAGE_NAME_YOU_PICK>

Running CLAMS App

CLAMS Apps are primarily designed to run as an HTTP server, but some apps written based on clams-python SDK additionally provide CLI equivalent to the HTTP requests. In this session, we will first cover the usage of CLAMS apps as an HTTP server, and then cover the (optional) CLI.

Starting the HTTP server as a container

Once the image is built (by docker build) or downloaded (by docker pull), to create and start a container, run:

$ docker run -v /path/to/data/directory:/data -p <PORT>:5000 <IMAGE_NAME>

where /path/to/data/directory is the local location of your media files or MMIF objects and PORT is the host port number you want your container to be listening to. The HTTP inside the container will be listening to 5000 by default, so the second part of the -p argument is always 5000. Usually any number above 1024 is fine for the host port number, and you can use the same 5000 number for the host port number.

The mount point for the data directory inside the container can be any path, and we used /data just as an example. However, it is very important to understand that the file location in the input MMIF file must be a valid and available path inside the container (see below for more details).

Note If you are using a Mac, on recent versions of macOS, port 5000 is used by Airplay Receiver by default. So you may need to use a different port number, or turn off the Airplay Receiver in the System Preferences to release 5000. For more information on safe port numbers, see IANA Port Number Registry or Wikipedia.

Note Another note for users of recent Macs with Apple Silicon (M1, M2, etc) CPU: you might see the following error message when you run the container image.
The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested
This is because the image you are trying to run is built for Intel/AMD x64 CPUs. To force the container to run on an emulation layer, you can add --platform linux/amd64 option to the docker run command.

Additionally, you can mount a directory to /cache/ inside the container to persist the cache data between container runs. This is particularly handy when the app you are using downloads a fairly large pretrained model file on the first run, and you want to keep it for the next run.

Unlike the data directory, the cache directory is not required to be mounted, but if you want to persist the cache data, you can mount a local directory to /cache/ inside the container (fixed path).

docker run -v /path/to/data/directory:/data -v /path/to/cache/directory:/cache -p <port>:5000 <image_name>

Note One might be tempted bind-mount their entire local cache directory (usually ~/.cache in Linux systems) to re-use locally downloaded model files, across different apps. However, doing so will expose all the cached data, not just model files, to the container. This can include sensitive information such as browser cache, authentication tokens, etc, hence will pose a great security risk. It is recommended to create a separate directory to use as a cache directory for CLAMS containers.

Invoking the app server

To get app metadata

Once the app is running as an HTTP server, visit the server address (localhost:5000, or the remote host name if running on a remote computer) to get the app metadata. App metadata is also available at the App Directory entry of the app if the app is published on the App Directory. App metadata contains important information about the app that we will use in the following sections.

To process input media

To actually run the app and process input media through computational analysis, simply send a POST request to the app with a MMIF input as the request body.

MMIF input files can be obtained from outputs of other CLAMS apps, or you can create an empty MMIF only with source media locations using clams source command. See the help message for a more detailed instructions. (Make sure you have installed clams-python package version from PyPI.)

$ pip install clams-python
$ clams source --help

For example; by running

$ clams source audio:/data/audio/some-audio-file.mp3

You will get

{
  "metadata": {
    "mmif": "http://mmif.clams.ai/X.Y.Z"
  },
  "documents": [
    {
      "@type": "http://mmif.clams.ai/vocabulary/AudioDocument/v1",
      "properties": {
        "mime": "audio",
        "id": "d1",
        "location": "file:///data/audio/some-audio-file.mp3"
      }
    }
  ],
  "views": []
}

If an app requires just Document inputs (see input section of the app metadata), an empty MMIF with required media file locations will suffice. The location has to be a URL or an absolute path, and it is important to ensure that it exists. Especially when running the app in a container, and the document location is specified as a file system path, the file must be available inside the container. In the above, we bind-mounted /path/to/data/directory (host) to /data (container). That is why we used /data/audio/some-audio-file.mp3 as the location when generating this MMIF input. So in this example, the file /path/to/data/directory/audio/some-audio-file.mp3 must exist on the host side, so that inside the container, it can be accessed as /data/audio/some-audio-file.mp3.

Some apps only works with input MMIF that already contains some annotations of specific types. To run such apps, you need to run different apps in a sequence.

(TODO: added CLAMS workflow documentation link here.)

When an input MMIF is ready, you can send it to the app server. Here’s an example of how to use the curl command, and store the response in a file output.mmif.

$ clams source audio:/data/audio/some-audio-file.mp3 > input.mmif
$ curl -H "Accept: application/json" -X POST -d@input.mmif -s http://localhost:5000 > output.mmif

# or using a bash pipeline 
$ clams source audio:/data/audio/some-audio-file.mp3 | curl -X POST -d@- -s http://localhost:5000 > output.mmif

Windows PowerShell users may encounter an Invoke-WebRequest exception when attempting to send an input file with curl. This can be resolved for the duration of the current session by using the command remove-item alias:curl before proceeding to use curl.

Configuring the app

Running as an HTTP server, CLAMS Apps are stateless, but can be configured for each HTTP request by providing configuration parameters as query string.

For example, appending ?pretty=True to the URL will result in a JSON output with indentation for better readability.

Note When you’re using curl from a shell session, you need to escape the ? or & characters with \ to prevent the shell from interpreting it as a special character.

Different apps have different configurability. For configuration parameters of an app, please refer to parameter section of the app metadata. In addition to app-specific parameters, all apps support universal parameters (e.g., pretty for formatted output). Check the app metadata for the complete and up-to-date list.

Using CLAMS App as a CLI program

First and foremost, not all CLAMS Apps support command line interface (CLI). At the minimum, a CLAMS app is required to support HTTP interfaces described in the previous section. If any of the following instructions do not work for an app, it is likely that the app does not support CLI.

Python entry points

Apps written on clams-python SDK have three python entry points by default: app.py, metadata.py, and cli.py.

`app.py`: Running app as a local HTTP server

app.py is the main entry point for running the app as an HTTP server. To run the app as a local HTTP server without containerization, you can run the following command from the source code directory.

$ python app.py

By default, the app will be listening to port 5000, but you can change the port number by passing --port <NUMBER> option.
Be default, the app will be running in debugging mode, but you can change it to production mode by passing --production option to support larger traffic volume.
As you might have noticed, the default CMD in the prebuilt containers is python app.py --production --port 5000.

Environment variables for production mode

When running in production mode, the following environment variables can be used to configure the app server:

Variable	Description	Default
`CLAMS_GUNICORN_WORKERS`	Number of gunicorn worker processes	Auto-calculated based on CPU cores and GPU memory
`CLAMS_LOGLEVEL`	Logging verbosity level (`debug`, `info`, `warning`, `error`)	`warning`

By default, the number of workers is calculated as (CPU cores × 2) + 1. For GPU-based apps, see GPU Memory Management for details on automatic worker scaling and VRAM management.

`metadata.py`: Getting app metadata

Running metadata.py will print out the app metadata in JSON format.

`cli.py`: Running as a CLI program

cli.py is completely optional for app developers, and unlike the other two above that are guaranteed to be available, cli.py may not be available for some apps. When running an app as a HTTP app, the input MMIF must be passed as POST request’s body, and the output MMIF will be returned as the response body. To mimic this behavior in a CLI, cli.py has two positional arguments;

$ python cli.py <INPUT_MMIF> <OUTPUT_MMIF>  # will read INPUT_MMIF file, process it, and write the result to OUTPUT_MMIF file

<INPUT_MMIF> and <OUTPUT_MMIF> are file paths to the input and output MMIF files, respectively. Following the common unix CLI practice, you can use - to represent STDIN and/or STDOUT

# will read from STDIN, process it, and write the result to STDOUT
$ python cli.py - -  

# or equivalently
$ python cli.py 

# read from a file, write to STDOUT
$ python cli.py input.mmif -

# or equivalently
$ python cli.py input.mmif

# read from STDIN, write to a file
$ cat input.mmif | python cli.py - output.mmif

As with the HTTP server, you can pass configuration parameters to the CLI program. All parameter names are the same as the HTTP query parameters, but you need to use -- prefix to indicate that it is a parameter.

$ python cli.py --pretty True input.mmif output.mmif

Finally, when running the app as a container, you can override the default CMD (app.py) by passing a cli.py command to the docker run command.

$ cat input.mmif | docker run -i -v /path/to/data/directory:/data <IMAGE_NAME> python cli.py 

Note that input.mmif is in the host machine, and the container is reading it from the STDIN. You can also pass the input MMIF file as a volume to the container. However, when you do so, you need to make sure that the file path in the MMIF is correctly set to the file path in the container.

Note Here, make sure to pass -i option to the docker run command to make host’s STDIN work properly with the container.

CLAMS Team