CLAMS NFA Wrapper (v0.1)

About this version

About this app (See raw metadata.json)

**Wraps the NVIDIA NeMo Forced Aligner tool to temporally align transcribed text with its audio source. **

Inputs

(Note: “*” as a property value means that the property is required but can be any value.)

One of the following is required: [

]

Configurable Parameters

(Note: Multivalued means the parameter can have one or more values.)

  • model: optional, defaults to fc_hybrid

    • Type: string
    • Multivalued: False
    • Choices: fc_hybrid, parakeet, conformer, fc_ctc

    NeMo ASR model to use. Choices: fc_hybrid, parakeet, conformer, fc_ctc. By default, the fc_hybrid model will be used.

  • pretty: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    The JSON body of the HTTP response will be re-formatted with 2-space indentation

  • runningTime: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    The running time of the app will be recorded in the view metadata

  • hwFetch: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata

Outputs

(Note: “*” as a property value means that the property is required but can be any value.)

(Note: Not all output annotations are always generated.)

  • http://mmif.clams.ai/vocabulary/Token/v1 (of any properties)

    Token from original text split on whitespace. text property stores the string value of the token. start and end properties indicate position of token in entire text. document property identifies source text document.

  • http://mmif.clams.ai/vocabulary/TimeFrame/v6
    • frameType = “speech”
    • timeUnit = “milliseconds”

    TimeFrame annotation representing the source audio segment corresponding to a given transcribed token, with start and end times given in milliseconds.

  • http://mmif.clams.ai/vocabulary/Alignment/v1 (of any properties)

    Alignment between Token and TimeFrame annotations.