CLAMS NFA Wrapper (v0.1)
About this version
- Submitter: keighrim
- Submission Time: 2025-11-09T18:36:09+00:00
- Prebuilt Container Image: ghcr.io/clamsproject/app-nfa-wrapper:v0.1
-
Release Notes
experiment first release of the app
About this app (See raw metadata.json)
**Wraps the NVIDIA NeMo Forced Aligner tool to temporally align transcribed text with its audio source. **
- App ID: http://apps.clams.ai/nfa-wrapper/v0.1
- App License: Apache 2.0
- Source Repository: https://github.com/clamsproject/app-nfa-wrapper (source tree of the submitted version)
- Analyzer Version: 454fabc
- Analyzer License: Apache 2.0
Inputs
(Note: “*” as a property value means that the property is required but can be any value.)
One of the following is required: [
-
http://mmif.clams.ai/vocabulary/AudioDocument/v1 (required) (of any properties)
-
http://mmif.clams.ai/vocabulary/VideoDocument/v1 (required) (of any properties)
]
-
http://mmif.clams.ai/vocabulary/TextDocument/v1 (required) (of any properties)
Text content transcribed from audio input with no existing annotations.
Configurable Parameters
(Note: Multivalued means the parameter can have one or more values.)
-
model: optional, defaults tofc_hybrid- Type: string
- Multivalued: False
- Choices:
fc_hybrid,parakeet,conformer,fc_ctc
NeMo ASR model to use. Choices: fc_hybrid, parakeet, conformer, fc_ctc. By default, the fc_hybrid model will be used.
-
pretty: optional, defaults tofalse- Type: boolean
- Multivalued: False
- Choices:
false,true
The JSON body of the HTTP response will be re-formatted with 2-space indentation
-
runningTime: optional, defaults tofalse- Type: boolean
- Multivalued: False
- Choices:
false,true
The running time of the app will be recorded in the view metadata
-
hwFetch: optional, defaults tofalse- Type: boolean
- Multivalued: False
- Choices:
false,true
The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata
Outputs
(Note: “*” as a property value means that the property is required but can be any value.)
(Note: Not all output annotations are always generated.)
-
http://mmif.clams.ai/vocabulary/Token/v1 (of any properties)
Token from original text split on whitespace.
textproperty stores the string value of the token.startandendproperties indicate position of token in entire text.documentproperty identifies source text document. - http://mmif.clams.ai/vocabulary/TimeFrame/v6
- frameType = “speech”
- timeUnit = “milliseconds”
TimeFrame annotation representing the source audio segment corresponding to a given transcribed token, with
startandendtimes given in milliseconds. -
http://mmif.clams.ai/vocabulary/Alignment/v1 (of any properties)
Alignment between
TokenandTimeFrameannotations.