Gentle Forced Aligner Wrapper (v1.0)

About this version

Submitter: keighrim
Submission Time: 2023-07-24T20:01:10+00:00
Prebuilt Container Image: ghcr.io/clamsproject/app-gentle-forced-aligner-wrapper:v1.0
Release Notes

(no notes provided by the developer)

About this app (See raw metadata.json)

This CLAMS app aligns transcript and audio track using Gentle. Gentle is a robust yet lenient forced aligner built on Kaldi.This app only works when Gentle is already installed locally.Unfortunately, Gentle is not distributed as a Python package distribution.To get Gentle installation instruction, see https://lowerquality.com/gentle/ Make sure install Gentle from the git commit specified in analyzer_version in this metadata.

App ID: http://apps.clams.ai/gentle-forced-aligner-wrapper/v1.0
App License: MIT
Source Repository: https://github.com/clamsproject/app-gentle-forced-aligner-wrapper (source tree of the submitted version)
Analyzer Version: f29245a
Analyzer License: MIT

Inputs

(Note: “*” as a property value means that the property is required but can be any value.)

http://mmif.clams.ai/vocabulary/TextDocument/v1 (required)

(any properties)

http://mmif.clams.ai/vocabulary/AudioDocument/v1 (required)

(any properties)

http://mmif.clams.ai/vocabulary/TimeFrame/v1
- frameType = “speech”
http://vocab.lappsgrid.org/Token

(any properties)

Configurable Parameters

(Note: Multivalued means the parameter can have one or more values.)

use_speech_segmentation: optional, defaults to true
- Type: boolean
- Multivalued: False
- Choices: false, true
When set true, use exising “speech”-typed TimeFrame annotations and run aligner only on those frames, instead of entire audio files.
use_tokenization: optional, defaults to true
- Type: boolean
- Multivalued: False
- Choices: false, true
When set true, Alignment annotation output will honor existing latest tokenization (Token annotations). Due to a limitation of the way Kaldi reads in English tokens, existing tokens must not contain whitespaces.
pretty: optional, defaults to false
- Type: boolean
- Multivalued: False
- Choices: false, true
The JSON body of the HTTP response will be re-formatted with 2-space indentation

Outputs

(Note: “*” as a property value means that the property is required but can be any value.)

(Note: Not all output annotations are always generated.)

http://vocab.lappsgrid.org/Token

(any properties)

http://mmif.clams.ai/vocabulary/TimeFrame/v1
- frameType = “speech”
- timeUnit = “milliseconds”
http://mmif.clams.ai/vocabulary/Alignment/v1

(any properties)