Gentle Forced Aligner Wrapper (v1.0)
About this version
- Submitter: keighrim
- Submission Time: 2023-07-24T20:01:10+00:00
- Prebuilt Container Image: ghcr.io/clamsproject/app-gentle-forced-aligner-wrapper:v1.0
-
Release Notes
(no notes provided by the developer)
About this app (See raw metadata.json)
This CLAMS app aligns transcript and audio track using Gentle. Gentle is a robust yet lenient forced aligner built on Kaldi.This app only works when Gentle is already installed locally.Unfortunately, Gentle is not distributed as a Python package distribution.To get Gentle installation instruction, see https://lowerquality.com/gentle/ Make sure install Gentle from the git commit specified in analyzer_version
in this metadata.
- App ID: http://apps.clams.ai/gentle-forced-aligner-wrapper/v1.0
- App License: MIT
- Source Repository: https://github.com/clamsproject/app-gentle-forced-aligner-wrapper (source tree of the submitted version)
- Analyzer Version: f29245a
- Analyzer License: MIT
Inputs
(Note: “*” as a property value means that the property is required but can be any value.)
(any properties)
(any properties)
- http://mmif.clams.ai/vocabulary/TimeFrame/v1
- frameType = “speech”
- http://vocab.lappsgrid.org/Token
(any properties)
Configurable Parameters
(Note: Multivalued means the parameter can have one or more values.)
-
use_speech_segmentation
: optional, defaults totrue
- Type: boolean
- Multivalued: False
- Choices:
false
,true
When set true, use exising “speech”-typed
TimeFrame
annotations and run aligner only on those frames, instead of entire audio files. -
use_tokenization
: optional, defaults totrue
- Type: boolean
- Multivalued: False
- Choices:
false
,true
When set true,
Alignment
annotation output will honor existing latest tokenization (Token
annotations). Due to a limitation of the way Kaldi reads in English tokens, existing tokens must not contain whitespaces. -
pretty
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The JSON body of the HTTP response will be re-formatted with 2-space indentation
Outputs
(Note: “*” as a property value means that the property is required but can be any value.)
(Note: Not all output annotations are always generated.)
(any properties)
- http://mmif.clams.ai/vocabulary/TimeFrame/v1
- frameType = “speech”
- timeUnit = “milliseconds”
- http://mmif.clams.ai/vocabulary/Alignment/v1
(any properties)