Scenes-with-text Detection (v6.0)
About this version
- Submitter: keighrim
- Submission Time: 2024-07-25T16:11:42+00:00
- Prebuilt Container Image: ghcr.io/clamsproject/app-swt-detection:v6.0
-
Release Notes
- updated to the latest SDK (adding some runtime recording params)
- added
usePosModel
param to toggle positional encoding - general re-implementation and bugfix for positional encoding
About this app (See raw metadata.json)
Detects scenes with text, like slates, chyrons and credits.
- App ID: http://apps.clams.ai/swt-detection/v6.0
- App License: Apache 2.0
- Source Repository: https://github.com/clamsproject/app-swt-detection (source tree of the submitted version)
Inputs
(Note: “*” as a property value means that the property is required but can be any value.)
- http://mmif.clams.ai/vocabulary/VideoDocument/v1 (required) (of any properties)
Configurable Parameters
(Note: Multivalued means the parameter can have one or more values.)
-
startAt
: optional, defaults to0
- Type: integer
- Multivalued: False
Number of milliseconds into the video to start processing
-
stopAt
: optional, defaults to9223372036854775807
- Type: integer
- Multivalued: False
Number of milliseconds into the video to stop processing
-
sampleRate
: optional, defaults to1000
- Type: integer
- Multivalued: False
Milliseconds between sampled frames
-
minFrameScore
: optional, defaults to0.01
- Type: number
- Multivalued: False
Minimum score for a still frame to be included in a TimeFrame
-
minTimeframeScore
: optional, defaults to0.5
- Type: number
- Multivalued: False
Minimum score for a TimeFrame
-
minFrameCount
: optional, defaults to2
- Type: integer
- Multivalued: False
Minimum number of sampled frames required for a TimeFrame
-
modelName
: optional, defaults toconvnext_lg
- Type: string
- Multivalued: False
- Choices:
convnext_tiny
,convnext_lg
model name to use for classification
-
usePosModel
: optional, defaults totrue
- Type: boolean
- Multivalued: False
- Choices:
false
,true
Use the model trained with positional features
-
useStitcher
: optional, defaults totrue
- Type: boolean
- Multivalued: False
- Choices:
false
,true
Use the stitcher after classifying the TimePoints
-
allowOverlap
: optional, defaults totrue
- Type: boolean
- Multivalued: False
- Choices:
false
,true
Allow overlapping time frames
-
map
: optional, defaults to['B:bars', 'S:slate', 'I:chyron', 'N:chyron', 'Y:chyron', 'C:credits', 'R:credits', 'W:other_opening', 'L:other_opening', 'O:other_opening', 'M:other_opening', 'E:other_text', 'K:other_text', 'G:other_text', 'T:other_text', 'F:other_text']
- Type: map
- Multivalued: True
Mapping of a label in the input annotations to a new label. Must be formatted as IN_LABEL:OUT_LABEL (with a colon). To pass multiple mappings, use this parameter multiple times. By default, all the input labels are passed as is, including any negative labels (with default value being no remapping at all). However, when at least one label is remapped, all the other “unset” labels are discarded as a negative label.
-
pretty
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The JSON body of the HTTP response will be re-formatted with 2-space indentation
-
runningTime
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The running time of the app will be recorded in the view metadata
-
hwFetch
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata
Outputs
(Note: “*” as a property value means that the property is required but can be any value.)
(Note: Not all output annotations are always generated.)
- http://mmif.clams.ai/vocabulary/TimeFrame/v5
- timeUnit = “milliseconds”
- http://mmif.clams.ai/vocabulary/TimePoint/v4
- timeUnit = “milliseconds”
- labelset = a list of [“B”, “S”, “W”, “L”, “O”, “M”, “I”, “N”, “E”, “P”, “Y”, “K”, “G”, “T”, “F”, “C”, “R”]