Scenes-with-text Detection (v6.0)

About this version

  • Submitter: keighrim
  • Submission Time: 2024-07-25T16:11:42+00:00
  • Prebuilt Container Image: ghcr.io/clamsproject/app-swt-detection:v6.0
  • Release Notes

    • updated to the latest SDK (adding some runtime recording params)
    • added usePosModel param to toggle positional encoding
    • general re-implementation and bugfix for positional encoding

About this app (See raw metadata.json)

Detects scenes with text, like slates, chyrons and credits.

Inputs

(Note: “*” as a property value means that the property is required but can be any value.)

Configurable Parameters

(Note: Multivalued means the parameter can have one or more values.)

  • startAt: optional, defaults to 0

    • Type: integer
    • Multivalued: False

    Number of milliseconds into the video to start processing

  • stopAt: optional, defaults to 9223372036854775807

    • Type: integer
    • Multivalued: False

    Number of milliseconds into the video to stop processing

  • sampleRate: optional, defaults to 1000

    • Type: integer
    • Multivalued: False

    Milliseconds between sampled frames

  • minFrameScore: optional, defaults to 0.01

    • Type: number
    • Multivalued: False

    Minimum score for a still frame to be included in a TimeFrame

  • minTimeframeScore: optional, defaults to 0.5

    • Type: number
    • Multivalued: False

    Minimum score for a TimeFrame

  • minFrameCount: optional, defaults to 2

    • Type: integer
    • Multivalued: False

    Minimum number of sampled frames required for a TimeFrame

  • modelName: optional, defaults to convnext_lg

    • Type: string
    • Multivalued: False
    • Choices: convnext_tiny, convnext_lg

    model name to use for classification

  • usePosModel: optional, defaults to true

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    Use the model trained with positional features

  • useStitcher: optional, defaults to true

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    Use the stitcher after classifying the TimePoints

  • allowOverlap: optional, defaults to true

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    Allow overlapping time frames

  • map: optional, defaults to ['B:bars', 'S:slate', 'I:chyron', 'N:chyron', 'Y:chyron', 'C:credits', 'R:credits', 'W:other_opening', 'L:other_opening', 'O:other_opening', 'M:other_opening', 'E:other_text', 'K:other_text', 'G:other_text', 'T:other_text', 'F:other_text']

    • Type: map
    • Multivalued: True

    Mapping of a label in the input annotations to a new label. Must be formatted as IN_LABEL:OUT_LABEL (with a colon). To pass multiple mappings, use this parameter multiple times. By default, all the input labels are passed as is, including any negative labels (with default value being no remapping at all). However, when at least one label is remapped, all the other “unset” labels are discarded as a negative label.

  • pretty: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    The JSON body of the HTTP response will be re-formatted with 2-space indentation

  • runningTime: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    The running time of the app will be recorded in the view metadata

  • hwFetch: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata

Outputs

(Note: “*” as a property value means that the property is required but can be any value.)

(Note: Not all output annotations are always generated.)