Heuristic Chyron Understanding (v0.2)
About this version
- Submitter: keighrim
- Submission Time: 2025-07-14T23:10:14+00:00
- Prebuilt Container Image: ghcr.io/clamsproject/app-heuristic-chyron-understanding:v0.2
-
Release Notes
Initial release with naive line parsing and name normalization
About this app (See raw metadata.json)
Prototype to convert chyron text from docTR/Tesseract/LLaVA MMIF outputinto a name and list of attributes.
- App ID: http://apps.clams.ai/heuristic-chyron-understanding/v0.2
- App License: Apache 2.0
- Source Repository: https://github.com/clamsproject/app-heuristic-chyron-understanding (source tree of the submitted version)
Inputs
(Note: “*” as a property value means that the property is required but can be any value.)
-
http://mmif.clams.ai/vocabulary/TextDocument/v1 (required) (of any properties)
Text content transcribed from video input by docTR/Tesseract/LLAVA.
Configurable Parameters
(Note: Multivalued means the parameter can have one or more values.)
-
note4mode: optional, defaults tofalse- Type: boolean
- Multivalued: False
- Choices:
false,true
Boolean to set the app to run in “note-4” mode and to take the second line (if available) from the input text to be the
name-normalizedvalue. The default is false, which means the app will try to generate normalization fromname-as-written(from the first line) value. -
pretty: optional, defaults tofalse- Type: boolean
- Multivalued: False
- Choices:
false,true
The JSON body of the HTTP response will be re-formatted with 2-space indentation
-
runningTime: optional, defaults tofalse- Type: boolean
- Multivalued: False
- Choices:
false,true
The running time of the app will be recorded in the view metadata
-
hwFetch: optional, defaults tofalse- Type: boolean
- Multivalued: False
- Choices:
false,true
The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata
Outputs
(Note: “*” as a property value means that the property is required but can be any value.)
(Note: Not all output annotations are always generated.)
- http://mmif.clams.ai/vocabulary/TextDocument/v1
- document = “*”
- origin = “*”
- provenance = “derived”
- mime = “application/json”
Reformatted chyron text.
documentproperty stores the ID of the original sourceVideoDocument.originproperty stores the ID of the original OCRTextDocumentannotation. Reformatted text is escaped JSON string with three fields:name-as-written,name-normalized, andattributes.