Heuristic Chyron Understanding (v0.2)
About this version
- Submitter: keighrim
- Submission Time: 2025-07-14T23:10:14+00:00
- Prebuilt Container Image: ghcr.io/clamsproject/app-heuristic-chyron-understanding:v0.2
-
Release Notes
Initial release with naive line parsing and name normalization
About this app (See raw metadata.json)
Prototype to convert chyron text from docTR/Tesseract/LLaVA MMIF outputinto a name and list of attributes.
- App ID: http://apps.clams.ai/heuristic-chyron-understanding/v0.2
- App License: Apache 2.0
- Source Repository: https://github.com/clamsproject/app-heuristic-chyron-understanding (source tree of the submitted version)
Inputs
(Note: “*” as a property value means that the property is required but can be any value.)
-
http://mmif.clams.ai/vocabulary/TextDocument/v1 (required) (of any properties)
Text content transcribed from video input by docTR/Tesseract/LLAVA.
Configurable Parameters
(Note: Multivalued means the parameter can have one or more values.)
-
note4mode
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
Boolean to set the app to run in “note-4” mode and to take the second line (if available) from the input text to be the
name-normalized
value. The default is false, which means the app will try to generate normalization fromname-as-written
(from the first line) value. -
pretty
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The JSON body of the HTTP response will be re-formatted with 2-space indentation
-
runningTime
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The running time of the app will be recorded in the view metadata
-
hwFetch
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata
Outputs
(Note: “*” as a property value means that the property is required but can be any value.)
(Note: Not all output annotations are always generated.)
- http://mmif.clams.ai/vocabulary/TextDocument/v1
- document = “*”
- origin = “*”
- provenance = “derived”
- mime = “application/json”
Reformatted chyron text.
document
property stores the ID of the original sourceVideoDocument
.origin
property stores the ID of the original OCRTextDocument
annotation. Reformatted text is escaped JSON string with three fields:name-as-written
,name-normalized
, andattributes
.