CLAMS wrapper for spaCy NLP (v1.2)

About this version

  • Submitter: marcverhagen
  • Submission Time: 2024-06-11T12:30:19+00:00
  • Prebuilt Container Image: ghcr.io/clamsproject/app-spacy-wrapper:v1.2
  • Release Notes

    Bumping Python SDK version, bug fixes and documentation updates

    • Updated to clams-python 1.2.2
    • Fixed token length (issue #30)
    • Fixed problems with the pretokenized parameter (issue #32)
    • Various documentation fixes.

About this app (See raw metadata.json)

Apply spaCy NLP to all text documents in a MMIF file.

Inputs

(Note: “*” as a property value means that the property is required but can be any value.)

Configurable Parameters

(Note: Multivalued means the parameter can have one or more values.)

  • pretokenized: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    Boolean parameter to set the app to use existing tokenization, if available, for text documents for NLP processing. Useful to process ASR documents, for example.

  • pretty: optional, defaults to false

    • Type: boolean
    • Multivalued: False
    • Choices: false, true

    The JSON body of the HTTP response will be re-formatted with 2-space indentation

Outputs

(Note: “*” as a property value means that the property is required but can be any value.)

(Note: Not all output annotations are always generated.)