CLAMS wrapper for spaCy NLP (v1.2)
About this version
- Submitter: marcverhagen
- Submission Time: 2024-06-11T12:30:19+00:00
- Prebuilt Container Image: ghcr.io/clamsproject/app-spacy-wrapper:v1.2
-
Release Notes
Bumping Python SDK version, bug fixes and documentation updates
- Updated to clams-python 1.2.2
- Fixed token length (issue #30)
- Fixed problems with the pretokenized parameter (issue #32)
- Various documentation fixes.
About this app (See raw metadata.json)
Apply spaCy NLP to all text documents in a MMIF file.
- App ID: http://apps.clams.ai/spacy-wrapper/v1.2
- App License: Apache 2.0
- Source Repository: https://github.com/clamsproject/app-spacy-wrapper (source tree of the submitted version)
- Analyzer Version: 3.6
- Analyzer License: MIT
Inputs
(Note: “*” as a property value means that the property is required but can be any value.)
-
http://mmif.clams.ai/vocabulary/TextDocument/v1 (required) (of any properties)
-
http://vocab.lappsgrid.org/Token (of any properties)
Configurable Parameters
(Note: Multivalued means the parameter can have one or more values.)
-
pretokenized
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
Boolean parameter to set the app to use existing tokenization, if available, for text documents for NLP processing. Useful to process ASR documents, for example.
-
pretty
: optional, defaults tofalse
- Type: boolean
- Multivalued: False
- Choices:
false
,true
The JSON body of the HTTP response will be re-formatted with 2-space indentation
Outputs
(Note: “*” as a property value means that the property is required but can be any value.)
(Note: Not all output annotations are always generated.)
-
http://vocab.lappsgrid.org/Token (of any properties)
-
http://vocab.lappsgrid.org/Token#pos (of any properties)
-
http://vocab.lappsgrid.org/Token#lemma (of any properties)
-
http://vocab.lappsgrid.org/NounChunk (of any properties)
-
http://vocab.lappsgrid.org/Sentence (of any properties)
-
http://vocab.lappsgrid.org/NamedEntity (of any properties)