GETTING STARTED

Last updated: 30-Jun-2022 16:22

This document describes a session, using either HTML or command-line calls, that uses the gate service to obtain the annotations resulting from applying GATE to a newspaper article harvested from the Boston Globe on March 3, 2023.

This sequence is harvested from the tests used to confirm the expected behavior of gate and gated.

Use URlEncoder to URL-encode parameter values for each URL.

The sequence of operations is the same for both HTML and the command line:

  1. Load the ANNIE plugin
  2. Load a document from a file
  3. Create a corpus
  4. Add the loaded document to the corpus
  5. Create a pipeline
  6. Load the "AnnotationDeletePR" processing resource
  7. Load the "DefaultTokenizer" processing resource
  8. Set the corpus of the pipeline
  9. Add the "AnnotationDeletePR" to the pipeline
  10. Add the DefaultTokenizer" to the pipeline
  11. Configure the runtime parameters of "AnnotationDeletePR"
  12. Run the pipeline
  13. Get the annotations from the document
  14. Cleanup gate daemon (gated)
  15. Reset shared state

HTML

Each numbered item is a link that will invoke the operation when clicked.

  1. Load the ANNIE plugin

  2. Load a document from a file

  3. Create a corpus

  4. Add the loaded document to the corpus

  5. Create a pipeline

  6. Load the "AnnotationDeletePR" processing resource

  7. Load the "DefaultTokenizer" processing resource

  8. Set the corpus of the pipeline

  9. Add the "AnnotationDeletePR" to the pipeline

  10. Add the DefaultTokenizer" to the pipeline

  11. Configure the setsToKeep parameter of "AnnotationDeletePR"

  12. Run the pipeline

  13. Get the annotations from the document

  14. Cleanup gate daemon (gated)

  15. Reset shared state

Command Line

  1. Load the ANNIE plugin

  2. Load a document from a file

  3. Create a corpus

  4. Add the loaded document to the corpus

  5. Create a pipeline

  6. Load the "AnnotationDeletePR" processing resource

  7. Load the "DefaultTokenizer" processing resource

  8. Set the corpus of the pipeline

  9. Add the "AnnotationDeletePR" to the pipeline

  10. Add the DefaultTokenizer" to the pipeline

  11. Configure the setsToKeep parameter of "AnnotationDeletePR"

  12. Run the pipeline

  13. Get the annotations from the document

  14. Cleanup gate daemon (gated)

  15. Reset shared state