Skip to content

Integrate with spaCy¶

spaCy is a library for training advanced NLP models for use cases such as tagging, parsing, named entity recognition, text classification, and more.

Start Logging¶

Connect Comet to your existing code by using the spaCy Comet logger, here is an example configuration file:

[training.logger]
@loggers = "comet_ml.spacy.logger.v1"

Log automatically¶

By default, Comet will log the following items:

  • Metrics
    • The metrics defined in the project .cfg file
  • Parameters
    • hyper-parameters defined in the project .cfg file
  • Model
    • the best-performing model

End-to-end example¶

Following is a basic example of using Comet with spaCy.

Install dependencies¶

python -m pip install 'comet_ml>=3.31.19' spacy

Download example project¶

spacy project clone pipelines/tagger_parser_ud
spacy project assets tagger_parser_ud

Login to Comet¶

There are two ways to configure Comet

You can either set your credentials through environment variables

Environment Variables

export COMET_API_KEY=<Your Comet API Key>
export COMET_PROJECT_NAME=<Your Comet Project Name>

Or create a .comet.config file in your working directory and set your credentials there.

Comet Configuration File

[comet]
api_key=<Your Comet API Key>
project_name=<Your Comet Project Name>

Configure Comet as your logger¶

Logging in spaCy is defined in the projects training.logger in the project configs/default.cfg file, replace the section [training.logger] with:

[training.logger]
@loggers = "comet_ml.spacy.logger.v1"
workspace = "your_workspace"
project_name = "your_project_name"
tags = ["your_tags"]
remove_config_values = ["paths.train", "paths.dev", "corpora.train.path", "corpora.dev.path"]

Run your spaCy Training Pipeline¶

spacy project run all tagger_parser_ud
Nov. 18, 2024