About GitHub Copilot telemetry

GitHub Copilot collects and relies on additional telemetry data beyond what other GitHub products and services collect.

What data is collected

The GitHub Copilot collects activity from the user’s Visual Studio Code editor, tied to a timestamp, and metadata. This metadata consists of the extension settings and the standard metadata collected by the Visual Studio Code extension telemetry package:

  • Visual Studio Code machine ID (pseudonymized identifier)
  • Visual Studio Code session ID (pseudonymized identifier)
  • Visual Studio Code version
  • Geolocation from IP address (country, state/province and city, but not the IP address itself)
  • Operating system and version
  • Extension version
  • The VS Code UI (web or desktop)

The activity collected consists of events that are triggered when:

  • An error occurs (it records the error kind and relevant background; e.g. if it’s an authentication error the key expiry date is recorded)
  • Our models are accessed to ask for code suggestions (it records editor state like position of cursor and snippets of code)—this includes cases when the user takes an action to request code suggestions
  • Code suggestions are received or displayed (it records the suggestions, post-processing, and metadata like model certainty and latency)
  • Code suggestions are redacted due to filters that ensure AI safety
  • The user acts on code suggestions (e.g. to accept or reject them)
  • The user has acted on code suggestions and then it records whether or how they persisted in the code

How the data is used

This data will only be used by GitHub for:

  • Directly improving the product, including assessing different strategies in processing and predicting which suggestions users may find helpful
  • Directly evaluating the product, e.g. by measuring the positive impact it has on the user
  • Improving the underlying code generation models, e.g. by providing positive and negative examples (but always so that your private code is not used as input to suggest code for other users of GitHub Copilot)
  • Guiding closely related GitHub products
  • Investigating and detecting potential abuse of the GitHub Copilot service
  • Other purposes related to improving the GitHub Copilot service

How the data is shared

The telemetry data is stored securely on GitHub systems, with appropriate encryption in place.

We know user edit actions and source code snippets are very sensitive data, and access is strictly controlled. The data can only be accessed by (1) named GitHub personnel (employees and contractors) working on the GitHub Copilot team or on the GitHub platform health team, (2) select Microsoft personnel (employees and contractors) working on or with the GitHub Copilot team, and (3) select employees of OpenAI who work on GitHub Copilot.

Did this doc help you?Privacy policy

Help us make these docs great!

All GitHub docs are open source. See something that's wrong or unclear? Submit a pull request.

Make a contribution

Or, learn how to contribute.