This page contains specifications of statistics files used on this website.
All per-graph statistics files are available for download via an URL of the form:
https://metrics.torproject.org/identifier.csv
These URLs all support a set of optional parameters that can be used to further customize their content. Typically, these are start and end plus additional parameters as specified below. Including a parameter in a URL typically filters the resulting statistics file by the given parameter value. In reverse, omitting a parameter produces a larger statistics file that is not filtered by that parameter.
Each per-graph statistics file starts with a comment section, followed by a header line and then the actual data lines. Columns are pre-defined and specified further down below. The rule of thumb for columns is that neither the choice of parameters nor availability of data should affect the set of columns, but that only a code change can add, change, or remove a column. This rule of thumb is not yet implemented for all per-graph statistics files.
Applications must not rely on the order of columns, as this order may change when columns are removed. Instead, applications should refer to columns by their name. Applications should be able to handle newly added columns and fail gracefully in case of removed columns.
Changes to columns will be announced on this page at least a couple weeks in advance as well as on the tor-dev@ mailing list.
The following changes have been made in the past or are scheduled to become effective in the near future:
We use tidy/"long" data formats for our CSV files. For example, the following lines are contained in the userstats-bridge-combined.csv:
date,country,users,frac
2022-04-20,,69560,94
2022-04-21,,69068,87
2022-04-22,,70663,94
2022-04-23,,69647,92
2022-04-24,,69993,92
2022-04-25,,71348,92
2022-04-26,,69745,93
2022-04-27,,69497,94
2022-04-28,,69043,92
2022-04-29,,68440,89
2022-04-30,,68368,92
2022-05-01,,67647,93
2022-05-02,,69142,93
2022-05-03,,70942,90
2022-05-04,,68913,90
2022-05-05,,71004,92
2022-05-06,,68430,85
2022-05-07,,67643,93
The tidy format is more flexible, and somewhat easier to handle with statistics tools/languages like R or python. See also Hadley Wickham's Tidy Data paper
which is a really good read on this topic.If you are familiar with programming in python you can easily parse our csv files with pandas:
  import pandas as pd
  df = pd.read_csv('userstats-bridge-combined-2022-04-20-all-2022-07-19.csv',  comment='#')
This will load the data in a dataframe From which you can easily manipulate columns and rows. Ex:
  date_th = '2022-05-01'
  df = df[df.date >= date_th]
The above snippet would filter the data in the table published after a defined date.
"obfs2", "obfs3", "obfs4", "scramblesuit", and "fte", and which will change in the future."https", "email", and "moat", and which may change in the future."Guard" relay flag assigned ("t") or not ("f")."Exit" relay flag assigned and at the same time the "BadExit" not assigned ("t") or not ("f").torproject.org web servers have been logged.'%/torbrowser/%.exe', '%/torbrowser/%.dmg', and '%/torbrowser/%.tar.xz' and response code 200.'%/torbrowser/%.exe.asc', '%/torbrowser/%.dmg.asc', and '%/torbrowser/%.tar.xz.asc' and response code 200.'%/torbrowser/update\__/%' and response code 200.'%/torbrowser/%.mar' and response code 302.torproject.org web servers have been logged.torproject.org web servers have been logged.torproject.org web servers have been logged.torproject.org web servers have been logged.'%/tormessenger/%.exe', '%/tormessenger/%.dmg', and '%/tormessenger/%.tar.xz' and response code 200.'%/tormessenger/update\__/%' and response code 200.© 2009–2023 The Tor Project
This material is supported in part by the National Science Foundation under Grant No. CNS-0959138. Any opinions, finding, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. "Tor" and the "Onion Logo" are registered trademarks of The Tor Project, Inc.. Data on this site is freely available under a CC0 no copyright declaration: To the extent possible under law, the Tor Project has waived all copyright and related or neighboring rights in the data. Graphs are licensed under a Creative Commons Attribution 3.0 United States License.