# Things to note when doing metrics code review.
[TOC]
This covers how to review metrics code in the Chromium codebase.
## UMA Histograms
### What is covered under review?
During code review ensure the following -
#### XML metadata in histograms.xml is correct and descriptive.
* Verify the histogram is created under the right histogram 'namespace'.
(i.e if the histogram name is A.B.C, then A is the histogram namespace).
If this is a new one, check if there a similar one that already exists?
* Histogram owners match the
[histogram owners guidelines](https://chromium.googlesource.com/chromium/src/+/HEAD/tools/metrics/histograms/README.md#owners).
* If generating multiple histograms programmatically or defining common set of
histogram, guide them to use
[patterned-histograms](https://chromium.googlesource.com/chromium/src/tools/+/HEAD/metrics/histograms/README.md#Patterned-Histograms).
* Verify that expires_after is reasonable. CL author should be able to justify
it. See guidance at on
[histogram-expiry](https://chromium.googlesource.com/chromium/src/+/HEAD/tools/metrics/histograms/README.md#Histogram-Expiry).
* Summary section should describe what is measured, when is it recorded and
when not. The summary should generally describe a single emission of sample
and not the statistics in aggregate.
* If the histogram is recorded only for some platforms, then it should be
included in the summary (unless part of its name).
* Histogram and enum names don't include special characters besides dot,
underscores or slashes.
#### Histogram is designed correctly.
* Verify that the histogram follows the
[UMA histogram design recommendation](https://chromium.googlesource.com/chromium/src.git/+/HEAD/tools/metrics/histograms/README.md).
If issues are found please link to docs in your review comments to
increase their visibility.
#### Histogram is setup correctly.
* If modifying an existing histogram, request that the histogram be
renamed if its meaning has changed significantly. Common practices are to
add suffix such as 2 to the name. When doing so, the existing entry for the
histogram should be removed and an obsoletion message should be added
following the [guidance](https://chromium.googlesource.com/chromium/src/tools/+/HEAD/metrics/histograms/README.md#obsolete) to point to the new histogram.
* Re-numbering enum bucket values is not allowed as these break backward
compatibility with respect to the data stored.
* When modifying enums it is better to add new values to the enum instead of
re-purposing existing enum values.
* Modifying enum labels / summary is safe and allowed without review. However,
if reviewing these changes make sure the semantic meaning of the bucket
remain unchanged. e.g re-labeling 'Has Error' to 'Has Warning' should not be
allowed, while re-labeling 'Has Error' to 'Has Error (e.g this and that
error)' is okay.
* Verify that histogram buckets are not a privacy risk. Some of the types
forbidden are if the buckets are encoding page contents, URL, domain name,
or is including any other type of personally identifying or sensitive
information. If during review you are unsure then do not hesitate to request
that Chrome Privacy Team review the change.
* Check that the histogram bucket space of all possible values for all clients
will be limited to 50 generally. It should not exceed 100 unless the CL
author has a justification for having extra buckets.
* If the lower bucket counts prove insufficient (e.g after reviewing
the data for 50 buckets proves to be unhelpful, more buckets can be
requested in a separate review).
* When logging hashes in a sparse histogram make sure the global space
meet this criterion (not just a client). Another valid case is when
logging API error codes which in theory could be potentially large,
but on a particular day it is not the case.
* Check that recorded samples will be within 0 and 2^31-1.
* If you expect majority of values to be under a certain value, say X,
then it's good practice to pick the next order of magnitude - e.g. 10X for
the max. This ensures that even outliers will not overflow the distribution.
* Verify that for an enum histogram, the enum described in enums.xml and the
enum defined in the client code matches. Furthermore, the enum in the code
should have a comment mentioning that the values must not be changes and
also that additions to the enum should be synced to enums.xml.
* See the sample comment
[here](https://cs.chromium.org/chromium/src/base/metrics/histogram_macros.h?rcl=2c99f35f64380ba63c928787834661fbc1fa4234&l=46).
The comment should be identical or nearly so.
* If the histogram is logged via a macro (rather than a call to the function),
check that the names will be constant at runtime.
### What is not covered under review?
* Metrics team members need not be versed with code where histogram is
recorded and are not reviewing all the corner cases associated with
collecting the histogram.
* enums.xml changes don't need a review. However, it is still useful to verify
that the changes match the guidelines mentioned above.
## User Actions
* Verify that the user action logged is actually user triggered. If they
are not then advise cl author to convert them to a histogram.
* However if ordering of actions is the interesting part of their analysis
then maybe see if each order combination can become a histogram bucket
instead of user action.
* Don't allow logging of noisy user actions (like scroll events). Typical
allowed frequency is to be less frequent than PageLoad or MobilePageLoaded
event.
## UKMs
* UKM metrics are to be reviewed by UKM
[data privacy owners](https://cs.chromium.org/chromium/src/tools/metrics/ukm/PRIVACY_OWNERS).
* The metrics must follow the
[data collection guideline](/analysis/uma/g3doc/ukm/ukm.md#adding-ukms).
<!--TODO(ukm-team): Add other guidelines for reviewing UKM metrics changes. -->
## Other specialized metrics
The ChromeUserMetricsExtension proto includes a variety of other fields such as
records for Omnibox, Profiler, Stability, etc. These are specialized reviews and
should be routed to relevant owner. The guidelines here don't cover these cases
and typically require a server-side review first to change the proto.