Reproducibility checklist

Use this checklist when an NL Analytics output will support a paper, policy note, appendix, replication file, monitoring output, report, or dashboard.

Search definition

Record:

  • Exact query string.
  • Date range.
  • Selected metrics.
  • Search options used where applicable.
  • Any document-level or within-document filters used.
  • Merge logic if searches were combined.

Review evidence

Keep notes from matched-sentence review:

  • Examples of true positives.
  • False positives and exclusions added.
  • Missing terms added during refinement.
  • Ambiguous terms and how they were handled.

Matched-sentence review supports auditability. It does not, by itself, prove construct validity.

Export context

Record:

  • Export date.
  • Corpus coverage statement used for the analysis.
  • File downloaded.
  • Search metadata or configuration file where available.
  • Product share link where used.

Do not rely on a saved search or share link alone as a replication package. Keep exported files, query metadata, and downstream code or notes together.

Downstream analysis

Record:

  • Join key.
  • Identifier checks.
  • Sample restrictions.
  • Whether raw counts were normalized by nr_of_sentences.
  • Any filtering of zero rows.
  • Version of the downstream dataset.

Zero Exposure means the selected query did not match sentences in that transcript. It does not mean the company had no real-world exposure to the topic.

Was this page helpful?