Reproducibility checklist
Use this checklist when an NL Analytics output will support a paper, policy note, appendix, replication file, monitoring output, report, or dashboard.
Search definition
Record:
- Exact query string.
- Date range.
- Selected metrics.
- Search options used where applicable.
- Any document-level or within-document filters used.
- Merge logic if searches were combined.
Review evidence
Keep notes from matched-sentence review:
- Examples of true positives.
- False positives and exclusions added.
- Missing terms added during refinement.
- Ambiguous terms and how they were handled.
Matched-sentence review supports auditability. It does not, by itself, prove construct validity.
Export context
Record:
- Export date.
- Corpus coverage statement used for the analysis.
- File downloaded.
- Search metadata or configuration file where available.
- Product share link where used.
Do not rely on a saved search or share link alone as a replication package. Keep exported files, query metadata, and downstream code or notes together.
Downstream analysis
Record:
- Join key.
- Identifier checks.
- Sample restrictions.
- Whether raw counts were normalized by
nr_of_sentences. - Any filtering of zero rows.
- Version of the downstream dataset.
Zero Exposure means the selected query did not match sentences in that transcript. It does not mean the company had no real-world exposure to the topic.