Export a research panel
Export a research panel after the query has been reviewed and the matched sentences support the intended measure. This page lists the core files, their columns, and the identifiers for downstream joins.
Before exporting
Confirm that:
- The query matches the intended construct.
- The date range matches the analysis sample.
- The topic has enough coverage and variation for the intended use.
- Matched sentences have been inspected in the Snippet Tool.
Download output
- Open the completed Risk Tool search (or its entry in search history).
- Download the main call-level file,
firmlevel.csv. - Download
unconditional.csvif the analysis needs overall risk or sentiment counts. - Download
gvkeys.csvif the downstream workflow needs a Compustat join. - Download the search metadata where available.
Keep every downloaded file together with the search definition per the reproducibility checklist.
firmlevel.csv
The main export. One row per earnings call in the selected date range, identified by earningscallID, including calls with zero matches.
| Column | Definition |
|---|---|
earningscallID | Unique identifier for the earnings call. Use it for joins, deduplication, and citations. |
company_name | English company name. |
date | Date of the earnings call, based on UTC time. |
event_name | Name of the earnings-call event, usually including the fiscal period discussed. |
start_time | ISO 8601 timestamp for when the call started, in UTC. Use it when precise timing matters. |
exposure, risk, positive, negative, sentiment | The topic-conditioned measures. See metric definitions. |
nr_of_sentences | Total number of sentences in the full transcript. |
tickersymbol | Ticker symbol. |
permid | Refinitiv PermID, where available in the corpus metadata. |
ric | Refinitiv Instrument Code, where available in the corpus metadata. |
isin | International Securities Identification Number. |
cik | SEC Central Index Key, mainly for U.S. entities. |
date_q | First day of the calendar quarter of the call. Calendar quarter, not fiscal quarter. |
headquarterscountry | Registered headquarters country. |
trbc | Thomson Reuters Business Classification code. |
economic_sector | Label for the first two digits of trbc. |
business_sector | Label for the third and fourth digits of trbc. |
nr_of_sentences_filtered | Number of sentences passing the section, speaker-affiliation, or speaker-title restrictions. Present only when such a restriction was used. |
matched_speaker_names | Names of speakers whose sentences matched, when a speaker-title restriction was used. |
The file can also contain provenance columns describing the scoring mode and columns for metrics under development, which these docs do not define.
Identifier columns other than earningscallID can be empty for some calls. Check identifier completeness in your target sample before designing a join around one of them.
unconditional.csv
Overall risk and sentiment counts, not restricted to the query. One row per earnings call.
| Column | Definition |
|---|---|
earningscallID | Unique identifier for the earnings call. |
nr_of_sentences | Total number of sentences in the full transcript. |
unconditional_positive | Sentences with at least one positive sentiment word. |
unconditional_negative | Sentences with at least one negative sentiment word. |
unconditional_risk | Sentences with at least one risk or uncertainty synonym. |
unconditional_sentiment | unconditional_positive - unconditional_negative, included when sentiment is among the selected measures. |
Merge it with firmlevel.csv on earningscallID to express topic-conditioned measures relative to all risk or sentiment language in the call. See Overall risk and sentiment for the definitions.
gvkeys.csv
A convenience crosswalk for Compustat joins. One row per earnings call.
| Column | Definition |
|---|---|
earningscallID | Unique identifier for the earnings call. |
gvkey_compustat | Compustat GVKey, where a match exists. |
sic_compustat | Four-digit SIC code, where available. |
sic_compustat_description | Label for the SIC code. |
Treat the crosswalk as an input to verify, not as a guaranteed match: check match quality in the downstream dataset and document the checks you performed.
Search metadata
The search metadata records the search definition: the query, date range, selected options, and the search identifier. Download it where available and store it with the exported files — it is the audit trail that lets a coauthor or referee see exactly which definition produced the panel.
Keep zero rows
firmlevel.csv includes all earnings calls in the selected date range, including calls with zero Exposure or other zero metric values.
Zero rows preserve absence in the selected sample. Removing them changes averages, merges, and aggregate statistics — see Search quality and zero rows.
Next step
Join the export to the rest of the empirical design: Join and normalize outputs.