Corpus scope and coverage

The current public documentation scope is English earnings-call transcripts.

Current scope

NL Analytics currently documents workflows around text-based corporate disclosures and earnings-call transcripts. It is not documented as a generic upload-any-text system.

The documented corpus source universe is LSEG earnings-call transcripts available to NL Analytics.

Coverage snapshot

These figures summarize corpus-level metadata available to the documentation.

Coverage statisticSnapshot value
Earnings-call transcripts457,261
Date coverage2002-01-14 to 2026-04-30
Covered entities15,869
Headquarters countries93
Total sentences175.9 million
Median transcript length378 sentences
Latest complete year27,529 calls in 2025
Largest country shareUnited States, 59.4% of calls
RIC coverage98.3% of call rows
GVKey coverage96.7% of call rows
PermID coverage86.9% of call rows
CIK coverage83.3% of call rows
ISIN coverage75.2% of call rows

Snapshot generated from corpus metadata on May 6, 2026.

Covered entities use the internal company key with company-name fallback. Identifier coverage is row-level coverage in the corpus metadata. Use these numbers as a starting point, not as proof that a specific sample is covered. After running a search, check the exported calls, firms, countries, dates, and identifiers that remain in your target sample.

Coverage varies

Coverage can vary by country, sector, firm, and time. Check coverage before interpreting variation in a topic measure.

For research use, compare the available transcript coverage against the intended sample. A topic can appear in the broad corpus but still be too sparse or too concentrated in the target firms, countries, sectors, or periods.

Refresh timing

NL Analytics materials describe a roughly biweekly refresh cadence, typically every 13 to 16 days. Treat that timing as approximate rather than guaranteed.

When publishing research, record the coverage statement and export date used for the analysis.

Future data sources

Some NL Analytics materials discuss possible future expansion to annual reports, quarterly reports, job postings, patent filings, or user-owned documents. Those sources are not documented as current product scope.

Was this page helpful?