Primary Keys
Keys are unique identifiers for individual records
- Primary (one column) or compound (multiple columns together)
The history of corporate IT is largely one of (failed) primary keys
- Finance: Tickers, Tickers + Exchange, Tickers + Share Class, CUSIP, ISIN, SEDOL, …
Meaningful true keys are vanishingly rare - cherish them when you find them
Often ‘unique enough’ for an analysis
dplyr::count
is helpful here