Data Format
Every product delivers different information and thus has a different structure needed to deliver the information.
Products of the same category will share a similar structure. For example two trackers will have the same hierarchy of information but will provide different attributes relevant to the technology tracked in each of the trackers. The same is true for Spending Guides or Black Books.
The format can change between versions
The constant product innovation leads to changes in the structure of the data products. For example, with the rise of AI enabled laptops IDC has added new fields into the Personal Device Tracker to provide the key information to IDC customers. In other cases some information that is no longer relevant is dropped or replaced.
This results not only into updates in the structure of IDC data products but it is also cascaded into the structure of the data delivered through REST API and SFTP integrations.
IDC will notify all the clients ahead of time if there are any changes in the structure of the products. To make sure you will receive the notification make sure your team is registered to receive the notifications. The process is defined in Staying up to Date with Recent Changes.
Concepts Used Accross Data Products
Data Product: An IDC offering consisting of structured data sets (as opposed to written research). Examples: Trackers, Spending Guides, Forecasts, etc. A data product typically has a defined schema (dimensions/measures) and a release schedule.
Data Set: Sometimes used interchangeably with product or part of a product. In IDC exports, a data product might encompass multiple data sets (e.g., one for historical data, one for forecast, or different segments). The API’s JSON may refer to a dataset name for a specific piece of a product.
Data Cut: A one-time delivery of data, outside the ongoing subscription stream. For example, a custom report or a special snapshot not regularly updated. (In contrast, a Subscription implies you get continuous updates).
Dimension: A categorical descriptor in a data set – essentially a column that represents an attribute of the data. Dimensions in IDC trackers might include things like Geography, Vendor, Product Category, Year, Quarter, Customer Segment, etc. Dimensions are used for grouping/filtering data.
Measure: A quantitative metric in the data – essentially a column representing a value being measured. Measures in IDC data often include Units, Value (Revenue), Spend, Shipments, etc. Measures are typically numeric.
Trackers, Spending Guides, and Balck Books
- The files have .csv.gz extension, meaning they are GZIP archives containing a CSV file. You will need to decompress them after download to get the CSV. The compression significantly reduces file size for transfer (often by ~90%).
- The CSV inside is typically comma-separated (standard CSV) with the first row as column headers (captions). Those headers correspond to dimensions and measures.
- The data is usually in a flat table form. Each row represents a specific combination of dimension values (e.g., a specific country/quarter/product segment) and the measure values for that combination.
- Character encoding is usually UTF-8 (so it can handle international characters).
- Dataset may be split into multiple CSV files by IDC (hence multiple .gz files) – you will then need to handle them separately. IDC uses logical splits (like by segment, channel, etc.).
Wallet: Spend by Market and Spend by Tech Provider
There is a GZIP archive for each product: Wallet: Spend by Market and Wallet: Spend by Tech Provider.
Each archive contains a set of CSV files that represent a full accessible dataset of a respective product.
Wallet: Spend by Market
Tech Spend by Technology columns:
Column Name |
Description |
Company Name |
Company Name |
Company Type |
Public company or private company |
Region (Headquarters) |
Region of Headquarters |
Country (Headquarters) |
Country of Headquarters |
State/Province (Headquarters) |
State/Province of Headquarters |
City (Headquarters) |
City of Headquarters |
Address |
Address of Headquarters |
ZIP Code |
ZIP Code of Headquarters |
Website |
Website |
Phone Number |
Phone Number of Headquarters |
Ticker |
Ticker (if a public company) |
Exchange |
Exchange (if a public company) |
Year Founded |
Year Founded |
Employee Band |
IDC defined Employee Band |
Revenue Band |
IDC defined Revenue Band |
Sector |
IDC defined Sector (parent of Industry) |
Industry Detail |
IDC defined Industry (child of Sector) |
Spending Region |
Spending Region |
Spending Country |
Spending Country |
Deployment Type |
Public cloud vs. On-premise spending (if applicable) |
Technology Group |
IDC Technology – Level 1 (least granular) |
Technology Category |
IDC Technology – Level 2 |
Technology |
IDC Technology – Level 3 |
Technology Detail |
IDC Technology – Level 4 (most granular) |
Year |
Calendar Year of spending estimates |
Value (US$M) |
Value of spending estimates in millions of US dollars |
Tech Spend by Technology
Column Name |
Description |
Company Name |
Company Name |
Company Type |
Public company or private company |
Region (Headquarters) |
Region of Headquarters |
Country (Headquarters) |
Country of Headquarters |
State/Province (Headquarters) |
State/Province of Headquarters |
City (Headquarters) |
City of Headquarters |
Address |
Address of Headquarters |
ZIP Code |
ZIP Code of Headquarters |
Website |
Website |
Phone Number |
Phone Number of Headquarters |
Ticker |
Ticker (if a public company) |
Exchange |
Exchange (if a public company) |
Year Founded |
Year Founded |
Employee Band |
IDC defined Employee Band |
Revenue Band |
IDC defined Revenue Band |
Sector |
IDC defined Sector (parent of Industry) |
Industry Detail |
IDC defined Industry (child of Sector) |
Spending Region |
Spending Region |
Spending Country |
Spending Country |
Deployment Type |
Public cloud vs. On-premise spending (if applicable) |
Technology Group |
IDC Technology – Level 1 (least granular) |
Platform Group |
IDC Platform – Level 1 (least granular) |
Platform Category |
IDC Platform – Level 2 |
Platform Detail |
IDC Platform – Level 3 (most granular) |
Year |
Calendar Year of spending estimates |
Value (US$M) |
Value of spending estimates in millions of US dollars |