Skip to Main Content
* UC Irvine access only

Business:  Datasets 

Your comprehensive station for UCI's business research content and beyond!
URL: https://guides.lib.uci.edu/business

Datasets


Get Pre-built Datasets

 

MarketLine Advantage
Access: UCI

Geography: Global

Country Statistics, Financial Deals, Company Prospector, Investment & Advisory Prospector, Company Report Generator, and Market Data Analytics.

  • Home > Databases [top menu] 

 

Global Financial Data
Access: UCI
Geography: Global

Current and historic downloadable economic and financial time series for the U.S. and over 150 foreign countries.

  • After creating an account, make sure to CLICK the Login button (do not use an Enter/Return key) to get into the database.

 

OECD iLibrary
Access: UCI
Geography: Global

All publications and datasets released by OECD (Organisation for Economic Cooperation and Development), International Energy Agency (IEA), Nuclear Energy Agency (NEA), OECD Development Centre, PISA (Programme for International Student Assessment), and International Transport Forum (ITF) since 1998.

 

Roper iPoll
Access: UCI

Geography: US

Provided by the Roper Center for Public Opinion Research at Cornell University, Roper iPoll is the largest collection of poll data anywhere—from 1935 to present. Contains data from U.S. and international polling firms with broad topical coverage of opinions and behavior on social issues, politics, pop culture, international affairs, and more. Questions, charts, demographic crosstabs, and dataset downloads are immediately available.

 

ICPSR Data Archive
Access: UCI
Geography: Global

An international consortium of more than 700 academic institutions and research organizations.  ICPSR maintains a data archive of more than 500,000 files of research in the social sciences.   

  • Find Data > View All [link to the right of the search bar] > Filters, i.e. the left navigation to limit to datasets that meet your needs.  

 


Build Datasets

 
 

Passport Global Market Information Database
Access: UCI - Authorized Affiliates

Geography: Global 

Statistical data on numerous B2C markets:  Alcoholic Drinks, Apparel and Footwear, Automotive, Beauty and Personal Care, Consumer Appliances, Consumer Electronics, Consumer Health, Eyewear, Fresh Food, Health and Wellness, Home and Garden, Home Care, Hot Drinks, Luxury Goods, Packaged Food, Personal Accessories, Pet Care, Soft Drinks, Tissue and Hygiene, Tobacco, and Toys and Games.

  • Learn this search!  < Get coffee market sizes over time for countries worldwide. (Please refresh if the content doesn't load after a few seconds)

 

IBISWorld
Access: UCI

Geography: US

Current and historic Industry Reports and Risk Rating reports on 700+ well-established U.S. industries (organized by NAICS code), including operating conditions, outlook, trends, and key players.  Also has current and historic Specialized Industry Reports reports for small niche industries.  Especially useful if you want to build a dataset with categorical variables!

  • Learn this search!  < Filter and compare US industries. (Please refresh if the content doesn't load after a few seconds)

 

SimplyAnalytics
Access: UCI

Geography: US

Maximum 3 users at once! An easy-to-use interface offering thousands of social and economic data variables that can be used when building custom maps and reports (i.e. spreadsheets).  Register for a free account to save and return to your work. 

  • Learn this search!  < Get consumer group data by region. (Please refresh if the content doesn't load after a few seconds)

 

S&P Capital IQ
Access: UCI - Authorized Affiliates
Geography: Global

A comprehensive company intelligence database, offering granular financial and capital structure information on millions of companies worldwide, including data and graphs for historic stock prices, searchable SEC filings, mergers & acquisitions activities, and venture capital/private equity data.  

  • Create custom tables of data; content is downloadable as Excel files.   

 

Vivvix 
Access: UCI

Geography: US

Advertising expenditures and occurrence information for millions of brands across a variety of national-level media, with data going back 5 years.  Part of the Kantar Media portfolio.  

 

Wharton Research Data Services (WRDS)
Access: UCI - Authorized Affiliates
Geography: Global

For PhD and faculty-level research.  Important databases in: finance, accounting, banking, economics, management, marketing and public policy. Merage users login through Catalyst.


Time Series Data Files

 

NOTE: 
The UCI Libraries have purchased access to these historical files of data, but the UCI Libraries do not host the data online.  If you're a UCI Authorized Affiliate (a user with an active UCInetID and password, i.e. a current UCI student, faculty, or staff member), then you can request access to the data files, and they will be transferred to you.  

 

CI Technology Historical Data
Access: UCI - Authorized Affiliates
Geography: Global 

Also known as Harte-Hanks.  Data on the installed technology and purchase plans of businesses, governments, and institutions at the local level; contains company profile information that includes annual spending/expenditure on IT and telecommunications, and counts on Installed Technology, such as: computer hardware, software, networks, storage and telecommunications, IT staff/employee counts.  

 

RateWatch Scholar Historical Data
Access: UCI - Authorized Affiliates
Geography: US

Information for U.S. financial institutions for research and analysis.  Data covers over 96,000 branch locations, depending on time period and data type, all provided voluntarily.  Data is gathered from institutions of all types and sizes, including banks, credit unions, savings and loan associations, etc. 

 

ReferenceUSA Historical Data
Access: UCI - Authorized Affiliates
Geography: US

Datasets consisting of annual snapshots of historic company data from Infogroup's U.S. Business Database. Includes company name, mailing address, SIC and NAICS codes, employee size, sales volume, and location (to zip code). 

 

Corporate Affiliations Historical Data
Access: UCI - Authorized Affiliates
Geography: Global (limited)

Historical hierarchies and profiles of over 1.9 million of the most prominent parent companies, subsidiaries, affiliates and divisions worldwide.  

 

Nielsen Datasets  
Access: UCI - Limited Affiliates

Geography: US

For PhD students and Tenure Track Faculty only!  The UCI Libraries' subscription includes the Consumer Panel dataset and the Retail Scanner dataset.  Data is from a partnership between Nielsen and the Kilts Center for Marketing at the Chicago Booth School of Business.


More Data

 

Tip  
Zillions of websites—academic, government, nonprofit, and corporate—offer datasets for research. 
Below are select academic sites with social science data. To find more datasets out in the web, try
adding site:.domain to a keyword search in Google.  That returns results only from websites with that
domain (.com, .gov, .org, .edu, etc.).  For example, paste these search strings into Google:

  • renewable energy data site:.org 
  • sustainable development site:.data.gov


UCI Machine Learning Repository
A collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms. It is used by students, educators, and researchers all over the world as a primary source of machine learning data sets. 

 

IPUMS
Census and survey data from around the world integrated across time and space. IPUMS integration and documentation makes it easy to study change, conduct comparative research, merge information across data types, and analyze individuals within family and community context.  From the University of Minnesota.  (IPUMS was an acronym for Integrated Public Use Microdata Series, but in 2016 it became a standalone name, not an acronym).  

 

re3data.org
The Registry of Research Data Repositories.  Identifies and lists 1,500+ research data repositories, making it the largest and most comprehensive registry of data repositories available on the web.

 

Academic Torrents
A community-maintained distributed repository for datasets and scientific knowledge.  The service is designed to facilitate storage of all the data used in research, including datasets as well as publications.

 

SNAP
The SNAP library is being actively developed since 2004 and is organically growing as a result of Stanford University's research pursuits in analysis of large social and information networks.

 

Billion Prices Project
An academic initiative that uses prices collected from hundreds of online retailers around the world on a daily basis to conduct research in macro and international economics.

 

DocNow
Documenting the Now, a collaborative effort among multiple universities, is building a variety of tools to help researchers work with Twitter data.  Twitter's terms of service don't allow tweet datasets to be published on the web, but they do allow tweet identifier datasets to be shared.   DocNow maintains a catalog of Twitter datasets that are publicly available on the web.

 

California Policy Lab
A non-partisan research institute based at the University of California. Work is focused in six policy areas: education, criminal justice reform, poverty and the social safety net, labor and employment, health, and homelessness and high needs populations.  Offers and streamlines access to data for policy-related research.  (Note: some data may include fees or an approval process prior to use.)


What's the jargon?


Database Dataset
A collection of information and/or data, usually covering a broad theme (e.g., company information, psychology, etc.).  Info/data can exist in any medium- text, images, spreadsheets, video, etc. 

The info/data in a database may come from one producer (e.g. Mintel produces its own market research reports), or it may come from several producers (e.g. Factiva compiles news articles from thousands of different publications worldwide). 
A narrower collection of information and/or data, usually describing a specific topic or phenomenon (e.g., the stock price of a company, or consumers' responses to surveys, etc.). 

In business and economics, as academic disciplines, datasets are often presented as spreadsheets in filetypes such as .xlsx or .csv.
  • Database vs Dataset
    A database is usually the container for datasets.  Our databases are resources that either 1) contain pre-made datasets, 2) or help you create custom datasets by requesting specific data (i.e. running a query) within the database.

  • Data
    "Related items of (chiefly numerical) information considered collectively, typically obtained by scientific work and used for reference, analysis, or calculation." (OED Online)

  • Raw Data
    This term is fuzzy because almost any observable or recorded phenomena could theoretically be raw data, but people often use it when referring to a dataset in a file format (e.g. .xlsx, .csv, .sav) that's often presented as a spreadsheet. 
    • Caveat: raw data is not always cleaned data.  
       
  • Cleaned Data
    To clean data, people or computer programs assess data, organize it, and correct errors, omissions, duplicates, inaccuracies, etc., thereby making the data ready for analysis.  Determining whether data is clean enough for your needs can be tricky.  Read the supporting documentation that accompanies a dataset, and contact the data producer or custodian if you have questions.  
     
  • Time Series Data
    To clean data, people or computer programs assess data, organize it, and correct errors, omissions, duplicates, inaccuracies, etc. thereby making the data ready for analysis.  Determining whether data is clean enough for your needs can be tricky.  Read the supporting documentation that accompanies a dataset, and contact the data producer if you have additional questions.  
     
  • Statistics
    This generally refers to data that has already been processed and analyzed.  For example, if you have sales data from several local car dealerships, then you could use that to generate statistics about local consumers preferences for vehicles.  

     

So, you want an Excel spreadsheet...


Get Pre-built Datasets Build Datasets

Use this page if you want to explore resources that let you download a pre-made Excel file.  

This is generally an easier way to get a data file!

Use this page if you want to explore resources that let you customize what's in the Excel file before you download it.  

This is generally a more challenging way to get a data file!

 

Return to the top of the page.

 Box title- Access Answers 


What can I access? 


Access is generally available to all users. Registration or account creation might be required to access.

For commercial websites, the UCI Libraries do not offer premium memberships or subscriptions.

 


Access is available to all users ON the UCI campus and at GML.

OFF campus access requires Authorized Affiliates to log into the VPN with their active UCInetID and password. Authorized Affiliates are users with an active UCInetID and password, i.e. current UCI students, faculty, and staff.

 


Access requires an active UCInetID and password.

Authorized Affiliates are users with an active UCInetID and password, i.e. current UCI students, faculty, and staff.

 


These resources are not licensed by the UCI Libraries, but librarians occasionally promote them when they are relevant for certain types of research.

Access is available only for Authorized Affiliates, who are also affiliated with the Paul Merage School of Business.

 

The resources are limited to select UCI populations, based on the user’s status, e.g. current UCI Faculty or PhD students. Please refer to the UCI Libraries for access instructions.

Examples for why content may be limited include: a vendor set restrictions on who may access their information; alternatively, information may be sensitive, identifying, or embargoed;

 


How do I access? 
-  Students
-  Employees
-  Alumni
-  Visitors


New browser window icon.Am I on the UCI network?
Test my UCI connection.


New browser window icon.Am I responsibly using
what I access?

Typically acceptable vs. unacceptable use.

Does UCI offer software for data analysis (e.g. SPSS or ArcGIS)? 
Yes!  

Can the UCI Libraries help me learn how to prepare, analyze, and understand data? 
Yes!