"Related items of (chiefly numerical) information considered collectively, typically obtained by scientific work and used for
reference, analysis, or calculation." (OED Online)
This term is fuzzy because almost any observable or recorded phenomena could theoretically be raw data, but people
often use it when referring to a dataset in a file format (i.e. an .xlsx or .csv file) that's presented as a spreadsheet.
Caveat: raw data is not always cleaned data.
To clean data, people or computer programs assess the data and correct any errors, omissions, duplicates, inaccuracies, etc.
so that the data is ready for analysis. Determining whether data is clean enough for your needs can be tricky. Read the
supporting documentation that accompanies a dataset, and contact the data producer if you have additional questions.
Many of UCI Libraries data and mapping databases are designed for researchers in the social sciences, but may be highly relevant to digital humanists.
UCI's Apporto Virtual Computer Lab (VCL) allows UCI undergraduates to access ArcGIS via the Internet with a valid UCINetID and Microsoft Remote Desktop. Read more about this virtual service, including detailed instructions, on OIT's website
It is also available at the Multimedia Resource Center (Science Library).
Get in touch with the MRC for the best hours and access information.
Need more information about GIS? Check out the GIS Research Guide.
Off-campus? Please use the Software VPN and choose the group UCIFull to access licensed content. For more information, please Click here
Software VPN is not available for guests, so they may not have access to some content when connecting from off-campus.