Skip to Main Content

Linguistic Data Consortium

A UCI guide for accessing corpora from the Linguistic Data Consortium.

Email this link:

Connect from Off Campus

Researching from home? Remote access to the UCI Libraries' licensed online resources is available to current UC Irvine students, faculty & staff. Visit our Connect from Off-Campus page for more information!

Librarian for Interdisciplinary Studies

Profile Photo
Melissa Beuoy
Hear my name

Linguistic Data Consortium

The Linguistic Data Consortium (LDC) is an open consortium of universities, libraries, corporations and government research laboratories. LDC is an organization that creates and distributes a wide array of language resources. LDC also supports sponsored research programs and language-based technology evaluations by providing resources and contributing organizational expertise. LDC is hosted by the University of Pennsylvania and is a center within the University’s School of Arts and Sciences.

Accessing Corpora in Linguistic Data Consortium

UCI Libraries is a member of LDC. LDC can be searched with no account. However, to gain access to corpora in LDC, you must have an active UCI Net ID and you must be approved for "user" status.

For approved users, please log in with your existing UCI account. If you are a new user, please create a new account. For new users, your UCI status will be verified and approved within 48 hours. Additional guidance will be sent in the account confirmation email. For more information, please contact Melissa Beuoy at

Please note: Access to corpora with no additional fees and that fall under UCI Libraries' general licensing agreement will be approved. Corpora with special licensing terms will be reviewed on a case-by-case basis. Any corpus fees not covered by our membership are to be paid by the user.