Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

401 to 450 of 1,855 Results
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 1.6 GB - MD5: 2ea18d48beccfed5dde4745b43dd6258
Data
ISO disc image containing all documentation and data
Apr 26, 2023
Huang, Shudong; Walker, Kevin; Graff, David, 2023, "Mixer 3 Speech", https://hdl.handle.net/11272.1/AB2/A9UZNY, Abacus Data Network, V1
Abstract Introduction Mixer 3 Speech was developed by the Linguistic Data Consortium (LDC) and comprises 3,200 hours of audio recordings of conversational telephone speech involving 3,875 speakers and 26 distinct languages. This material was collected by LDC from 2005-2007 as par...
Apr 26, 2023 - Mixer 3 Speech
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Apr 26, 2023 - Mixer 3 Speech
Plain Text - 909.2 KB - MD5: eed5d0e5ecf52821df3d67d33af2944a
Data
File manifest
Apr 26, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Tamil Representative Language Pack", https://hdl.handle.net/11272.1/AB2/TXXE33, Abacus Data Network, V1
Abstract Introduction LORELEI Tamil Representative Language Pack (LDC2023T03) consists of Tamil monolingual text, Tamil-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI pr...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 1.6 GB - MD5: fe006f3885df2cf21201dcc918e4c55e
Data
ISO disc image containing all documentation and data
Plain Text - 1.3 MB - MD5: fb57c7828a3f6bcbf0138128eca8aa5d
Documentation
File manifest
Apr 26, 2023
Choi, Jinho D.; Han, Na-Rae; Hwang, Jena D.; Kim, Hansaem, 2023, "Penn Korean Universal Dependency Treebank", https://hdl.handle.net/11272.1/AB2/ZW25WL, Abacus Data Network, V1
Abstract Introduction Penn Korean Universal Dependency Treebank contains 5,010 sentences and 132,041 tokens annotated in dependency format under the Universal Dependencies framework. It is a conversion of Korean Treebank Annotations Version 2.0 (LDC2006T09) which was produced in...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 9.0 MB - MD5: b6ae55528175de94879c18a74d200b86
Data
ISO disc image containing all documentation and data
Plain Text - 4.6 KB - MD5: 8491544ad53335bc1c5b4cbfc1233929
Documentation
File manifest
Apr 26, 2023
Chen, Song; Bies, Ann; Griffitt, Kira; Ellis, Joe; Strassel, Stephanie, 2023, "DEFT English Light and Rich ERE Annotation", https://hdl.handle.net/11272.1/AB2/7KH7V4, Abacus Data Network, V1
Abstract Introduction DEFT English Light and Rich ERE Annotation was developed by the Linguistic Data Consortium (LDC) and consists of 1190 English discussion forum, newswire and proxy documents annotated for entities, relations and events (ERE). DARPA's Deep Exploration and Filt...
Optical Disc Image - 53.4 MB - MD5: b83a9dd0657ec76aa76eabae4bed76b3
Data
ISO disc image containing all documentation and data
Plain Text - 156.6 KB - MD5: b8a60461e8c768e0d0a07933f803f1b9
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Mar 20, 2023
Delgado, Dana; Walker, Kevin; Graff, David; Strassel, Stephanie, 2023, "AIDA Ukrainian Broadcast and Telephone Speech Audio and Transcripts", https://hdl.handle.net/11272.1/AB2/CKALC2, Abacus Data Network, V1
Abstract Introduction AIDA Ukrainian Broadcast and Telephone Speech Audio and Transcripts was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 156 hours of Ukrainian conversational telephone speech (CTS) and broadcast news audio (BN) with 1.2 mi...
Optical Disc Image - 3.8 GB - MD5: e8c5e391a5f0ca9b8ea63a12426d5de8
Data
ISO disc image containing all documentation and data: disc 1
Optical Disc Image - 2.7 GB - MD5: cd43c203c73c2c3634c34b0b26ebf508
Data
ISO disc image containing all documentation and data: disc 3
Optical Disc Image - 3.2 GB - MD5: 558e6a25c75c7f9f618994eed0ea7385
Data
ISO disc image containing all documentation and data: disc 2
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 20.2 KB - MD5: 95249351378992d0b079cb29cc6ff206
Documentation
File manifest for disc 1
Plain Text - 27.0 KB - MD5: 97bd28faa2f2034825c56b4aae764ea1
Documentation
File manifest for disc 2
Plain Text - 24.9 KB - MD5: 43ac7d8b28a27696fc4796404b04cb8a
Documentation
File manifest for disc 3
Mar 17, 2023
Sadjadi, Omid; Greenberg, Craig; Li, Xuansong; Strassel, Stephanie, 2023, "2019 NIST Speaker Recognition Evaluation Test Set -- Audio-Visual", https://hdl.handle.net/11272.1/AB2/RWQNK7, Abacus Data Network, V1
Abstract Introduction 2019 NIST Speaker Recognition Evaluation Test Set -- Audio-Visual was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains approximately 64 hours of English audio-visual data for development...
Optical Disc Image - 3.0 GB - MD5: 97eae9989b6f04dcb6df0659062c6685
Data
ISO disc image containing all documentation and data: disc 1
Plain Text - 7.9 KB - MD5: f410f7b5c10ad549c1e6fa5e0a24a393
Documentation
File manifest for disc 1
Optical Disc Image - 3.4 GB - MD5: b3ddbf03957150ed15bfdce977d28bc3
Data
ISO disc image containing all documentation and data: disc 2
Optical Disc Image - 3.2 GB - MD5: 79d97b5b3b0b19fe94be9f0ef7a25af6
Data
ISO disc image containing all documentation and data: disc 3
Optical Disc Image - 3.0 GB - MD5: 2dcf86e20e44fac5e9aa2eeed6223e96
Data
ISO disc image containing all documentation and data: disc 4
Optical Disc Image - 3.1 GB - MD5: 7029dffd5783b6216eb604026d6a003e
Data
ISO disc image containing all documentation and data: disc 5
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 6.6 KB - MD5: 799a9456304ca34621f2194694c87f76
Documentation
File manifest for disc 2
Plain Text - 8.2 KB - MD5: 38ea33515042e75cd136c272f6dbf1e5
Documentation
File manifest for disc 3
Plain Text - 7.9 KB - MD5: aa15ed0acfb83e0f7b9e2db2caa57dfd
Documentation
File manifest for disc 4
Plain Text - 7.9 KB - MD5: 8d5bbb9fb2a182ba4fa4f49084c8d739
Documentation
File manifest for disc 5
Mar 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Tagalog Representative Language Pack", https://hdl.handle.net/11272.1/AB2/IALRRN, Abacus Data Network, V1
Abstract Introduction LORELEI Tagalog Representative Language Pack consists of Tagalog monolingual text, Tagalog-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO...
Optical Disc Image - 423.2 MB - MD5: 28506483cf236683fa465901edd2a4ab
Data
ISO disc image containing all documentation and data
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 1.0 MB - MD5: afd5f51c5372625ac712e02be0c17b4f
Documentation
File manifest
Mar 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Swahili Representative Language Pack", https://hdl.handle.net/11272.1/AB2/RPNXXU, Abacus Data Network, V1
Abstract Introduction LORELEI Swahili Representative Language Pack consists of Swahili monolingual text, Swahili-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 467.1 MB - MD5: 07a901ab7074796e16e742725e01fd89
Data
ISO disc image containing all documentation and data
Plain Text - 1.1 MB - MD5: 7393a9caae5552abc6b1254e35b5f598
Documentation
File manifest
Feb 14, 2023
Chay, Kevin; Elizalde, Cecilia; Ziemski, Michal, 2023, "United Nations Proceedings Speech", https://hdl.handle.net/11272.1/AB2/3LTQ01, Abacus Data Network, V1
Abstract Introduction United Nations Proceedings Speech was developed by the United Nations (UN) and contains approximately 8,500 hours of recorded proceedings in the six official UN languages, Arabic, Chinese, English, French, Russian and Spanish. The data was recorded in 2009-2...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Plain Text - 2.8 MB - MD5: 28f8d1318d7a797659c0c005f754dbe4
Documentation
File manifest
Jan 26, 2023
Arrigo, Michael; Strassel, Stephanie; Caruso, Christopher, 2023, "CAMIO Transcription Languages", https://hdl.handle.net/11272.1/AB2/IEJLCN, Abacus Data Network, V1
Abstract Introduction CAMIO Transcription Languages was developed by the Linguistic Data Consortium and contains nearly 70,000 images of machine printed text with corresponding annotations and transcripts in the following 13 languages: Arabic, Chinese, English, Farsi, Hindi, Japa...
Optical Disc Image - 3.1 GB - MD5: eecf370251324a271b774ab8a7312675
Data
ISO disc image containing all documentation and data: disc 2
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =