401 to 450 of 1,855 Results
Jun 16, 2023 -
LORELEI Zulu Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jun 16, 2023 -
LORELEI Zulu Representative Language Pack
Optical Disc Image - 1.6 GB -
MD5: 2ea18d48beccfed5dde4745b43dd6258
ISO disc image containing all documentation and data |
Apr 26, 2023
Huang, Shudong; Walker, Kevin; Graff, David, 2023, "Mixer 3 Speech", https://hdl.handle.net/11272.1/AB2/A9UZNY, Abacus Data Network, V1
Abstract Introduction Mixer 3 Speech was developed by the Linguistic Data Consortium (LDC) and comprises 3,200 hours of audio recordings of conversational telephone speech involving 3,875 speakers and 26 distinct languages. This material was collected by LDC from 2005-2007 as par... |
Apr 26, 2023 -
Mixer 3 Speech
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Apr 26, 2023 -
Mixer 3 Speech
Plain Text - 909.2 KB -
MD5: eed5d0e5ecf52821df3d67d33af2944a
File manifest |
Apr 26, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Tamil Representative Language Pack", https://hdl.handle.net/11272.1/AB2/TXXE33, Abacus Data Network, V1
Abstract Introduction LORELEI Tamil Representative Language Pack (LDC2023T03) consists of Tamil monolingual text, Tamil-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI pr... |
Apr 26, 2023 -
LORELEI Tamil Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Apr 26, 2023 -
LORELEI Tamil Representative Language Pack
Optical Disc Image - 1.6 GB -
MD5: fe006f3885df2cf21201dcc918e4c55e
ISO disc image containing all documentation and data |
Apr 26, 2023 -
LORELEI Tamil Representative Language Pack
Plain Text - 1.3 MB -
MD5: fb57c7828a3f6bcbf0138128eca8aa5d
File manifest |
Apr 26, 2023
Choi, Jinho D.; Han, Na-Rae; Hwang, Jena D.; Kim, Hansaem, 2023, "Penn Korean Universal Dependency Treebank", https://hdl.handle.net/11272.1/AB2/ZW25WL, Abacus Data Network, V1
Abstract Introduction Penn Korean Universal Dependency Treebank contains 5,010 sentences and 132,041 tokens annotated in dependency format under the Universal Dependencies framework. It is a conversion of Korean Treebank Annotations Version 2.0 (LDC2006T09) which was produced in... |
Apr 26, 2023 -
Penn Korean Universal Dependency Treebank
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Apr 26, 2023 -
Penn Korean Universal Dependency Treebank
Optical Disc Image - 9.0 MB -
MD5: b6ae55528175de94879c18a74d200b86
ISO disc image containing all documentation and data |
Apr 26, 2023 -
Penn Korean Universal Dependency Treebank
Plain Text - 4.6 KB -
MD5: 8491544ad53335bc1c5b4cbfc1233929
File manifest |
Apr 26, 2023
Chen, Song; Bies, Ann; Griffitt, Kira; Ellis, Joe; Strassel, Stephanie, 2023, "DEFT English Light and Rich ERE Annotation", https://hdl.handle.net/11272.1/AB2/7KH7V4, Abacus Data Network, V1
Abstract Introduction DEFT English Light and Rich ERE Annotation was developed by the Linguistic Data Consortium (LDC) and consists of 1190 English discussion forum, newswire and proxy documents annotated for entities, relations and events (ERE). DARPA's Deep Exploration and Filt... |
Apr 26, 2023 -
DEFT English Light and Rich ERE Annotation
Optical Disc Image - 53.4 MB -
MD5: b83a9dd0657ec76aa76eabae4bed76b3
ISO disc image containing all documentation and data |
Apr 26, 2023 -
DEFT English Light and Rich ERE Annotation
Plain Text - 156.6 KB -
MD5: b8a60461e8c768e0d0a07933f803f1b9
File manifest |
Apr 26, 2023 -
DEFT English Light and Rich ERE Annotation
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 20, 2023
Delgado, Dana; Walker, Kevin; Graff, David; Strassel, Stephanie, 2023, "AIDA Ukrainian Broadcast and Telephone Speech Audio and Transcripts", https://hdl.handle.net/11272.1/AB2/CKALC2, Abacus Data Network, V1
Abstract Introduction AIDA Ukrainian Broadcast and Telephone Speech Audio and Transcripts was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 156 hours of Ukrainian conversational telephone speech (CTS) and broadcast news audio (BN) with 1.2 mi... |
Optical Disc Image - 3.8 GB -
MD5: e8c5e391a5f0ca9b8ea63a12426d5de8
ISO disc image containing all documentation and data: disc 1 |
Optical Disc Image - 2.7 GB -
MD5: cd43c203c73c2c3634c34b0b26ebf508
ISO disc image containing all documentation and data: disc 3 |
Optical Disc Image - 3.2 GB -
MD5: 558e6a25c75c7f9f618994eed0ea7385
ISO disc image containing all documentation and data: disc 2 |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Plain Text - 20.2 KB -
MD5: 95249351378992d0b079cb29cc6ff206
File manifest for disc 1 |
Plain Text - 27.0 KB -
MD5: 97bd28faa2f2034825c56b4aae764ea1
File manifest for disc 2 |
Plain Text - 24.9 KB -
MD5: 43ac7d8b28a27696fc4796404b04cb8a
File manifest for disc 3 |
Mar 17, 2023
Sadjadi, Omid; Greenberg, Craig; Li, Xuansong; Strassel, Stephanie, 2023, "2019 NIST Speaker Recognition Evaluation Test Set -- Audio-Visual", https://hdl.handle.net/11272.1/AB2/RWQNK7, Abacus Data Network, V1
Abstract Introduction 2019 NIST Speaker Recognition Evaluation Test Set -- Audio-Visual was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains approximately 64 hours of English audio-visual data for development... |
Optical Disc Image - 3.0 GB -
MD5: 97eae9989b6f04dcb6df0659062c6685
ISO disc image containing all documentation and data: disc 1 |
Plain Text - 7.9 KB -
MD5: f410f7b5c10ad549c1e6fa5e0a24a393
File manifest for disc 1 |
Optical Disc Image - 3.4 GB -
MD5: b3ddbf03957150ed15bfdce977d28bc3
ISO disc image containing all documentation and data: disc 2 |
Optical Disc Image - 3.2 GB -
MD5: 79d97b5b3b0b19fe94be9f0ef7a25af6
ISO disc image containing all documentation and data: disc 3 |
Optical Disc Image - 3.0 GB -
MD5: 2dcf86e20e44fac5e9aa2eeed6223e96
ISO disc image containing all documentation and data: disc 4 |
Optical Disc Image - 3.1 GB -
MD5: 7029dffd5783b6216eb604026d6a003e
ISO disc image containing all documentation and data: disc 5 |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Plain Text - 6.6 KB -
MD5: 799a9456304ca34621f2194694c87f76
File manifest for disc 2 |
Plain Text - 8.2 KB -
MD5: 38ea33515042e75cd136c272f6dbf1e5
File manifest for disc 3 |
Plain Text - 7.9 KB -
MD5: aa15ed0acfb83e0f7b9e2db2caa57dfd
File manifest for disc 4 |
Plain Text - 7.9 KB -
MD5: 8d5bbb9fb2a182ba4fa4f49084c8d739
File manifest for disc 5 |
Mar 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Tagalog Representative Language Pack", https://hdl.handle.net/11272.1/AB2/IALRRN, Abacus Data Network, V1
Abstract Introduction LORELEI Tagalog Representative Language Pack consists of Tagalog monolingual text, Tagalog-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO... |
Mar 17, 2023 -
LORELEI Tagalog Representative Language Pack
Optical Disc Image - 423.2 MB -
MD5: 28506483cf236683fa465901edd2a4ab
ISO disc image containing all documentation and data |
Mar 17, 2023 -
LORELEI Tagalog Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 17, 2023 -
LORELEI Tagalog Representative Language Pack
Plain Text - 1.0 MB -
MD5: afd5f51c5372625ac712e02be0c17b4f
File manifest |
Mar 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Swahili Representative Language Pack", https://hdl.handle.net/11272.1/AB2/RPNXXU, Abacus Data Network, V1
Abstract Introduction LORELEI Swahili Representative Language Pack consists of Swahili monolingual text, Swahili-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO... |
Mar 17, 2023 -
LORELEI Swahili Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 17, 2023 -
LORELEI Swahili Representative Language Pack
Optical Disc Image - 467.1 MB -
MD5: 07a901ab7074796e16e742725e01fd89
ISO disc image containing all documentation and data |
Mar 17, 2023 -
LORELEI Swahili Representative Language Pack
Plain Text - 1.1 MB -
MD5: 7393a9caae5552abc6b1254e35b5f598
File manifest |
Feb 14, 2023
Chay, Kevin; Elizalde, Cecilia; Ziemski, Michal, 2023, "United Nations Proceedings Speech", https://hdl.handle.net/11272.1/AB2/3LTQ01, Abacus Data Network, V1
Abstract Introduction United Nations Proceedings Speech was developed by the United Nations (UN) and contains approximately 8,500 hours of recorded proceedings in the six official UN languages, Arabic, Chinese, English, French, Russian and Spanish. The data was recorded in 2009-2... |
Feb 14, 2023 -
United Nations Proceedings Speech
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Feb 14, 2023 -
United Nations Proceedings Speech
Plain Text - 2.8 MB -
MD5: 28f8d1318d7a797659c0c005f754dbe4
File manifest |
Jan 26, 2023
Arrigo, Michael; Strassel, Stephanie; Caruso, Christopher, 2023, "CAMIO Transcription Languages", https://hdl.handle.net/11272.1/AB2/IEJLCN, Abacus Data Network, V1
Abstract Introduction CAMIO Transcription Languages was developed by the Linguistic Data Consortium and contains nearly 70,000 images of machine printed text with corresponding annotations and transcripts in the following 13 languages: Arabic, Chinese, English, Farsi, Hindi, Japa... |
Jan 26, 2023 -
CAMIO Transcription Languages
Optical Disc Image - 3.1 GB -
MD5: eecf370251324a271b774ab8a7312675
ISO disc image containing all documentation and data: disc 2 |