351 to 400 of 1,819 Results
Jun 16, 2023
NIST Multimodal Information Group, 2023, "NIST 2002 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/AO1F7Z, Abacus Data Network, V1
Abstract Introduction NIST 2002 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2002 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems... |
Jun 16, 2023 -
NIST 2002 Open Machine Translation (OpenMT) Evaluation
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jun 16, 2023 -
NIST 2002 Open Machine Translation (OpenMT) Evaluation
Optical Disc Image - 4.5 MB -
MD5: 72ef750de760f633cf88461c2b5e1d45
ISO disc image containing all documentation and data |
Jun 16, 2023 -
NIST 2002 Open Machine Translation (OpenMT) Evaluation
Plain Text - 1.4 KB -
MD5: 6ec3b67024258149c67eedce85069301
File manifest |
Jun 16, 2023
Ma, Xiaoyi, 2023, "Chinese News Translation Text Part 1", https://hdl.handle.net/11272.1/AB2/1AHIZ3, Abacus Data Network, V1
Abstract Introduction Chinese News Translation Text Part 1 was developed by the Linguistic Data Consortium (LDC) and contains approximately 474,000 characters of Chinese text and corresponding English translations, totalling approximately 285,000 words. All the stories in this co... |
Jun 16, 2023 -
Chinese News Translation Text Part 1
Optical Disc Image - 13.0 MB -
MD5: a43b361d8bb9ca4bf01e48b085337dbc
ISO disc image containing all documentation and data |
Jun 16, 2023 -
Chinese News Translation Text Part 1
Plain Text - 103.6 KB -
MD5: 7f9fcbcad1f0afe3e45cc61c2e20b780
File manifest |
Jun 16, 2023 -
Chinese News Translation Text Part 1
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jun 16, 2023
Ma, Xiaoyi, 2023, "Multiple-Translation Chinese (MTC) Part 3", https://hdl.handle.net/11272.1/AB2/NYIMDR, Abacus Data Network, V1
Abstract Introduction Multiple-Translation Chinese (MTC) Part 3 was produced by Linguistic Data Consortium (LDC) catalog number LDC2004T07 and ISBN 1-58563-289-9. To support the development of automatic means for evaluating translation quality, the LDC was sponsored to solicit fo... |
Jun 16, 2023 -
Multiple-Translation Chinese (MTC) Part 3
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jun 16, 2023 -
Multiple-Translation Chinese (MTC) Part 3
Optical Disc Image - 3.9 MB -
MD5: ffc745681b5d88c7dd71ce683533cd03
ISO disc image containing all documentation and data |
Jun 16, 2023 -
Multiple-Translation Chinese (MTC) Part 3
Plain Text - 26.7 KB -
MD5: 4f33dbe7df8355ce20e580d48ed09b02
LDC2004T07_File_Manifest |
Jun 16, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Zulu Representative Language Pack", https://hdl.handle.net/11272.1/AB2/TYSP2P, Abacus Data Network, V1
Abstract Introduction LORELEI Zulu Representative Language Pack consists of Zulu monolingual text, Zulu-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI program. The LOREL... |
Jun 16, 2023 -
LORELEI Zulu Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jun 16, 2023 -
LORELEI Zulu Representative Language Pack
Optical Disc Image - 1.6 GB -
MD5: 2ea18d48beccfed5dde4745b43dd6258
ISO disc image containing all documentation and data |
Jun 16, 2023 -
LORELEI Zulu Representative Language Pack
Plain Text - 3.7 MB -
MD5: 36e3d6994c4b78d958c5a45d9b28436e
File manifest |
Apr 26, 2023
Huang, Shudong; Walker, Kevin; Graff, David, 2023, "Mixer 3 Speech", https://hdl.handle.net/11272.1/AB2/A9UZNY, Abacus Data Network, V1
Abstract Introduction Mixer 3 Speech was developed by the Linguistic Data Consortium (LDC) and comprises 3,200 hours of audio recordings of conversational telephone speech involving 3,875 speakers and 26 distinct languages. This material was collected by LDC from 2005-2007 as par... |
Apr 26, 2023 -
Mixer 3 Speech
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Apr 26, 2023 -
Mixer 3 Speech
Plain Text - 909.2 KB -
MD5: eed5d0e5ecf52821df3d67d33af2944a
File manifest |
Apr 26, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Tamil Representative Language Pack", https://hdl.handle.net/11272.1/AB2/TXXE33, Abacus Data Network, V1
Abstract Introduction LORELEI Tamil Representative Language Pack (LDC2023T03) consists of Tamil monolingual text, Tamil-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI pr... |
Apr 26, 2023 -
LORELEI Tamil Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Apr 26, 2023 -
LORELEI Tamil Representative Language Pack
Optical Disc Image - 1.6 GB -
MD5: fe006f3885df2cf21201dcc918e4c55e
ISO disc image containing all documentation and data |
Apr 26, 2023 -
LORELEI Tamil Representative Language Pack
Plain Text - 1.3 MB -
MD5: fb57c7828a3f6bcbf0138128eca8aa5d
File manifest |
Apr 26, 2023
Choi, Jinho D.; Han, Na-Rae; Hwang, Jena D.; Kim, Hansaem, 2023, "Penn Korean Universal Dependency Treebank", https://hdl.handle.net/11272.1/AB2/ZW25WL, Abacus Data Network, V1
Abstract Introduction Penn Korean Universal Dependency Treebank contains 5,010 sentences and 132,041 tokens annotated in dependency format under the Universal Dependencies framework. It is a conversion of Korean Treebank Annotations Version 2.0 (LDC2006T09) which was produced in... |
Apr 26, 2023 -
Penn Korean Universal Dependency Treebank
Optical Disc Image - 9.0 MB -
MD5: b6ae55528175de94879c18a74d200b86
ISO disc image containing all documentation and data |
Apr 26, 2023 -
Penn Korean Universal Dependency Treebank
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Apr 26, 2023 -
Penn Korean Universal Dependency Treebank
Plain Text - 4.6 KB -
MD5: 8491544ad53335bc1c5b4cbfc1233929
File manifest |
Apr 26, 2023
Chen, Song; Bies, Ann; Griffitt, Kira; Ellis, Joe; Strassel, Stephanie, 2023, "DEFT English Light and Rich ERE Annotation", https://hdl.handle.net/11272.1/AB2/7KH7V4, Abacus Data Network, V1
Abstract Introduction DEFT English Light and Rich ERE Annotation was developed by the Linguistic Data Consortium (LDC) and consists of 1190 English discussion forum, newswire and proxy documents annotated for entities, relations and events (ERE). DARPA's Deep Exploration and Filt... |
Apr 26, 2023 -
DEFT English Light and Rich ERE Annotation
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Apr 26, 2023 -
DEFT English Light and Rich ERE Annotation
Optical Disc Image - 53.4 MB -
MD5: b83a9dd0657ec76aa76eabae4bed76b3
ISO disc image containing all documentation and data |
Apr 26, 2023 -
DEFT English Light and Rich ERE Annotation
Plain Text - 156.6 KB -
MD5: b8a60461e8c768e0d0a07933f803f1b9
File manifest |
Mar 20, 2023
Delgado, Dana; Walker, Kevin; Graff, David; Strassel, Stephanie, 2023, "AIDA Ukrainian Broadcast and Telephone Speech Audio and Transcripts", https://hdl.handle.net/11272.1/AB2/CKALC2, Abacus Data Network, V1
Abstract Introduction AIDA Ukrainian Broadcast and Telephone Speech Audio and Transcripts was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 156 hours of Ukrainian conversational telephone speech (CTS) and broadcast news audio (BN) with 1.2 mi... |
Optical Disc Image - 3.8 GB -
MD5: e8c5e391a5f0ca9b8ea63a12426d5de8
ISO disc image containing all documentation and data: disc 1 |
Optical Disc Image - 2.7 GB -
MD5: cd43c203c73c2c3634c34b0b26ebf508
ISO disc image containing all documentation and data: disc 3 |
Optical Disc Image - 3.2 GB -
MD5: 558e6a25c75c7f9f618994eed0ea7385
ISO disc image containing all documentation and data: disc 2 |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Plain Text - 20.2 KB -
MD5: 95249351378992d0b079cb29cc6ff206
File manifest for disc 1 |
Plain Text - 27.0 KB -
MD5: 97bd28faa2f2034825c56b4aae764ea1
File manifest for disc 2 |
Plain Text - 24.9 KB -
MD5: 43ac7d8b28a27696fc4796404b04cb8a
File manifest for disc 3 |
Mar 17, 2023
Sadjadi, Omid; Greenberg, Craig; Li, Xuansong; Strassel, Stephanie, 2023, "2019 NIST Speaker Recognition Evaluation Test Set -- Audio-Visual", https://hdl.handle.net/11272.1/AB2/RWQNK7, Abacus Data Network, V1
Abstract Introduction 2019 NIST Speaker Recognition Evaluation Test Set -- Audio-Visual was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains approximately 64 hours of English audio-visual data for development... |
Optical Disc Image - 3.0 GB -
MD5: 97eae9989b6f04dcb6df0659062c6685
ISO disc image containing all documentation and data: disc 1 |
Optical Disc Image - 3.4 GB -
MD5: b3ddbf03957150ed15bfdce977d28bc3
ISO disc image containing all documentation and data: disc 2 |
Plain Text - 7.9 KB -
MD5: f410f7b5c10ad549c1e6fa5e0a24a393
File manifest for disc 1 |
Plain Text - 6.6 KB -
MD5: 799a9456304ca34621f2194694c87f76
File manifest for disc 2 |
Optical Disc Image - 3.2 GB -
MD5: 79d97b5b3b0b19fe94be9f0ef7a25af6
ISO disc image containing all documentation and data: disc 3 |
Optical Disc Image - 3.0 GB -
MD5: 2dcf86e20e44fac5e9aa2eeed6223e96
ISO disc image containing all documentation and data: disc 4 |
Optical Disc Image - 3.1 GB -
MD5: 7029dffd5783b6216eb604026d6a003e
ISO disc image containing all documentation and data: disc 5 |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Plain Text - 8.2 KB -
MD5: 38ea33515042e75cd136c272f6dbf1e5
File manifest for disc 3 |
Plain Text - 7.9 KB -
MD5: aa15ed0acfb83e0f7b9e2db2caa57dfd
File manifest for disc 4 |