Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

351 to 400 of 1,855 Results
Optical Disc Image - 109.5 MB - MD5: 7f83062e784dd37aa2ab1bf622c4df47
Data
ISO disc image containing all documentation and data
Plain Text - 245.8 KB - MD5: f91b7ee45012ac7b51674fd2b273db67
Documentation
File manifest
Aug 17, 2023
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Second DIHARD Challenge Development - SEEDLingS", https://hdl.handle.net/11272.1/AB2/PKMDCL, Abacus Data Network, V1
Abstract Introduction Second DIHARD Challenge Development - SEEDLinGS was developed by Duke University and LDC and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the Second DIHARD Challenge. This relea...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Data
Working with ISO disc images
Optical Disc Image - 119.4 MB - MD5: 48c9e4ccfc1e23620ab8d3846491ea69
Data
ISO disc image containing all documentation and data
Plain Text - 6.0 KB - MD5: 1d0b300e763142e711955c12a60e80c2
Documentation
File manifest
Aug 17, 2023
Hirschberg, Julia; Gravano, Agustin; Benus, Stefan; Ward, Gregory; German, Elisa Sneed, 2023, "Columbia Games Corpus", https://hdl.handle.net/11272.1/AB2/TGPSBO, Abacus Data Network, V1
Abstract Introduction Columbia Games Corpus was developed by the Spoken Language Group, Columbia University and the Department of Linguistics, Northwestern University. It consists of approximately 10 hours of spontaneous English conversation along with corresponding orthographic...
Aug 17, 2023 - Columbia Games Corpus
Optical Disc Image - 930.6 MB - MD5: 183b236ed8d84c811b46314234125f1c
Data
ISO disc image containing all documentation and data
Aug 17, 2023 - Columbia Games Corpus
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Aug 17, 2023 - Columbia Games Corpus
Plain Text - 23.8 KB - MD5: fc73ab6ba7fcea294a1bac5d922b42f2
Documentation
File manifest
Jul 24, 2023
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Second DIHARD Challenge Evaluation - SEEDLingS", https://hdl.handle.net/11272.1/AB2/CXOTQ3, Abacus Data Network, V1
Abstract Introduction Second DIHARD Challenge Evaluation - SEEDLingS was developed by Duke University and the Linguistic Data Consortium (LDC) and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the Sec...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 119.8 MB - MD5: 928f8fe3f7aaa951c34719f9e86ea9be
Data
ISO disc image containing all documentation and data
Plain Text - 3.4 KB - MD5: 5b26747d79ee406018ec18f56f301fdc
Documentation
File manifest
Jul 24, 2023
Amith, Jonathan D.; Alcántara, Amelia Domínguez; Osollo, Hermelindo Salazar; Castañeda, Ceferino Salgado; Salgado, Eleuterio Gorostiza, 2023, "Ethnobotanical Research and Language Documentation of Nahuatl", https://hdl.handle.net/11272.1/AB2/EEHKAK, Abacus Data Network, V1
Abstract Introduction Ethnobotanical Research and Language Documentation of Nahuatl consists of approximately 190 hours of field recordings collected in the Sierra Nororiental and Sierra Norte regions of Puebla, Mexico. The corpus contains audio and video recordings of native Nah...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Plain Text - 1.0 MB - MD5: 1b0fab82f54632e467849b8b7c6a33d8
Documentation
File manifest
Jun 21, 2023
Greenberg, Craig; Sadjadi, Omid; Singer, Elliot; Walker, Kevin; Jones, Karen; Caruso, Christopher; Wright, Jonathan; Strassel, Stephanie, 2023, "2019 NIST Speaker Recognition Evaluation Test Set -- CTS Challenge", https://hdl.handle.net/11272.1/AB2/JEG5RH, Abacus Data Network, V1
Abstract Introduction 2019 NIST Speaker Recognition Evaluation Test Set -- CTS Challenge was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains approximately 635 hours of Tunisian Arabic telephone recordings fo...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Plain Text - 1.8 MB - MD5: d7cc32ed1d9b3e539c5d034ca9bbbce0
Documentation
File manifest
Jun 20, 2023
Ma, Xiaoyi, 2023, "Hong Kong Parallel Text", https://hdl.handle.net/11272.1/AB2/MX5PAM, Abacus Data Network, V1
Abstract Introduction Hong Kong Parallel Text was developed by the Linguistic Data Consortium (LDC) and contains data from three sub-corpora, namely Hong Kong Hansards Parallel Text, Hong Kong Laws Parallel Text and Hong Kong News Parallel Text. Hong Kong Hansards Parallel Text c...
Jun 20, 2023 - Hong Kong Parallel Text
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Jun 20, 2023 - Hong Kong Parallel Text
Optical Disc Image - 892.0 MB - MD5: 92564836f8154cc50213236027e132db
Data
ISO disc image containing all documentation and data
Jun 20, 2023 - Hong Kong Parallel Text
Plain Text - 426.5 KB - MD5: 0a73b5ffb0e049f9fb8066fdbb0fb253
Documentation
File manifest
Jun 20, 2023
NIST Multimodal Information Group, 2023, "NIST 2008 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/YEK10L, Abacus Data Network, V1
Abstract Introduction NIST 2008 Open Machine Translation (OpenMT) Evaluation, Linguistic Data Consortium (LDC) catalog number LDC2010T21 and isbn 1-58563-567-7, is a package containing source data, reference translations and scoring software used in the NIST 2008 OpenMT evaluatio...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 12.5 MB - MD5: beffe229827332e4f336992eec57343d
Data
ISO disc image containing all documentation and data
Plain Text - 1.9 KB - MD5: 7c47ef45b197497dba2f445f9b724d23
Documentation
File manifest
Jun 20, 2023
NIST Multimodal Information Group, 2023, "NIST 2006 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/6UBB7S, Abacus Data Network, V1
Abstract Introduction NIST 2006 Open Machine Translation (OpenMT) Evaluation, Linguistic Data Consortium (LDC) catalog number LDC2010T17 and isbn 1-58563-562-6, is a package containing source data, reference translations and scoring software used in the NIST 2006 OpenMT evaluatio...
Optical Disc Image - 9.9 MB - MD5: 9d6db09b93b456655d013f3dde666495
Data
ISO disc image containing all documentation and data
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 1.8 KB - MD5: b642f8eebfd68a0afd9c3454370e15cb
Documentation
File manifest
Jun 20, 2023
NIST Multimodal Information Group, 2023, "NIST 2003 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/ZH4VPY, Abacus Data Network, V1
Abstract Introduction NIST 2003 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2003 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems...
Optical Disc Image - 1.6 KB - MD5: 1a62a1c98a9ffd1a02e6766575b0c059
Documentation
File manifest
Optical Disc Image - 4.5 MB - MD5: 8b6935042994e51eca5dc3ed6868ee81
Data
ISO disc image containing all documentation and data
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Jun 16, 2023
NIST Multimodal Information Group, 2023, "NIST 2002 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/AO1F7Z, Abacus Data Network, V1
Abstract Introduction NIST 2002 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2002 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 4.5 MB - MD5: 72ef750de760f633cf88461c2b5e1d45
Data
ISO disc image containing all documentation and data
Plain Text - 1.4 KB - MD5: 6ec3b67024258149c67eedce85069301
Documentation
File manifest
Jun 16, 2023
Ma, Xiaoyi, 2023, "Chinese News Translation Text Part 1", https://hdl.handle.net/11272.1/AB2/1AHIZ3, Abacus Data Network, V1
Abstract Introduction Chinese News Translation Text Part 1 was developed by the Linguistic Data Consortium (LDC) and contains approximately 474,000 characters of Chinese text and corresponding English translations, totalling approximately 285,000 words. All the stories in this co...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 13.0 MB - MD5: a43b361d8bb9ca4bf01e48b085337dbc
Data
ISO disc image containing all documentation and data
Plain Text - 103.6 KB - MD5: 7f9fcbcad1f0afe3e45cc61c2e20b780
Documentation
File manifest
Jun 16, 2023
Ma, Xiaoyi, 2023, "Multiple-Translation Chinese (MTC) Part 3", https://hdl.handle.net/11272.1/AB2/NYIMDR, Abacus Data Network, V1
Abstract Introduction Multiple-Translation Chinese (MTC) Part 3 was produced by Linguistic Data Consortium (LDC) catalog number LDC2004T07 and ISBN 1-58563-289-9. To support the development of automatic means for evaluating translation quality, the LDC was sponsored to solicit fo...
Optical Disc Image - 3.9 MB - MD5: ffc745681b5d88c7dd71ce683533cd03
Data
ISO disc image containing all documentation and data
Plain Text - 26.7 KB - MD5: 4f33dbe7df8355ce20e580d48ed09b02
Documentation
LDC2004T07_File_Manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Jun 16, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Zulu Representative Language Pack", https://hdl.handle.net/11272.1/AB2/TYSP2P, Abacus Data Network, V1
Abstract Introduction LORELEI Zulu Representative Language Pack consists of Zulu monolingual text, Zulu-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI program. The LOREL...
Plain Text - 3.7 MB - MD5: 36e3d6994c4b78d958c5a45d9b28436e
Documentation
File manifest
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =