Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

51 to 100 of 1,837 Results
Plain Text - 6.2 KB - MD5: 2f3261430557b99ed8a12573d86338be
Documentation
File manifest
Apr 1, 2025
Linguistic Data Consortium; Appen Pty Ltd., 2025, "ASpIRE Development and Development Test Sets", https://hdl.handle.net/11272.1/AB2/YS9IIX, Abacus Data Network, V1
Abstract Introduction ASpIRE Development and Development Test Sets was developed for the Automatic Speech recognition In Reverberant Environments (ASpIRE) Challenge sponsored by IARPA (the Intelligent Advanced Research Projects Activity). It contains approximately 226 hours of En...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 24.4 GB - MD5: fca9878ab5c9d98464e1bd6bec0d5435
Data
ISO disc image containing all documentation and data
Plain Text - 98.4 KB - MD5: b9de63006195b1b5641856ddb213cb62
Documentation
File manifest
Mar 28, 2025
Asatiani, Sandro; Bills, Aric; Brunckhorst, Rachael; Chouder, Sarra; Corey, Cassian; Dubinski, Eyal; Ellis, Corinna; Gibby, Paul; Kalkhitashvili, Tamar; Kazi, Michael; Tong, Audrey; Lam, Julie; Le, Hanh; Malyska, Nicolas; Marcucci, Giorgia; Marvi, Sarah; McConnell, Sara; Melot, Jennifer; Mensch, Alyssa; Morrison, Michelle; Paget, Shelley; Richardson, Frederick; Roberts, Annette; Rubino, Carl; Samushia, Lela, 2025, "MATERIAL Georgian-English Language Pack", https://hdl.handle.net/11272.1/AB2/H5DHYO, Abacus Data Network, V1
Abstract Introduction MATERIAL Georgian-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 79 hours of...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 7.3 GB - MD5: 55a96021cfe613fd7ac9b749559b34ff
Data
ISO disc image containing all documentation and data
Plain Text - 179.5 KB - MD5: a7c4c0a94b82c01e519fa1d2bcb3c6ce
Documentation
File manifest
Mar 28, 2025
Bills, Aric; Chouder, Sarra; Corey, Cassian; Davoodian, Marjan; Dubinski, Eyal; Ellis, Corinna; Farnam, Reza; Gibby, Paul; Hartwig, Luke; Kalnins, Dagmara; Kazi, Michael; Lam, Julie; Le, Hanh; Malyska, Nicolas; Marvi, Sarah; McConnell, Sara; Melot, Jennifer; Mensch, Alyssa; Moore, Alex; Morrison, Michelle; Paget, Shelley; Richardson, Frederick; Roberts, Annette; Rubino, Carl; Moaddel, Marjan Sadeghi, 2025, "MATERIAL Farsi-English Language Pack", https://hdl.handle.net/11272.1/AB2/WLFTJ6, Abacus Data Network, V1
Abstract Introduction MATERIAL Farsi-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 61 hours of Fa...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 3.3 GB - MD5: b8e8336243ace95d41c46f3c35e5b547
Data
ISO disc image containing all documentation and data
Plain Text - 205.6 KB - MD5: b4378d6ad3042377216dfa2ff2452655
Documentation
File manifest
Mar 28, 2025
Abdi, Zeinab; Ali, Zahra; Bills, Aric; Bishop, Judith; Boyle, Anne; Chouder, Sarra; Clair, Nathaniel; Conners, Tom; Corey, Cassian; Dubinski, Eyal; Ellis, Corinna; Fernando, Jess; Gibby, Paul; Abdi, Farah H; Hammond, Simon; Hubert, Maxime; Kaiser-Schatzlein, Alice; Kazi, Michael; Lam, Julie; Lazar, Rosie; Le, Hanh; Levot, Michael; Malyska, Nicolas; Melot, Jennifer; Mensch, Alyssa; Omar, Abdulkadir Arale; Paget, Shelley; Richardson, Frederick; Rubino, Carl; Samko, Bern; Sanders, Gregory; Soh, Stephanie; Strahan, Tania E.; Taylor, Jonathan; Thompson, Brian; Tong, Audrey; Tong, Richard; Yelle, Julie; Yu, Jennifer; Zavorin, Ilya, 2025, "MATERIAL Somali-English Language Pack", https://hdl.handle.net/11272.1/AB2/2FKSLF, Abacus Data Network, V1
Abstract Introduction MATERIAL Somali-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 80 hours of S...
Optical Disc Image - 13.2 GB - MD5: 5c836b7ce164720bd2e458c0b01efe42
Data
ISO disc image containing all documentation and data
Plain Text - 255.0 KB - MD5: 78473307c057e7700462583fa62a93a6
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Mar 28, 2025
Bills, Aric; Bishop, Judith; Boyle, Anne; Chouder, Sarra; Clair, Nathaniel; Conners, Tom; Corey, Cassian; Cronin, Kristina; Dubinski, Eyal; Ellis, Corinna; Gibby, Paul; Hammond, Simon; Hidalgo, Guia; Kaiser-Schatzlein, Alice; Kalnins, Dagmara; Kazi, Michael; Lam, Julie; Lazar, Rosie; Le, Hanh; Malyska, Nicolas; Medel, Olivia; Melot, Jennifer; Mensch, Alyssa; Moore, Alex; Morrison, Michelle; Paget, Shelley; Raymer, Alston; Richardson, Fred; Ridgway, Hristina; Roberts, Annette; Rubino, Carl; Saw, Kenneth; Shen, Sinney; Soh, Stephanie; Taylor, Jonathan; Thompson, Brian; Tong, Audrey; Tong, Richard; Williams, Mariana; Yelle, Julie; Yu, Jennifer; Zavora, Yoanna; Zavorin, Ilya, 2025, "MATERIAL Bulgarian-English Language Pack", https://hdl.handle.net/11272.1/AB2/WCU3PV, Abacus Data Network, V1
Abstract Introduction MATERIAL Bulgarian-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 78 hours o...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 4.3 GB - MD5: f0b360083e6774a09e770861404dacbc
Data
ISO disc image containing all documentation and data
Plain Text - 207.5 KB - MD5: 817e7b010077eafe4034af43dcd3a6cc
Documentation
File manifest
Feb 3, 2025
Hernández Mena, Carlos Daniel; Örnólfsson, Gunnar Thor; Gudnason, Jon, 2025, "Samrómur Synthetic", https://hdl.handle.net/11272.1/AB2/DZUB82, Abacus Data Network, V1
Abstract Introduction Samrómur Synthetic was developed by the Language and Voice Lab, Reykjavik University and contains 72 hours of Icelandic synthetic speech, transcripts and metadata. Data Source sentences were extracted from the Samrómur platform, comprised of texts and transc...
Feb 3, 2025 - Samrómur Synthetic
Optical Disc Image - 5.8 GB - MD5: 0814bfa634ed5125e7ce700a8376870c
Data
ISO disc image containing all documentation and data
Feb 3, 2025 - Samrómur Synthetic
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Feb 3, 2025 - Samrómur Synthetic
Plain Text - 4.2 MB - MD5: 1ac66bdef68bd84484dfbcd53943d248
Documentation
File manifest
Feb 3, 2025
Hernández Mena, Carlos Daniel; Simonsen, Annika; Gudnason, Jon, 2025, "Ravnursson Faroese Speech and Transcripts", https://hdl.handle.net/11272.1/AB2/OBXEAK, Abacus Data Network, V1
Abstract Introduction Ravnursson Faroese Speech and Transcripts contains 109 hours of Faroese prompted speech from 433 speakers (249 female, 184 male), corresponding transcripts and speaker metadata. It is an extract from the Basic Language Resource Kit 1.0 (BLARK 1.0) developed...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 6.0 GB - MD5: 4a4e83330846a634ff04f4d0640fae68
Data
ISO disc image containing all documentation and data
Plain Text - 4.3 MB - MD5: 601359e08922f5dfc3aeee4d3ce57962
Documentation
File manifest
Feb 3, 2025
Alrashoudi, Norah; AlKhalifa, Hend; Alotaibi, Yousef Ajami, 2025, "L2-KSU Native and Non-Native Arabic Speech", https://hdl.handle.net/11272.1/AB2/N7YZP8, Abacus Data Network, V1
Abstract Introduction L2-KSU Native and Non-Native Arabic Speech was developed by King Saud University (KSU) and contains approximately six hours of Modern Standard Arabic read speech from 80 subjects, along with transcripts and speaker metadata. Data The speech data was collecte...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 703.9 MB - MD5: 2058c58d3064b4990ffeda2ccd6a843b
Data
ISO disc image containing all documentation and data
Plain Text - 348.0 KB - MD5: f6ecaed4930423f5a5423f3aa255a38c
Documentation
File manifest
Feb 3, 2025
Maamouri, Mohamed; Graff, David, 2025, "Iraqi Arabic - English Lexical Database", https://hdl.handle.net/11272.1/AB2/EUPXQD, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic - English Lexical Database was developed by the Linguistic Data Consortium (LDC). It contains six interrelated tables presenting over 67,000 Iraqi Arabic words as orthographic forms in Arabic script and pronunciation forms in International Phone...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 6.1 MB - MD5: 5047ba657ed838ab7ed28361cf99a52a
Data
ISO disc image containing all documentation and data
Plain Text - 568 B - MD5: 7ec975313a646a6c4df31a6bb250fe96
Documentation
File manifest
Jan 21, 2025
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2025, "LORELEI Yoruba Representative Language Pack", https://hdl.handle.net/11272.1/AB2/ATPB58, Abacus Data Network, V1
Abstract Introduction LORELEI Yoruba Representative Language Pack (LDC2024T10) consists of Yoruba monolingual text, Yoruba-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI progr...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 460.8 MB - MD5: fdf8c1f55ca588f02ff3959fb038479f
Data
ISO disc image containing all documentation and data
Plain Text - 1.1 MB - MD5: d43196767d4cbedec9f8e50b1db4d57d
Documentation
File manifest
Jan 21, 2025
Hennig, Leonhard; Thomas, Philippe; Möller, Sebastian, 2025, "MultiTACRED", https://hdl.handle.net/11272.1/AB2/GIEQ7J, Abacus Data Network, V1
Abstract Introduction MultiTACRED was developed by the German Research Center for Artificial Intelligence (DFKI) Speech and Language Technology Lab and is a machine translation of TAC Relation Extraction Dataset (LDC2018T24) (TACRED) into twelve languages with projected entity an...
Jan 21, 2025 - MultiTACRED
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Jan 21, 2025 - MultiTACRED
Optical Disc Image - 756.2 MB - MD5: a0fe09e9df6275339122a385da5dfd16
Data
ISO disc image containing all documentation and data
Jan 21, 2025 - MultiTACRED
Plain Text - 2.9 KB - MD5: c116a5c547fa51b981286727f18d8ad1
Documentation
File manifest
Jan 21, 2025
Das, Debopam; Egg, Markus, 2025, "RST Continuity Corpus", https://hdl.handle.net/11272.1/AB2/YSIB2J, Abacus Data Network, V1
Abstract Introduction RST Continuity Corpus was developed at Åbo Akademi University and Humboldt-Universität zu Berlin and contains annotations for continuity dimensions added to RST Discourse Treebank (LDC2002T07). RST Discourse Treebank is a collection of English news texts fro...
Jan 21, 2025 - RST Continuity Corpus
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Jan 21, 2025 - RST Continuity Corpus
Optical Disc Image - 12.4 MB - MD5: c0f5e35cf7c7b61b86c391835c97ce11
Data
ISO disc image containing all documentation and data
Jan 21, 2025 - RST Continuity Corpus
Plain Text - 82.7 KB - MD5: 79de47849d089724428899455e0270ee
Documentation
File manifest
Oct 25, 2024
Larson, Brian N., 2024, "First-Year Law Students' Court Memoranda", https://hdl.handle.net/11272.1/AB2/CC9MT6, Abacus Data Network, V1
Abstract Introduction First-Year Law Students' Court Memoranda consists of 197 English law student writing samples of legal briefs annotated for certain characteristics along with accompanying survey responses by the student writers. The briefs were created in a law school writin...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =