51 to 100 of 1,837 Results
Plain Text - 6.2 KB -
MD5: 2f3261430557b99ed8a12573d86338be
File manifest |
Apr 1, 2025
Linguistic Data Consortium; Appen Pty Ltd., 2025, "ASpIRE Development and Development Test Sets", https://hdl.handle.net/11272.1/AB2/YS9IIX, Abacus Data Network, V1
Abstract Introduction ASpIRE Development and Development Test Sets was developed for the Automatic Speech recognition In Reverberant Environments (ASpIRE) Challenge sponsored by IARPA (the Intelligent Advanced Research Projects Activity). It contains approximately 226 hours of En... |
Apr 1, 2025 -
ASpIRE Development and Development Test Sets
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Apr 1, 2025 -
ASpIRE Development and Development Test Sets
Optical Disc Image - 24.4 GB -
MD5: fca9878ab5c9d98464e1bd6bec0d5435
ISO disc image containing all documentation and data |
Apr 1, 2025 -
ASpIRE Development and Development Test Sets
Plain Text - 98.4 KB -
MD5: b9de63006195b1b5641856ddb213cb62
File manifest |
Mar 28, 2025
Asatiani, Sandro; Bills, Aric; Brunckhorst, Rachael; Chouder, Sarra; Corey, Cassian; Dubinski, Eyal; Ellis, Corinna; Gibby, Paul; Kalkhitashvili, Tamar; Kazi, Michael; Tong, Audrey; Lam, Julie; Le, Hanh; Malyska, Nicolas; Marcucci, Giorgia; Marvi, Sarah; McConnell, Sara; Melot, Jennifer; Mensch, Alyssa; Morrison, Michelle; Paget, Shelley; Richardson, Frederick; Roberts, Annette; Rubino, Carl; Samushia, Lela, 2025, "MATERIAL Georgian-English Language Pack", https://hdl.handle.net/11272.1/AB2/H5DHYO, Abacus Data Network, V1
Abstract Introduction MATERIAL Georgian-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 79 hours of... |
Mar 28, 2025 -
MATERIAL Georgian-English Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 28, 2025 -
MATERIAL Georgian-English Language Pack
Optical Disc Image - 7.3 GB -
MD5: 55a96021cfe613fd7ac9b749559b34ff
ISO disc image containing all documentation and data |
Mar 28, 2025 -
MATERIAL Georgian-English Language Pack
Plain Text - 179.5 KB -
MD5: a7c4c0a94b82c01e519fa1d2bcb3c6ce
File manifest |
Mar 28, 2025
Bills, Aric; Chouder, Sarra; Corey, Cassian; Davoodian, Marjan; Dubinski, Eyal; Ellis, Corinna; Farnam, Reza; Gibby, Paul; Hartwig, Luke; Kalnins, Dagmara; Kazi, Michael; Lam, Julie; Le, Hanh; Malyska, Nicolas; Marvi, Sarah; McConnell, Sara; Melot, Jennifer; Mensch, Alyssa; Moore, Alex; Morrison, Michelle; Paget, Shelley; Richardson, Frederick; Roberts, Annette; Rubino, Carl; Moaddel, Marjan Sadeghi, 2025, "MATERIAL Farsi-English Language Pack", https://hdl.handle.net/11272.1/AB2/WLFTJ6, Abacus Data Network, V1
Abstract Introduction MATERIAL Farsi-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 61 hours of Fa... |
Mar 28, 2025 -
MATERIAL Farsi-English Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 28, 2025 -
MATERIAL Farsi-English Language Pack
Optical Disc Image - 3.3 GB -
MD5: b8e8336243ace95d41c46f3c35e5b547
ISO disc image containing all documentation and data |
Mar 28, 2025 -
MATERIAL Farsi-English Language Pack
Plain Text - 205.6 KB -
MD5: b4378d6ad3042377216dfa2ff2452655
File manifest |
Mar 28, 2025
Abdi, Zeinab; Ali, Zahra; Bills, Aric; Bishop, Judith; Boyle, Anne; Chouder, Sarra; Clair, Nathaniel; Conners, Tom; Corey, Cassian; Dubinski, Eyal; Ellis, Corinna; Fernando, Jess; Gibby, Paul; Abdi, Farah H; Hammond, Simon; Hubert, Maxime; Kaiser-Schatzlein, Alice; Kazi, Michael; Lam, Julie; Lazar, Rosie; Le, Hanh; Levot, Michael; Malyska, Nicolas; Melot, Jennifer; Mensch, Alyssa; Omar, Abdulkadir Arale; Paget, Shelley; Richardson, Frederick; Rubino, Carl; Samko, Bern; Sanders, Gregory; Soh, Stephanie; Strahan, Tania E.; Taylor, Jonathan; Thompson, Brian; Tong, Audrey; Tong, Richard; Yelle, Julie; Yu, Jennifer; Zavorin, Ilya, 2025, "MATERIAL Somali-English Language Pack", https://hdl.handle.net/11272.1/AB2/2FKSLF, Abacus Data Network, V1
Abstract Introduction MATERIAL Somali-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 80 hours of S... |
Mar 28, 2025 -
MATERIAL Somali-English Language Pack
Optical Disc Image - 13.2 GB -
MD5: 5c836b7ce164720bd2e458c0b01efe42
ISO disc image containing all documentation and data |
Mar 28, 2025 -
MATERIAL Somali-English Language Pack
Plain Text - 255.0 KB -
MD5: 78473307c057e7700462583fa62a93a6
File manifest |
Mar 28, 2025 -
MATERIAL Somali-English Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 28, 2025
Bills, Aric; Bishop, Judith; Boyle, Anne; Chouder, Sarra; Clair, Nathaniel; Conners, Tom; Corey, Cassian; Cronin, Kristina; Dubinski, Eyal; Ellis, Corinna; Gibby, Paul; Hammond, Simon; Hidalgo, Guia; Kaiser-Schatzlein, Alice; Kalnins, Dagmara; Kazi, Michael; Lam, Julie; Lazar, Rosie; Le, Hanh; Malyska, Nicolas; Medel, Olivia; Melot, Jennifer; Mensch, Alyssa; Moore, Alex; Morrison, Michelle; Paget, Shelley; Raymer, Alston; Richardson, Fred; Ridgway, Hristina; Roberts, Annette; Rubino, Carl; Saw, Kenneth; Shen, Sinney; Soh, Stephanie; Taylor, Jonathan; Thompson, Brian; Tong, Audrey; Tong, Richard; Williams, Mariana; Yelle, Julie; Yu, Jennifer; Zavora, Yoanna; Zavorin, Ilya, 2025, "MATERIAL Bulgarian-English Language Pack", https://hdl.handle.net/11272.1/AB2/WCU3PV, Abacus Data Network, V1
Abstract Introduction MATERIAL Bulgarian-English Language Pack was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) MATERIAL (Machine Translation for English Retrieval of Information in Any Language) program. It contains approximately 78 hours o... |
Mar 28, 2025 -
MATERIAL Bulgarian-English Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 28, 2025 -
MATERIAL Bulgarian-English Language Pack
Optical Disc Image - 4.3 GB -
MD5: f0b360083e6774a09e770861404dacbc
ISO disc image containing all documentation and data |
Mar 28, 2025 -
MATERIAL Bulgarian-English Language Pack
Plain Text - 207.5 KB -
MD5: 817e7b010077eafe4034af43dcd3a6cc
File manifest |
Feb 3, 2025
Hernández Mena, Carlos Daniel; Örnólfsson, Gunnar Thor; Gudnason, Jon, 2025, "Samrómur Synthetic", https://hdl.handle.net/11272.1/AB2/DZUB82, Abacus Data Network, V1
Abstract Introduction Samrómur Synthetic was developed by the Language and Voice Lab, Reykjavik University and contains 72 hours of Icelandic synthetic speech, transcripts and metadata. Data Source sentences were extracted from the Samrómur platform, comprised of texts and transc... |
Feb 3, 2025 -
Samrómur Synthetic
Optical Disc Image - 5.8 GB -
MD5: 0814bfa634ed5125e7ce700a8376870c
ISO disc image containing all documentation and data |
Feb 3, 2025 -
Samrómur Synthetic
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Feb 3, 2025 -
Samrómur Synthetic
Plain Text - 4.2 MB -
MD5: 1ac66bdef68bd84484dfbcd53943d248
File manifest |
Feb 3, 2025
Hernández Mena, Carlos Daniel; Simonsen, Annika; Gudnason, Jon, 2025, "Ravnursson Faroese Speech and Transcripts", https://hdl.handle.net/11272.1/AB2/OBXEAK, Abacus Data Network, V1
Abstract Introduction Ravnursson Faroese Speech and Transcripts contains 109 hours of Faroese prompted speech from 433 speakers (249 female, 184 male), corresponding transcripts and speaker metadata. It is an extract from the Basic Language Resource Kit 1.0 (BLARK 1.0) developed... |
Feb 3, 2025 -
Ravnursson Faroese Speech and Transcripts
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Feb 3, 2025 -
Ravnursson Faroese Speech and Transcripts
Optical Disc Image - 6.0 GB -
MD5: 4a4e83330846a634ff04f4d0640fae68
ISO disc image containing all documentation and data |
Feb 3, 2025 -
Ravnursson Faroese Speech and Transcripts
Plain Text - 4.3 MB -
MD5: 601359e08922f5dfc3aeee4d3ce57962
File manifest |
Feb 3, 2025
Alrashoudi, Norah; AlKhalifa, Hend; Alotaibi, Yousef Ajami, 2025, "L2-KSU Native and Non-Native Arabic Speech", https://hdl.handle.net/11272.1/AB2/N7YZP8, Abacus Data Network, V1
Abstract Introduction L2-KSU Native and Non-Native Arabic Speech was developed by King Saud University (KSU) and contains approximately six hours of Modern Standard Arabic read speech from 80 subjects, along with transcripts and speaker metadata. Data The speech data was collecte... |
Feb 3, 2025 -
L2-KSU Native and Non-Native Arabic Speech
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Feb 3, 2025 -
L2-KSU Native and Non-Native Arabic Speech
Optical Disc Image - 703.9 MB -
MD5: 2058c58d3064b4990ffeda2ccd6a843b
ISO disc image containing all documentation and data |
Feb 3, 2025 -
L2-KSU Native and Non-Native Arabic Speech
Plain Text - 348.0 KB -
MD5: f6ecaed4930423f5a5423f3aa255a38c
File manifest |
Feb 3, 2025
Maamouri, Mohamed; Graff, David, 2025, "Iraqi Arabic - English Lexical Database", https://hdl.handle.net/11272.1/AB2/EUPXQD, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic - English Lexical Database was developed by the Linguistic Data Consortium (LDC). It contains six interrelated tables presenting over 67,000 Iraqi Arabic words as orthographic forms in Arabic script and pronunciation forms in International Phone... |
Feb 3, 2025 -
Iraqi Arabic - English Lexical Database
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Feb 3, 2025 -
Iraqi Arabic - English Lexical Database
Optical Disc Image - 6.1 MB -
MD5: 5047ba657ed838ab7ed28361cf99a52a
ISO disc image containing all documentation and data |
Feb 3, 2025 -
Iraqi Arabic - English Lexical Database
Plain Text - 568 B -
MD5: 7ec975313a646a6c4df31a6bb250fe96
File manifest |
Jan 21, 2025
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2025, "LORELEI Yoruba Representative Language Pack", https://hdl.handle.net/11272.1/AB2/ATPB58, Abacus Data Network, V1
Abstract Introduction LORELEI Yoruba Representative Language Pack (LDC2024T10) consists of Yoruba monolingual text, Yoruba-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI progr... |
Jan 21, 2025 -
LORELEI Yoruba Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 21, 2025 -
LORELEI Yoruba Representative Language Pack
Optical Disc Image - 460.8 MB -
MD5: fdf8c1f55ca588f02ff3959fb038479f
ISO disc image containing all documentation and data |
Jan 21, 2025 -
LORELEI Yoruba Representative Language Pack
Plain Text - 1.1 MB -
MD5: d43196767d4cbedec9f8e50b1db4d57d
File manifest |
Jan 21, 2025
Hennig, Leonhard; Thomas, Philippe; Möller, Sebastian, 2025, "MultiTACRED", https://hdl.handle.net/11272.1/AB2/GIEQ7J, Abacus Data Network, V1
Abstract Introduction MultiTACRED was developed by the German Research Center for Artificial Intelligence (DFKI) Speech and Language Technology Lab and is a machine translation of TAC Relation Extraction Dataset (LDC2018T24) (TACRED) into twelve languages with projected entity an... |
Jan 21, 2025 -
MultiTACRED
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 21, 2025 -
MultiTACRED
Optical Disc Image - 756.2 MB -
MD5: a0fe09e9df6275339122a385da5dfd16
ISO disc image containing all documentation and data |
Jan 21, 2025 -
MultiTACRED
Plain Text - 2.9 KB -
MD5: c116a5c547fa51b981286727f18d8ad1
File manifest |
Jan 21, 2025
Das, Debopam; Egg, Markus, 2025, "RST Continuity Corpus", https://hdl.handle.net/11272.1/AB2/YSIB2J, Abacus Data Network, V1
Abstract Introduction RST Continuity Corpus was developed at Åbo Akademi University and Humboldt-Universität zu Berlin and contains annotations for continuity dimensions added to RST Discourse Treebank (LDC2002T07). RST Discourse Treebank is a collection of English news texts fro... |
Jan 21, 2025 -
RST Continuity Corpus
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 21, 2025 -
RST Continuity Corpus
Optical Disc Image - 12.4 MB -
MD5: c0f5e35cf7c7b61b86c391835c97ce11
ISO disc image containing all documentation and data |
Jan 21, 2025 -
RST Continuity Corpus
Plain Text - 82.7 KB -
MD5: 79de47849d089724428899455e0270ee
File manifest |
Oct 25, 2024
Larson, Brian N., 2024, "First-Year Law Students' Court Memoranda", https://hdl.handle.net/11272.1/AB2/CC9MT6, Abacus Data Network, V1
Abstract Introduction First-Year Law Students' Court Memoranda consists of 197 English law student writing samples of legal briefs annotated for certain characteristics along with accompanying survey responses by the student writers. The briefs were created in a law school writin... |