Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

251 to 300 of 1,819 Results
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (PDF)
Aug 29, 2023
Li, Xuansong; Strassel, Stephanie; Jones, Karen; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/SXVGS7, Abacus Data Network, V2
Abstract Introduction HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,800 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and rel...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (PDF)
Aug 29, 2023
Mahmoud, Sabri; Ahmad, Irfan; Al-Khatib, Wasfi; Alshayeb, Mohammad; Parvez, Mohammad; Märgner, Volker; Fink, Gernot, 2015, "KHATT: Handwritten Arabic Text", https://hdl.handle.net/11272.1/AB2/PL0DHA, Abacus Data Network, V2
Introduction KHATT: Handwritten Arabic Text was developed by King Fahd University of Petroleum & Minerals, Technical University of Dortmund and Braunschweig University of Technology. It is comprised of scanned Arabic handwriting from 1,000 distinct male and female writers represe...
Markdown Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Aug 25, 2023
Alwan, Abeer; Lulich, Steven; Sommers, Mitchell, 2015, "The Subglottal Resonances Database", https://hdl.handle.net/11272.1/AB2/R82KKG, Abacus Data Network, V2
Introduction The Subglottal Resonances Database was developed by Washington University and University of California Los Angeles and consists of 45 hours of simultaneous microphone and subglottal accelerometer recordings of 25 adult male and 25 adult female speakers of American En...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Aug 25, 2023
Walker, Kevin; Ma, Xiaoyi; Graff, David; Strassel, Stephanie; Sessa, Stephanie; Jones, Karen, 2015, "RATS Speech Activity Detection", https://hdl.handle.net/11272.1/AB2/1UISJ7, Abacus Data Network, V2
Introduction RATS Speech Activity Detection was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,000 hours of Levantine Arabic, English, Farsi, Pashto, and Urdu conversational telephone speech with automatic and manual annotation of speech seg...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (PDF)
Aug 18, 2023
Hernández Mena, Carlos Daniel; Gatt, Albert; Borg, Claudia; DeMarco, Andrea; van der Plas, Lonneke, 2023, "MASRI Synthetic", https://hdl.handle.net/11272.1/AB2/WBPJBV, Abacus Data Network, V1
Abstract Introduction MASRI (Maltese Automatic Speech Recognition I) Synthetic was developed by the MASRI team at the University of Malta and consists of approximately 99 hours of synthesized Maltese speech. Data Source sentences were extracted from the Maltese Language Resource...
Aug 18, 2023 - MASRI Synthetic
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Aug 18, 2023 - MASRI Synthetic
Optical Disc Image - 6.3 GB - MD5: ad9774df27abe949208102786e6ecdd8
Data
ISO disc image containing all documentation and data
Aug 18, 2023 - MASRI Synthetic
Plain Text - 3.6 MB - MD5: 33c068416a07aa174a0e8b1808fe71ca
Documentation
File manifest
Aug 18, 2023
Pradhan, Sameer; Cole, Ronald Allan; Ward, Wayne, 2023, "MyST Children's Conversational Speech", https://hdl.handle.net/11272.1/AB2/QUHJRW, Abacus Data Network, V1
Abstract Introduction MyST (My Science Tutor) Children's Conversational Speech was developed by Boulder Learning Inc. It is comprised of approximately 470 hours of English speech from 1371 students in grades 3-5 conversing with a virtual science tutor in eight areas of science in...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
nstructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
nstructions on how to access LDC data via UBC's Teamshare service
Plain Text - 27.8 MB - MD5: ff2292703964e157e7aae535e38a4325
Documentation
File manifest
Aug 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Indonesian Representative Language Pack", https://hdl.handle.net/11272.1/AB2/JLEISQ, Abacus Data Network, V1
Abstract Introduction LORELEI Indonesian Representative Language Pack consists of Indonesian monolingual text, Indonesian-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI...
Optical Disc Image - 752.3 MB - MD5: 550498c694e21c55b95327a1da4d321f
Data
ISO disc image containing all documentation and data
Plain Text - 1.4 MB - MD5: 6bbaa195a7dd66dfbb8ab168c24849c8
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Aug 17, 2023
Helgadóttir, Inga Rún; Kjaran, Róbert; Nikulásdóttir, Anna Björk; Gudnason, Jon, 2023, "Althingi Parliamentary Speech", https://hdl.handle.net/11272.1/AB2/NIG304, Abacus Data Network, V1
Abstract Introduction Althingi Parliamentary Speech consists of approximately 542 hours of recorded speech from Althingi, the Icelandic Parliament, along with corresponding transcripts, a pronunciation dictionary and two language models. Speeches date from 2005-2016. This dataset...
Optical Disc Image - 3.7 GB - MD5: c0450e614559604b8d555b1879b3821e
Data
ISO disc image containing all documentation and data: disc 2
Optical Disc Image - 3.6 GB - MD5: 53207cea2d17dccd92b49b7763dd0950
Data
ISO disc image containing all documentation and data: disc 3
Optical Disc Image - 3.8 GB - MD5: 8e00d365a65f0f6ad331038f0fc2e5e2
Data
ISO disc image containing all documentation and data: disc 4
Optical Disc Image - 3.6 GB - MD5: 4798ae805e8a78236f73ce45866eacc9
Data
ISO disc image containing all documentation and data: disc 1
Plain Text - 62.4 KB - MD5: a715ed80dcf3822fb4bbf0cb2a50ce32
Documentation
File manifest for disc 1
Plain Text - 62.4 KB - MD5: 3504cb4ba53bfc32f39e5536b638c388
Documentation
File manifest for disc 2
Plain Text - 67.2 KB - MD5: a7304a047da2e68cdd02fc6e599282d6
Documentation
File manifest for disc 3
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 4.2 GB - MD5: afe68e0ffc1d8e3f3daa34c0de081098
Data
ISO disc image containing all documentation and data: disc 5
Plain Text - 62.0 KB - MD5: 5d081a1f8f3f50959914fb18c3ec966a
Documentation
File manifest for disc 5
Plain Text - 67.2 KB - MD5: ea5185fb493f26622820bb219e082366
Documentation
File manifest for disc 4
Aug 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Thai Representative Language Pack", https://hdl.handle.net/11272.1/AB2/GCBMNV, Abacus Data Network, V1
Abstract Introduction LORELEI Thai Representative Language Pack (LDC2023T08) consists of Thai monolingual text, Thai-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI progr...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disk images
Optical Disc Image - 4.2 GB - MD5: 685824c186da00dc8991e268ecbdfa20
Data
ISO disc image containing all documentation and data
Plain Text - 3.0 MB - MD5: 78fba2c5955c2c1299ba72c61520c5f5
Documentation
File manifest
Aug 17, 2023
Brandschain, Linda; Walker, Kevin; Graff, David, 2023, "Mixer 7 Spanish Speech", https://hdl.handle.net/11272.1/AB2/CYMBUE, Abacus Data Network, V1
Abstract Introduction Mixer 7 Spanish Speech (LDC2023S04) was developed by the Linguistic Data Consortium (LDC) and contains 9,600 hours of audio recordings of interviews, transcript readings and conversational telephone speech involving 191 distinct native Spanish speakers. This...
Aug 17, 2023 - Mixer 7 Spanish Speech
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service
Aug 17, 2023 - Mixer 7 Spanish Speech
Plain Text - 4.3 MB - MD5: 2f2fddbbd5babfdfe7fc76b53262e4ee
Documentation
File manifest
Aug 17, 2023 - Mixer 7 Spanish Speech
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service
Aug 17, 2023
Maamouri, Mohamed; Graff, David, 2023, "Moroccan Arabic - English Lexical Database", https://hdl.handle.net/11272.1/AB2/E8N63E, Abacus Data Network, V1
Abstract Introduction Moroccan Arabic - English Lexical Database was developed by the Linguistic Data Consortium (LDC). It is comprised of a set of five interrelated tables presenting each Moroccan Arabic word as an orthographic form in Arabic script and a pronunciation form in I...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 4.4 MB - MD5: 53dbb297788cf86ef9e6eec6a6f4b465
Data
ISO disc image containing all documentation and data
Plain Text - 537 B - MD5: 9fca6c2dd41cf2df3711f7a063098afd
Documentation
File manifest
Aug 17, 2023
Hernández Mena, Carlos Daniel; Borsky, Michal; Mollberg, David; Guðmundsson, Smári Freyr; Hedström, Staffan; Pálsson, Ragnar; Jónsson, Ólafur Helgi; Þorsteinsdóttir, Sunneva; Guðmundsdóttir, Jóhanna Vigdís; Magnusdottir, Eydis Huld; Þórhallsdóttir, Ragnheiður; Gudnason, Jon, 2023, "Samrómur Children Icelandic Speech 1.0", https://hdl.handle.net/11272.1/AB2/LKGTIU, Abacus Data Network, V1
Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (childre...
Plain Text - 2.9 MB - MD5: 6a5d27caa13f8a6c13544e0393a652a4
Documentation
File manifest for disc i
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =