Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

501 to 550 of 1,855 Results
Nov 30, 2022
Greenberg, Craig; Sadjadi, Omid; Reynolds, Douglas; Singer, Elliot; Graff, David, 2022, "2017 NIST Language Recognition Evaluation Training and Development Sets", https://hdl.handle.net/11272.1/AB2/K7LOKJ, Abacus Data Network, V1
Abstract Introduction 2017 NIST Language Recognition Evaluation Training and Development Sets contains training and development material for the 2017 NIST Language Recognition Evaluation. It consists of approximately 2,100 hours of conversational telephone speech, broadcast conve...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.6 KB - MD5: 2c59ff6b57152c7861b50daebf2aef07
Documentation
Instructions on how to access LDC data via UBC's Teamshare service
Plain Text - 1003.2 KB - MD5: 73b9ff71647df18cb1aed150d169823f
Documentation
File manifest
Nov 29, 2022
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2022, "LORELEI Bengali Representative Language Pack", https://hdl.handle.net/11272.1/AB2/IG4DBS, Abacus Data Network, V1
Abstract Introduction LORELEI Bengali Representative Language Pack consists of Bengali monolingual text, Bengali-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO...
Optical Disc Image - 822.3 MB - MD5: bd46a7b80e6c846d953b46e50aa87af8
Data
ISO disc image containing all documentation and data - disc 2
Optical Disc Image - 3.7 GB - MD5: 00eabaf0eb9d6c77aa4194ce099d2712
Data
ISO disc image containing all documentation and data - disc 1
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 268.2 KB - MD5: 920ca1530ff22cb9d3dd1cfcb8a53973
Documentation
File manifest for disc 1
Plain Text - 2.2 MB - MD5: 194fe633c82e9626d4aa34315dd34f5d
Documentation
File manifest for disc 2
Nov 29, 2022
Lau, Mingfei; Zhong, Muhan; Lau, Chaak-ming; Su, Jian; Chan, Henry; Cheung, Bing, 2022, "Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon", https://hdl.handle.net/11272.1/AB2/URBMXM, Abacus Data Network, V1
Abstract Introduction Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon was developed by the Cantonese Computational Linguistics Infrastructure Working Group. It contains approximately 130,000 Cantonese character, word, and phrase entries paired with their corresponding rom...
Optical Disc Image - 3.9 MB - MD5: 5ab9b5e4c14a5ef90ae493a3adbdb6da
Data
ISO disc image containing all documentation and data
Plain Text - 281 B - MD5: 086b2d5d70e2d91a3940da8aea1ef1e9
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Gulf Arabic Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/SCSMSJ, Abacus Data Network, V1
Abstract Introduction Gulf Arabic Conversational Telephone Speech is a database developed by Appen Pty Ltd., Sydney, Australia and contains roughly 2,800 min of spontaneous telephone conversations in Colloquial Gulf Arabic. This corpus was collected and transcribed in 2004 by App...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 2.7 GB - MD5: 74a5b1d30b5a37117abbc3141c87b996
Data
ISO disc image containing all documentation and data
Plain Text - 21.5 KB - MD5: dd299ec6cd761783ecf00392ff376798
Documentation
File manifest
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Iraqi Arabic Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/YBQF3Y, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic Conversational Telephone Speech was developed by Appen Pty Ltd, Sydney, Australia and contains roughly 3000 mins of speech from Iraqi Arabic speakers taking part in spontaneous telephone conversations in Colloquial Iraqi Arabic. This corpus was...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 1.4 GB - MD5: 2910699cad8323e76ec4dab61e0a9dc2
Data
ISO disc image containing all documentation and data
Plain Text - 15.7 KB - MD5: 143566b5f737bd54156225939f7804c4
Documentation
File manifest
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Gulf Arabic Conversational Telephone Speech, Transcripts", https://hdl.handle.net/11272.1/AB2/ZLBR2M, Abacus Data Network, V1
Abstract Introduction Gulf Arabic Conversational Telephone Speech, Transcripts is a database developed by Appen Pty Ltd., Sydney, Australia and contains transcripts of roughly 2,800 min of spontaneous telephone conversations in Colloquial Gulf Arabic. A total of 976 conversation...
Optical Disc Image - 11.6 MB - MD5: 8df326a775c9b9f020728893fd83d980
Data
ISO disc image containing all documentation and data
Plain Text - 24.5 KB - MD5: adbe699a71b244abde4429b990bbbd48
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Iraqi Arabic Conversational Telephone Speech, Transcripts", https://hdl.handle.net/11272.1/AB2/ELQDGO, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic Conversational Telephone Speech, Transcripts was developed by Appen Pty Ltd, Sydney, Australia and contains transcripts for roughly 3000 mins of speech from Iraqi Arabic speakers taking part in spontaneous telephone conversations in Colloquial I...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 5.2 MB - MD5: 65366589f881db2a48db7f807fe942f5
Data
ISO disc image containing all documentation and data
Plain Text - 14.6 KB - MD5: a6d8ad828d13d6eae10baf4003330713
Documentation
File manifest
Oct 13, 2022
Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2022, "GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 1", https://hdl.handle.net/11272.1/AB2/MZSDMN, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 1 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 123 hours of Arabic broadcast conversation speech collected in 2006 and 2007 by LDC, MediaNet, Tu...
Optical Disc Image - 15.2 MB - MD5: a414c394ade107b69e05fcc9e67ea417
Data
ISO disc image containing all documentation and data
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 10.2 KB - MD5: a17d486bc3c336ed7db29fe84e07cdb9
Documentation
File manifest
Oct 12, 2022
Alsulaiman, Mansour; Muhammad, Ghulam; Abdelkader, Bencherif Mohamed; Mahmood, Awais; Ali, Zulfiqar, 2022, "King Saud University Arabic Speech Database", https://hdl.handle.net/11272.1/AB2/4YVL4A, Abacus Data Network, V1
Abstract Introduction King Saud University Arabic Speech Database was developed by Speech Group (SG) at King Saud University and contains 590 hours of recorded Arabic speech from 269 male and female speakers. The utterances include read and spontaneous speech. The recordings were...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (PDF)
Plain Text - 8.8 MB - MD5: 2bfd5cbae2879cafada79a4890653fea
Documentation
File manifest
Oct 12, 2022
Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2022, "GALE Phase 2 Arabic Broadcast Conversation Speech Part 1", https://hdl.handle.net/11272.1/AB2/GGD0CB, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Speech Part 1 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 123 hours of Arabic broadcast conversation speech collected in 2006 and 2007 by LDC as part of the DARPA GALE (Gl...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 6.7 GB - MD5: 48c0677831c51bb5c437170eb2ba2265
Data
ISO disc image containing all documentation and data
Plain Text - 7.8 KB - MD5: 0b47ed1cb6a6881a291bcd1ed7ed64c4
Documentation
File manifest
Oct 12, 2022
Cieri, Christopher; Zhan, Juhong; Jiang, Yue; Liberman, Mark; Yuan, Jiahong; Chen, Yiya; Scharenborg, Odette, 2022, "Xi'an Guanzhong Object Naming", https://hdl.handle.net/11272.1/AB2/D2DBLV, Abacus Data Network, V1
Abstract Introduction Xi'an Guanzhong Object Naming is comprised of approximately 15 hours of audio recordings from speakers of the Guanzhong dialect of Mandarin Chinese living in or near Xi'an in Shaangxi Province (China) naming objects that appeared in colored line drawings. Th...
Optical Disc Image - 799.8 MB - MD5: dea90d62fe4089357b226db72fb6ced4
Data
ISO disc image containing all documentation and data
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 1.8 MB - MD5: 8d091c2b3f692f322fd28fd1dd620b0f
Documentation
File manifest
Sep 20, 2022
Li, Xuansong; Strassel, Stephanie; Jones, Karen; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/GNUQ1A, Abacus Data Network, V1
Abstract Introduction HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 6,200 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and rel...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (PDF)
Aug 9, 2022
Carvalho, Vitor R.; Kiran, Yigit; Borthwick, Andrew, 2022, "American English Nickname Collection", https://hdl.handle.net/11272.1/AB2/JR1WG6, Abacus Data Network, V1
Abstract Introduction American English Nickname Collection was developed by Intelius, Inc. and is a compilation of American English nicknames to given name mappings based on information in US government records, public web profiles and financial and property reports. This corpus...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =