451 to 500 of 1,819 Results
Jan 24, 2023 -
Global TIMIT Thai
Optical Disc Image - 1000.5 MB -
MD5: a663545c1bf2f5931b1a69a4e81ca87e
ISO disc image containing all documentation and data |
Jan 24, 2023 -
Global TIMIT Thai
Plain Text - 2.3 MB -
MD5: d624a00ad258c3b468330a0c0be2597c
File manifest |
Dec 8, 2022
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2022, "Third DIHARD Challenge Development", https://hdl.handle.net/11272.1/AB2/UY5O0X, Abacus Data Network, V1
Abstract Introduction Third DIHARD Challenge Development was developed by Linguistic Data Consortium (LDC) and contains approximately 34 hours of English and Chinese speech data along with corresponding annotations used in support of the Third DIHARD Challenge. The DIHARD Challen... |
Dec 8, 2022 -
Third DIHARD Challenge Development
Optical Disc Image - 1.8 GB -
MD5: d1d6b5bf72286297f4732b488e90c79b
ISO disc image containing all documentation and data |
Dec 8, 2022 -
Third DIHARD Challenge Development
Plain Text - 47.7 KB -
MD5: 4146720f0b80f973181d252c38635c30
File manifest |
Dec 8, 2022 -
Third DIHARD Challenge Development
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disk images |
Dec 8, 2022
Bies, Ann; Mott, Justin; Warner, Colin; Kulick, Seth, 2022, "BOLT English Translation Treebank - Egyptian Arabic SMS/Chat", https://hdl.handle.net/11272.1/AB2/SPCYLS, Abacus Data Network, V1
Abstract Introduction BOLT English Translation Treebank - Egyptian Arabic SMS/Chat was developed by the Linguistic Data Consortium (LDC) and consists of SMS and chat text data translated from Egyptian Arabic to English and annotated for part-of-speech and syntactic structure. The... |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disk images |
Optical Disc Image - 52.7 MB -
MD5: 63848a47ddeb6b84b5a2052a5d9d5393
ISO disc image containing all documentation and data |
Plain Text - 144.2 KB -
MD5: 8ce13bc0db258f5a51ef13ed54bca7f8
File manifest |
Nov 30, 2022
Byrne, William; Knodt, Eva; Bernstein, Jared; Emami, Farzhad, 2022, "Hispanic-English Database", https://hdl.handle.net/11272.1/AB2/IIJZCH, Abacus Data Network, V1
Abstract Introduction Hispanic-English Database contains approximately 30 hours of English and Spanish conversational and read speech with transcripts (24 hours) and metadata collected from 22 non-native English speakers between 1996 and 1998. The corpus was developed by Entropic... |
Nov 30, 2022 -
Hispanic-English Database
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Nov 30, 2022 -
Hispanic-English Database
Optical Disc Image - 2.9 GB -
MD5: 6579259e47fa887ceab07ab59c736534
ISO disc image containing all documentation and data |
Nov 30, 2022 -
Hispanic-English Database
Plain Text - 176.6 KB -
MD5: ac03ea2e19f3524202a077bd24727b19
File manifest |
Nov 30, 2022
Greenberg, Craig; Sadjadi, Omid; Reynolds, Douglas; Singer, Elliot; Graff, David, 2022, "2017 NIST Language Recognition Evaluation Training and Development Sets", https://hdl.handle.net/11272.1/AB2/K7LOKJ, Abacus Data Network, V1
Abstract Introduction 2017 NIST Language Recognition Evaluation Training and Development Sets contains training and development material for the 2017 NIST Language Recognition Evaluation. It consists of approximately 2,100 hours of conversational telephone speech, broadcast conve... |
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Adobe PDF - 31.6 KB -
MD5: 2c59ff6b57152c7861b50daebf2aef07
Instructions on how to access LDC data via UBC's Teamshare service |
Plain Text - 1003.2 KB -
MD5: 73b9ff71647df18cb1aed150d169823f
File manifest |
Nov 29, 2022
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2022, "LORELEI Bengali Representative Language Pack", https://hdl.handle.net/11272.1/AB2/IG4DBS, Abacus Data Network, V1
Abstract Introduction LORELEI Bengali Representative Language Pack consists of Bengali monolingual text, Bengali-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO... |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Optical Disc Image - 822.3 MB -
MD5: bd46a7b80e6c846d953b46e50aa87af8
ISO disc image containing all documentation and data - disc 2 |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Plain Text - 268.2 KB -
MD5: 920ca1530ff22cb9d3dd1cfcb8a53973
File manifest for disc 1 |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Plain Text - 2.2 MB -
MD5: 194fe633c82e9626d4aa34315dd34f5d
File manifest for disc 2 |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Optical Disc Image - 3.7 GB -
MD5: 00eabaf0eb9d6c77aa4194ce099d2712
ISO disc image containing all documentation and data - disc 1 |
Nov 29, 2022
Lau, Mingfei; Zhong, Muhan; Lau, Chaak-ming; Su, Jian; Chan, Henry; Cheung, Bing, 2022, "Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon", https://hdl.handle.net/11272.1/AB2/URBMXM, Abacus Data Network, V1
Abstract Introduction Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon was developed by the Cantonese Computational Linguistics Infrastructure Working Group. It contains approximately 130,000 Cantonese character, word, and phrase entries paired with their corresponding rom... |
Nov 29, 2022 -
Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Nov 29, 2022 -
Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon
Optical Disc Image - 3.9 MB -
MD5: 5ab9b5e4c14a5ef90ae493a3adbdb6da
ISO disc image containing all documentation and data |
Nov 29, 2022 -
Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon
Plain Text - 281 B -
MD5: 086b2d5d70e2d91a3940da8aea1ef1e9
File manifest |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Gulf Arabic Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/SCSMSJ, Abacus Data Network, V1
Abstract Introduction Gulf Arabic Conversational Telephone Speech is a database developed by Appen Pty Ltd., Sydney, Australia and contains roughly 2,800 min of spontaneous telephone conversations in Colloquial Gulf Arabic. This corpus was collected and transcribed in 2004 by App... |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech
Optical Disc Image - 2.7 GB -
MD5: 74a5b1d30b5a37117abbc3141c87b996
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech
Plain Text - 21.5 KB -
MD5: dd299ec6cd761783ecf00392ff376798
File manifest |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Iraqi Arabic Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/YBQF3Y, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic Conversational Telephone Speech was developed by Appen Pty Ltd, Sydney, Australia and contains roughly 3000 mins of speech from Iraqi Arabic speakers taking part in spontaneous telephone conversations in Colloquial Iraqi Arabic. This corpus was... |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech
Optical Disc Image - 1.4 GB -
MD5: 2910699cad8323e76ec4dab61e0a9dc2
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech
Plain Text - 15.7 KB -
MD5: 143566b5f737bd54156225939f7804c4
File manifest |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Gulf Arabic Conversational Telephone Speech, Transcripts", https://hdl.handle.net/11272.1/AB2/ZLBR2M, Abacus Data Network, V1
Abstract Introduction Gulf Arabic Conversational Telephone Speech, Transcripts is a database developed by Appen Pty Ltd., Sydney, Australia and contains transcripts of roughly 2,800 min of spontaneous telephone conversations in Colloquial Gulf Arabic. A total of 976 conversation... |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech, Transcripts
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech, Transcripts
Optical Disc Image - 11.6 MB -
MD5: 8df326a775c9b9f020728893fd83d980
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech, Transcripts
Plain Text - 24.5 KB -
MD5: adbe699a71b244abde4429b990bbbd48
File manifest |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Iraqi Arabic Conversational Telephone Speech, Transcripts", https://hdl.handle.net/11272.1/AB2/ELQDGO, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic Conversational Telephone Speech, Transcripts was developed by Appen Pty Ltd, Sydney, Australia and contains transcripts for roughly 3000 mins of speech from Iraqi Arabic speakers taking part in spontaneous telephone conversations in Colloquial I... |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech, Transcripts
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech, Transcripts
Optical Disc Image - 5.2 MB -
MD5: 65366589f881db2a48db7f807fe942f5
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech, Transcripts
Plain Text - 14.6 KB -
MD5: a6d8ad828d13d6eae10baf4003330713
File manifest |
Oct 13, 2022
Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2022, "GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 1", https://hdl.handle.net/11272.1/AB2/MZSDMN, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 1 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 123 hours of Arabic broadcast conversation speech collected in 2006 and 2007 by LDC, MediaNet, Tu... |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Optical Disc Image - 15.2 MB -
MD5: a414c394ade107b69e05fcc9e67ea417
ISO disc image containing all documentation and data |
Plain Text - 10.2 KB -
MD5: a17d486bc3c336ed7db29fe84e07cdb9
File manifest |
Oct 12, 2022
Alsulaiman, Mansour; Muhammad, Ghulam; Abdelkader, Bencherif Mohamed; Mahmood, Awais; Ali, Zulfiqar, 2022, "King Saud University Arabic Speech Database", https://hdl.handle.net/11272.1/AB2/4YVL4A, Abacus Data Network, V1
Abstract Introduction King Saud University Arabic Speech Database was developed by Speech Group (SG) at King Saud University and contains 590 hours of recorded Arabic speech from 269 male and female speakers. The utterances include read and spontaneous speech. The recordings were... |
Oct 12, 2022 -
King Saud University Arabic Speech Database
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |