501 to 550 of 1,855 Results
Nov 30, 2022
Greenberg, Craig; Sadjadi, Omid; Reynolds, Douglas; Singer, Elliot; Graff, David, 2022, "2017 NIST Language Recognition Evaluation Training and Development Sets", https://hdl.handle.net/11272.1/AB2/K7LOKJ, Abacus Data Network, V1
Abstract Introduction 2017 NIST Language Recognition Evaluation Training and Development Sets contains training and development material for the 2017 NIST Language Recognition Evaluation. It consists of approximately 2,100 hours of conversational telephone speech, broadcast conve... |
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Adobe PDF - 31.6 KB -
MD5: 2c59ff6b57152c7861b50daebf2aef07
Instructions on how to access LDC data via UBC's Teamshare service |
Plain Text - 1003.2 KB -
MD5: 73b9ff71647df18cb1aed150d169823f
File manifest |
Nov 29, 2022
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2022, "LORELEI Bengali Representative Language Pack", https://hdl.handle.net/11272.1/AB2/IG4DBS, Abacus Data Network, V1
Abstract Introduction LORELEI Bengali Representative Language Pack consists of Bengali monolingual text, Bengali-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO... |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Optical Disc Image - 822.3 MB -
MD5: bd46a7b80e6c846d953b46e50aa87af8
ISO disc image containing all documentation and data - disc 2 |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Optical Disc Image - 3.7 GB -
MD5: 00eabaf0eb9d6c77aa4194ce099d2712
ISO disc image containing all documentation and data - disc 1 |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Plain Text - 268.2 KB -
MD5: 920ca1530ff22cb9d3dd1cfcb8a53973
File manifest for disc 1 |
Nov 29, 2022 -
LORELEI Bengali Representative Language Pack
Plain Text - 2.2 MB -
MD5: 194fe633c82e9626d4aa34315dd34f5d
File manifest for disc 2 |
Nov 29, 2022
Lau, Mingfei; Zhong, Muhan; Lau, Chaak-ming; Su, Jian; Chan, Henry; Cheung, Bing, 2022, "Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon", https://hdl.handle.net/11272.1/AB2/URBMXM, Abacus Data Network, V1
Abstract Introduction Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon was developed by the Cantonese Computational Linguistics Infrastructure Working Group. It contains approximately 130,000 Cantonese character, word, and phrase entries paired with their corresponding rom... |
Nov 29, 2022 -
Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon
Optical Disc Image - 3.9 MB -
MD5: 5ab9b5e4c14a5ef90ae493a3adbdb6da
ISO disc image containing all documentation and data |
Nov 29, 2022 -
Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon
Plain Text - 281 B -
MD5: 086b2d5d70e2d91a3940da8aea1ef1e9
File manifest |
Nov 29, 2022 -
Rime-Cantonese: A Normalized Cantonese Jyutping Lexicon
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Gulf Arabic Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/SCSMSJ, Abacus Data Network, V1
Abstract Introduction Gulf Arabic Conversational Telephone Speech is a database developed by Appen Pty Ltd., Sydney, Australia and contains roughly 2,800 min of spontaneous telephone conversations in Colloquial Gulf Arabic. This corpus was collected and transcribed in 2004 by App... |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech
Optical Disc Image - 2.7 GB -
MD5: 74a5b1d30b5a37117abbc3141c87b996
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech
Plain Text - 21.5 KB -
MD5: dd299ec6cd761783ecf00392ff376798
File manifest |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Iraqi Arabic Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/YBQF3Y, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic Conversational Telephone Speech was developed by Appen Pty Ltd, Sydney, Australia and contains roughly 3000 mins of speech from Iraqi Arabic speakers taking part in spontaneous telephone conversations in Colloquial Iraqi Arabic. This corpus was... |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech
Optical Disc Image - 1.4 GB -
MD5: 2910699cad8323e76ec4dab61e0a9dc2
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech
Plain Text - 15.7 KB -
MD5: 143566b5f737bd54156225939f7804c4
File manifest |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Gulf Arabic Conversational Telephone Speech, Transcripts", https://hdl.handle.net/11272.1/AB2/ZLBR2M, Abacus Data Network, V1
Abstract Introduction Gulf Arabic Conversational Telephone Speech, Transcripts is a database developed by Appen Pty Ltd., Sydney, Australia and contains transcripts of roughly 2,800 min of spontaneous telephone conversations in Colloquial Gulf Arabic. A total of 976 conversation... |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech, Transcripts
Optical Disc Image - 11.6 MB -
MD5: 8df326a775c9b9f020728893fd83d980
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech, Transcripts
Plain Text - 24.5 KB -
MD5: adbe699a71b244abde4429b990bbbd48
File manifest |
Oct 13, 2022 -
Gulf Arabic Conversational Telephone Speech, Transcripts
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022
Appen Pty Ltd. Sydney, Australia, 2022, "Iraqi Arabic Conversational Telephone Speech, Transcripts", https://hdl.handle.net/11272.1/AB2/ELQDGO, Abacus Data Network, V1
Abstract Introduction Iraqi Arabic Conversational Telephone Speech, Transcripts was developed by Appen Pty Ltd, Sydney, Australia and contains transcripts for roughly 3000 mins of speech from Iraqi Arabic speakers taking part in spontaneous telephone conversations in Colloquial I... |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech, Transcripts
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech, Transcripts
Optical Disc Image - 5.2 MB -
MD5: 65366589f881db2a48db7f807fe942f5
ISO disc image containing all documentation and data |
Oct 13, 2022 -
Iraqi Arabic Conversational Telephone Speech, Transcripts
Plain Text - 14.6 KB -
MD5: a6d8ad828d13d6eae10baf4003330713
File manifest |
Oct 13, 2022
Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2022, "GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 1", https://hdl.handle.net/11272.1/AB2/MZSDMN, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 1 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 123 hours of Arabic broadcast conversation speech collected in 2006 and 2007 by LDC, MediaNet, Tu... |
Optical Disc Image - 15.2 MB -
MD5: a414c394ade107b69e05fcc9e67ea417
ISO disc image containing all documentation and data |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Plain Text - 10.2 KB -
MD5: a17d486bc3c336ed7db29fe84e07cdb9
File manifest |
Oct 12, 2022
Alsulaiman, Mansour; Muhammad, Ghulam; Abdelkader, Bencherif Mohamed; Mahmood, Awais; Ali, Zulfiqar, 2022, "King Saud University Arabic Speech Database", https://hdl.handle.net/11272.1/AB2/4YVL4A, Abacus Data Network, V1
Abstract Introduction King Saud University Arabic Speech Database was developed by Speech Group (SG) at King Saud University and contains 590 hours of recorded Arabic speech from 269 male and female speakers. The utterances include read and spontaneous speech. The recordings were... |
Oct 12, 2022 -
King Saud University Arabic Speech Database
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Oct 12, 2022 -
King Saud University Arabic Speech Database
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Oct 12, 2022 -
King Saud University Arabic Speech Database
Plain Text - 8.8 MB -
MD5: 2bfd5cbae2879cafada79a4890653fea
File manifest |
Oct 12, 2022
Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2022, "GALE Phase 2 Arabic Broadcast Conversation Speech Part 1", https://hdl.handle.net/11272.1/AB2/GGD0CB, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Speech Part 1 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 123 hours of Arabic broadcast conversation speech collected in 2006 and 2007 by LDC as part of the DARPA GALE (Gl... |
Oct 12, 2022 -
GALE Phase 2 Arabic Broadcast Conversation Speech Part 1
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 12, 2022 -
GALE Phase 2 Arabic Broadcast Conversation Speech Part 1
Optical Disc Image - 6.7 GB -
MD5: 48c0677831c51bb5c437170eb2ba2265
ISO disc image containing all documentation and data |
Oct 12, 2022 -
GALE Phase 2 Arabic Broadcast Conversation Speech Part 1
Plain Text - 7.8 KB -
MD5: 0b47ed1cb6a6881a291bcd1ed7ed64c4
File manifest |
Oct 12, 2022
Cieri, Christopher; Zhan, Juhong; Jiang, Yue; Liberman, Mark; Yuan, Jiahong; Chen, Yiya; Scharenborg, Odette, 2022, "Xi'an Guanzhong Object Naming", https://hdl.handle.net/11272.1/AB2/D2DBLV, Abacus Data Network, V1
Abstract Introduction Xi'an Guanzhong Object Naming is comprised of approximately 15 hours of audio recordings from speakers of the Guanzhong dialect of Mandarin Chinese living in or near Xi'an in Shaangxi Province (China) naming objects that appeared in colored line drawings. Th... |
Oct 12, 2022 -
Xi'an Guanzhong Object Naming
Optical Disc Image - 799.8 MB -
MD5: dea90d62fe4089357b226db72fb6ced4
ISO disc image containing all documentation and data |
Oct 12, 2022 -
Xi'an Guanzhong Object Naming
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Oct 12, 2022 -
Xi'an Guanzhong Object Naming
Plain Text - 1.8 MB -
MD5: 8d091c2b3f692f322fd28fd1dd620b0f
File manifest |
Sep 20, 2022
Li, Xuansong; Strassel, Stephanie; Jones, Karen; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/GNUQ1A, Abacus Data Network, V1
Abstract Introduction HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 6,200 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and rel... |
Sep 20, 2022 -
HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Sep 20, 2022 -
HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 9, 2022
Carvalho, Vitor R.; Kiran, Yigit; Borthwick, Andrew, 2022, "American English Nickname Collection", https://hdl.handle.net/11272.1/AB2/JR1WG6, Abacus Data Network, V1
Abstract Introduction American English Nickname Collection was developed by Intelius, Inc. and is a compilation of American English nicknames to given name mappings based on information in US government records, public web profiles and financial and property reports. This corpus... |