401 to 450 of 1,819 Results
Plain Text - 7.9 KB -
MD5: 8d5bbb9fb2a182ba4fa4f49084c8d739
File manifest for disc 5 |
Mar 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Tagalog Representative Language Pack", https://hdl.handle.net/11272.1/AB2/IALRRN, Abacus Data Network, V1
Abstract Introduction LORELEI Tagalog Representative Language Pack consists of Tagalog monolingual text, Tagalog-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO... |
Mar 17, 2023 -
LORELEI Tagalog Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 17, 2023 -
LORELEI Tagalog Representative Language Pack
Optical Disc Image - 423.2 MB -
MD5: 28506483cf236683fa465901edd2a4ab
ISO disc image containing all documentation and data |
Mar 17, 2023 -
LORELEI Tagalog Representative Language Pack
Plain Text - 1.0 MB -
MD5: afd5f51c5372625ac712e02be0c17b4f
File manifest |
Mar 17, 2023
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Swahili Representative Language Pack", https://hdl.handle.net/11272.1/AB2/RPNXXU, Abacus Data Network, V1
Abstract Introduction LORELEI Swahili Representative Language Pack consists of Swahili monolingual text, Swahili-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LO... |
Mar 17, 2023 -
LORELEI Swahili Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 17, 2023 -
LORELEI Swahili Representative Language Pack
Optical Disc Image - 467.1 MB -
MD5: 07a901ab7074796e16e742725e01fd89
ISO disc image containing all documentation and data |
Mar 17, 2023 -
LORELEI Swahili Representative Language Pack
Plain Text - 1.1 MB -
MD5: 7393a9caae5552abc6b1254e35b5f598
File manifest |
Feb 14, 2023
Chay, Kevin; Elizalde, Cecilia; Ziemski, Michal, 2023, "United Nations Proceedings Speech", https://hdl.handle.net/11272.1/AB2/3LTQ01, Abacus Data Network, V1
Abstract Introduction United Nations Proceedings Speech was developed by the United Nations (UN) and contains approximately 8,500 hours of recorded proceedings in the six official UN languages, Arabic, Chinese, English, French, Russian and Spanish. The data was recorded in 2009-2... |
Feb 14, 2023 -
United Nations Proceedings Speech
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Feb 14, 2023 -
United Nations Proceedings Speech
Plain Text - 2.8 MB -
MD5: 28f8d1318d7a797659c0c005f754dbe4
File manifest |
Jan 26, 2023
Arrigo, Michael; Strassel, Stephanie; Caruso, Christopher, 2023, "CAMIO Transcription Languages", https://hdl.handle.net/11272.1/AB2/IEJLCN, Abacus Data Network, V1
Abstract Introduction CAMIO Transcription Languages was developed by the Linguistic Data Consortium and contains nearly 70,000 images of machine printed text with corresponding annotations and transcripts in the following 13 languages: Arabic, Chinese, English, Farsi, Hindi, Japa... |
Jan 26, 2023 -
CAMIO Transcription Languages
Optical Disc Image - 3.1 GB -
MD5: eecf370251324a271b774ab8a7312675
ISO disc image containing all documentation and data: disc 2 |
Jan 26, 2023 -
CAMIO Transcription Languages
Optical Disc Image - 3.0 GB -
MD5: 981c7054891ebc1e0ce7668a5fd9548d
ISO disc image containing all documentation and data: disc 3 |
Jan 26, 2023 -
CAMIO Transcription Languages
Optical Disc Image - 3.4 GB -
MD5: 7f9cc20c811d09899076206259b7b6f5
ISO disc image containing all documentation and data: disc 1 |
Jan 26, 2023 -
CAMIO Transcription Languages
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 26, 2023 -
CAMIO Transcription Languages
Plain Text - 649.6 KB -
MD5: 637881930fedbd2e52c05492d6463d07
File manifest for disc 2 |
Jan 26, 2023 -
CAMIO Transcription Languages
Plain Text - 977.9 KB -
MD5: 5052178f81c7db6d97b1d8617dc26493
File manifest for disc 1 |
Jan 26, 2023 -
CAMIO Transcription Languages
Plain Text - 485.0 KB -
MD5: f5c55b619ab2b2abc123df89e206fda0
File manifest for disc 3 |
Jan 25, 2023
Gadalla, Hassan; Kilany, Hanaa; Arram, Howaida; Yacoub, Ashraf; El-Habashi, Alaa; Shalaby, Amr; Karins, Krisjanis; Rowson, Everett; MacIntyre, Robert; Kingsbury, Paul; Graff, David; McLemore, Cynthia, 2023, "CALLHOME Egyptian Arabic Transcripts", https://hdl.handle.net/11272.1/AB2/Y03PCU, Abacus Data Network, V1
Abstract Introduction The text component of the CALLHOME Egyptian Arabic package includes transcripts and documentation files. The transcripts cover a contiguous five or ten minute segment taken from 120 unscripted telephone conversations between native speakers of Egyptian Collo... |
Jan 25, 2023 -
CALLHOME Egyptian Arabic Transcripts
Optical Disc Image - 4.9 MB -
MD5: 19b247ab10d33888309acbc7a81b1cbb
ISO disc image containing all documentation and data |
Jan 25, 2023 -
CALLHOME Egyptian Arabic Transcripts
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 25, 2023 -
CALLHOME Egyptian Arabic Transcripts
Plain Text - 10.9 KB -
MD5: b8bb81e8025e7f8e3bb55a0a96b14339
File manifest |
Jan 25, 2023
Canavan, Alexandra; Zipperlen, George; Graff, David, 2023, "CALLHOME Egyptian Arabic Speech", https://hdl.handle.net/11272.1/AB2/J3CPAE, Abacus Data Network, V1
Abstract Introduction The CALLHOME Egyptian Arabic corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Egyptian Colloquial Arabic (ECA), the spoken variety of Arabic found in Egypt. The dialect of ECA that this dictionary repre... |
Jan 25, 2023 -
CALLHOME Egyptian Arabic Speech
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 25, 2023 -
CALLHOME Egyptian Arabic Speech
Optical Disc Image - 1.7 GB -
MD5: 4165ae8e9a599b693e2b5c2e030ee3c6
ISO disc image containing all documentation and data |
Jan 25, 2023 -
CALLHOME Egyptian Arabic Speech
Plain Text - 5.6 KB -
MD5: 88f0d40fe2cf9635919e0a3c67c2e9c4
File manifest |
Jan 25, 2023
Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2023, "GALE Phase 2 Arabic Broadcast News Transcripts Part 1", https://hdl.handle.net/11272.1/AB2/YPCAIR, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast News Transcripts Part 1 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 165 hours of Arabic broadcast news speech collected in 2006 and 2007 by LDC, MediaNet, Tunis, Tunisia and... |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast News Transcripts Part 1
Optical Disc Image - 17.9 MB -
MD5: e10919985266b9b9bd78e844b62f4685
ISO disc image containing all documentation and data |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast News Transcripts Part 1
Plain Text - 15.1 KB -
MD5: 6e57e80c05588c7b8ed6c7cb59dc7765
File manifest |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast News Transcripts Part 1
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 25, 2023
Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2023, "GALE Phase 2 Arabic Broadcast News Speech Part 1", https://hdl.handle.net/11272.1/AB2/CXPTR7, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast News Speech Part 1 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 165 hours of Arabic broadcast news speech collected in 2006 and 2007 by LDC, MediaNet, Tunis, Tunisia and MTC, Rabat, Mor... |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast News Speech Part 1
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast News Speech Part 1
Optical Disc Image - 9.4 GB -
MD5: cf19678b3196b32d331ffedfe8d032a9
ISO disc image containing all documentation and data |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast News Speech Part 1
Plain Text - 13.4 KB -
MD5: ddcbd23ec76492e400aebc7044031925
File manifest |
Jan 25, 2023
Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2023, "GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2", https://hdl.handle.net/11272.1/AB2/CS2DU6, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 128 hours of Arabic broadcast conversation speech collected in 2007 by LDC, MediaNet, Tunis, Tuni... |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Optical Disc Image - 16.3 MB -
MD5: eb0f27494445322bf3c819c4de4e7c85
ISO disc image containing all documentation and data |
Plain Text - 10.7 KB -
MD5: e1926ee94d6996562281ca251b534c49
File manifest |
Jan 25, 2023
Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2023, "GALE Phase 2 Arabic Broadcast Conversation Speech Part 2", https://hdl.handle.net/11272.1/AB2/AJ2CAE, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Speech Part 2 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 128 hours of Arabic broadcast conversation speech collected in 2007 by LDC, MediaNet, Tunis, Tunisia and MTC, Rab... |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast Conversation Speech Part 2
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast Conversation Speech Part 2
Optical Disc Image - 6.9 GB -
MD5: 685440bf91ce20eb623ef4bdd312b069
ISO disc image containing all documentation and data |
Jan 25, 2023 -
GALE Phase 2 Arabic Broadcast Conversation Speech Part 2
Plain Text - 9.5 KB -
MD5: 5b739a3547066810f06a53c3a537870c
File manifest |
Jan 24, 2023
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Third DIHARD Challenge Evaluation", https://hdl.handle.net/11272.1/AB2/VQPCKU, Abacus Data Network, V1
Abstract Introduction Third DIHARD Challenge Evaluation was developed by the Linguistic Data Consortium (LDC) and contains approximately 33 hours of English and Chinese speech data along with corresponding annotations used in support of the Third DIHARD Challenge. The DIHARD Chal... |
Jan 24, 2023 -
Third DIHARD Challenge Evaluation
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disk images |
Jan 24, 2023 -
Third DIHARD Challenge Evaluation
Optical Disc Image - 1.7 GB -
MD5: 97f9090d3ed77d0083cf0784a47a72ce
ISO disc image containing all documentation and data |
Jan 24, 2023 -
Third DIHARD Challenge Evaluation
Plain Text - 49.6 KB -
MD5: 6cb764f255018f419b1e920c2c783eca
File manifest |
Jan 24, 2023
Liberman, Mark; Yuan, Jiahong; Cieri, Christopher; Wright, Jonathan, 2023, "Global TIMIT Thai", https://hdl.handle.net/11272.1/AB2/JY8T3N, Abacus Data Network, V1
Abstract Introduction Global TIMIT Thai was developed by the Linguistic Data Consortium and consists of approximately 12 hours of read speech and time-aligned transcripts in Standard Thai. The Global TIMIT project aimed to create a series of corpora in a variety of languages with... |
Jan 24, 2023 -
Global TIMIT Thai
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disk images |