Linguistic Data Consortium

Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

451 to 500 of 1,855 Results

LDC2022T07_d3.iso Jan 26, 2023 - CAMIO Transcription Languages Optical Disc Image - 3.0 GB - MD5: 981c7054891ebc1e0ce7668a5fd9548d Data ISO disc image containing all documentation and data: disc 3
LDC2022T07_d1.iso Jan 26, 2023 - CAMIO Transcription Languages Optical Disc Image - 3.4 GB - MD5: 7f9cc20c811d09899076206259b7b6f5 Data ISO disc image containing all documentation and data: disc 1
LDC2022T07_d2_File_Manifest.txt Jan 26, 2023 - CAMIO Transcription Languages Plain Text - 649.6 KB - MD5: 637881930fedbd2e52c05492d6463d07 Documentation File manifest for disc 2
LDC2022T07_d1_File_Manifest.txt Jan 26, 2023 - CAMIO Transcription Languages Plain Text - 977.9 KB - MD5: 5052178f81c7db6d97b1d8617dc26493 Documentation File manifest for disc 1
Working_with_ISO_Images.txt Jan 26, 2023 - CAMIO Transcription Languages Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
LDC2022T07_d3_File_Manifest.txt Jan 26, 2023 - CAMIO Transcription Languages Plain Text - 485.0 KB - MD5: f5c55b619ab2b2abc123df89e206fda0 Documentation File manifest for disc 3
CALLHOME Egyptian Arabic Transcripts Jan 25, 2023 Gadalla, Hassan; Kilany, Hanaa; Arram, Howaida; Yacoub, Ashraf; El-Habashi, Alaa; Shalaby, Amr; Karins, Krisjanis; Rowson, Everett; MacIntyre, Robert; Kingsbury, Paul; Graff, David; McLemore, Cynthia, 2023, "CALLHOME Egyptian Arabic Transcripts", https://hdl.handle.net/11272.1/AB2/Y03PCU, Abacus Data Network, V1 Abstract Introduction The text component of the CALLHOME Egyptian Arabic package includes transcripts and documentation files. The transcripts cover a contiguous five or ten minute segment taken from 120 unscripted telephone conversations between native speakers of Egyptian Collo...
Working_with_ISO_Images.txt Jan 25, 2023 - CALLHOME Egyptian Arabic Transcripts Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
LDC97T19.iso Jan 25, 2023 - CALLHOME Egyptian Arabic Transcripts Optical Disc Image - 4.9 MB - MD5: 19b247ab10d33888309acbc7a81b1cbb Data ISO disc image containing all documentation and data
LDC97T19_File_Manifest.txt Jan 25, 2023 - CALLHOME Egyptian Arabic Transcripts Plain Text - 10.9 KB - MD5: b8bb81e8025e7f8e3bb55a0a96b14339 Documentation File manifest
CALLHOME Egyptian Arabic Speech Jan 25, 2023 Canavan, Alexandra; Zipperlen, George; Graff, David, 2023, "CALLHOME Egyptian Arabic Speech", https://hdl.handle.net/11272.1/AB2/J3CPAE, Abacus Data Network, V1 Abstract Introduction The CALLHOME Egyptian Arabic corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Egyptian Colloquial Arabic (ECA), the spoken variety of Arabic found in Egypt. The dialect of ECA that this dictionary repre...
Working_with_ISO_Images.txt Jan 25, 2023 - CALLHOME Egyptian Arabic Speech Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
LDC97S45.iso Jan 25, 2023 - CALLHOME Egyptian Arabic Speech Optical Disc Image - 1.7 GB - MD5: 4165ae8e9a599b693e2b5c2e030ee3c6 Data ISO disc image containing all documentation and data
LDC97S45_File_Manifest.txt Jan 25, 2023 - CALLHOME Egyptian Arabic Speech Plain Text - 5.6 KB - MD5: 88f0d40fe2cf9635919e0a3c67c2e9c4 Documentation File manifest
GALE Phase 2 Arabic Broadcast News Transcripts Part 1 Jan 25, 2023 Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2023, "GALE Phase 2 Arabic Broadcast News Transcripts Part 1", https://hdl.handle.net/11272.1/AB2/YPCAIR, Abacus Data Network, V1 Abstract Introduction GALE Phase 2 Arabic Broadcast News Transcripts Part 1 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 165 hours of Arabic broadcast news speech collected in 2006 and 2007 by LDC, MediaNet, Tunis, Tunisia and...
Working_with_ISO_Images.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Transcripts Part 1 Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
LDC2014T17.iso Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Transcripts Part 1 Optical Disc Image - 17.9 MB - MD5: e10919985266b9b9bd78e844b62f4685 Data ISO disc image containing all documentation and data
LDC2014T17_File_Manifest.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Transcripts Part 1 Plain Text - 15.1 KB - MD5: 6e57e80c05588c7b8ed6c7cb59dc7765 Documentation File manifest
GALE Phase 2 Arabic Broadcast News Speech Part 1 Jan 25, 2023 Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2023, "GALE Phase 2 Arabic Broadcast News Speech Part 1", https://hdl.handle.net/11272.1/AB2/CXPTR7, Abacus Data Network, V1 Abstract Introduction GALE Phase 2 Arabic Broadcast News Speech Part 1 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 165 hours of Arabic broadcast news speech collected in 2006 and 2007 by LDC, MediaNet, Tunis, Tunisia and MTC, Rabat, Mor...
Working_with_ISO_Images.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Speech Part 1 Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
LDC2014S07.iso Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Speech Part 1 Optical Disc Image - 9.4 GB - MD5: cf19678b3196b32d331ffedfe8d032a9 Data ISO disc image containing all documentation and data
LDC2014S07_File_Manifest.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Speech Part 1 Plain Text - 13.4 KB - MD5: ddcbd23ec76492e400aebc7044031925 Documentation File manifest
GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2 Jan 25, 2023 Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2023, "GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2", https://hdl.handle.net/11272.1/AB2/CS2DU6, Abacus Data Network, V1 Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 128 hours of Arabic broadcast conversation speech collected in 2007 by LDC, MediaNet, Tunis, Tuni...
LDC2013T17.iso Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2 Optical Disc Image - 16.3 MB - MD5: eb0f27494445322bf3c819c4de4e7c85 Data ISO disc image containing all documentation and data
LDC2013T17_File_Manifest.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2 Plain Text - 10.7 KB - MD5: e1926ee94d6996562281ca251b534c49 Documentation File manifest
Working_with_ISO_Images.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2 Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
GALE Phase 2 Arabic Broadcast Conversation Speech Part 2 Jan 25, 2023 Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2023, "GALE Phase 2 Arabic Broadcast Conversation Speech Part 2", https://hdl.handle.net/11272.1/AB2/AJ2CAE, Abacus Data Network, V1 Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Speech Part 2 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 128 hours of Arabic broadcast conversation speech collected in 2007 by LDC, MediaNet, Tunis, Tunisia and MTC, Rab...
Working_with_ISO_Images.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Speech Part 2 Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
LDC2013S07.iso Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Speech Part 2 Optical Disc Image - 6.9 GB - MD5: 685440bf91ce20eb623ef4bdd312b069 Data ISO disc image containing all documentation and data
LDC2013S07_File_Manifest.txt Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Speech Part 2 Plain Text - 9.5 KB - MD5: 5b739a3547066810f06a53c3a537870c Documentation File manifest
Third DIHARD Challenge Evaluation Jan 24, 2023 Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Third DIHARD Challenge Evaluation", https://hdl.handle.net/11272.1/AB2/VQPCKU, Abacus Data Network, V1 Abstract Introduction Third DIHARD Challenge Evaluation was developed by the Linguistic Data Consortium (LDC) and contains approximately 33 hours of English and Chinese speech data along with corresponding annotations used in support of the Third DIHARD Challenge. The DIHARD Chal...
Working_with_ISO_Images.txt Jan 24, 2023 - Third DIHARD Challenge Evaluation Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disk images
LDC2022S14.iso Jan 24, 2023 - Third DIHARD Challenge Evaluation Optical Disc Image - 1.7 GB - MD5: 97f9090d3ed77d0083cf0784a47a72ce Data ISO disc image containing all documentation and data
LDC2022S14_File_Manifest.txt Jan 24, 2023 - Third DIHARD Challenge Evaluation Plain Text - 49.6 KB - MD5: 6cb764f255018f419b1e920c2c783eca Documentation File manifest
Global TIMIT Thai Jan 24, 2023 Liberman, Mark; Yuan, Jiahong; Cieri, Christopher; Wright, Jonathan, 2023, "Global TIMIT Thai", https://hdl.handle.net/11272.1/AB2/JY8T3N, Abacus Data Network, V1 Abstract Introduction Global TIMIT Thai was developed by the Linguistic Data Consortium and consists of approximately 12 hours of read speech and time-aligned transcripts in Standard Thai. The Global TIMIT project aimed to create a series of corpora in a variety of languages with...
LDC2022S13_File_Manifest.txt Jan 24, 2023 - Global TIMIT Thai Plain Text - 2.3 MB - MD5: d624a00ad258c3b468330a0c0be2597c Documentation File manifest
Working_with_ISO_Images.txt Jan 24, 2023 - Global TIMIT Thai Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disk images
LDC2022S13.iso Jan 24, 2023 - Global TIMIT Thai Optical Disc Image - 1000.5 MB - MD5: a663545c1bf2f5931b1a69a4e81ca87e Data ISO disc image containing all documentation and data
Third DIHARD Challenge Development Dec 8, 2022 Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2022, "Third DIHARD Challenge Development", https://hdl.handle.net/11272.1/AB2/UY5O0X, Abacus Data Network, V1 Abstract Introduction Third DIHARD Challenge Development was developed by Linguistic Data Consortium (LDC) and contains approximately 34 hours of English and Chinese speech data along with corresponding annotations used in support of the Third DIHARD Challenge. The DIHARD Challen...
Working_with_ISO_Images.txt Dec 8, 2022 - Third DIHARD Challenge Development Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disk images
LDC2022S12.iso Dec 8, 2022 - Third DIHARD Challenge Development Optical Disc Image - 1.8 GB - MD5: d1d6b5bf72286297f4732b488e90c79b Data ISO disc image containing all documentation and data
LDC2022S12_File_Manifest.txt Dec 8, 2022 - Third DIHARD Challenge Development Plain Text - 47.7 KB - MD5: 4146720f0b80f973181d252c38635c30 Documentation File manifest
BOLT English Translation Treebank - Egyptian Arabic SMS/Chat Dec 8, 2022 Bies, Ann; Mott, Justin; Warner, Colin; Kulick, Seth, 2022, "BOLT English Translation Treebank - Egyptian Arabic SMS/Chat", https://hdl.handle.net/11272.1/AB2/SPCYLS, Abacus Data Network, V1 Abstract Introduction BOLT English Translation Treebank - Egyptian Arabic SMS/Chat was developed by the Linguistic Data Consortium (LDC) and consists of SMS and chat text data translated from Egyptian Arabic to English and annotated for part-of-speech and syntactic structure. The...
LDC2022T06.iso Dec 8, 2022 - BOLT English Translation Treebank - Egyptian Arabic SMS/Chat Optical Disc Image - 52.7 MB - MD5: 63848a47ddeb6b84b5a2052a5d9d5393 Data ISO disc image containing all documentation and data
LDC2022T06_File_Manifest.txt Dec 8, 2022 - BOLT English Translation Treebank - Egyptian Arabic SMS/Chat Plain Text - 144.2 KB - MD5: 8ce13bc0db258f5a51ef13ed54bca7f8 Documentation File manifest
Working_with_ISO_Images.txt Dec 8, 2022 - BOLT English Translation Treebank - Egyptian Arabic SMS/Chat Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disk images
Hispanic-English Database Nov 30, 2022 Byrne, William; Knodt, Eva; Bernstein, Jared; Emami, Farzhad, 2022, "Hispanic-English Database", https://hdl.handle.net/11272.1/AB2/IIJZCH, Abacus Data Network, V1 Abstract Introduction Hispanic-English Database contains approximately 30 hours of English and Spanish conversational and read speech with transcripts (24 hours) and metadata collected from 22 non-native English speakers between 1996 and 1998. The corpus was developed by Entropic...
Working_with_ISO_Images.txt Nov 30, 2022 - Hispanic-English Database Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea Documentation Working with ISO disc images
LDC2014S05.iso Nov 30, 2022 - Hispanic-English Database Optical Disc Image - 2.9 GB - MD5: 6579259e47fa887ceab07ab59c736534 Data ISO disc image containing all documentation and data
LDC0214S05_File_Manifest.txt Nov 30, 2022 - Hispanic-English Database Plain Text - 176.6 KB - MD5: ac03ea2e19f3524202a077bd24727b19 Documentation File manifest

LDC2022T07_d3.iso

Jan 26, 2023 - CAMIO Transcription Languages

Optical Disc Image - 3.0 GB -

Data

ISO disc image containing all documentation and data: disc 3

LDC2022T07_d1.iso

Jan 26, 2023 - CAMIO Transcription Languages

Optical Disc Image - 3.4 GB -

Data

ISO disc image containing all documentation and data: disc 1

LDC2022T07_d2_File_Manifest.txt

Jan 26, 2023 - CAMIO Transcription Languages

Plain Text - 649.6 KB -

Documentation

File manifest for disc 2

LDC2022T07_d1_File_Manifest.txt

Jan 26, 2023 - CAMIO Transcription Languages

Plain Text - 977.9 KB -

Documentation

File manifest for disc 1

Working_with_ISO_Images.txt

Jan 26, 2023 - CAMIO Transcription Languages

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

LDC2022T07_d3_File_Manifest.txt

Jan 26, 2023 - CAMIO Transcription Languages

Plain Text - 485.0 KB -

Documentation

File manifest for disc 3

CALLHOME Egyptian Arabic Transcripts

Jan 25, 2023

Gadalla, Hassan; Kilany, Hanaa; Arram, Howaida; Yacoub, Ashraf; El-Habashi, Alaa; Shalaby, Amr; Karins, Krisjanis; Rowson, Everett; MacIntyre, Robert; Kingsbury, Paul; Graff, David; McLemore, Cynthia, 2023, "CALLHOME Egyptian Arabic Transcripts", https://hdl.handle.net/11272.1/AB2/Y03PCU, Abacus Data Network, V1

Abstract Introduction The text component of the CALLHOME Egyptian Arabic package includes transcripts and documentation files. The transcripts cover a contiguous five or ten minute segment taken from 120 unscripted telephone conversations between native speakers of Egyptian Collo...

Working_with_ISO_Images.txt

Jan 25, 2023 - CALLHOME Egyptian Arabic Transcripts

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

LDC97T19.iso

Jan 25, 2023 - CALLHOME Egyptian Arabic Transcripts

Optical Disc Image - 4.9 MB -

Data

ISO disc image containing all documentation and data

LDC97T19_File_Manifest.txt

Jan 25, 2023 - CALLHOME Egyptian Arabic Transcripts

Plain Text - 10.9 KB -

Documentation

File manifest

CALLHOME Egyptian Arabic Speech

Jan 25, 2023

Canavan, Alexandra; Zipperlen, George; Graff, David, 2023, "CALLHOME Egyptian Arabic Speech", https://hdl.handle.net/11272.1/AB2/J3CPAE, Abacus Data Network, V1

Abstract Introduction The CALLHOME Egyptian Arabic corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Egyptian Colloquial Arabic (ECA), the spoken variety of Arabic found in Egypt. The dialect of ECA that this dictionary repre...

Working_with_ISO_Images.txt

Jan 25, 2023 - CALLHOME Egyptian Arabic Speech

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

LDC97S45.iso

Jan 25, 2023 - CALLHOME Egyptian Arabic Speech

Optical Disc Image - 1.7 GB -

Data

ISO disc image containing all documentation and data

LDC97S45_File_Manifest.txt

Jan 25, 2023 - CALLHOME Egyptian Arabic Speech

Plain Text - 5.6 KB -

Documentation

File manifest

GALE Phase 2 Arabic Broadcast News Transcripts Part 1

Jan 25, 2023

Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2023, "GALE Phase 2 Arabic Broadcast News Transcripts Part 1", https://hdl.handle.net/11272.1/AB2/YPCAIR, Abacus Data Network, V1

Abstract Introduction GALE Phase 2 Arabic Broadcast News Transcripts Part 1 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 165 hours of Arabic broadcast news speech collected in 2006 and 2007 by LDC, MediaNet, Tunis, Tunisia and...

Working_with_ISO_Images.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Transcripts Part 1

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

LDC2014T17.iso

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Transcripts Part 1

Optical Disc Image - 17.9 MB -

Data

ISO disc image containing all documentation and data

LDC2014T17_File_Manifest.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Transcripts Part 1

Plain Text - 15.1 KB -

Documentation

File manifest

GALE Phase 2 Arabic Broadcast News Speech Part 1

Jan 25, 2023

Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2023, "GALE Phase 2 Arabic Broadcast News Speech Part 1", https://hdl.handle.net/11272.1/AB2/CXPTR7, Abacus Data Network, V1

Abstract Introduction GALE Phase 2 Arabic Broadcast News Speech Part 1 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 165 hours of Arabic broadcast news speech collected in 2006 and 2007 by LDC, MediaNet, Tunis, Tunisia and MTC, Rabat, Mor...

Working_with_ISO_Images.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Speech Part 1

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

LDC2014S07.iso

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Speech Part 1

Optical Disc Image - 9.4 GB -

Data

ISO disc image containing all documentation and data

LDC2014S07_File_Manifest.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast News Speech Part 1

Plain Text - 13.4 KB -

Documentation

File manifest

GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2

Jan 25, 2023

Glenn, Meghan; Lee, Haejoong; Strassel, Stephanie; Maeda, Kazuaki, 2023, "GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2", https://hdl.handle.net/11272.1/AB2/CS2DU6, Abacus Data Network, V1

Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2 was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 128 hours of Arabic broadcast conversation speech collected in 2007 by LDC, MediaNet, Tunis, Tuni...

LDC2013T17.iso

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2

Optical Disc Image - 16.3 MB -

Data

ISO disc image containing all documentation and data

LDC2013T17_File_Manifest.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2

Plain Text - 10.7 KB -

Documentation

File manifest

Working_with_ISO_Images.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Transcripts Part 2

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

GALE Phase 2 Arabic Broadcast Conversation Speech Part 2

Jan 25, 2023

Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2023, "GALE Phase 2 Arabic Broadcast Conversation Speech Part 2", https://hdl.handle.net/11272.1/AB2/AJ2CAE, Abacus Data Network, V1

Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Speech Part 2 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 128 hours of Arabic broadcast conversation speech collected in 2007 by LDC, MediaNet, Tunis, Tunisia and MTC, Rab...

Working_with_ISO_Images.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Speech Part 2

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

LDC2013S07.iso

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Speech Part 2

Optical Disc Image - 6.9 GB -

Data

ISO disc image containing all documentation and data

LDC2013S07_File_Manifest.txt

Jan 25, 2023 - GALE Phase 2 Arabic Broadcast Conversation Speech Part 2

Plain Text - 9.5 KB -

Documentation

File manifest

Third DIHARD Challenge Evaluation

Jan 24, 2023

Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Third DIHARD Challenge Evaluation", https://hdl.handle.net/11272.1/AB2/VQPCKU, Abacus Data Network, V1

Abstract Introduction Third DIHARD Challenge Evaluation was developed by the Linguistic Data Consortium (LDC) and contains approximately 33 hours of English and Chinese speech data along with corresponding annotations used in support of the Third DIHARD Challenge. The DIHARD Chal...

Working_with_ISO_Images.txt

Jan 24, 2023 - Third DIHARD Challenge Evaluation

Plain Text - 1.3 KB -

Documentation

Working with ISO disk images

LDC2022S14.iso

Jan 24, 2023 - Third DIHARD Challenge Evaluation

Optical Disc Image - 1.7 GB -

Data

ISO disc image containing all documentation and data

LDC2022S14_File_Manifest.txt

Jan 24, 2023 - Third DIHARD Challenge Evaluation

Plain Text - 49.6 KB -

Documentation

File manifest

Global TIMIT Thai

Jan 24, 2023

Liberman, Mark; Yuan, Jiahong; Cieri, Christopher; Wright, Jonathan, 2023, "Global TIMIT Thai", https://hdl.handle.net/11272.1/AB2/JY8T3N, Abacus Data Network, V1

Abstract Introduction Global TIMIT Thai was developed by the Linguistic Data Consortium and consists of approximately 12 hours of read speech and time-aligned transcripts in Standard Thai. The Global TIMIT project aimed to create a series of corpora in a variety of languages with...

LDC2022S13_File_Manifest.txt

Jan 24, 2023 - Global TIMIT Thai

Plain Text - 2.3 MB -

Documentation

File manifest

Working_with_ISO_Images.txt

Jan 24, 2023 - Global TIMIT Thai

Plain Text - 1.3 KB -

Documentation

Working with ISO disk images

LDC2022S13.iso

Jan 24, 2023 - Global TIMIT Thai

Optical Disc Image - 1000.5 MB -

Data

ISO disc image containing all documentation and data

Third DIHARD Challenge Development

Dec 8, 2022

Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2022, "Third DIHARD Challenge Development", https://hdl.handle.net/11272.1/AB2/UY5O0X, Abacus Data Network, V1

Abstract Introduction Third DIHARD Challenge Development was developed by Linguistic Data Consortium (LDC) and contains approximately 34 hours of English and Chinese speech data along with corresponding annotations used in support of the Third DIHARD Challenge. The DIHARD Challen...

Working_with_ISO_Images.txt

Dec 8, 2022 - Third DIHARD Challenge Development

Plain Text - 1.3 KB -

Documentation

Working with ISO disk images

LDC2022S12.iso

Dec 8, 2022 - Third DIHARD Challenge Development

Optical Disc Image - 1.8 GB -

Data

ISO disc image containing all documentation and data

LDC2022S12_File_Manifest.txt

Dec 8, 2022 - Third DIHARD Challenge Development

Plain Text - 47.7 KB -

Documentation

File manifest

BOLT English Translation Treebank - Egyptian Arabic SMS/Chat

Dec 8, 2022

Bies, Ann; Mott, Justin; Warner, Colin; Kulick, Seth, 2022, "BOLT English Translation Treebank - Egyptian Arabic SMS/Chat", https://hdl.handle.net/11272.1/AB2/SPCYLS, Abacus Data Network, V1

Abstract Introduction BOLT English Translation Treebank - Egyptian Arabic SMS/Chat was developed by the Linguistic Data Consortium (LDC) and consists of SMS and chat text data translated from Egyptian Arabic to English and annotated for part-of-speech and syntactic structure. The...

LDC2022T06.iso

Dec 8, 2022 - BOLT English Translation Treebank - Egyptian Arabic SMS/Chat

Optical Disc Image - 52.7 MB -

Data

ISO disc image containing all documentation and data

LDC2022T06_File_Manifest.txt

Dec 8, 2022 - BOLT English Translation Treebank - Egyptian Arabic SMS/Chat

Plain Text - 144.2 KB -

Documentation

File manifest

Working_with_ISO_Images.txt

Dec 8, 2022 - BOLT English Translation Treebank - Egyptian Arabic SMS/Chat

Plain Text - 1.3 KB -

Documentation

Working with ISO disk images

Hispanic-English Database

Nov 30, 2022

Byrne, William; Knodt, Eva; Bernstein, Jared; Emami, Farzhad, 2022, "Hispanic-English Database", https://hdl.handle.net/11272.1/AB2/IIJZCH, Abacus Data Network, V1

Abstract Introduction Hispanic-English Database contains approximately 30 hours of English and Spanish conversational and read speech with transcripts (24 hours) and metadata collected from 22 non-native English speakers between 1996 and 1998. The corpus was developed by Entropic...

Working_with_ISO_Images.txt

Nov 30, 2022 - Hispanic-English Database

Plain Text - 1.3 KB -

Documentation

Working with ISO disc images

LDC2014S05.iso

Nov 30, 2022 - Hispanic-English Database

Optical Disc Image - 2.9 GB -

Data

ISO disc image containing all documentation and data

LDC0214S05_File_Manifest.txt

Nov 30, 2022 - Hispanic-English Database

Plain Text - 176.6 KB -

Documentation

File manifest

Add Data

Share Dataverse

Link Dataverse

Reset Modifications