151 to 200 of 1,819 Results
May 13, 2024 -
LoReHLT Hausa Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
May 13, 2024 -
LoReHLT Hausa Representative Language Pack
Optical Disc Image - 431.2 MB -
MD5: 3650d33f85fdb65527af9f4a0a3b60ca
ISO disc image containing all documentation and data |
May 13, 2024 -
LoReHLT Hausa Representative Language Pack
Plain Text - 1.1 MB -
MD5: db0a44d7773f2ac8a184b27c3fd39f49
File manifest |
May 13, 2024
Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher, 2024, "AIDA Scenario 2 Practice Topic Source Data", https://hdl.handle.net/11272.1/AB2/TXAWUL, Abacus Data Network, V1
Abstract Introduction AIDA Scenario 2 Practice Topic Source Data was developed by the Linguistic Data Consortium (LDC) and is comprised of 1500 root documents, including text, image, and video, from English, Russian, and Spanish web sources. The DARPA AIDA (Active Interpretation... |
May 13, 2024 -
AIDA Scenario 2 Practice Topic Source Data
Optical Disc Image - 3.5 GB -
MD5: 8bc1532ee5be0a300404dcd5bd69e5ca
ISO disc image containing all documentation and data: disc 1 |
May 13, 2024 -
AIDA Scenario 2 Practice Topic Source Data
Plain Text - 3.3 KB -
MD5: 335f5fb8014b991ab0d5a3b336d01368
File manifest for disc 1 |
May 13, 2024 -
AIDA Scenario 2 Practice Topic Source Data
Optical Disc Image - 3.6 GB -
MD5: 4e0720e6e03c9cfdf0fe69ff2c64e2b8
ISO disc image containing all documentation and data: disc 2 |
May 13, 2024 -
AIDA Scenario 2 Practice Topic Source Data
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
May 13, 2024 -
AIDA Scenario 2 Practice Topic Source Data
Plain Text - 202 B -
MD5: 4fd8be00f9c64cfccdb5dce748d59248
File manifest for disc 2 |
May 13, 2024
Walker, Kevin; Graff, David; Ma, Xiaoyi; Strassel, Stephanie; Jones, Karen, 2024, "RATS Low Speech Density", https://hdl.handle.net/11272.1/AB2/CXVUXZ, Abacus Data Network, V1
Abstract Introduction RATS Low Speech Density was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 87 hours of English, Levantine Arabic, Farsi, Pashto and Urdu speech and non-speech samples. The recordings were assembled by concatenating a rand... |
May 13, 2024 -
RATS Low Speech Density
Plain Text - 3.1 KB -
MD5: 891064c78a8e46a2f9922b793aafa160
Instructions on how to access LDC data via UBC's Teamshare service (Markdown / ASCII text) |
May 13, 2024 -
RATS Low Speech Density
Adobe PDF - 31.2 KB -
MD5: 2a043207829f9ab259df770590941165
Instructions on how to access LDC data via UBC's Teamshare service |
May 13, 2024 -
RATS Low Speech Density
Plain Text - 1.5 MB -
MD5: c7ca3b492d190a4b84794c5fba5b7397
File manifest |
Mar 28, 2024
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2024, "LORELEI Farsi Representative Language Pack", https://hdl.handle.net/11272.1/AB2/UMEVGY, Abacus Data Network, V1
Abstract Introduction LORELEI Farsi Representative Language Pack consists of Farsi monolingual text, Farsi-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LORELEI... |
Mar 28, 2024 -
LORELEI Farsi Representative Language Pack
Optical Disc Image - 1.4 GB -
MD5: 394e63852f73db2b526ff061caea6b95
ISO disc image containing all documentation and data: disc 1 |
Mar 28, 2024 -
LORELEI Farsi Representative Language Pack
Optical Disc Image - 3.8 GB -
MD5: 9d0e70b9884794395863b5d76b8450c7
ISO disc image containing all documentation and data: disc 2 |
Mar 28, 2024 -
LORELEI Farsi Representative Language Pack
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 28, 2024 -
LORELEI Farsi Representative Language Pack
Plain Text - 1.3 MB -
MD5: 9fc12758e227789ec9a2fa73df97ace0
File manifest for disc 1 |
Mar 28, 2024 -
LORELEI Farsi Representative Language Pack
Plain Text - 7.9 KB -
MD5: df4c2df9c2f3e89def948c9a11ef8d64
File manifest for disc 2 |
Mar 28, 2024
Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher, 2024, "AIDA Scenario 1 Practice Topic Annotation", https://hdl.handle.net/11272.1/AB2/XPPJWR, Abacus Data Network, V1
Abstract Introduction AIDA Scenario 1 Practice Topic Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of annotations for 212 English, Russian and Ukrainian web documents (text, image and video) from AIDA Scenario 1 Practice Topic Source Data (LDC2... |
Mar 28, 2024 -
AIDA Scenario 1 Practice Topic Annotation
Optical Disc Image - 15.8 MB -
MD5: b02f4b9710f884286ab7371badb30473
ISO disc image containing all documentation and data |
Mar 28, 2024 -
AIDA Scenario 1 Practice Topic Annotation
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Mar 28, 2024 -
AIDA Scenario 1 Practice Topic Annotation
Plain Text - 1.7 KB -
MD5: cfa2d8d59634669de02a7245fb035926
File manifest |
Mar 28, 2024
Delgado, Dana; Walker, Kevin; Strassel, Stephanie; Graff, David; Caruso, Christopher, 2024, "KASET - Kurmanji and Sorani Kurdish Speech and Transcripts", https://hdl.handle.net/11272.1/AB2/ODAGYC, Abacus Data Network, V1
Abstract Introduction KASET - Kurmanji and Sorani Kurdish Speech and Transcripts was developed by the Linguistic Data Consortium (LDC) and consists of approximately 147 hours of telephone conversations (289 recordings) and broadcast news (410 recordings) in two Kurdish dialects:... |
Mar 28, 2024 -
KASET - Kurmanji and Sorani Kurdish Speech and Transcripts
Optical Disc Image - 3.9 GB -
MD5: 8dbcb695af5c099e0e8be49770353f6b
ISO disc image containing all documentation and data: disc 1 |
Mar 28, 2024 -
KASET - Kurmanji and Sorani Kurdish Speech and Transcripts
Optical Disc Image - 3.9 GB -
MD5: 20c538456cb5d6ebca5df2a9ff6d1a46
ISO disc image containing all documentation and data: disc 2 |
Mar 28, 2024 -
KASET - Kurmanji and Sorani Kurdish Speech and Transcripts
Plain Text - 83.7 KB -
MD5: 63387280fb584ace7dc0db273497efc6
File manifest for disc 1 |
Mar 28, 2024 -
KASET - Kurmanji and Sorani Kurdish Speech and Transcripts
Plain Text - 28.8 KB -
MD5: bc0d4014d15f924fe5eea1832ec497c7
File manifest for disc 2 |
Mar 28, 2024 -
KASET - Kurmanji and Sorani Kurdish Speech and Transcripts
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Jan 11, 2024
Tracey, Jennifer; Strassel, Stephanie; Arrigo, Michael, 2024, "TAC KBP Belief and Sentiment - Comprehensive Training and Evaluation Data 2016-2017", https://hdl.handle.net/11272.1/AB2/OM2WHS, Abacus Data Network, V1
Abstract Introduction TAC KBP Belief and Sentiment - Comprehensive Training and Evaluation Data 2016-2017 (LDC2023T13) was developed by the Linguistic Data Consortium and contains training and evaluation data produced in support of the 2016 and 2017 TAC KBP Belief and Sentiment (... |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Optical Disc Image - 179.1 MB -
MD5: 45c92370531e0e9a28b4788fc04b3372
ISO disc image containing all documentation and data |
Plain Text - 377.7 KB -
MD5: 7abf969f3d826ebb2b644db030574440
File manifest |
Jan 11, 2024
Belhadj, Mourad; Bendellali, Ilham; Lakhdari, Elalia, 2024, "Kasdi-Merbah (University) Emotional Database in Arabic Speech", https://hdl.handle.net/11272.1/AB2/Y4LDPA, Abacus Data Network, V1
Abstract Introduction Kasdi-Merbah Emotional Database in Arabic Speech was developed by the University of Kasdi Merbah Ouargla. The corpus contains two hours of Modern Standard Arabic prompted speech from 500 speakers (254 female, 246 male) representing 5,000 utterances. Data Spe... |
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Optical Disc Image - 276.9 MB -
MD5: ecab757c314945dcaef8b5980e5493fa
ISO disc image containing all documentation and data |
Plain Text - 259.9 KB -
MD5: ad5c1e60429ad3a3919ac5ff93bb955c
File manifest |
Dec 5, 2023
Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher, 2023, "AIDA Scenario 1 and 2 Reference Knowledge Base", https://hdl.handle.net/11272.1/AB2/YTF9AB, Abacus Data Network, V1
Abstract Introduction AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in V... |
Dec 5, 2023 -
AIDA Scenario 1 and 2 Reference Knowledge Base
Optical Disc Image - 2.9 GB -
MD5: 2b0f3788f02fb0519e32d4a40c02bbdf
ISO disc image containing all documentation and data |
Dec 5, 2023 -
AIDA Scenario 1 and 2 Reference Knowledge Base
Plain Text - 1.3 KB -
MD5: 4d4231d07ac669e105f71e602457efea
Working with ISO disc images |
Dec 5, 2023 -
AIDA Scenario 1 and 2 Reference Knowledge Base
Plain Text - 386 B -
MD5: 92afa9a294b40b308bc79f2193e070e1
File manifest |
Dec 5, 2023
Graff, David; Jones, Karen; Strassel, Stephanie; Walker, Kevin, 2023, "REMIX Telephone Collection", https://hdl.handle.net/11272.1/AB2/VJPGYX, Abacus Data Network, V1
Abstract Introduction REMIX Telephone Collection was developed by the Linguistic Data Consortium (LDC) and contains 320 hours of English conversational telephone speech from 358 speakers who had completed all tasks in one of the previous LDC Mixer collections, specifically, Mixer... |
Dec 5, 2023 -
REMIX Telephone Collection
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Dec 5, 2023 -
REMIX Telephone Collection
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service |
Dec 5, 2023 -
REMIX Telephone Collection
Plain Text - 87.4 KB -
MD5: a1771008191a169b89367cc448c04bc7
File manifest |
Dec 5, 2023
Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher, 2023, "AIDA Scenario 1 Practice Topic Source Data", https://hdl.handle.net/11272.1/AB2/M4QWGV, Abacus Data Network, V1
Abstract Introduction AIDA Scenario 1 Practice Topic Source Data was developed by the Linguistic Data Consortium (LDC) and is comprised of 1511 documents (text, image, and video) from English, Russian, and Ukrainian web sources. The DARPA AIDA (Active Interpretation of Disparate... |
Dec 5, 2023 -
AIDA Scenario 1 Practice Topic Source Data
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Dec 5, 2023 -
AIDA Scenario 1 Practice Topic Source Data
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service |
Dec 5, 2023 -
AIDA Scenario 1 Practice Topic Source Data
Plain Text - 5.8 KB -
MD5: e47ef6568323a9007b0aaf70802df97a
File manifest |
Oct 17, 2023
Miller, David; Walker, Kevin; Graff, David; Canavan, Alexandra, 2023, "CALLFRIEND Russian Text", https://hdl.handle.net/11272.1/AB2/BNFFSZ, Abacus Data Network, V1
Abstract Introduction CALLFRIEND Russian Text (LDC2023T09) was developed by the Linguistic Data Consortium and consists of transcripts for approximately 48 hours of telephone conversations (100 recordings) between native Russian speakers. The calls were recorded in 1999 as part o... |