Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

501 to 550 of 1,819 Results
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (PDF)
Plain Text - 8.8 MB - MD5: 2bfd5cbae2879cafada79a4890653fea
Documentation
File manifest
Oct 12, 2022
Walker, Kevin; Caruso, Christopher; Maeda, Kazuaki; DiPersio, Denise; Strassel, Stephanie, 2022, "GALE Phase 2 Arabic Broadcast Conversation Speech Part 1", https://hdl.handle.net/11272.1/AB2/GGD0CB, Abacus Data Network, V1
Abstract Introduction GALE Phase 2 Arabic Broadcast Conversation Speech Part 1 was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 123 hours of Arabic broadcast conversation speech collected in 2006 and 2007 by LDC as part of the DARPA GALE (Gl...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 6.7 GB - MD5: 48c0677831c51bb5c437170eb2ba2265
Data
ISO disc image containing all documentation and data
Plain Text - 7.8 KB - MD5: 0b47ed1cb6a6881a291bcd1ed7ed64c4
Documentation
File manifest
Oct 12, 2022
Cieri, Christopher; Zhan, Juhong; Jiang, Yue; Liberman, Mark; Yuan, Jiahong; Chen, Yiya; Scharenborg, Odette, 2022, "Xi'an Guanzhong Object Naming", https://hdl.handle.net/11272.1/AB2/D2DBLV, Abacus Data Network, V1
Abstract Introduction Xi'an Guanzhong Object Naming is comprised of approximately 15 hours of audio recordings from speakers of the Guanzhong dialect of Mandarin Chinese living in or near Xi'an in Shaangxi Province (China) naming objects that appeared in colored line drawings. Th...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 799.8 MB - MD5: dea90d62fe4089357b226db72fb6ced4
Data
ISO disc image containing all documentation and data
Plain Text - 1.8 MB - MD5: 8d091c2b3f692f322fd28fd1dd620b0f
Documentation
File manifest
Sep 20, 2022
Li, Xuansong; Strassel, Stephanie; Jones, Karen; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/GNUQ1A, Abacus Data Network, V1
Abstract Introduction HAVIC MED Novel 2 Test -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 6,200 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and rel...
Plain Text - 3.1 KB - MD5: 1b8a8741370964dcfff1eeec66e4b151
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text)
Adobe PDF - 31.2 KB - MD5: 100c549ff1bb48ed76f05d01f6342eb3
Documentation
Instructions on how to access LDC data via UBC's Teamshare service (PDF)
Aug 9, 2022
Carvalho, Vitor R.; Kiran, Yigit; Borthwick, Andrew, 2022, "American English Nickname Collection", https://hdl.handle.net/11272.1/AB2/JR1WG6, Abacus Data Network, V1
Abstract Introduction American English Nickname Collection was developed by Intelius, Inc. and is a compilation of American English nicknames to given name mappings based on information in US government records, public web profiles and financial and property reports. This corpus...
Unknown - 3.5 MB - MD5: 8eb2a06d7e8b21d51d089074d379346e
Data
Zip file containing all documentation and data
Plain Text - 211 B - MD5: 123df9946bba1a955035b76db7daf026
Documentation
File manifest
Aug 9, 2022
Ahmed, Abdelhamid M.; Myhill, Debra; Abdollahzadeh, Esmaeel; McCallum, Lee; Zaghouani, Wajdi; Rezk, Lameya; Jrad, Anissa; Zhang, Xiao, 2022, "Qatari Corpus of Argumentative Writing", https://hdl.handle.net/11272.1/AB2/F2P2EY, Abacus Data Network, V1
Abstract Introduction Qatari Corpus of Argumentative Writing was developed by Qatar University, University of Exeter and Hamad Bin Khalifa University and is comprised of approximately 200,000 tokens of Arabic and English writing by undergraduate students (159 female, 36 male) alo...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Optical Disc Image - 14.4 MB - MD5: 0a141cc2f0ab0e2af68681fcbf62c260
Data
ISO disc image containing all documentation and data
Plain Text - 35.5 KB - MD5: e52651303564b944463ad297c8f4e439
Documentation
File manifest
Jul 7, 2022
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2022, "Second DIHARD Challenge Evaluation - Eleven Sources", https://hdl.handle.net/11272.1/AB2/ML7KD5, Abacus Data Network, V1
Abstract Introduction Second DIHARD Challenge Evaluation - Eleven Sources was developed by the Linguistic Data Consortium (LDC) and contains approximately 20 hours of English and Chinese speech data along with corresponding annotations used in support of the Second DIHARD Challen...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Description
Working with ISO disc images
Optical Disc Image - 1.2 GB - MD5: 4704e3e192e584d8964e8db38d579c19
Data
ISO disc image containing all documentation and data
Plain Text - 25.7 KB - MD5: 618e0098c0915c1552f73fed6db4b191
Description
File manifest
Jul 7, 2022
Lewis, Gwyneth; van Rijn, Pol; Gwilliams, Laura; Larrouy-Maestri, Pauline; Poeppel, David; Ghitza, Oded, 2022, "NUBUC", https://hdl.handle.net/11272.1/AB2/IUFKIG, Abacus Data Network, V1
Abstract Introduction NUBUC (NyU-BU contextually controlled stories Corpus) was developed by New York University, Max Planck Institute for Empirical Aesthetics and Boston University. It contains approximately three hours of English read speech from eight stories focused on lingui...
Jul 7, 2022 - NUBUC
Optical Disc Image - 319.6 MB - MD5: 5128aac29b6d280f32aa198bb2a6b1c7
Data
ISO disc image containing all documentation and data
Jul 7, 2022 - NUBUC
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Jul 7, 2022 - NUBUC
Plain Text - 327.3 KB - MD5: 2d1d796c2cba087129d07536c8613b0b
Documentation
File manifest
Jun 10, 2022
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2022, "LORELEI Wolof Representative Language Pack", https://hdl.handle.net/11272.1/AB2/1M9HI6, Abacus Data Network, V1
Abstract Introduction LORELEI Wolof Representative Language Pack consists of Wolof monolingual text, Wolof-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium for the DARPA LORELEI program. The LORELEI...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working_with_ISO_Images
Optical Disc Image - 124.5 MB - MD5: 92a90f481973b7790458e205f66530cd
Data
ISO disc image containing all documentation and data
Plain Text - 213.4 KB - MD5: f38c50035a54aa7964ab76b6c6bd1529
Documentation
File manifest
Mar 31, 2022
Alsaif, Amal; Alyahya, Tasniem; Alotibi, Madawi; Almuzaini, Huda; Alqahtani, Abeer, 2022, "AttImam", https://hdl.handle.net/11272.1/AB2/9FBCBG, Abacus Data Network, V1
Abstract Introduction AttImam was developed by Al-Imam Mohammad Ibn Saud Islamic University and consists of approximately 2,000 attribution relations applied to Arabic newswire text from Arabic Treebank: Part 1 v 4.1 (LDC2010T13). Attribution refers to the process of reporting or...
Mar 31, 2022 - AttImam
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Mar 31, 2022 - AttImam
Optical Disc Image - 9.1 MB - MD5: ae945c46e92b6f7724621e2932e42d65
Data
ISO disc image containing all documentation and data
Mar 31, 2022 - AttImam
Plain Text - 20.9 KB - MD5: a1993dcd8d15129b1501e0a8988b6147
Documentation
File manifest
Mar 18, 2022
Andrus, Tony; Bills, Aric; Corris, Miriam; Dubinski, Eyal; Fiscus, Jonathan G.; Gillies, Breanna; Harper, Mary; Hazen, T. J.; Hefright, Brook; Jarrett, Amy; Le, Hanh; Ray, Jessica; Rytting, Anton; Silber, Ronnie; Shen, Wade; Tzoukermann, Evelyne, 2022, "IARPA Babel Vietnamese Language Pack IARPA-babel107b-v0.7", https://hdl.handle.net/11272.1/AB2/WJGWAP, Abacus Data Network, V1
Abstract Introduction IARPA Babel Vietnamese Language Pack IARPA-babel107b-v0.7 was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 201 hours of Vietnamese conversational and scripted telephone speech co...
Optical Disc Image - 324.7 MB - MD5: 7b5abcf7a3e32450b0f10d985ffca495
Data
ISO disc image containing all documentation and data: disc 3
Optical Disc Image - 7.9 GB - MD5: 54162b5157f2a6913bd2314b202bca7e
Data
ISO disc image containing all documentation and data: disc 2
Optical Disc Image - 3.3 GB - MD5: cc7ebcde0d879f334c403d672e9d4ac5
Data
ISO disc image containing all documentation and data: disc 1
Plain Text - 1.5 MB - MD5: f59731b7ec4f58614843af21a9fecad3
Documentation
File manifest for disc 1
Plain Text - 100.7 KB - MD5: 70810c4a070e6ba727660f1d08bbf902
Documentation
File manifest for disc 2
Plain Text - 1.4 KB - MD5: 5ad2d2452764a056fa326c7267fe8aa9
Documentation
File manifest for disc 3
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Mar 18, 2022
Bills, Aric; Conners, Thomas; Corris, Miriam; David, Anne; Dubinski, Eyal; Fiscus, Jonathan G.; Gann, Ketty; Harper, Mary; Kazi, Michael; Malyska, Nicolas; Melot, Jennifer; Ray, Jessica; Rytting, Anton; Zawaydeh, Bushra, 2022, "IARPA Babel Dholuo Language Pack IARPA-babel403b-v1.0b", https://hdl.handle.net/11272.1/AB2/HSAU9N, Abacus Data Network, V1
Abstract Introduction IARPA Babel Dholuo Language Pack IARPA-babel403b-v1.0b was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 204 hours of Dholuo conversational and scripted telephone speech collected...
Optical Disc Image - 4.4 GB - MD5: 9b886d11aab911cfd11c07ac433bbcf6
Data
ISO disc image containing all documentation and data: disc 1
Optical Disc Image - 7.0 GB - MD5: 985a21fbe6dc67f1822f4ebd189be133
Data
ISO disc image containing all documentation and data: disc 2
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
Working with ISO disc images
Plain Text - 1.3 MB - MD5: 6e7222fc89a00222ea17ebced1e7a8c0
Documentation
File manifest for disc 1
Plain Text - 67.4 KB - MD5: 8ed6f764894d308e5bf45fa213e67faa
Documentation
File manifest for disc 2
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =