Skip to main content
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

801 to 850 of 1,819 Results
Sep 29, 2021
Andresen, Jess; Bills, Aric; Conners, Thomas; Dubinski, Eyal; Fiscus, Jonathan G.; Harper, Mary; Kozlov, Kirill; Malyska, Nicolas; Melot, Jennifer; Morrison, Michelle; Phillips, Josh; Ray, Jessica; Rytting, Anton; Shen, Wade; Silber, Ronnie; Tzoukermann, Evelyne; Wong, Jamie, 2021, "IARPA Babel Swahili Language Pack IARPA-babel202b-v1.0d", https://hdl.handle.net/11272.1/AB2/TNSSDU, Abacus Data Network, V2
Abstract Introduction IARPA Babel Swahili Language Pack IARPA-babel202b-v1.0d was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 350 hours of Swahili conversational and scripted telephone speech collect...
Plain Text - 66.4 KB - MD5: 48f06f0ceeb937ca98861398b0581cbb
Documentation
File manifest for disc 2
Plain Text - 1.0 MB - MD5: fefe9639929457bdc162ffd2f9c20d90
Documentation
File manifest for disc 1
Sep 29, 2021
Tracey, Jennifer; Graff, David; Strassel, Stephanie; Arrigo, Michael; Wright, Jonathan; Bies, Ann, 2021, "LORELEI Oromo Incident Language Pack", https://hdl.handle.net/11272.1/AB2/EH7NXF, Abacus Data Network, V1
Abstract Introduction LORELEI Oromo Incident Language Pack was developed by the Linguistic Data Consortium and is comprised of approximately 3.9 million words of Oromo monolingual text, 25,000 words of English monolingual text, 135,000 words of parallel and comparable Oromo-Engli...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 166.5 MB - MD5: e2b57f987640f6888fbc1ec1ef677c79
Data
ISO disc image including all documentation and data
Plain Text - 436.5 KB - MD5: 8c6a015fb4fcbfbf84981fdcb2671bc2
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 7.0 GB - MD5: 00515e887a6ce47e9a4fb3254ed9b15a
Data
ISO disc image including all documentation and data: disc 2
Optical Disc Image - 4.3 GB - MD5: 31f85d5a517ec8c450e537061574703f
Data
ISO disc image including all documentation and data: disc 1
Sep 3, 2021
Neergaard, Karl David; Xu, Hongzhi; Huang, Chu-Ren, 2021, "Database of Word Level Statistics - Mandarin", https://hdl.handle.net/11272.1/AB2/VJDPA0, Abacus Data Network, V1
Abstract Introduction Database of Word Level Statistics - Mandarin was developed by The Hong Kong Polytechnic University. It provides lexical characteristics of a descriptive and statistical nature for words and nonwords of Mandarin Chinese. It is designed for researchers particu...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 279.4 MB - MD5: bc033c1519a63f05d509aee06c1bf5b2
Data
ISO disc image including all documentation and data
Plain Text - 1.4 KB - MD5: 44fce89c1a2390a7b64f0c8222088855
Documentation
File manifest
Sep 3, 2021
Knight, Kevin; Badarau, Bianca; Baranescu, Laura; Bonial, Claire; Bardocz, Madalina; Griffitt, Kira; Hermjakob, Ulf; Marcu, Daniel; Palmer, Martha; O'Gorman, Tim; Schneider, Nathan, 2021, "Abstract Meaning Representation (AMR) Annotation Release 3.0", https://hdl.handle.net/11272.1/AB2/82CVJF, Abacus Data Network, V1
Abstract Introduction Abstract Meaning Representation (AMR) Annotation Release 3.0 was developed by the Linguistic Data Consortium (LDC), SDL/Language Weaver, Inc., the University of Colorado's Computational Language and Educational Research group and the Information Sciences Ins...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 263.5 MB - MD5: e2ffa11c9d6bbb3a183cfa1b5679183b
Data
ISO disc image including all documentation and data
Plain Text - 309.8 KB - MD5: 18700a77dc8421f185bd209d3a582f4f
Documentation
File manifest
Sep 3, 2021
Sluyter-Gaethje, Henny; Bourgonje, Peter; Stede, Manfred, 2021, "Penn Discourse Treebank Version 2.0 - German Translation", https://hdl.handle.net/11272.1/AB2/1AXWBN, Abacus Data Network, V1
Abstract Introduction Penn Discourse Treebank Version 2.0 - German Translation was developed at the University of Potsdam's Applied Computational Linguistics group and consists of approximately one million tokens derived from Penn Discourse Treebank Version 2.0 (LDC2008T05). This...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 110.7 MB - MD5: c64b172f3b16a3c9bcd0ad3a5985f548
Data
ISO disc image including all documentation and data
Plain Text - 402 B - MD5: 1b3ee9f19976a27d4dc11d43b5bc1551
Documentation
File manifest
Sep 3, 2021
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2021, "TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010", https://hdl.handle.net/11272.1/AB2/VAZOSD, Abacus Data Network, V1
Abstract Introduction TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010 was developed by the Linguistic Data Consortium and contains training and evaluation data produced in support of the 2010 TAC KBP Surprise Slot Filling track, the only y...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 2.4 MB - MD5: eeef8698d6282fb7a9e2cd45f23ea691
Data
ISO disc image including all documentation and data
Plain Text - 7.5 KB - MD5: 5bc4cd812c6b2118a94e246039efa1ee
Documentation
File manifest
Sep 3, 2021
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2021, "TAC KBP English Sentiment Slot Filling -- Comprehensive Training and Evaluation Data 2013-2014", https://hdl.handle.net/11272.1/AB2/MRZALN, Abacus Data Network, V1
Abstract Introduction TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010 was developed by the Linguistic Data Consortium and contains training and evaluation data produced in support of the 2013 and 2014 TAC KBP Sentiment Slot Filling tracks....
Optical Disc Image - 6.7 MB - MD5: ca74664d38a50ac97c892eb5b40b6c23
Data
ISO disc image including all documentation and data
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Plain Text - 46.6 KB - MD5: 40f20b324c900c1e96c63724058747f0
Documentation
File manifest
Sep 3, 2021
Daza, Angel; Frank, Anette, 2021, "X-SRL: Parallel Cross-lingual Semantic Role Labeling", https://hdl.handle.net/11272.1/AB2/DNOJP9, Abacus Data Network, V1
Abstract Introduction X-SRL: Parallel Cross-lingual Semantic Role Labeling was developed by Heidelberg University, Department of Computational Linguistics and the Leibniz Institute for the German Language (IDS). It consists of approximately three million words of German, French a...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 187.7 MB - MD5: bb20fcbcdcc91f337cd6abf1da2ac7e8
Data
ISO disc image including all documentation and data
Plain Text - 1.4 KB - MD5: df77a0ce35a7b2680597ceff5eb176bb
Documentation
File manifest
Sep 3, 2021
Arase, Yuki; Tsujii, Junichi, 2021, "ESPADA", https://hdl.handle.net/11272.1/AB2/ANSK9Z, Abacus Data Network, V1
Abstract Introduction ESPADA (Extended Syntactic Phrase Alignment DAtaset) consists of annotated parse trees and alignment on English sentential paraphrases extracted from machine translation evaluation corpora. It extends SPADE (LDC2018T09) by adding new annotated data for train...
Sep 3, 2021 - ESPADA
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Sep 3, 2021 - ESPADA
Optical Disc Image - 37.8 MB - MD5: 492cc273177c347c7b0dce46317401b8
Data
ISO disc image including all documentation and data
Sep 3, 2021 - ESPADA
Plain Text - 138.7 KB - MD5: 8632c4fd65fa44c7b53778b162bc6a9b
Documentation
File manifest
Sep 3, 2021
Tracey, Jennifer; Delgado, Dana; Chen, Song; Strassel, Stephanie, 2021, "BOLT Chinese SMS/Chat Parallel Training Data", https://hdl.handle.net/11272.1/AB2/O3JTA9, Abacus Data Network, V1
Abstract Introduction BOLT Chinese SMS/Chat Parallel Training Data was developed by the Linguistic Data Consortium and consists of approximately 1.8 million tokens of Chinese SMS/Chat data collected for the DARPA BOLT program along with their corresponding English translations Th...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 123.1 MB - MD5: fee8120c3058cf7520c61b373c3d7fcf
Data
ISO disc image including all documentation and data
Plain Text - 657.5 KB - MD5: 1445dc08c56a8ad9adbdda7777138eb5
Documentation
File manifest
Sep 3, 2021
Li, Bin; Xiao, Liming; Liu, Yihuan; Wen, Yuan; Song, Li; Chun, Jayeol; Feng, Minxuan; Zhou, Junsheng; Qu, Weiguang; Xue, Nianwen, 2021, "Chinese Abstract Meaning Representation 2.0", https://hdl.handle.net/11272.1/AB2/LVQEZJ, Abacus Data Network, V1
Abstract Introduction Chinese Abstract Meaning Representation (CAMR) 2.0 was developed by Brandeis University and Nanjing Normal University and is comprised of semantic representations of a set of approximately 20,000 Chinese sentences from Chinese Treebank (CTB) 8.0 (LDC2013T21)...
Optical Disc Image - 74.1 MB - MD5: 9ee5119e2feec0341f2784e1d223b269
Data
ISO disc image including all documentation and data
Plain Text - 764 B - MD5: bb6b1b6a756a7a77ca9bbab7763fe253
Documentation
File manifest
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Sep 3, 2021
Agarwal, Nitin; Francini, Michelle; Kappler, Michelle; Micciulla, Linnea; Pradhan, Sameer; Ramshaw, Lance, 2021, "BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/DXWM3B, Abacus Data Network, V1
Abstract Introduction BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech was developed by Raytheon BBN Technologies and consists of co-reference annotation on Egyptian Arabic discussion forum (DF), SMS/Chat and conversational tele...
Plain Text - 1.3 KB - MD5: 4d4231d07ac669e105f71e602457efea
Documentation
How to work with ISO disc images
Optical Disc Image - 14.3 MB - MD5: 14b71777c64773eda3d06a8c4318a689
Data
ISO disc image including all documentation and data
Plain Text - 72.5 KB - MD5: b5b960eb1d365df8580674ac4c0e7c6f
Documentation
File manifest
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =