Skip to main content
Metrics
146,327 Downloads
The Abacus Data Network is a data repository collaboration involving Libraries at Simon Fraser University (SFU), the University of British Columbia (UBC), the University of Northern British Columbia (UNBC) and the University of Victoria (UVic).
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 50 of 2,249 Results
Oct 20, 2021 - DMTI Spatial
DMTI Spatial Inc., 2021, "CanMap Address Points, v2021.3", https://hdl.handle.net/11272.1/AB2/HOWGP8, Abacus Data Network, V1
CanMap Address Points are unique and discrete representations of civic address assignments across Canada. It is the ultimate in answering the question of “where” and an anchor for a single source of accuracy in your mission-critical data. When building your location intelligence...
Oct 20, 2021 - DMTI Spatial
DMTI Spatial Inc., 2019, "CanMap Address Points, v2019.2", https://hdl.handle.net/11272.1/AB2/RZ99DD, Abacus Data Network, V1
CanMap Address Points are unique and discrete representations of civic address assignments across Canada. It is the ultimate in answering the question of “where” and an anchor for a single source of accuracy in your mission-critical data. When building your location intelligence...
Oct 20, 2021 - DMTI Spatial
DMTI Spatial Inc., 2018, "CanMap Address Points, v2018.3", https://hdl.handle.net/11272.1/AB2/MDCZXG, Abacus Data Network, V1
CanMap Address Points are unique and discrete representations of civic address assignments across Canada. It is the ultimate in answering the question of “where” and an anchor for a single source of accuracy in your mission-critical data. When building your location intelligence...
Oct 20, 2021 - DMTI Spatial
DMTI Spatial Inc., 2017, "CanMap Address Points, v2017.4", https://hdl.handle.net/11272.1/AB2/GBIUJJ, Abacus Data Network, V1
CanMap Address Points are unique and discrete representations of civic address assignments across Canada. It is the ultimate in answering the question of “where” and an anchor for a single source of accuracy in your mission-critical data. When building your location intelligence...
Oct 19, 2021 - Abacus open data
University of British Columbia. Campus and Community Planning, 2021, "[University of British Columbia Point Grey Campus Lidar], 2005", https://hdl.handle.net/11272.1/AB2/GTPDZF, Abacus Data Network, V1
University of British Columbia Point Grey campus lidar survey. This survey does not cover the entire campus; it consists mostly of shoreline areas.
Oct 14, 2021 - Linguistic Data Consortium
Mena, Carlos Daniel Hernández; Ruiz, Iván Vladimir Meza, 2021, "Wikipedia Spanish Speech and Transcripts", https://hdl.handle.net/11272.1/AB2/L05NFF, Abacus Data Network, V1
Abstract Introduction Wikipedia Spanish Speech and Transcripts consists of approximately 25 hours of Spanish read speech and transcripts. The read text was taken from the Spanish version of WikiProject Spoken Wikipedia, referred to as Wikipedia Grabada. The transcripts were devel...
Oct 14, 2021 - Linguistic Data Consortium
Tracey, Jennifer; Delgado, Dana; Chen, Song; Strassel, Stephanie, 2021, "BOLT Egyptian Arabic SMS/Chat Parallel Training Data", https://hdl.handle.net/11272.1/AB2/WXML9A, Abacus Data Network, V1
Abstract Introduction BOLT Egyptian Arabic SMS/Chat Parallel Training Data was developed by the Linguistic Data Consortium (LDC) and consists of approximately 723,000 tokens of Egyptian Arabic SMS/Chat data collected for the DARPA BOLT program along with their corresponding Engli...
Oct 14, 2021 - Linguistic Data Consortium
Alsheddi, Abeer, 2021, "Classical Arabic Dictionary", https://hdl.handle.net/11272.1/AB2/FQ7PIS, Abacus Data Network, V1
Abstract Introduction Classical Arabic Dictionary consists of approximately one hundred million words of Arabic collected from texts dating between 431 and 1104 CE, principally books and essays, along with word occurrences, source documents and related metadata. Data The dictiona...
Oct 14, 2021 - DMTI Spatial
DMTI Spatial Inc., 2020, "CanMap Address Points, v2020.4", https://hdl.handle.net/11272.1/AB2/HL7BV7, Abacus Data Network, V1
CanMap Address Points are unique and discrete representations of civic address assignments across Canada. It is the ultimate in answering the question of “where” and an anchor for a single source of accuracy in your mission-critical data. When building your location intelligence...
Oct 14, 2021 - DMTI Spatial
DMTI Spatial Inc., 2021, "CanMap Postal Code Suite, v2020.3", https://hdl.handle.net/11272.1/AB2/MPQ1LE, Abacus Data Network, V1
The CanMap Postal Code Suite is comprised of the following postal products: The CanMap Postal Code File - Multiple Enhanced Postal Code (MEP) product is a precision-based point file representing over 1 million postal codes across Canada. The Multiple Enhanced Postal Code product...
Oct 8, 2021 - Statistics Canada - DLI
Statistics Canada, 2021, "Postal Code Conversion File, August 2021 Postal Codes, 2021", https://hdl.handle.net/11272.1/AB2/HJPB6W, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2016 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 8, 2021 - Statistics Canada - DLI
Statistics Canada, 2021, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, August 2021 Postal Codes, 2021", https://hdl.handle.net/11272.1/AB2/GI4245, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2016 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 8, 2021 - Statistics Canada Open License
Statistics Canada, 2021, "Labour Force Survey, 2021", https://hdl.handle.net/11272.1/AB2/HP9TEK, Abacus Data Network, V10, UNF:6:NqXGK/jtmf6Lv+nZnPUABg== [fileUNF]
LFS data are used to produce the well-known unemployment rate as well as other standard labour market indicators such as the employment rate and the participation rate. The LFS also provides employment estimates by industry, occupation, public and private sector, hours worked and...
Oct 7, 2021 - Abacus open data
University of British Columbia. Campus and Community Planning, 2021, "[Orthophotos, University of British Columbia Point Grey Campus], 2021", https://hdl.handle.net/11272.1/AB2/R731P3, Abacus Data Network, V1
Orthorectified aerial imagery of the UBC Vancouver campus, 2021
Oct 6, 2021 - Abacus open data
University of British Columbia. Campus and Community Planning, 2021, "[University of British Columbia Point Grey Campus Lidar], 2021", https://hdl.handle.net/11272.1/AB2/Y5KQNB, Abacus Data Network, V1
University of British Columbia Point Grey campus lidar survey. Includes Pacific Spirit Park and Musqueam Reserve No.2 (southeast of UBC Campus).
Oct 1, 2021 - Linguistic Data Consortium
Bills, Aric; Conners, Thomas; David, Anne; Dubinski, Eyal; Fiscus, Jonathan G.; Gann, Ketty; Harper, Mary; Kazi, Michael; Lim, Lynn-Li; Malyska, Nicolas; Melot, Jennifer; Ray, Jessica; Rytting, Anton; Shen, Sinney; Smith, Rosanna, 2021, "IARPA Babel Mongolian Language Pack IARPA-babel401b-v2.0b", https://hdl.handle.net/11272.1/AB2/IFBL6A, Abacus Data Network, V1
Abstract Introduction IARPA Babel Mongolian Language Pack IARPA-babel401b-v2.0b was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 204 hours of Halh Mongolian conversational and scripted telephone speec...
Sep 29, 2021 - Linguistic Data Consortium
Andresen, Jess; Bills, Aric; Conners, Thomas; Dubinski, Eyal; Fiscus, Jonathan G.; Harper, Mary; Kozlov, Kirill; Malyska, Nicolas; Melot, Jennifer; Morrison, Michelle; Phillips, Josh; Ray, Jessica; Rytting, Anton; Shen, Wade; Silber, Ronnie; Tzoukermann, Evelyne; Wong, Jamie, 2021, "IARPA Babel Swahili Language Pack IARPA-babel202b-v1.0d", https://hdl.handle.net/11272.1/AB2/TNSSDU, Abacus Data Network, V2
Abstract Introduction IARPA Babel Swahili Language Pack IARPA-babel202b-v1.0d was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 350 hours of Swahili conversational and scripted telephone speech collect...
Sep 29, 2021 - Linguistic Data Consortium
Tracey, Jennifer; Graff, David; Strassel, Stephanie; Arrigo, Michael; Wright, Jonathan; Bies, Ann, 2021, "LORELEI Oromo Incident Language Pack", https://hdl.handle.net/11272.1/AB2/EH7NXF, Abacus Data Network, V1
Abstract Introduction LORELEI Oromo Incident Language Pack was developed by the Linguistic Data Consortium and is comprised of approximately 3.9 million words of Oromo monolingual text, 25,000 words of English monolingual text, 135,000 words of parallel and comparable Oromo-Engli...
Sep 24, 2021 - Statistics Canada Open License
Statistics Canada, 2021, "Survey of Financial Security, 2019", https://hdl.handle.net/11272.1/AB2/B8A8ZH, Abacus Data Network, V1, UNF:6:+jkZpvTireJsb9/nPMFK0A== [fileUNF]
The purpose of the survey is to collect information from a sample of Canadian households on their assets, debts, employment, income and education. The SFS provides a comprehensive picture of the financial health of Canadians. Information is collected on the value of all major fin...
Sep 3, 2021 - Linguistic Data Consortium
Neergaard, Karl David; Xu, Hongzhi; Huang, Chu-Ren, 2021, "Database of Word Level Statistics - Mandarin", https://hdl.handle.net/11272.1/AB2/VJDPA0, Abacus Data Network, V1
Abstract Introduction Database of Word Level Statistics - Mandarin was developed by The Hong Kong Polytechnic University. It provides lexical characteristics of a descriptive and statistical nature for words and nonwords of Mandarin Chinese. It is designed for researchers particu...
Sep 3, 2021 - Linguistic Data Consortium
Knight, Kevin; Badarau, Bianca; Baranescu, Laura; Bonial, Claire; Bardocz, Madalina; Griffitt, Kira; Hermjakob, Ulf; Marcu, Daniel; Palmer, Martha; O'Gorman, Tim; Schneider, Nathan, 2021, "Abstract Meaning Representation (AMR) Annotation Release 3.0", https://hdl.handle.net/11272.1/AB2/82CVJF, Abacus Data Network, V1
Abstract Introduction Abstract Meaning Representation (AMR) Annotation Release 3.0 was developed by the Linguistic Data Consortium (LDC), SDL/Language Weaver, Inc., the University of Colorado's Computational Language and Educational Research group and the Information Sciences Ins...
Sep 3, 2021 - Linguistic Data Consortium
Sluyter-Gaethje, Henny; Bourgonje, Peter; Stede, Manfred, 2021, "Penn Discourse Treebank Version 2.0 - German Translation", https://hdl.handle.net/11272.1/AB2/1AXWBN, Abacus Data Network, V1
Abstract Introduction Penn Discourse Treebank Version 2.0 - German Translation was developed at the University of Potsdam's Applied Computational Linguistics group and consists of approximately one million tokens derived from Penn Discourse Treebank Version 2.0 (LDC2008T05). This...
Sep 3, 2021 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2021, "TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010", https://hdl.handle.net/11272.1/AB2/VAZOSD, Abacus Data Network, V1
Abstract Introduction TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010 was developed by the Linguistic Data Consortium and contains training and evaluation data produced in support of the 2010 TAC KBP Surprise Slot Filling track, the only y...
Sep 3, 2021 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2021, "TAC KBP English Sentiment Slot Filling -- Comprehensive Training and Evaluation Data 2013-2014", https://hdl.handle.net/11272.1/AB2/MRZALN, Abacus Data Network, V1
Abstract Introduction TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010 was developed by the Linguistic Data Consortium and contains training and evaluation data produced in support of the 2013 and 2014 TAC KBP Sentiment Slot Filling tracks....
Sep 3, 2021 - Linguistic Data Consortium
Daza, Angel; Frank, Anette, 2021, "X-SRL: Parallel Cross-lingual Semantic Role Labeling", https://hdl.handle.net/11272.1/AB2/DNOJP9, Abacus Data Network, V1
Abstract Introduction X-SRL: Parallel Cross-lingual Semantic Role Labeling was developed by Heidelberg University, Department of Computational Linguistics and the Leibniz Institute for the German Language (IDS). It consists of approximately three million words of German, French a...
Sep 3, 2021 - Linguistic Data Consortium
Arase, Yuki; Tsujii, Junichi, 2021, "ESPADA", https://hdl.handle.net/11272.1/AB2/ANSK9Z, Abacus Data Network, V1
Abstract Introduction ESPADA (Extended Syntactic Phrase Alignment DAtaset) consists of annotated parse trees and alignment on English sentential paraphrases extracted from machine translation evaluation corpora. It extends SPADE (LDC2018T09) by adding new annotated data for train...
Sep 3, 2021 - Linguistic Data Consortium
Tracey, Jennifer; Delgado, Dana; Chen, Song; Strassel, Stephanie, 2021, "BOLT Chinese SMS/Chat Parallel Training Data", https://hdl.handle.net/11272.1/AB2/O3JTA9, Abacus Data Network, V1
Abstract Introduction BOLT Chinese SMS/Chat Parallel Training Data was developed by the Linguistic Data Consortium and consists of approximately 1.8 million tokens of Chinese SMS/Chat data collected for the DARPA BOLT program along with their corresponding English translations Th...
Sep 3, 2021 - Linguistic Data Consortium
Li, Bin; Xiao, Liming; Liu, Yihuan; Wen, Yuan; Song, Li; Chun, Jayeol; Feng, Minxuan; Zhou, Junsheng; Qu, Weiguang; Xue, Nianwen, 2021, "Chinese Abstract Meaning Representation 2.0", https://hdl.handle.net/11272.1/AB2/LVQEZJ, Abacus Data Network, V1
Abstract Introduction Chinese Abstract Meaning Representation (CAMR) 2.0 was developed by Brandeis University and Nanjing Normal University and is comprised of semantic representations of a set of approximately 20,000 Chinese sentences from Chinese Treebank (CTB) 8.0 (LDC2013T21)...
Sep 3, 2021 - Linguistic Data Consortium
Agarwal, Nitin; Francini, Michelle; Kappler, Michelle; Micciulla, Linnea; Pradhan, Sameer; Ramshaw, Lance, 2021, "BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/DXWM3B, Abacus Data Network, V1
Abstract Introduction BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech was developed by Raytheon BBN Technologies and consists of co-reference annotation on Egyptian Arabic discussion forum (DF), SMS/Chat and conversational tele...
Sep 2, 2021 - Linguistic Data Consortium
Mena, Carlos Daniel Hernández, 2021, "LibriVox Spanish", https://hdl.handle.net/11272.1/AB2/AHBO1C, Abacus Data Network, V1
Abstract Introduction LibriVox Spanish consists of approximately 73 hours of Spanish read speech and transcripts. The audio data was taken from Spanish audiobooks developed by LibriVox, a non-profit project that creates audiobooks from public domain works. The transcripts were de...
Sep 2, 2021 - Linguistic Data Consortium
Ding, Hongwei; Liao, Sishi; Zhan, Yuqing; Yuan, Jiahong; Liberman, Mark, 2021, "Global TIMIT Mandarin Chinese", https://hdl.handle.net/11272.1/AB2/2CCXH8, Abacus Data Network, V1
Abstract Introduction Global TIMIT Mandarin Chinese was developed by the Linguistic Data Consortium and Shanghai Jiao Tong University and consists of approximately five hours of read speech and transcripts in Mandarin Chinese. The Global TIMIT project aimed to create a series of...
Sep 2, 2021 - Linguistic Data Consortium
Beijing Magic Data Technology Co., 2021, "Magic Data Chinese Mandarin Conversational Speech", https://hdl.handle.net/11272.1/AB2/M4T1CO, Abacus Data Network, V1
Abstract Introduction Magic Data Chinese Mandarin Conversational Speech was developed by Beijing Magic Data Technology Co., Ltd. and consists of approximately 10 hours of Mandarin conversational speech from 60 speakers. Each conversation was recorded on multiple devices and is pr...
Sep 2, 2021 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2021, "TAC KBP Entity Discovery and Linking - Comprehensive Evaluation Data 2016-2017", https://hdl.handle.net/11272.1/AB2/DAW97M, Abacus Data Network, V1
Abstract Introduction TAC KBP Entity Discovery and Linking - Comprehensive Evaluation Data 2016-2017 was developed by the Linguistic Data Consortium (LDC) and contains training and evaluation data produced in support of the TAC KBP Entity Discovery and Linking (EDL) tasks in 2016...
Sep 2, 2021 - Linguistic Data Consortium
Maamouri, Mohamed; Bies, Ann; Kulick, Seth; Krouna, Sondos; Tabassi, Dalila; Ciul, Michael, 2021, "BOLT Egyptian Arabic Treebank - Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/D9JRBV, Abacus Data Network, V1
Abstract Introduction BOLT Egyptian Arabic Treebank - Conversational Telephone Speech was developed by the Linguistic Data Consortium (LDC) and consists of Egyptian Arabic conversational telephone speech data with part-of-speech annotation, morphology, gloss and syntactic tree an...
Sep 2, 2021 - Linguistic Data Consortium
Agarwal, Nitin; Francini, Michelle; Kappler, Michelle; Micciulla, Linnea; Pradhan, Sameer; Ramshaw, Lance, 2021, "BOLT Chinese Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech", https://hdl.handle.net/11272.1/AB2/LVUADW, Abacus Data Network, V1
Abstract Introduction BOLT Chinese Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech was developed by Raytheon BBN Technologies and consists of co-reference annotation on Chinese discussion forum (DF), SMS/Chat and conversational telephone speech (CT...
Sep 2, 2021 - Linguistic Data Consortium
Li, Xuansong; Grimes, Stephen; Strassel, Stephanie, 2021, "BOLT Egyptian Arabic-English Word Alignment -- SMS/Chat Training", https://hdl.handle.net/11272.1/AB2/XACS3U, Abacus Data Network, V1
Abstract Introduction BOLT Egyptian Arabic-English Word Alignment -- SMS/Chat Training was developed by the Linguistic Data Consortium (LDC) and consists of 349,414 words of Egyptian Arabic and English parallel text enhanced with linguistic tags to indicate word relations. The DA...
Sep 1, 2021 - UBC Library restricted data
BC Assessment, 2020, "BC Assessment Data Advice and Inventory Extracts", https://hdl.handle.net/11272.1/AB2/LAPUAB, Abacus Data Network, V4
The Data Advice product from BC Assessment (BCA) provides value assessments and sales information for properties in British Columbia. Two types of data files are available to UBC researchers: REVD (Revised Roll): annual report including property information and valuation for all...
Aug 9, 2021 - Statistics Canada - DLI
Statistics Canada, 2021, "Social Policy Simulation Database and Model (SPSD/M), Version 28.1, database year 2016", https://hdl.handle.net/11272.1/AB2/DYQKAN, Abacus Data Network, V2
The Social Policy Simulation Database and Model (SPSD/M) is a micro computer-based product designed to assist those interested in analyzing the financial interactions of governments and individuals in Canada. It can help one to assess the cost implications or income redistributiv...
Jul 28, 2021 - Statistics Canada Open License
Statistics Canada, 2021, "Impacts of COVID-19 on Health Care Workers: Infection Prevention and Control (ICHCWIPC), 2020", https://hdl.handle.net/11272.1/AB2/DPKFWF, Abacus Data Network, V1, UNF:6:TP2nyryjP1yuiVw8LZEHQQ== [fileUNF]
The purpose of this crowdsource questionnaire is to understand the impacts of COVID-19 on Canadian health care workers, with particular focus on access to personal protective equipment (PPE) and infection prevention and control (IPC) measures in the workplace.
Jul 28, 2021 - Statistics Canada Open License
Statistics Canada, 2021, "Canadian Tobacco and Nicotine Survey (CTNS), 2020", https://hdl.handle.net/11272.1/AB2/UYC0Z8, Abacus Data Network, V1, UNF:6:m2tlmNojiECi8+hoIaIN9g== [fileUNF]
The information collected in this survey will be used to fill important data gaps related to vaping, cannabis, and tobacco usage. The data will inform policy and provide a current snapshot of use across Canada. Until 2017, Statistics Canada conducted the Canadian Tobacco, Alcohol...
Jul 27, 2021 - Statistics Canada Open License
Statistics Canada, 2021, "Visitor Travel Survey (VTS), 2019", https://hdl.handle.net/11272.1/AB2/FTV0R0, Abacus Data Network, V1, UNF:6:N8yQgLGtUGIFipPa6B9trQ== [fileUNF]
The Visitor Travel Survey (VTS) is conducted by Statistics Canada to meet the requirements of the Balance of Payments of the Canadian System of National Accounts (BOP). Prior to 2018, information about visitors to Canada was collected through the International Travel Survey (ITS)...
Jun 28, 2021 - Statistics Canada - DLI
Statistics Canada, 2021, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, May 2021 Postal Codes, 2021", https://hdl.handle.net/11272.1/AB2/PEUBGS, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2016 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Jun 28, 2021 - Statistics Canada - DLI
Statistics Canada, 2021, "Postal Code Conversion File, May 2021 Postal Codes, 2021", https://hdl.handle.net/11272.1/AB2/9J8YUH, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2016 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Jun 14, 2021 - Linguistic Data Consortium
Brandschain, Linda; Walker, Kevin; Graff, David; Cieri, Christopher; Neely, Abby; Mirghafori, Nikki; Peskin, Barbara; Godfrey, Jack; Strassel, Stephanie; Goodman, Fred; Doddington, George R.; King, Mike, 2021, "Mixer 4 and 5 Speech", https://hdl.handle.net/11272.1/AB2/LU0TQ8, Abacus Data Network, V1
Abstract Introduction Mixer 4 and 5 Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 14,185 hours of audio recordings of conversational telephone speech, interviews, elicitation exercises and transcript readings involving 616 distinct...
Jun 11, 2021 - Linguistic Data Consortium
Bills, Aric; Conners, Thomas; David, Anne; Dubinski, Eyal; Fiscus, Jonathan G.; Gann, Ketty; Harper, Mary; Kazi, Michael; Le, Hanh; Malyska, Nicolas; Melot, Jennifer; Phillips, Josh; Ray, Jessica; Roomi, Bergul; Rytting, Anton; Strahan, Tania E., 2019, "IARPA Babel Amharic Language Pack IARPA-babel307b-v1.0b", https://hdl.handle.net/11272.1/AB2/U1H3H7, Abacus Data Network, V1
Abstract Introduction IARPA Babel Amharic Language Pack IARPA-babel307b-v1.0b was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 204 hours of Amharic conversational and scripted telephone speech collect...
Jun 9, 2021 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2020, "TAC KBP English Event Argument - Training and Evaluation Data 2014-2015", https://hdl.handle.net/11272.1/AB2/TTCGFJ, Abacus Data Network, V1
Abstract Introduction TAC KBP English Event Argument - Training and Evaluation Data 2014-2015 was developed by the Linguistic Data Consortium (LDC) and contains training and evaluation data produced in support of the 2014 TAC KBP English Event Argument Extraction Pilot and Evalua...
Jun 9, 2021 - Linguistic Data Consortium
Simpson, Heather; Strassel, Stephanie; Wright, Jonathan; Griffitt, Kira, 2020, "Machine Reading Phase 1 IC Training Data", https://hdl.handle.net/11272.1/AB2/7GZ3YJ, Abacus Data Network, V1
Abstract Introduction Machine Reading Phase 1 IC Training Data was developed by the Linguistic Data Consortium and contains 248 English source documents and 116 standoff annotation files used in the DARPA (Defense Advanced Research Projects Agency) Machine Reading program. The Ma...
Jun 9, 2021 - Linguistic Data Consortium
Li, Xuansong; Grimes, Stephen; Strassel, Stephanie, 2020, "BOLT Egyptian Arabic-English Word Alignment -- Conversational Telephone Speech Training", https://hdl.handle.net/11272.1/AB2/ZZOGLK, Abacus Data Network, V1
Abstract Introduction BOLT Egyptian Arabic-English Word Alignment -- Conversational Telephone Speech Training was developed by the Linguistic Data Consortium (LDC) and consists of 153,171 words of Egyptian Arabic and English parallel text enhanced with linguistic tags to indicate...
Jun 9, 2021 - Linguistic Data Consortium
Li, Bin; Yin, Siqi; Xu, Jie; Song, Li; Feng, Minxuan, 2020, "Chinese CogBank", https://hdl.handle.net/11272.1/AB2/XQKHRG, Abacus Data Network, V1
Abstract Introduction Chinese CogBank is a database of cognitive properties of Chinese words intended for use in metaphor understanding and generation. It consists of 232,497 "word-property" pairs, which are comprised of 83,104 words and 100,195 properties. Each "word-property" t...
Jun 9, 2021 - Linguistic Data Consortium
Bies, Ann; Mott, Justin; Warner, Colin; Kulick, Seth, 2021, "BOLT English Treebank - SMS/Chat", https://hdl.handle.net/11272.1/AB2/TMECTL, Abacus Data Network, V1
Abstract Introduction BOLT English Treebank - SMS/Chat was developed by the Linguistic Data Consortium (LDC) and consists of English SMS and text chat data with part-of-speech and syntactic structure annotation. The DARPA BOLT (Broad Operational Language Translation) program deve...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =