Skip to main content
Metrics
716,040 Downloads
The Abacus Data Network is a data repository collaboration involving Libraries at Simon Fraser University (SFU), the University of British Columbia (UBC), the University of Northern British Columbia (UNBC) and the University of Victoria (UVic).
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

551 to 600 of 2,566 Results
Jun 10, 2020 - Statistics Canada Open License
Statistics Canada, 2020, "Crowdsourcing: Impacts of COVID-19 on Canadians Public Use Microdata File, [2020]", https://hdl.handle.net/11272.1/AB2/SMGRHJ, Abacus Data Network, V1, UNF:6:h6oHsTBXTkM6wAomSRkNCQ== [fileUNF]
This public use microdata file includes information from the first crowdsource questionnaire that collected information on Canadians' behaviours and concerns relating to the COVID-19 pandemic, specifically regarding health, finances and employment. The collection series collects...
May 12, 2020 - Statistics Canada Open License
Statistics Canada, 2020, "Canadian Perspectives Survey Series 1: Impacts of COVID-19 Public Use Microdata File", https://hdl.handle.net/11272.1/AB2/NL02DJ, Abacus Data Network, V1, UNF:6:QdW14C8pmzPVGxEAvm9+Bg== [fileUNF]
The Canadian Perspectives Survey Series (CPSS) is a set of short, online surveys beginning in March 2020 that will be used to collect information on the knowledge and behaviours of residents of the 10 Canadian provinces. All surveys in the series will be asked of Statistics Canad...
Mar 31, 2020 - Statistics Canada Open License
Statistics Canada, 2020, "Canadian Tobacco and Nicotine Survey (CTNS), 2019", https://hdl.handle.net/11272.1/AB2/CB55TS, Abacus Data Network, V1, UNF:6:QUXiZ8wStee12zvmx67Ymg== [fileUNF]
The information collected in this survey will be used to fill important data gaps related to vaping, cannabis, and tobacco usage. The data will inform policy and provide a current snapshot of use across Canada. Until 2017, Statistics Canada conducted the Canadian Tobacco, Alcohol...
Mar 9, 2020 - Statistics Canada Open License
Statistics Canada, 2020, "National Travel Survey 2018", https://hdl.handle.net/11272.1/AB2/NTBL54, Abacus Data Network, V1, UNF:6:XvjntEuihniQadoLFet4BQ== [fileUNF]
The National Travel Survey (NTS) was developed to fully replace the Travel Survey of Residents of Canada (TSRC record number 3810) and replace the Canadian resident component of the International Travel Survey (ITS record number 3152). The National Travel Survey collects informat...
Mar 6, 2020 - Statistics Canada - DLI
Statistics Canada, 2020, "Postal Code Conversion File Plus (PCCF+) Version 7C, November 2019 Postal Codes", https://hdl.handle.net/11272.1/AB2/UH97QR, Abacus Data Network, V1
The Postal Code Conversion File Plus (PCCF+) is a SAS control program and set of associated datasets derived from the Postal Code Conversion File (PCCF), a Postal Code population weight file, the Geographic Attribute File, Health Region boundary files, and other supplementary dat...
Feb 26, 2020 - Statistics Canada Open License
Statistics Canada, 2020, "International Travel Survey, 2017", https://hdl.handle.net/11272.1/AB2/DCOYO6, Abacus Data Network, V1, UNF:6:oRY+4lNN9sKd25rHGuXiJg== [fileUNF]
The electronic questionnaires (e-questionnaires) and Air Exit Survey (AES) are components of the International Travel Survey (ITS) together with the Frontier Counts (record number 5005). It is an ongoing survey conducted by Statistics Canada since 1972 to meet the requirements of...
Feb 13, 2020 - Statistics Canada Open License
Statistics Canada, 2020, "National Graduates Survey - Public Use Microdata File, 2015 (time of graduation), 2018 (time of interview)", https://hdl.handle.net/11272.1/AB2/G5MVZW, Abacus Data Network, V1
Data from this survey will be used to better understand the experiences and outcomes of graduates, and to improve government programs. The survey is designed to collect details on topics such as: i) the extent to which graduates of postsecondary programs have been successful in o...
Dec 17, 2019 - Linguistic Data Consortium
Gallardo, Laura Fernández, 2019, "Nautilus Speaker Characterization", https://hdl.handle.net/11272.1/AB2/JR6VMZ, Abacus Data Network, V1
Nautilus Speaker Characterization was developed at the Technical University of Berlin and is comprised of approximately 155 hours of conversational speech from 300 German speakers aged 18 to 35 years (126 males and 174 females) with no marked dialect or accent, recorded in an aco...
Nov 19, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "Employment Insurance Coverage Survey, 2018", https://hdl.handle.net/11272.1/AB2/4QZKCP, Abacus Data Network, V1, UNF:6:FuCRKIsH5sqxDhRhq6E4aA== [fileUNF]
The main purpose of this survey is to study the coverage of the employment insurance program. It provides a meaningful picture of who does or does not have access to EI benefits among the jobless and those in a situation of underemployment. The Employment Insurance Coverage Surve...
Nov 15, 2019 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2019, "TAC KBP Cold Start - Comprehensive Evaluation Data 2012-2017", https://hdl.handle.net/11272.1/AB2/KQWRTL, Abacus Data Network, V1
TAC KBP Cold Start - Comprehensive Evaluation Data 2012-2017 was developed by the Linguistic Data Consortium (LDC) and contains Chinese, English and Spanish data produced in support of the TAC KBP Cold Start evaluation track conducted from 2012 to 2017. This includes source docum...
Nov 15, 2019 - Linguistic Data Consortium
Tracey, Jennifer; Arrigo, Michael; Strassel, Stephanie, 2019, "DEFT English Committed Belief Annotation", https://hdl.handle.net/11272.1/AB2/WY5NZN, Abacus Data Network, V1
DEFT English Committed Belief Annotation was developed by the Linguistic Data Consortium (LDC) and consists of approximately 950,000 words of English discussion forum text annotated for “committed belief,” which marks the level of commitment displayed by the author to the truth o...
Nov 15, 2019 - Linguistic Data Consortium
Canavan, Alexandra; Zipperlen, George; Bartlett, John, 2019, "CALLFRIEND American English-Non-Southern Dialect Second Edition", https://hdl.handle.net/11272.1/AB2/OBLYDI, Abacus Data Network, V1
CALLFRIEND American English-Non-Southern Dialect Second Edition was developed by the Linguistic Data Consortium (LDC) and consists of approximately 26 hours of unscripted telephone conversations between native speakers of non-Southern dialects of American English. This second edi...
Oct 25, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "Censuses of Canada, 1665-1871", https://hdl.handle.net/11272.1/AB2/DSMK3Y, Abacus Data Network, V1
Censuses of Canada contain 343 tables on the social and economic conditions in Canada from the earliest settlements to 1871.
Oct 16, 2019 - Statistics Canada - DLI
Statistics Canada, 2019, "Postal Code Conversion File, August 2019 Postal Codes, 2019", https://hdl.handle.net/11272.1/AB2/LEX9D7, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2016 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 15, 2019 - Linguistic Data Consortium
Szwelnik, Tomasz; Kawalec, Jacek; Gutowska, Dorota, 2019, "Polish Speech Database", https://hdl.handle.net/11272.1/AB2/GNGZEI, Abacus Data Network, V1
Polish Speech Database was developed by VoiceLab. It consists of 263,424 utterances of Polish speech data from 200 speakers, totaling approximately 280 hours, and corresponding transcripts. Data collection was performed in Poland. Speakers were asked to record themselves for at l...
Oct 15, 2019 - Linguistic Data Consortium
Greenberg, Craig; Sadjadi, Omid; Kheyrkhah, Timothee; Jones, Karen; Walker, Kevin; Strassel, Stephanie; Graff, David, 2019, "2016 NIST Speaker Recognition Evaluation Test Set", https://hdl.handle.net/11272.1/AB2/WJ2G5L, Abacus Data Network, V1
2016 NIST Speaker Recognition Evaluation Test Set was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains approximately 340 hours of short segments of Tagalog, Cantonese, Cebuano and Mandarin telephone speech us...
Oct 15, 2019 - Linguistic Data Consortium
Bies, Ann; Mott, Justin; Warner, Colin; Kulick, Seth, 2019, "BOLT English Treebank - Discussion Forum", https://hdl.handle.net/11272.1/AB2/9OA0DB, Abacus Data Network, V1
BOLT English Treebank - Discussion Forum was developed by the Linguistic Data Consortium (LDC) and consists of English web discussion forum data with part-of-speech and syntactic structure annotations. The DARPA BOLT (Broad Operational Language Translation) program developed mach...
Sep 24, 2019 - DMTI Spatial
DMTI Spatial Inc., 2019, "CanMap Content Suite, v2019.3", https://hdl.handle.net/11272.1/AB2/PCTBFN, Abacus Data Network, V1
CanMap Content Suite contains over 100 unique and rich content layers. Each layer has a unique file and layer name with associated definitions, descriptions, attribution and metadata. All layers, with a few exceptions, are vector data consisting of polygon, polyline, or point geo...
Sep 24, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "Canadian Income Survey (CIS), 2017", https://hdl.handle.net/11272.1/AB2/HFWKLV, Abacus Data Network, V1, UNF:6:Uk15LUlXxyTqBC8SteMCjA== [fileUNF]
The primary objective of the Canadian Income Survey (CIS) is to provide information on the income and income sources of Canadians, along with their individual and household characteristics. The data collected in the CIS is combined with Labour Force Survey (LFS, record number 370...
Sep 16, 2019 - Linguistic Data Consortium
Canavan, Alexandra; Zipperlen, George; Bartlett, John, 2019, "CALLFRIEND Canadian French Second Edition", https://hdl.handle.net/11272.1/AB2/PPNHVC, Abacus Data Network, V1
CALLFRIEND Canadian French Second Edition was developed by the Linguistic Data Consortium (LDC) and consists of approximately 26 hours of unscripted telephone conversations between native speakers of Canadian French. This second edition updates the audio files to wav format, simp...
Sep 16, 2019 - Linguistic Data Consortium
Li, Xuansong; Grimes, Stephen; Strassel, Stephanie, 2019, "BOLT Chinese-English Word Alignment and Tagging -- SMS/Chat Training", https://hdl.handle.net/11272.1/AB2/TJO8RI, Abacus Data Network, V1
BOLT Chinese-English Word Alignment and Tagging – SMS/Chat Training was developed by the Linguistic Data Consortium (LDC) and consists of 388,027 words of Chinese and English parallel text enhanced with linguistic tags to indicate word relations. The DARPA BOLT (Broad Operational...
Sep 16, 2019 - Linguistic Data Consortium
Simpson, Heather; Strassel, Stephanie; Wright, Jonathan; Griffitt, Kira, 2019, "Machine Reading Phase 1 NFL Scoring Training Data", https://hdl.handle.net/11272.1/AB2/AZSUUC, Abacus Data Network, V1
Machine Reading Phase 1 NFL Scoring Training Data was developed by the Linguistic Data Consortium (LDC) and contains 110 US NFL (National Football League) scoring source documents and 110 standoff annotation files used in the DARPA (Defense Advanced Research Projects Agency) Mach...
Sep 13, 2019 - DMTI Spatial
DMTI Spatial Inc., 2019, "CanMap Postal Code Suite, v2019.3", https://hdl.handle.net/11272.1/AB2/4LXJQS, Abacus Data Network, V1
The CanMap Postal Code Suite is comprised of the following postal products: The CanMap Postal Code File - Multiple Enhanced Postal Code (MEP) product is a precision-based point file representing over 1 million postal codes across Canada. The Multiple Enhanced Postal Code product...
Aug 15, 2019 - Linguistic Data Consortium
Jones, Karen; Graff, David; Walker, Kevin; Strassel, Stephanie, 2019, "Multi-Language Conversational Telephone Speech 2011 -- East Asian", https://hdl.handle.net/11272.1/AB2/3MKZES, Abacus Data Network, V1
Multi-Language Conversational Telephone Speech 2011 – East Asian was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 19 hours of telephone speech in two distinct languages of East Asia: Thai and Lao. The data were collected primarily to support...
Aug 15, 2019 - Linguistic Data Consortium
Adams, Nikki; Bills, Aric; Conners, Thomas; David, Anne; Dubinski, Eyal; Fiscus, Jonathan G.; Gann, Ketty; Harper, Mary; Kaiser-Schatzlein, Alice; Kazi, Michael; Malyska, Nicolas; Melot, Jennifer; Onaka, Akiko; Paget, Shelley; Ray, Jessica; Richardson, Fred; Rytting, Anton; Shen, Sinney, 2019, "IARPA Babel Igbo Language Pack IARPA-babel306b-v2.0c", https://hdl.handle.net/11272.1/AB2/39RDNJ, Abacus Data Network, V1
ARPA Babel Igbo Language Pack IARPA-babel306b-v2.0c was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 207 hours of Igbo conversational and scripted telephone speech collected in 2014 and 2015 along wit...
Aug 15, 2019 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2019, "TAC KBP Evaluation Source Corpora 2016-2017", https://hdl.handle.net/11272.1/AB2/JDNLHX, Abacus Data Network, V1
TAC KBP Evaluation Source Corpora 2016-2017 was developed by the Linguistic Data Consortium (LDC) and contains the 180,003 Chinese, English and Spanish source documents used in support of all TAC KBP evaluation tracks conducted in 2016 and 2017. Text Analysis Conference (TAC) is...
Aug 15, 2019 - Linguistic Data Consortium
Mohammadi, Ariana Negar, 2019, "Corpus of Conversational Persian Transcripts", https://hdl.handle.net/11272.1/AB2/VPL800, Abacus Data Network, V1
Corpus of Conversational Persian Transcripts consists of transcripts from approximately 20 hours of naturally occurring informal conversations in the Tehrani dialect of Iranian Persian. The corresponding speech is not included in this release. Data This corpus is extracted from 1...
Aug 2, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "Provincial Symmetric Input-Output Tables, 2015", https://hdl.handle.net/11272.1/AB2/UPWB8O, Abacus Data Network, V1
The Industry Accounts Division of Statistics Canada publishes annual provincial supply and use tables. While these industry by product tables closely reflect actual economic transactions, certain analytical and modeling purposes, however, require symmetric industry-by-industry in...
Jul 19, 2019 - Linguistic Data Consortium
Linguistic Data Consortium, 2019, "First DIHARD Challenge Development - Eight Sources", https://hdl.handle.net/11272.1/AB2/XA6BRY, Abacus Data Network, V1
First DIHARD Challenge Development - Eight Sources was developed by the Linguistic Data Consortium (LDC) and contains approximately 17 hours of English and Chinese speech data along with corresponding annotations used in support of the First DIHARD Challenge. The First DIHARD Cha...
Jul 15, 2019 - Linguistic Data Consortium
Linguistic Data Consortium, 2019, "First DIHARD Challenge Evaluation - Nine Sources", https://hdl.handle.net/11272.1/AB2/HGTUHY, Abacus Data Network, V1
First DIHARD Challenge Evaluation - Nine Sources was developed by the Linguistic Data Consortium (LDC) and contains approximately 18 hours of English and Chinese speech data along with corresponding annotations used in support of the First DIHARD Challenge. The First DIHARD Chall...
Jul 15, 2019 - Linguistic Data Consortium
Chamberlain, Jon; Paun, Silviu; Yu, Juntao; Kruschwitz, Udo; Poesio, Massimo, 2019, "Phrase Detectives Corpus Version 2", https://hdl.handle.net/11272.1/AB2/6GWBA8, Abacus Data Network, V1
Phrase Detectives Corpus Version 2 was developed by the School of Computer Science and Electronic Engineering at the University of Essex and consists of approximately 407,000 tokens across 537 documents anaphorically-annotated by the Phrase Detectives Game, an online interactive...
Jul 15, 2019 - Linguistic Data Consortium
Qin, Xiaoyi; Liu, Xinzhong; Cai, Zexin; Li, Ming, 2019, "The DKU-JNU-EMA Electromagnetic Articulography Database", https://hdl.handle.net/11272.1/AB2/D9PQFH, Abacus Data Network, V1
The DKU-JNU-EMA Electromagnetic Articulography Database was developed by Duke Kunshan University and Jinan University and contains approximately 10 hours of articulography and speech data in Mandarin, Cantonese, Hakka, and Teochew Chinese from two to seven native speakers for eac...
Jul 15, 2019 - Linguistic Data Consortium
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2019, "First DIHARD Challenge Evaluation - SEEDLingS", https://hdl.handle.net/11272.1/AB2/XH4KVV, Abacus Data Network, V1
First DIHARD Challenge Evaluation - SEEDLingS was developed by Duke University and the Linguistic Data Consortium (LDC) and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the First DIHARD Challenge. Th...
Jun 28, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "2016 Census Public Use Microdata File (PUMF): Hierarchical file", https://hdl.handle.net/11272.1/AB2/PYYXXR, Abacus Data Network, V1
The 2016 Census public use microdata file (PUMF) on households contains 140,720 private households with a total of 343,330 individual records, representing 1% of the population in private households in private occupied dwellings in Canada. These records were drawn from a sample o...
Jun 17, 2019 - Linguistic Data Consortium
Tracey, Jennifer; Arrigo, Michael; Strassel, Stephanie, 2019, "DEFT Spanish Committed Belief Annotation", https://hdl.handle.net/11272.1/AB2/HWOJGE, Abacus Data Network, V1
DEFT Spanish Committed Belief Annotation was developed by the Linguistic Data Consortium (LDC) and consists of approximately 67,000 tokens of Spanish discussion forum text annotated for "committed belief," which marks the level of commitment displayed by the author to the truth o...
Jun 17, 2019 - Linguistic Data Consortium
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2019, "First DIHARD Challenge Development - SEEDLingS", https://hdl.handle.net/11272.1/AB2/KXC76R, Abacus Data Network, V1
First DIHARD Challenge Development - SEEDLingS was developed by Duke University and the Linguistic Data Consortium (LDC) and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the First DIHARD Challenge. T...
Jun 17, 2019 - Linguistic Data Consortium
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William; Hajič, Jan; Oard, Douglas; Olsson, J. Scott; Picheny, Michael; Psutka, Josef, 2019, "USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition", https://hdl.handle.net/11272.1/AB2/SGOMWO, Abacus Data Network, V1
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition, LDC Catalog Number LDC2019S11 and ISBN 1-58563-889-7, was developed by IBM as part of the MALACH (Multilingual Access to Large Spoken ArCHives) Project. This edition augments USC-SFI MALACH Interviews...
May 15, 2019 - Linguistic Data Consortium
Mena, Carlos Daniel Hernández, 2019, "CIEMPIESS Experimentation", https://hdl.handle.net/11272.1/AB2/DUUYQV, Abacus Data Network, V1
CIEMPIESS (Corpus de Investigación en Español de México del Posgrado de Ingeniería Eléctrica y Servicio Social) Experimentation was developed by the social service program "Desarrollo de Tecnologías del Habla" of the "Facultad de Ingeniería" (FI) at the National Autonomous Univer...
May 15, 2019 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2019, "TAC KBP Chinese Regular Slot Filling - Comprehensive Training and Evaluation Data 2014", https://hdl.handle.net/11272.1/AB2/ZZMOPP, Abacus Data Network, V1
TAC KBP Chinese Regular Slot Filling - Comprehensive Training and Evaluation Data 2014 was developed by the Linguistic Data Consortium (LDC) and contains training and evaluation data produced in support of the TAC KBP Chinese Regular Slot Filling evaluation track conducted in 201...
May 15, 2019 - Linguistic Data Consortium
Jones, Karen; Graff, David; Walker, Kevin; Strassel, Stephanie, 2019, "Multi-Language Conversational Telephone Speech 2011 -- English Group", https://hdl.handle.net/11272.1/AB2/ACDWDL, Abacus Data Network, V1
Multi-Language Conversational Telephone Speech 2011 – English Group was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 18 hours of telephone speech in two general varieties of English: American and South Asian. The data were collected primaril...
Apr 15, 2019 - Linguistic Data Consortium
Li, Xuansong; Peterson, Katherine; Grimes, Stephen; Strassel, Stephanie, 2019, "BOLT Egyptian-English Word Alignment -- Discussion Forum Training", https://hdl.handle.net/11272.1/AB2/AR1QCS, Abacus Data Network, V1
BOLT Egyptian-English Word Alignment – Discussion Forum Training was developed by the Linguistic Data Consortium (LDC) and consists of 400,448 words of Egyptian Arabic and English parallel text enhanced with linguistic tags to indicate word relations. The DARPA BOLT (Broad Operat...
Apr 15, 2019 - Linguistic Data Consortium
Li, Bin; Wen, Yuan; Song, Li; Dai, Rubing; Qu, Weiguang; Xue, Nianwen, 2019, "Chinese Abstract Meaning Representation 1.0", https://hdl.handle.net/11272.1/AB2/TT5KRI, Abacus Data Network, V1
Chinese Abstract Meaning Representation was developed by Brandeis University and Nanjing Normal University and is comprised of semantic representations of a set of Chinese sentences from Chinese Treebank 8.0 (LDC2013T21). Abstract Meaning Representation (AMR) captures "who is doi...
Mar 15, 2019 - Linguistic Data Consortium
Prasad, Rashmi; Webber, Bonnie; Lee, Alan; Joshi, Aravind, 2019, "Penn Discourse Treebank Version 3.0", https://hdl.handle.net/11272.1/AB2/SUU9CB, Abacus Data Network, V1
Penn Discourse Treebank (PDTB) Version 3.0 is the third release in the Penn Discourse Treebank project, the goal of which is to annotate the Wall Street Journal (WSJ) section of Treebank-2 (LDC95T7) with discourse relations. Penn Discourse Treebank Version 2 (LDC2008T05) contains...
Mar 15, 2019 - Linguistic Data Consortium
Canavan, Alexandra; Zipperlen, George; Bartlett, John, 2019, "CALLFRIEND Egyptian Arabic Second Edition", https://hdl.handle.net/11272.1/AB2/4LCUFC, Abacus Data Network, V1
CALLFRIEND Egyptian Arabic Second Edition was developed by the Linguistic Data Consortium (LDC) and consists of approximately 25 hours of unscripted telephone conversations between native speakers of Egyptian Arabic. This second edition updates the audio files to wav format, simp...
Mar 15, 2019 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Kuster, Neil, 2019, "VAST Chinese Speech and Transcripts", https://hdl.handle.net/11272.1/AB2/OE8XTX, Abacus Data Network, V1
VAST Chinese Speech and Transcripts was developed by the Linguistic Data Consortium (LDC) for the VAST (Video Annotation for Speech Technologies) project and is comprised of approximately 29 hours of Mandarin Chinese audio extracted from amateur video content harvested from the w...
Feb 15, 2019 - Linguistic Data Consortium
Tracey, Jennifer; Arrigo, Michael; Kuster, Neil; Strassel, Stephanie, 2019, "DEFT Chinese Committed Belief Annotation", https://hdl.handle.net/11272.1/AB2/EGZOQ9, Abacus Data Network, V1
DEFT Chinese Committed Belief Annotation was developed by the Linguistic Data Consortium (LDC) and consists of approximately 83,000 tokens of Chinese discussion forum text annotated for “committed belief,” which marks the level of commitment displayed by the author to the truth o...
Feb 15, 2019 - Linguistic Data Consortium
Upadhyay, Shyam; Hakkani-Tur, Dilek; Tur, Gokhan; Rastogi, Abhinav, 2019, "Multilingual ATIS", https://hdl.handle.net/11272.1/AB2/AGMWIU, Abacus Data Network, V1
Multilingual ATIS was developed by Google Inc. and consists of 5,871 utterances from ATIS2 (LDC93S5), ATIS3 Training Data (LDC94S19), and ATIS3 Test Data (LDC95S26) annotated and translated into Hindi and Turkish. The ATIS (Air Travel Information Services) collection was develope...
Feb 15, 2019 - Linguistic Data Consortium
Jones, Karen; Graff, David; Walker, Kevin; Strassel, Stephanie, 2019, "Multi-Language Conversational Telephone Speech 2011 -- Arabic Group", https://hdl.handle.net/11272.1/AB2/A5UT97, Abacus Data Network, V1
Multi-Language Conversational Telephone Speech 2011 – Arabic Group was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 117 hours of telephone speech in distinct dialects of colloquial Arabic: Iraqi, Levantine and Maghrebi. The data were collect...
Jan 15, 2019 - Linguistic Data Consortium
Richey, Colleen; D'Angelo, Cynthia; Alozie, Nonye; Bratt, Harry; Shriberg, Elizabeth, 2019, "SRI Speech-Based Collaborative Learning Corpus", https://hdl.handle.net/11272.1/AB2/YJWBEU, Abacus Data Network, V1
SRI Speech-Based Collaborative Learning Corpus was developed by SRI International and is comprised of approximately 120 hours of English speech from 134 US middle school students working collaboratively. The data set also contains orthographic transcriptions, manual annotation of...
Jan 15, 2019 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2019, "TAC KBP Entity Discovery and Linking - Comprehensive Training and Evaluation Data 2014-2015", https://hdl.handle.net/11272.1/AB2/LCPM63, Abacus Data Network, V1
TAC KBP Entity Discovery and Linking - Comprehensive Training and Evaluation Data 2014-2015 was developed by the Linguistic Data Consortium (LDC) and contains training and evaluation data produced in support of the TAC KBP Entity Discovery and Linking (EDL) tasks in 2014 and 2015...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =