601 to 650 of 2,582 Results
Jun 17, 2019 - Linguistic Data Consortium
Tracey, Jennifer; Arrigo, Michael; Strassel, Stephanie, 2019, "DEFT Spanish Committed Belief Annotation", https://hdl.handle.net/11272.1/AB2/HWOJGE, Abacus Data Network, V1
DEFT Spanish Committed Belief Annotation was developed by the Linguistic Data Consortium (LDC) and consists of approximately 67,000 tokens of Spanish discussion forum text annotated for "committed belief," which marks the level of commitment displayed by the author to the truth o... |
Jun 17, 2019 - Linguistic Data Consortium
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2019, "First DIHARD Challenge Development - SEEDLingS", https://hdl.handle.net/11272.1/AB2/KXC76R, Abacus Data Network, V1
First DIHARD Challenge Development - SEEDLingS was developed by Duke University and the Linguistic Data Consortium (LDC) and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the First DIHARD Challenge. T... |
Jun 17, 2019 - Linguistic Data Consortium
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William; Hajič, Jan; Oard, Douglas; Olsson, J. Scott; Picheny, Michael; Psutka, Josef, 2019, "USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition", https://hdl.handle.net/11272.1/AB2/SGOMWO, Abacus Data Network, V1
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition, LDC Catalog Number LDC2019S11 and ISBN 1-58563-889-7, was developed by IBM as part of the MALACH (Multilingual Access to Large Spoken ArCHives) Project. This edition augments USC-SFI MALACH Interviews... |
May 15, 2019 - Linguistic Data Consortium
Mena, Carlos Daniel Hernández, 2019, "CIEMPIESS Experimentation", https://hdl.handle.net/11272.1/AB2/DUUYQV, Abacus Data Network, V1
CIEMPIESS (Corpus de Investigación en Español de México del Posgrado de Ingeniería Eléctrica y Servicio Social) Experimentation was developed by the social service program "Desarrollo de Tecnologías del Habla" of the "Facultad de Ingeniería" (FI) at the National Autonomous Univer... |
May 15, 2019 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2019, "TAC KBP Chinese Regular Slot Filling - Comprehensive Training and Evaluation Data 2014", https://hdl.handle.net/11272.1/AB2/ZZMOPP, Abacus Data Network, V1
TAC KBP Chinese Regular Slot Filling - Comprehensive Training and Evaluation Data 2014 was developed by the Linguistic Data Consortium (LDC) and contains training and evaluation data produced in support of the TAC KBP Chinese Regular Slot Filling evaluation track conducted in 201... |
May 15, 2019 - Linguistic Data Consortium
Jones, Karen; Graff, David; Walker, Kevin; Strassel, Stephanie, 2019, "Multi-Language Conversational Telephone Speech 2011 -- English Group", https://hdl.handle.net/11272.1/AB2/ACDWDL, Abacus Data Network, V1
Multi-Language Conversational Telephone Speech 2011 – English Group was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 18 hours of telephone speech in two general varieties of English: American and South Asian. The data were collected primaril... |
Apr 15, 2019 - Linguistic Data Consortium
Li, Xuansong; Peterson, Katherine; Grimes, Stephen; Strassel, Stephanie, 2019, "BOLT Egyptian-English Word Alignment -- Discussion Forum Training", https://hdl.handle.net/11272.1/AB2/AR1QCS, Abacus Data Network, V1
BOLT Egyptian-English Word Alignment – Discussion Forum Training was developed by the Linguistic Data Consortium (LDC) and consists of 400,448 words of Egyptian Arabic and English parallel text enhanced with linguistic tags to indicate word relations. The DARPA BOLT (Broad Operat... |
Apr 15, 2019 - Linguistic Data Consortium
Li, Bin; Wen, Yuan; Song, Li; Dai, Rubing; Qu, Weiguang; Xue, Nianwen, 2019, "Chinese Abstract Meaning Representation 1.0", https://hdl.handle.net/11272.1/AB2/TT5KRI, Abacus Data Network, V1
Chinese Abstract Meaning Representation was developed by Brandeis University and Nanjing Normal University and is comprised of semantic representations of a set of Chinese sentences from Chinese Treebank 8.0 (LDC2013T21). Abstract Meaning Representation (AMR) captures "who is doi... |
Mar 15, 2019 - Linguistic Data Consortium
Prasad, Rashmi; Webber, Bonnie; Lee, Alan; Joshi, Aravind, 2019, "Penn Discourse Treebank Version 3.0", https://hdl.handle.net/11272.1/AB2/SUU9CB, Abacus Data Network, V1
Penn Discourse Treebank (PDTB) Version 3.0 is the third release in the Penn Discourse Treebank project, the goal of which is to annotate the Wall Street Journal (WSJ) section of Treebank-2 (LDC95T7) with discourse relations. Penn Discourse Treebank Version 2 (LDC2008T05) contains... |
Mar 15, 2019 - Linguistic Data Consortium
Canavan, Alexandra; Zipperlen, George; Bartlett, John, 2019, "CALLFRIEND Egyptian Arabic Second Edition", https://hdl.handle.net/11272.1/AB2/4LCUFC, Abacus Data Network, V1
CALLFRIEND Egyptian Arabic Second Edition was developed by the Linguistic Data Consortium (LDC) and consists of approximately 25 hours of unscripted telephone conversations between native speakers of Egyptian Arabic. This second edition updates the audio files to wav format, simp... |
Mar 15, 2019 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Kuster, Neil, 2019, "VAST Chinese Speech and Transcripts", https://hdl.handle.net/11272.1/AB2/OE8XTX, Abacus Data Network, V1
VAST Chinese Speech and Transcripts was developed by the Linguistic Data Consortium (LDC) for the VAST (Video Annotation for Speech Technologies) project and is comprised of approximately 29 hours of Mandarin Chinese audio extracted from amateur video content harvested from the w... |
Feb 15, 2019 - Linguistic Data Consortium
Tracey, Jennifer; Arrigo, Michael; Kuster, Neil; Strassel, Stephanie, 2019, "DEFT Chinese Committed Belief Annotation", https://hdl.handle.net/11272.1/AB2/EGZOQ9, Abacus Data Network, V1
DEFT Chinese Committed Belief Annotation was developed by the Linguistic Data Consortium (LDC) and consists of approximately 83,000 tokens of Chinese discussion forum text annotated for “committed belief,” which marks the level of commitment displayed by the author to the truth o... |
Feb 15, 2019 - Linguistic Data Consortium
Upadhyay, Shyam; Hakkani-Tur, Dilek; Tur, Gokhan; Rastogi, Abhinav, 2019, "Multilingual ATIS", https://hdl.handle.net/11272.1/AB2/AGMWIU, Abacus Data Network, V1
Multilingual ATIS was developed by Google Inc. and consists of 5,871 utterances from ATIS2 (LDC93S5), ATIS3 Training Data (LDC94S19), and ATIS3 Test Data (LDC95S26) annotated and translated into Hindi and Turkish. The ATIS (Air Travel Information Services) collection was develope... |
Feb 15, 2019 - Linguistic Data Consortium
Jones, Karen; Graff, David; Walker, Kevin; Strassel, Stephanie, 2019, "Multi-Language Conversational Telephone Speech 2011 -- Arabic Group", https://hdl.handle.net/11272.1/AB2/A5UT97, Abacus Data Network, V1
Multi-Language Conversational Telephone Speech 2011 – Arabic Group was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 117 hours of telephone speech in distinct dialects of colloquial Arabic: Iraqi, Levantine and Maghrebi. The data were collect... |
Jan 15, 2019 - Linguistic Data Consortium
Richey, Colleen; D'Angelo, Cynthia; Alozie, Nonye; Bratt, Harry; Shriberg, Elizabeth, 2019, "SRI Speech-Based Collaborative Learning Corpus", https://hdl.handle.net/11272.1/AB2/YJWBEU, Abacus Data Network, V1
SRI Speech-Based Collaborative Learning Corpus was developed by SRI International and is comprised of approximately 120 hours of English speech from 134 US middle school students working collaboratively. The data set also contains orthographic transcriptions, manual annotation of... |
Jan 15, 2019 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2019, "TAC KBP Entity Discovery and Linking - Comprehensive Training and Evaluation Data 2014-2015", https://hdl.handle.net/11272.1/AB2/LCPM63, Abacus Data Network, V1
TAC KBP Entity Discovery and Linking - Comprehensive Training and Evaluation Data 2014-2015 was developed by the Linguistic Data Consortium (LDC) and contains training and evaluation data produced in support of the TAC KBP Entity Discovery and Linking (EDL) tasks in 2014 and 2015... |
Jan 15, 2019 - Linguistic Data Consortium
Song, Zhiyi; Tracey, Jennifer; Walker, Christopher; Stephanie, Strassel,, 2019, "BOLT Arabic Discussion Forum Parallel Training Data", https://hdl.handle.net/11272.1/AB2/CZR6SG, Abacus Data Network, V1
BOLT Arabic Discussion Forum Parallel Training Data was developed by the Linguistic Data Consortium (LDC) and consists of 1,169,599 tokens of Egyptian Arabic discussion forum data collected for the DARPA BOLT program along with their corresponding English translations. The BOLT (... |
Jan 8, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "National Symmetric Input-Output Tables, 2015", https://hdl.handle.net/11272.1/AB2/BIGOX8, Abacus Data Network, V1
The symmetric industry by industry input-output tables show inter-industry transactions, that is, all purchases of an industry from all other industries as well as expenditures on imports and the components of value added such as wages and gross operating surplus. Similarly, the... |
Jan 1, 2019 - Statistics Canada - DLI
Statistics Canada, 2019, "Postal Code Conversion File Plus (PCCF+) Version 7B, November 2018 Postal Codes", https://hdl.handle.net/11272.1/AB2/EX6D5M, Abacus Data Network, V1
The Postal Code Conversion File Plus (PCCF+) is a SAS control program and set of associated datasets derived from the Postal Code Conversion File (PCCF), a Postal Code population weight file, the Geographic Attribute File, Health Region boundary files, and other supplementary dat... |
Jan 1, 2019 - Statistics Canada - DLI
Statistics Canada, 2019, "Social Policy Simulation Database and Model (SPSD/M), 1997 to 2025 (Version 27.1, database year 2015)", https://hdl.handle.net/11272.1/AB2/PWTUHG, Abacus Data Network, V1
The Social Policy Simulation Database and Model (SPSD/M) is a tool designed to assist those interested in analyzing the financial interactions of governments and individuals in Canada. It can help one to assess the cost implications or income redistributive effects of changes in... |
Jan 1, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "Survey of Household Spending, 2017", https://hdl.handle.net/11272.1/AB2/96DRT3, Abacus Data Network, V1, UNF:6:Wepwt/hoTIAJmn6XbECWnA== [fileUNF]
The SHS primarily collects detailed information on household expenditures. It also collects information about the annual income of household members (from personal income tax data), demographic characteristics of the household, certain dwelling characteristics (e.g., type, age an... |
Jan 1, 2019 - Statistics Canada Open License
Statistics Canada, 2019, "Estimates of interprovincial migrants by province or territory of origin and destination, annual [2002-2016]", https://hdl.handle.net/11272.1/AB2/X5UHGM, Abacus Data Network, V1
Updated estimates for inter-jurisdictional employees in Canada for each jurisdiction, disaggregated by total count, T4 earnings, North American Industry Classification System, age and gender. Inter-jurisdictional employees are identified as individuals who maintain their permanen... |
Dec 17, 2018 - Linguistic Data Consortium
Linguistic Data Consortium, 2018, "HUB5 Mandarin Telephone Speech and Transcripts Second Edition", https://hdl.handle.net/11272.1/AB2/2JAJJE, Abacus Data Network, V1
HUB5 Mandarin Telephone Speech and Transcripts Second Edition was developed by the Linguistic Data Consortium (LDC) in support of US government projects for language recognition and Large Vocabulary Conversational Speech Recognition (LVCSR). The first edition was released by LDC... |
Dec 15, 2018 - Linguistic Data Consortium
Zhong, Victor; Zhang, Yuhao; Chen, Danqi; Angeli, Gabor; Manning, Christopher, 2018, "TAC Relation Extraction Dataset", https://hdl.handle.net/11272.1/AB2/SOYGGB, Abacus Data Network, V1
TAC Relation Extraction Dataset (TACRED) was developed by The Stanford NLP Group and is a large-scale relation extraction dataset with 106,264 examples built over English newswire and web text used in the NIST TAC KBP English slot filling evaluations during the period 2009-2014.... |
Dec 7, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "National and Provincial Multipliers, 2014", https://hdl.handle.net/11272.1/AB2/F6I1EB, Abacus Data Network, V1
The input-output multipliers are derived from the supply and use tables. They are used to assess the effects on the economy of an exogenous change in final demand for the output of a given industry. They provide a measure of the interdependence between an industry and the rest of... |
Nov 26, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "Canadian Business Counts, 2017", https://hdl.handle.net/11272.1/AB2/LL0HPS, Abacus Data Network, V1
Canadian business counts—previously called Canadian business patterns—provide counts of active businesses by industry classification and employment-size categories for Canada and the provinces and territories. Canadian business counts are based on the same criteria that were used... |
Nov 22, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "Employment Insurance Coverage Survey, 2017", https://hdl.handle.net/11272.1/AB2/LQPMUE, Abacus Data Network, V1, UNF:6:A2MB5iDYZ52F6Rn7qAXG0A== [fileUNF]
The main purpose of this survey is to study the coverage of the employment insurance program. It provides a meaningful picture of who does or does not have access to EI benefits among the jobless and those in a situation of underemployment. The Employment Insurance Coverage Surve... |
Nov 22, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "Entrepreneurship Indicators Database, 2015", https://hdl.handle.net/11272.1/AB2/0O8MWD, Abacus Data Network, V1
The Entrepreneurship Indicators Database is based on existing administrative data sources. The entrepreneurship data provides government researchers and academics, who have a strategic interest in promoting growth of businesses, with integrated data to facilitate analysis and pro... |
Nov 21, 2018 - Statistics Canada Open License
Statistics Canada. Special Surveys Division., 2018, "Canadian Tobacco, Alcohol and Drugs Survey (CTADS), 2017", https://hdl.handle.net/11272.1/AB2/NOCQG4, Abacus Data Network, V1, UNF:6:6KB4qQDrK9XLoS1djNHYSA== [fileUNF]
The major objectives of the survey are to: measure the frequency of cigarette smoking, as well as the amount smoked, gain insight into behaviors related to smoking, measure the prevalence and frequency of alcohol use, and measure the prevalence of drug use and the extent of harm... |
Nov 15, 2018 - Linguistic Data Consortium
Bills, Aric; Conners, Thomas; David, Anne; Dubinski, Eyal; Fiscus, Jonathan G.; Hammond, Simon; Harper, Mary; Kaiser-Schatzlein, Alice; Melot, Jennifer; Paget, Shelley; Ray, Jessica; Rytting, Anton; Shen, Sinney; Shen, Wade; Silber, Ronnie; Tzoukermann, Evelyne, 2018, "IARPA Babel Telugu Language Pack IARPA-babel303b-v1.0a", https://hdl.handle.net/11272.1/AB2/OTDPUV, Abacus Data Network, V1
Introduction IARPA Babel Telugu Language Pack IARPA-babel303b-v1.0a was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 201 hours of Telugu conversational and scripted telephone speech collected in 2013... |
Nov 15, 2018 - Linguistic Data Consortium
Maamouri, Mohamed; Bies, Ann; Kulick, Seth; Krouna, Sondos; Tabassi,Dalila; Ciul, Michael, 2018, "BOLT Egyptian Arabic Treebank - Discussion Forum", https://hdl.handle.net/11272.1/AB2/CAA0JW, Abacus Data Network, V1
BOLT Egyptian Arabic Treebank – Discussion Forum was developed by the Linguistic Data Consortium (LDC) and consists of Egyptian Arabic web discussion forum data with part-of-speech annotation, morphology, gloss and syntactic tree annotation. The DARPA BOLT (Broad Operational Lang... |
Nov 15, 2018 - Linguistic Data Consortium
Maciel, Alexandre M. A.; Rodrigues, Rodrigo L.; Barbosa, Danilo S., 2018, "Avatar Education Portuguese", https://hdl.handle.net/11272.1/AB2/BSQ4NP, Abacus Data Network, V1
Avatar Education Portuguese was developed by the University of Pernambuco and consists of approximately 80 minutes of Brazilian Portuguese microphone speech with phonetic and orthographic transcriptions. The data was developed for Avatar Education, an animated virtual assistant d... |
Oct 30, 2018 - DMTI Spatial
DMTI Spatial Inc., 2018, "CanMap Content Suite, v2018.3", https://hdl.handle.net/11272.1/AB2/BH0MGI, Abacus Data Network, V1
CanMap Content Suite contains over 100 unique and rich content layers. Each layer has a unique file and layer name with associated definitions, descriptions, attribution and metadata. All layers, with a few exceptions, are vector data consisting of polygon, polyline, or point geo... |
Oct 15, 2018 - Linguistic Data Consortium
Ellis, Joe; Getman, Jeremy; Strassel, Stephanie, 2018, "TAC KBP English Regular Slot Filling - Comprehensive Training and Evaluation Data 2009-2014", https://hdl.handle.net/11272.1/AB2/B3R0J4, Abacus Data Network, V1
TAC KBP English Regular Slot Filling - Comprehensive Training and Evaluation Data 2009-2014 was developed by the Linguistic Data Consortium (LDC) and contains training and evaluation data produced in support of the TAC KBP Slot Filling evaluation track conducted from 2009 to 2014... |
Oct 10, 2018 - Statistics Canada - DLI
Statistics Canada, 2018, "Postal Code Conversion File, August 2018 Postal Codes, 2018", https://hdl.handle.net/11272.1/AB2/COK0H7, Abacus Data Network, V1
The Postal Code Conversion File (PCCF) is a digital file which provides a correspondence between the Canada Post Corporation (CPC) six-character postal code and Statistics Canada’s standard geographic areas for which census data and other statistics are produced. Through the link... |
Oct 10, 2018 - Statistics Canada - DLI
Statistics Canada, 2018, "Postal Codes by Federal Ridings File (PCFRF), 2013 Representation Order, August 2018 Postal Codes, 2018", https://hdl.handle.net/11272.1/AB2/LDXVG3, Abacus Data Network, V1
The Postal Codes by Federal Ridings File (PCFRF) is a digital le which provides a link between the six- character postal code and Canada’s federal electoral districts (which are also known as federal ridings). Elections Canada defines a federal electoral district (FED) as any pla... |
Sep 20, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "Canadian Income Survey (CIS), 2016", https://hdl.handle.net/11272.1/AB2/Y6FHKO, Abacus Data Network, V1, UNF:6:C0ePEA8UKnCFagJ6bI7RRw== [fileUNF]
The primary objective of the Canadian Income Survey (CIS) is to provide information on the income and income sources of Canadians, along with their individual and household characteristics. The data collected in the CIS is combined with Labour Force Survey (LFS, record number 370... |
Sep 17, 2018 - Linguistic Data Consortium
Bills, Aric; Conners, Thomas; David, Anne; Dubinski, Eyal; Fiscus, Jonathan G.; Harper, Mary; Hefright, Brook; Kozlov, Kirill; Melot, Jennifer; Ray, Jessica; Rytting, Anton; Phillips, Josh; Walter, Marle; Shen, Wade; Silber, Ronnie; Tzoukermann, Evelyne, 2018, "IARPA Babel Kazakh Language Pack IARPA-babel302b-v1.0a", https://hdl.handle.net/11272.1/AB2/KGA4ZX, Abacus Data Network, V1
Introduction IARPA Babel Kazakh Language Pack IARPA-babel302b-v1.0a was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 203 hours of Kazakh conversational and scripted telephone speech collected in 2013... |
Sep 17, 2018 - Linguistic Data Consortium
Morris, Amanda; Strassel, Stephanie; Li, Xuansong; Antonishek, Brian; Fiscus, Jonathan G., 2018, "HAVIC MED Event E051-E060 -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/XNNWD1, Abacus Data Network, V1
Introduction HAVIC MED Event E051-E060 – Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 53 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and related techn... |
Sep 17, 2018 - Linguistic Data Consortium
Jones, Karen; Graff, David; Walker, Kevin; Strassel, Stephanie, 2018, "Multi-Language Conversational Telephone Speech 2011 -- Spanish", https://hdl.handle.net/11272.1/AB2/9Q4DIQ, Abacus Data Network, V1
Introduction Multi-Language Conversational Telephone Speech 2011 – Spanish was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 23 hours of telephone speech in Spanish. The data were collected primarily to support research and technology evaluat... |
Sep 17, 2018 - Linguistic Data Consortium
Griffitt, Kira; Strassel, Stephanie, 2018, "BOLT Information Retrieval Comprehensive Training and Evaluation", https://hdl.handle.net/11272.1/AB2/EDRQLG, Abacus Data Network, V1
Introduction BOLT Information Retrieval Comprehensive Training and Evaluation was developed by the Linguistic Data Consortium (LDC) and consists of all data produced in support of the Information Retrieval (IR) task within the DARPA Broad Operational Language Translation (BOLT) P... |
Sep 1, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "Canadian Income Survey (CIS), 2015", https://hdl.handle.net/11272.1/AB2/VYZKG9, Abacus Data Network, V1, UNF:6:fBy+mgfVu8ybi7sC7vijQA== [fileUNF]
The primary objective of the Canadian Income Survey (CIS) is to provide information on the income and income sources of Canadians, along with their individual and household characteristics. The data collected in the CIS is combined with Labour Force Survey (LFS, record number 370... |
Aug 15, 2018 - Linguistic Data Consortium
Hernández Mena, Carlos Daniel, 2018, "CIEMPIESS Balance", https://hdl.handle.net/11272.1/AB2/JWRYUR, Abacus Data Network, V1
CIEMPIESS (Corpus de Investigación en Español de México del Posgrado de Ingeniería Eléctrica y Servicio Social) Balance was developed by the Development of Speech Technologies program at the School of Engineering at the National Autonomous University of Mexico (UNAM) and consists... |
Aug 15, 2018 - Linguistic Data Consortium
Greenberg, Craig; Martin, Alvin; Graff, David; Walker, Kevin; Jones, Karen; Strassel, Stephanie, 2018, "2011 NIST Language Recognition Evaluation Test Set", https://hdl.handle.net/11272.1/AB2/0ZCWPS, Abacus Data Network, V1
2011 NIST Language Recognition Evaluation Test Set contains selected training data and the evaluation test set for the 2011 NIST Language Recognition Evaluation. It consists of approximately 204 hours of conversational telephone speech and broadcast audio collected by the Linguis... |
Aug 15, 2018 - Linguistic Data Consortium
Song, Zhiyi; Fore, Dana; Strassel, Stephanie; Lee, Haejoong; Wright, Jonathan, 2018, "BOLT English SMS/Chat", https://hdl.handle.net/11272.1/AB2/RNIGFD, Abacus Data Network, V1
BOLT English SMS/Chat was developed by the Linguistic Data Consortium (LDC) and consists of naturally-occurring Short Message Service (SMS) and Chat (CHT) data collected through data donations and live collection involving native speakers of English. The corpus contains 18,429 co... |
Aug 3, 2018 - Statistics Canada - DLI
Statistics Canada, 2018, "Postal Code Conversion File Plus (PCCF+) Version 7A, June 2017 Postal Codes", https://hdl.handle.net/11272.1/AB2/T7SNNA, Abacus Data Network, V1
The Postal Code Conversion File Plus (PCCF+) is a SAS© control program and set of associated datasets derived from the Postal Code Conversion File (PCCF), a Postal Code population weight file, the Geographic Attribute File, Health Region boundary files, and other supplementary da... |
Jul 30, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "Travel Survey of Residents of Canada, 2017", https://hdl.handle.net/11272.1/AB2/0MR9LN, Abacus Data Network, V1, UNF:6:62Y1q9jwIVMm5qee2tQyzw== [fileUNF]
Since the beginning of 2005, the Travel Survey of Residents of Canada (TSRC) has been conducted to measure domestic travel in Canada. It replaces the Canadian Travel Survey (CTS). Featuring several definitional changes and a new questionnaire, this survey provides estimates of do... |
Jul 27, 2018 - Statistics Canada Open License
Statistics Canada, 2018, "International Travel Survey, 2016", https://hdl.handle.net/11272.1/AB2/EEDVV9, Abacus Data Network, V1, UNF:6:R46HtApKEP6hObkj9OEpoA== [fileUNF]
The Mail-back and E-Questionnaires and Air Exit Survey (AES) are components of the International Travel Survey Program (ITS) together with the Frontier Counts (record number 5005, see the “Additional documentation” link that follows the “Statistical activity” section). It is an o... |
Jul 18, 2018 - Linguistic Data Consortium
Bills, Aric; Conners, Thomas; Corris, Miriam; David, Anne; Dubinski, Eyal; Fiscus, Jonathan G.; Harper, Mary; Kaiser-Schatzlein, Alice; Melot, Jennifer; Paget, Shelley; Ray, Jessica; Rytting, Anton; Shen, Wade; Silber, Ronnie; Tzoukermann, Evelyne; Viswanath, Arun, 2018, "IARPA Babel Tamil Language Pack IARPA-babel204b-v1.1b", https://hdl.handle.net/11272.1/AB2/8245NT, Abacus Data Network, V1
Introduction IARPA Babel Tamil Language Pack IARPA-babel204b-v1.1b was developed by Appen for the IARPA (Intelligence Advanced Research Projects Activity) Babel program. It contains approximately 350 hours of Tamil conversational and scripted telephone speech collected in 2012 an... |
Jul 16, 2018 - Linguistic Data Consortium
Linguistic Data Consortium, 2018, "CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition", https://hdl.handle.net/11272.1/AB2/88OSWL, Abacus Data Network, V1
CALLFRIEND Mandarin Chinese-Mainland Dialect Second Edition was developed by the Linguistic Data Consortium (LDC) and consists of approximately 24 hours of unscripted telephone conversations between native speakers of the Mandarin Chinese dialect spoken in mainland China. This se... |