1 to 50 of 81 Results
Jan 14, 2025 - Statistics Canada Open License
Statistics Canada, 2023, "Canadian Housing Survey, 2021", https://hdl.handle.net/11272.1/AB2/SBIZM2, Abacus Data Network, V2, UNF:6:sAGoOg7xa6Bk6pEXmMeOuQ== [fileUNF]
The Canadian Housing Survey (CHS) provides information on how Canadians feel about their housing and how housing affects them. Information is collected on core housing need, dwelling characteristics and housing tenure, perceptions on economic hardship from housing costs, dwelling... |
Aug 2, 2024 - Statistics Canada Open License
Statistics Canada, 2023, "General Social Survey Cycle 35: Social Identity, 2020", https://hdl.handle.net/11272.1/AB2/R7HAAF, Abacus Data Network, V3, UNF:6:WZvJCkiAs1DIgMbjjqaTIg== [fileUNF]
The main objective of the General Social Survey on Social Identity is to provide an overall picture of Canadians' identification, attachment, belonging and pride in their social and cultural environment. The key components of the survey include the following topics: social networ... |
Jun 27, 2024 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Code Conversion File Plus (PCCF+) Version 8A1, December 2022 Postal Codes", https://hdl.handle.net/11272.1/AB2/FPEURY, Abacus Data Network, V2
Overview The PCCF+ is a SAS control program and set of associated datasets derived from the PCCF, a 2021 postal code population weight file, the Geographic Attribute File, Health Region boundary files, and other supplementary data. PCCF+ automatically assigns a range of Statistic... |
Jan 31, 2024 - Statistics Canada Open License
Statistics Canada, 2023, "2021 Census Public Use Microdata File (PUMF) Individuals File", https://hdl.handle.net/11272.1/AB2/1WTDOP, Abacus Data Network, V4, UNF:6:4DaSfnDtAbe1lAYDJvWbag== [fileUNF]
The 2021 Census public use microdata file (PUMF) on individuals contains 980,868 records, representing 2.7% of the Canadian population. These records were drawn from a sample of one quarter of the Canadian population (sample data from questionnaire 2A-L). The 2021 PUMF contains 1... |
Jan 16, 2024 - Statistics Canada - DLI
Statistics Canada, 2023, "Social Policy Simulation Database and Model (SPSD/M), Version 30, database year 2018", https://hdl.handle.net/11272.1/AB2/URLQQA, Abacus Data Network, V3
The SPSD/M is a static microsimulation model designed to analyse financial interactions between governments and individuals in Canada. It can compute taxes paid to and cash transfers received from government. It is comprised of a database, a series of tax/transfer algorithms and... |
Jan 5, 2024 - Statistics Canada Open License
Statistics Canada, 2023, "Labour Force Survey, 2023", https://hdl.handle.net/11272.1/AB2/IJU1QK, Abacus Data Network, V12, UNF:6:AG5ZHxqjLpXkfg21OD3/IA== [fileUNF]
LFS data are used to produce the well-known unemployment rate as well as other standard labour market indicators such as the employment rate and the participation rate. The LFS also provides employment estimates by industry, occupation, public and private sector, hours worked and... |
Dec 5, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher, 2023, "AIDA Scenario 1 and 2 Reference Knowledge Base", https://hdl.handle.net/11272.1/AB2/YTF9AB, Abacus Data Network, V1
Abstract Introduction AIDA Scenario 1 and 2 Reference Knowledge Base was developed by the Linguistic Data Consortium (LDC) and contains the English knowledge base (KB) used for all AIDA entity linking annotation in Scenario 1 (Russia-Ukraine Relations) and Scenario 2 (Crisis in V... |
Dec 5, 2023 - Linguistic Data Consortium
Graff, David; Jones, Karen; Strassel, Stephanie; Walker, Kevin, 2023, "REMIX Telephone Collection", https://hdl.handle.net/11272.1/AB2/VJPGYX, Abacus Data Network, V1
Abstract Introduction REMIX Telephone Collection was developed by the Linguistic Data Consortium (LDC) and contains 320 hours of English conversational telephone speech from 358 speakers who had completed all tasks in one of the previous LDC Mixer collections, specifically, Mixer... |
Dec 5, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Getman, Jeremy; Bies, Ann; Griffitt, Kira; Graff, David; Caruso, Christopher, 2023, "AIDA Scenario 1 Practice Topic Source Data", https://hdl.handle.net/11272.1/AB2/M4QWGV, Abacus Data Network, V1
Abstract Introduction AIDA Scenario 1 Practice Topic Source Data was developed by the Linguistic Data Consortium (LDC) and is comprised of 1511 documents (text, image, and video) from English, Russian, and Ukrainian web sources. The DARPA AIDA (Active Interpretation of Disparate... |
Nov 21, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "Provincial Symmetric Input-Output Tables, 2020", https://hdl.handle.net/11272.1/AB2/OLIUIL, Abacus Data Network, V1
The Industry Accounts Division of Statistics Canada publishes annual provincial supply and use tables. While these industry by product tables closely reflect actual economic transactions, certain analytical and modeling purposes, however, require symmetric industry-by-industry in... |
Nov 21, 2023 - DMTI Spatial
DMTI Spatial Inc., 2023, "CanMap Content Suite, v2023.3", https://hdl.handle.net/11272.1/AB2/KIBZCV, Abacus Data Network, V1
CanMap Content Suite contains over 100 unique and rich content layers. Each layer has a unique file and layer name with associated definitions, descriptions, attribution and metadata. All layers, with a few exceptions, are vector data consisting of polygon, polyline, or point geo... |
Oct 31, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, September 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/RKXIRY, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin... |
Oct 31, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, June 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/IGPZPC, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin... |
Oct 31, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, March 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/OP7TU4, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin... |
Oct 30, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Code Conversion File, September 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/PGWFU5, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin... |
Oct 30, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Code Conversion File, June 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/TPXGYR, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin... |
Oct 30, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Code Conversion File, March 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/ETUHV2, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin... |
Oct 30, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "Canadian Tobacco and Nicotine Survey (CTNS), 2022", https://hdl.handle.net/11272.1/AB2/PWWFK3, Abacus Data Network, V1, UNF:6:kteRE6QsXKzonyDVAqRz/Q== [fileUNF]
The information collected in this survey will be used to fill important data gaps related to vaping, cannabis, and tobacco usage. The data will inform policy and provide a current snapshot of use across Canada. Until 2017, Statistics Canada conducted the Canadian Tobacco, Alcohol... |
Oct 24, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "Canadian Tobacco and Nicotine Survey (CTNS), 2021", https://hdl.handle.net/11272.1/AB2/YOLZ1M, Abacus Data Network, V1, UNF:6:l5CjFBAeehXcIm89HK6dpA== [fileUNF]
The information collected in this survey will be used to fill important data gaps related to vaping, cannabis, and tobacco usage. The data will inform policy and provide a current snapshot of use across Canada. Until 2017, Statistics Canada conducted the Canadian Tobacco, Alcohol... |
Oct 17, 2023 - Linguistic Data Consortium
Miller, David; Walker, Kevin; Graff, David; Canavan, Alexandra, 2023, "CALLFRIEND Russian Text", https://hdl.handle.net/11272.1/AB2/BNFFSZ, Abacus Data Network, V1
Abstract Introduction CALLFRIEND Russian Text (LDC2023T09) was developed by the Linguistic Data Consortium and consists of transcripts for approximately 48 hours of telephone conversations (100 recordings) between native Russian speakers. The calls were recorded in 1999 as part o... |
Oct 17, 2023 - Linguistic Data Consortium
Delgado, Dana; Jones, Karen; Walker, Kevin; Strassel, Stephanie; Caruso, Christopher; Graff, David, 2023, "2019 OpenSAT Public Safety Communications Simulation", https://hdl.handle.net/11272.1/AB2/BOXO5O, Abacus Data Network, V1
Abstract Introduction 2019 OpenSAT Public Safety Communications Simulation was developed by the Linguistic Data Consortium (LDC) and contains approximately 141 hours of speech recordings and transcripts used in the used in the National Institute of Standards and Technology (NIST)... |
Oct 16, 2023 - Linguistic Data Consortium
Miller, David; Walker, Kevin; Graff, David; Canavan, Alexandra, 2023, "CALLFRIEND Russian Speech", https://hdl.handle.net/11272.1/AB2/NGRVVO, Abacus Data Network, V1
Abstract Introduction CALLFRIEND Russian Speech (LDC2023S08) was developed by the Linguistic Data Consortium (LDC) and consists of approximately 48 hours of telephone conversations (100 recordings) between native speakers of Russian. The calls were recorded in 1999 as part of the... |
Oct 11, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "National Graduates Survey - Public Use Microdata File, 2015 (time of graduation), 2018 (time of interview)", https://hdl.handle.net/11272.1/AB2/OHTEHG, Abacus Data Network, V1, UNF:6:lPN9wkgqJdE1ZSZ+xXtGXA== [fileUNF]
Data from this survey will be used to better understand the experiences and outcomes of graduates, and to improve government programs. The survey is designed to collect details on topics such as: i) the extent to which graduates of postsecondary programs have been successful in o... |
Oct 10, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "General Social Survey Cycle 34: Canadians' Safety (Victimization), 2019", https://hdl.handle.net/11272.1/AB2/TY08CB, Abacus Data Network, V1, UNF:6:8LsnPtiVJZxig7nMRWYrvg== [fileUNF]
The main objective of the GSS on Canadians' Safety is to better understand how Canadians perceive crime and the justice system and to capture information on their experiences of victimization. This survey is the only national survey of self-reported victimization and is collected... |
Sep 28, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "Canadian Income Survey, 2020", https://hdl.handle.net/11272.1/AB2/I6BDAC, Abacus Data Network, V2, UNF:6:EFjvtjGOa6N2Mj23BZ4j/Q== [fileUNF]
The primary objective of the Canadian Income Survey (CIS) is to provide information on the income and income sources of Canadians, along with their individual and household characteristics. The data collected in the CIS is combined with Labour Force Survey (LFS, record number 370... |
Sep 12, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "2021 Census Geographic Attribute File", https://hdl.handle.net/11272.1/AB2/BXLPEP, Abacus Data Network, V1
The 2021 Geographic Attribute File contains all the 2021 Census DBs and their selected attributes, such as standard geographic areas’ unique identifiers (UIDs), DGUIDs, population and dwelling counts, land area, 2021 Census incompletely enumerated Indian reserves and Indian settl... |
Aug 18, 2023 - Linguistic Data Consortium
Hernández Mena, Carlos Daniel; Gatt, Albert; Borg, Claudia; DeMarco, Andrea; van der Plas, Lonneke, 2023, "MASRI Synthetic", https://hdl.handle.net/11272.1/AB2/WBPJBV, Abacus Data Network, V1
Abstract Introduction MASRI (Maltese Automatic Speech Recognition I) Synthetic was developed by the MASRI team at the University of Malta and consists of approximately 99 hours of synthesized Maltese speech. Data Source sentences were extracted from the Maltese Language Resource... |
Aug 18, 2023 - Linguistic Data Consortium
Pradhan, Sameer; Cole, Ronald Allan; Ward, Wayne, 2023, "MyST Children's Conversational Speech", https://hdl.handle.net/11272.1/AB2/QUHJRW, Abacus Data Network, V1
Abstract Introduction MyST (My Science Tutor) Children's Conversational Speech was developed by Boulder Learning Inc. It is comprised of approximately 470 hours of English speech from 1371 students in grades 3-5 conversing with a virtual science tutor in eight areas of science in... |
Aug 17, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Indonesian Representative Language Pack", https://hdl.handle.net/11272.1/AB2/JLEISQ, Abacus Data Network, V1
Abstract Introduction LORELEI Indonesian Representative Language Pack consists of Indonesian monolingual text, Indonesian-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI... |
Aug 17, 2023 - Linguistic Data Consortium
Helgadóttir, Inga Rún; Kjaran, Róbert; Nikulásdóttir, Anna Björk; Gudnason, Jon, 2023, "Althingi Parliamentary Speech", https://hdl.handle.net/11272.1/AB2/NIG304, Abacus Data Network, V1
Abstract Introduction Althingi Parliamentary Speech consists of approximately 542 hours of recorded speech from Althingi, the Icelandic Parliament, along with corresponding transcripts, a pronunciation dictionary and two language models. Speeches date from 2005-2016. This dataset... |
Aug 17, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Thai Representative Language Pack", https://hdl.handle.net/11272.1/AB2/GCBMNV, Abacus Data Network, V1
Abstract Introduction LORELEI Thai Representative Language Pack (LDC2023T08) consists of Thai monolingual text, Thai-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI progr... |
Aug 17, 2023 - Linguistic Data Consortium
Brandschain, Linda; Walker, Kevin; Graff, David, 2023, "Mixer 7 Spanish Speech", https://hdl.handle.net/11272.1/AB2/CYMBUE, Abacus Data Network, V1
Abstract Introduction Mixer 7 Spanish Speech (LDC2023S04) was developed by the Linguistic Data Consortium (LDC) and contains 9,600 hours of audio recordings of interviews, transcript readings and conversational telephone speech involving 191 distinct native Spanish speakers. This... |
Aug 17, 2023 - Linguistic Data Consortium
Maamouri, Mohamed; Graff, David, 2023, "Moroccan Arabic - English Lexical Database", https://hdl.handle.net/11272.1/AB2/E8N63E, Abacus Data Network, V1
Abstract Introduction Moroccan Arabic - English Lexical Database was developed by the Linguistic Data Consortium (LDC). It is comprised of a set of five interrelated tables presenting each Moroccan Arabic word as an orthographic form in Arabic script and a pronunciation form in I... |
Aug 17, 2023 - Linguistic Data Consortium
Hernández Mena, Carlos Daniel; Borsky, Michal; Mollberg, David; Guðmundsson, Smári Freyr; Hedström, Staffan; Pálsson, Ragnar; Jónsson, Ólafur Helgi; Þorsteinsdóttir, Sunneva; Guðmundsdóttir, Jóhanna Vigdís; Magnusdottir, Eydis Huld; Þórhallsdóttir, Ragnheiður; Gudnason, Jon, 2023, "Samrómur Children Icelandic Speech 1.0", https://hdl.handle.net/11272.1/AB2/LKGTIU, Abacus Data Network, V1
Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (childre... |
Aug 17, 2023 - Linguistic Data Consortium
Mollberg, David; Jónsson, Ólafur Helgi; Þorsteinsdóttir, Sunneva; Guðmundsdóttir, Jóhanna Vigdís; Steingrimsson, Steinthor; Magnusdottir, Eydis Huld; Fong, Judy; Borsky, Michal; Gudnason, Jon, 2023, "Samrómur Icelandic Speech 1.0", https://hdl.handle.net/11272.1/AB2/JXQH5C, Abacus Data Network, V1
Abstract Introduction Samrómur Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 145 hours of Icelandic prompted speech from 8,392 speakers representing 100,... |
Aug 17, 2023 - Linguistic Data Consortium
Sen Bhattacharya, Basabdatta; Subramanian, Aiswarya; Chatterjee, Purbayan; Dey, Sounak, 2023, "Spoken Digits in Hindi and Indian English", https://hdl.handle.net/11272.1/AB2/VQQK0O, Abacus Data Network, V1
Abstract Introduction Spoken Digits in Hindi and Indian English was developed by the Birla Institute of Technology and Science Pilani. It contains approximately two hours of speech comprised of spoken digits from one to ten in Hindi and English with regional accents from across I... |
Aug 17, 2023 - Linguistic Data Consortium
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Second DIHARD Challenge Development - SEEDLingS", https://hdl.handle.net/11272.1/AB2/PKMDCL, Abacus Data Network, V1
Abstract Introduction Second DIHARD Challenge Development - SEEDLinGS was developed by Duke University and LDC and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the Second DIHARD Challenge. This relea... |
Aug 17, 2023 - Linguistic Data Consortium
Hirschberg, Julia; Gravano, Agustin; Benus, Stefan; Ward, Gregory; German, Elisa Sneed, 2023, "Columbia Games Corpus", https://hdl.handle.net/11272.1/AB2/TGPSBO, Abacus Data Network, V1
Abstract Introduction Columbia Games Corpus was developed by the Spoken Language Group, Columbia University and the Department of Linguistics, Northwestern University. It consists of approximately 10 hours of spontaneous English conversation along with corresponding orthographic... |
Jul 24, 2023 - Linguistic Data Consortium
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Second DIHARD Challenge Evaluation - SEEDLingS", https://hdl.handle.net/11272.1/AB2/CXOTQ3, Abacus Data Network, V1
Abstract Introduction Second DIHARD Challenge Evaluation - SEEDLingS was developed by Duke University and the Linguistic Data Consortium (LDC) and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the Sec... |
Jul 24, 2023 - Linguistic Data Consortium
Amith, Jonathan D.; Alcántara, Amelia Domínguez; Osollo, Hermelindo Salazar; Castañeda, Ceferino Salgado; Salgado, Eleuterio Gorostiza, 2023, "Ethnobotanical Research and Language Documentation of Nahuatl", https://hdl.handle.net/11272.1/AB2/EEHKAK, Abacus Data Network, V1
Abstract Introduction Ethnobotanical Research and Language Documentation of Nahuatl consists of approximately 190 hours of field recordings collected in the Sierra Nororiental and Sierra Norte regions of Puebla, Mexico. The corpus contains audio and video recordings of native Nah... |
Jun 21, 2023 - Linguistic Data Consortium
Greenberg, Craig; Sadjadi, Omid; Singer, Elliot; Walker, Kevin; Jones, Karen; Caruso, Christopher; Wright, Jonathan; Strassel, Stephanie, 2023, "2019 NIST Speaker Recognition Evaluation Test Set -- CTS Challenge", https://hdl.handle.net/11272.1/AB2/JEG5RH, Abacus Data Network, V1
Abstract Introduction 2019 NIST Speaker Recognition Evaluation Test Set -- CTS Challenge was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains approximately 635 hours of Tunisian Arabic telephone recordings fo... |
Jun 21, 2023 - DMTI Spatial
DMTI Spatial Inc., 2023, "CanMap Content Suite, v2022.3", https://hdl.handle.net/11272.1/AB2/K4JVA5, Abacus Data Network, V2
CanMap Content Suite contains over 100 unique and rich content layers. Each layer has a unique file and layer name with associated definitions, descriptions, attribution and metadata. All layers, with a few exceptions, are vector data consisting of polygon, polyline, or point geo... |
Jun 20, 2023 - Linguistic Data Consortium
Ma, Xiaoyi, 2023, "Hong Kong Parallel Text", https://hdl.handle.net/11272.1/AB2/MX5PAM, Abacus Data Network, V1
Abstract Introduction Hong Kong Parallel Text was developed by the Linguistic Data Consortium (LDC) and contains data from three sub-corpora, namely Hong Kong Hansards Parallel Text, Hong Kong Laws Parallel Text and Hong Kong News Parallel Text. Hong Kong Hansards Parallel Text c... |
Jun 20, 2023 - Linguistic Data Consortium
NIST Multimodal Information Group, 2023, "NIST 2008 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/YEK10L, Abacus Data Network, V1
Abstract Introduction NIST 2008 Open Machine Translation (OpenMT) Evaluation, Linguistic Data Consortium (LDC) catalog number LDC2010T21 and isbn 1-58563-567-7, is a package containing source data, reference translations and scoring software used in the NIST 2008 OpenMT evaluatio... |
Jun 20, 2023 - Linguistic Data Consortium
NIST Multimodal Information Group, 2023, "NIST 2006 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/6UBB7S, Abacus Data Network, V1
Abstract Introduction NIST 2006 Open Machine Translation (OpenMT) Evaluation, Linguistic Data Consortium (LDC) catalog number LDC2010T17 and isbn 1-58563-562-6, is a package containing source data, reference translations and scoring software used in the NIST 2006 OpenMT evaluatio... |
Jun 20, 2023 - Linguistic Data Consortium
NIST Multimodal Information Group, 2023, "NIST 2003 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/ZH4VPY, Abacus Data Network, V1
Abstract Introduction NIST 2003 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2003 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems... |
Jun 16, 2023 - Linguistic Data Consortium
NIST Multimodal Information Group, 2023, "NIST 2002 Open Machine Translation (OpenMT) Evaluation", https://hdl.handle.net/11272.1/AB2/AO1F7Z, Abacus Data Network, V1
Abstract Introduction NIST 2002 Open Machine Translation (OpenMT) Evaluation is a package containing source data, reference translations, and scoring software used in the NIST 2002 OpenMT evaluation. It is designed to help evaluate the effectiveness of machine translation systems... |
Jun 16, 2023 - Linguistic Data Consortium
Ma, Xiaoyi, 2023, "Chinese News Translation Text Part 1", https://hdl.handle.net/11272.1/AB2/1AHIZ3, Abacus Data Network, V1
Abstract Introduction Chinese News Translation Text Part 1 was developed by the Linguistic Data Consortium (LDC) and contains approximately 474,000 characters of Chinese text and corresponding English translations, totalling approximately 285,000 words. All the stories in this co... |
Jun 16, 2023 - Linguistic Data Consortium
Ma, Xiaoyi, 2023, "Multiple-Translation Chinese (MTC) Part 3", https://hdl.handle.net/11272.1/AB2/NYIMDR, Abacus Data Network, V1
Abstract Introduction Multiple-Translation Chinese (MTC) Part 3 was produced by Linguistic Data Consortium (LDC) catalog number LDC2004T07 and ISBN 1-58563-289-9. To support the development of automatic means for evaluating translation quality, the LDC was sponsored to solicit fo... |
Jun 16, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Zulu Representative Language Pack", https://hdl.handle.net/11272.1/AB2/TYSP2P, Abacus Data Network, V1
Abstract Introduction LORELEI Zulu Representative Language Pack consists of Zulu monolingual text, Zulu-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI program. The LOREL... |