Skip to main content
Metrics
701,771 Downloads
The Abacus Data Network is a data repository collaboration involving Libraries at Simon Fraser University (SFU), the University of British Columbia (UBC), the University of Northern British Columbia (UNBC) and the University of Victoria (UVic).
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

151 to 200 of 2,582 Results
Nov 21, 2023 - DMTI Spatial
DMTI Spatial Inc., 2023, "CanMap Content Suite, v2023.3", https://hdl.handle.net/11272.1/AB2/KIBZCV, Abacus Data Network, V1
CanMap Content Suite contains over 100 unique and rich content layers. Each layer has a unique file and layer name with associated definitions, descriptions, attribution and metadata. All layers, with a few exceptions, are vector data consisting of polygon, polyline, or point geo...
Nov 7, 2023 - Statistics Canada Open License
Statistics Canada, 2009, "General Social Survey, Cycle 13: Victimization, 1999", https://hdl.handle.net/11272.1/AB2/XPUOAA, Abacus Data Network, V2
This package is designed to enable interested users to access and manipulate the microdata file for the thirteenth cycle of the General Social Survey (GSS). It contains information on the objectives, methodology, and estimation procedures, as well as guidelines for releasing esti...
Oct 31, 2023 - Restricted data
BC Assessment, 2020, "BC Assessment Data Advice and Inventory Extracts, 2016-2022", https://hdl.handle.net/11272.1/AB2/LAPUAB, Abacus Data Network, V6
The Data Advice product from BC Assessment (BCA) provides value assessments and sales information for properties in British Columbia. Two types of data files are available to UBC researchers: REVD (Revised Roll): annual report including property information and valuation for all...
Oct 31, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, September 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/RKXIRY, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 31, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, June 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/IGPZPC, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 31, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Codes by Federal Ridings File (PCFRF) 2013 Representation Order, March 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/OP7TU4, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 30, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Code Conversion File, September 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/PGWFU5, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 30, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Code Conversion File, June 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/TPXGYR, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 30, 2023 - Statistics Canada - DLI
Statistics Canada, 2023, "Postal Code Conversion File, March 2023 Postal Codes, 2023", https://hdl.handle.net/11272.1/AB2/ETUHV2, Abacus Data Network, V1
The Postal Code Project is responsible for linking the approximately 900,000 single postal codes in Canada to Statistics Canada’s Census dissemination geography, (presently 2021 Census geography). This process is performed by using data provided by Canada Post Corporation and lin...
Oct 30, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "Canadian Tobacco and Nicotine Survey (CTNS), 2022", https://hdl.handle.net/11272.1/AB2/PWWFK3, Abacus Data Network, V1, UNF:6:kteRE6QsXKzonyDVAqRz/Q== [fileUNF]
The information collected in this survey will be used to fill important data gaps related to vaping, cannabis, and tobacco usage. The data will inform policy and provide a current snapshot of use across Canada. Until 2017, Statistics Canada conducted the Canadian Tobacco, Alcohol...
Oct 24, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "Canadian Tobacco and Nicotine Survey (CTNS), 2021", https://hdl.handle.net/11272.1/AB2/YOLZ1M, Abacus Data Network, V1, UNF:6:l5CjFBAeehXcIm89HK6dpA== [fileUNF]
The information collected in this survey will be used to fill important data gaps related to vaping, cannabis, and tobacco usage. The data will inform policy and provide a current snapshot of use across Canada. Until 2017, Statistics Canada conducted the Canadian Tobacco, Alcohol...
Oct 17, 2023 - Linguistic Data Consortium
Miller, David; Walker, Kevin; Graff, David; Canavan, Alexandra, 2023, "CALLFRIEND Russian Text", https://hdl.handle.net/11272.1/AB2/BNFFSZ, Abacus Data Network, V1
Abstract Introduction CALLFRIEND Russian Text (LDC2023T09) was developed by the Linguistic Data Consortium and consists of transcripts for approximately 48 hours of telephone conversations (100 recordings) between native Russian speakers. The calls were recorded in 1999 as part o...
Oct 17, 2023 - Linguistic Data Consortium
Delgado, Dana; Jones, Karen; Walker, Kevin; Strassel, Stephanie; Caruso, Christopher; Graff, David, 2023, "2019 OpenSAT Public Safety Communications Simulation", https://hdl.handle.net/11272.1/AB2/BOXO5O, Abacus Data Network, V1
Abstract Introduction 2019 OpenSAT Public Safety Communications Simulation was developed by the Linguistic Data Consortium (LDC) and contains approximately 141 hours of speech recordings and transcripts used in the used in the National Institute of Standards and Technology (NIST)...
Oct 16, 2023 - Linguistic Data Consortium
Miller, David; Walker, Kevin; Graff, David; Canavan, Alexandra, 2023, "CALLFRIEND Russian Speech", https://hdl.handle.net/11272.1/AB2/NGRVVO, Abacus Data Network, V1
Abstract Introduction CALLFRIEND Russian Speech (LDC2023S08) was developed by the Linguistic Data Consortium (LDC) and consists of approximately 48 hours of telephone conversations (100 recordings) between native speakers of Russian. The calls were recorded in 1999 as part of the...
Oct 11, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "National Graduates Survey - Public Use Microdata File, 2015 (time of graduation), 2018 (time of interview)", https://hdl.handle.net/11272.1/AB2/OHTEHG, Abacus Data Network, V1, UNF:6:lPN9wkgqJdE1ZSZ+xXtGXA== [fileUNF]
Data from this survey will be used to better understand the experiences and outcomes of graduates, and to improve government programs. The survey is designed to collect details on topics such as: i) the extent to which graduates of postsecondary programs have been successful in o...
Oct 10, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "General Social Survey Cycle 34: Canadians' Safety (Victimization), 2019", https://hdl.handle.net/11272.1/AB2/TY08CB, Abacus Data Network, V1, UNF:6:8LsnPtiVJZxig7nMRWYrvg== [fileUNF]
The main objective of the GSS on Canadians' Safety is to better understand how Canadians perceive crime and the justice system and to capture information on their experiences of victimization. This survey is the only national survey of self-reported victimization and is collected...
Sep 28, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "Canadian Income Survey, 2020", https://hdl.handle.net/11272.1/AB2/I6BDAC, Abacus Data Network, V2, UNF:6:EFjvtjGOa6N2Mj23BZ4j/Q== [fileUNF]
The primary objective of the Canadian Income Survey (CIS) is to provide information on the income and income sources of Canadians, along with their individual and household characteristics. The data collected in the CIS is combined with Labour Force Survey (LFS, record number 370...
Sep 12, 2023 - Statistics Canada Open License
Statistics Canada, 2023, "2021 Census Geographic Attribute File", https://hdl.handle.net/11272.1/AB2/BXLPEP, Abacus Data Network, V1
The 2021 Geographic Attribute File contains all the 2021 Census DBs and their selected attributes, such as standard geographic areas’ unique identifiers (UIDs), DGUIDs, population and dwelling counts, land area, 2021 Census incompletely enumerated Indian reserves and Indian settl...
Aug 29, 2023 - Linguistic Data Consortium
Luqman, Hamzah; Mahmoud, Sabri; Awaida, Sameh, 2016, "KAFD: Arabic Font Database", https://hdl.handle.net/11272.1/AB2/A0JPYM, Abacus Data Network, V2
Introduction KAFD: Arabic Font Database was developed by King Fahd University of Petroleum & Minerals and Qassim University. It is comprised of approximately 2.5 million scanned Arabic printed pages in a variety of fonts, sizes and resolutions along with corresponding transcripts...
Aug 29, 2023 - Linguistic Data Consortium
Abdulaziz, Azhar; Kepuska, Veton, 2017, "Noisy TIMIT Speech", https://hdl.handle.net/11272.1/AB2/FFFXT2, Abacus Data Network, V2
Introduction Noisy TIMIT Speech was developed by the Florida Institute of Technology and contains approximately 322 hours of speech from the TIMIT Acoustic-Phonetic Continuous Speech Corpus (LDC93S1) modified with different additive noise levels. Only the audio has been modified;...
Aug 29, 2023 - Linguistic Data Consortium
Chen, Gang; Neubauer, Juergen; Garellek, Marc; Samlan, Robin; Gerratt, Bruce R.; Kreiman, Jody; Alwan, Abeer, 2017, "UCLA High-Speed Laryngeal Video and Audio", https://hdl.handle.net/11272.1/AB2/OWLHMG, Abacus Data Network, V2
UCLA High-Speed Laryngeal Video and Audio was developed by UCLA Speech Processing and Auditory Perception Laboratory and is comprised of high-speed laryngeal video recordings of the vocal folds and synchronized audio recordings from nine subjects collected between April 2012 and...
Aug 29, 2023 - Linguistic Data Consortium
Vincent, Emmanuel; Barker, Jon; Watanabe, Shinji; Le Roux, Jonathan; Nesta, Francesco; Matassoni, Marco, 2017, "CHiME2 WSJ0", https://hdl.handle.net/11272.1/AB2/IUB8PD, Abacus Data Network, V2
CHiME2 WSJ0 was developed as part of The 2nd CHiME Speech Separation and Recognition Challenge and contains approximately 166 hours of English speech from a noisy living room environment. The CHiME Challenges focus on distant-microphone automatic speech recognition (ASR) in real-...
Aug 29, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Lee, Haejoong; Strassel, Stephanie, 2017, "BOLT English Discussion Forums", https://hdl.handle.net/11272.1/AB2/VDFID2, Abacus Data Network, V2
BOLT English Discussion Forums was developed by the Linguistic Data Consortium (LDC) and consists of 830,440 discussion forum threads in English harvested from the Internet using a combination of manual and automatic processes. The DARPA BOLT (Broad Operational Language Translati...
Aug 29, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Lee, Haejoong; Strassel, Stephanie; Ismael, Safa, 2018, "BOLT Arabic Discussion Forums", https://hdl.handle.net/11272.1/AB2/DP4INP, Abacus Data Network, V2
BOLT Arabic Discussion Forums was developed by the Linguistic Data Consortium (LDC) and consists of 813,080 discussion forum threads in Egyptian Arabic harvested from the Internet using a combination of manual and automatic processes. The DARPA BOLT (Broad Operational Language Tr...
Aug 29, 2023 - Linguistic Data Consortium
Ferraro, Francis; Thomas, Max; Wolfe, Travis; R. Gormley, Matthew; Harman, Craig; Van Durme, Benjamin, 2018, "Concretely Annotated New York Times", https://hdl.handle.net/11272.1/AB2/VA98GM, Abacus Data Network, V2
Introduction Concretely Annotated New York Times was developed by Johns Hopkins University’s Human Language Technology Center of Excellence. It adds multiple kinds and instances of automatically-generated syntactic, semantic and coreference annotations to The New York Times Annot...
Aug 29, 2023 - Linguistic Data Consortium
Ferraro, Francis; Thomas, Max; Gormley, Matthew R.; Wolfe, Travis; Harman, Craig; Van Durme, Benjamin, 2018, "Concretely Annotated English Gigaword", https://hdl.handle.net/11272.1/AB2/NQCDFR, Abacus Data Network, V2
Concretely Annotated English Gigaword was developed by Johns Hopkins University’s Human Language Technology Center of Excellence (JHU). It adds multiple kinds and instances of automatically-generated syntactic, semantic and coreference annotations to English Gigaword Fifth Editio...
Aug 29, 2023 - Linguistic Data Consortium
Morris, Amanda; Strassel, Stephanie; Li, Xuansong; Antonishek, Brian; Fiscus, Jonathan G., 2019, "HAVIC MED Progress Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/QYTBMD, Abacus Data Network, V2
HAVIC MED Progress Test – Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,650 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and related technologies, LDC...
Aug 29, 2023 - Linguistic Data Consortium
Greenberg, Craig; Martin, Alvin; Graff, David; Brandschain, Linda; Walker, Kevin, 2017, "2010 NIST Speaker Recognition Evaluation Test Set", https://hdl.handle.net/11272.1/AB2/2CPM3O, Abacus Data Network, V2
Introduction 2010 NIST Speaker Recognition Evaluation Test Set was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains 2,255 hours of American English telephone speech and speech recorded over a microphone chann...
Aug 29, 2023 - Linguistic Data Consortium
Barker, Jon; Marxer, Ricard; Vincent, Emmanuel; Watanabe, Shinji, 2017, "CHiME3", https://hdl.handle.net/11272.1/AB2/HGHM4U, Abacus Data Network, V2
Introduction CHiME3 was developed as part of The 3rd CHiME Speech Separation and Recognition Challenge and contains approximately 342 hours of English speech and transcripts from noisy environments and 50 hours of noisy environment audio. The CHiME Challenges focus on distant-mic...
Aug 29, 2023 - Linguistic Data Consortium
Bu, Hui, 2018, "AISHELL-1", https://hdl.handle.net/11272.1/AB2/2WMDTT, Abacus Data Network, V2
AISHELL-1 was developed by Beijing Shell Shell Technology Co., Ltd. It contains approximately 520 hours of Chinese Mandarin speech from 400 speakers recorded simultaneously on three different devices with associated transcripts. The goal of the collection was to support speech re...
Aug 29, 2023 - Linguistic Data Consortium
Brandschain, Linda; Walker, Kevin; Graff, David; Cieri, Christopher; Neely, Abby; Mirghafori, Nikki; Peskin, Barbara; Godfrey, Jack; Strassel, Stephanie; Goodman, Fred; Doddington, George R.; King, Mike, 2021, "Mixer 4 and 5 Speech", https://hdl.handle.net/11272.1/AB2/LU0TQ8, Abacus Data Network, V2
Abstract Introduction Mixer 4 and 5 Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 14,185 hours of audio recordings of conversational telephone speech, interviews, elicitation exercises and transcript readings involving 616 distinct...
Aug 29, 2023 - Linguistic Data Consortium
Graff, David; Ma, Xiaoyi; Strassel, Stephanie; Walker, Kevin; Jones, Karen, 2021, "RATS Speaker Identification", https://hdl.handle.net/11272.1/AB2/BZYHPS, Abacus Data Network, V2
Abstract Introduction RATS Speaker Identification was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 1,900 hours of Levantine Arabic, Farsi, Dari, Pashto and Urdu conversational telephone speech with annotations of speech segments. The audio w...
Aug 29, 2023 - Linguistic Data Consortium
Morris, Amanda; Strassel, Stephanie; Li, Xuansong; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Training Data -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/TQLGAR, Abacus Data Network, V2
Abstract Introduction HAVIC MED Training Data -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 2,100 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and re...
Aug 29, 2023 - Linguistic Data Consortium
Li, Xuansong; Strassel, Stephanie; Jones, Karen; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/SXVGS7, Abacus Data Network, V2
Abstract Introduction HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,800 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and rel...
Aug 29, 2023 - Linguistic Data Consortium
Mahmoud, Sabri; Ahmad, Irfan; Al-Khatib, Wasfi; Alshayeb, Mohammad; Parvez, Mohammad; Märgner, Volker; Fink, Gernot, 2015, "KHATT: Handwritten Arabic Text", https://hdl.handle.net/11272.1/AB2/PL0DHA, Abacus Data Network, V2
Introduction KHATT: Handwritten Arabic Text was developed by King Fahd University of Petroleum & Minerals, Technical University of Dortmund and Braunschweig University of Technology. It is comprised of scanned Arabic handwriting from 1,000 distinct male and female writers represe...
Aug 25, 2023 - Linguistic Data Consortium
Alwan, Abeer; Lulich, Steven; Sommers, Mitchell, 2015, "The Subglottal Resonances Database", https://hdl.handle.net/11272.1/AB2/R82KKG, Abacus Data Network, V2
Introduction The Subglottal Resonances Database was developed by Washington University and University of California Los Angeles and consists of 45 hours of simultaneous microphone and subglottal accelerometer recordings of 25 adult male and 25 adult female speakers of American En...
Aug 25, 2023 - Linguistic Data Consortium
Walker, Kevin; Ma, Xiaoyi; Graff, David; Strassel, Stephanie; Sessa, Stephanie; Jones, Karen, 2015, "RATS Speech Activity Detection", https://hdl.handle.net/11272.1/AB2/1UISJ7, Abacus Data Network, V2
Introduction RATS Speech Activity Detection was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,000 hours of Levantine Arabic, English, Farsi, Pashto, and Urdu conversational telephone speech with automatic and manual annotation of speech seg...
Aug 18, 2023 - Linguistic Data Consortium
Hernández Mena, Carlos Daniel; Gatt, Albert; Borg, Claudia; DeMarco, Andrea; van der Plas, Lonneke, 2023, "MASRI Synthetic", https://hdl.handle.net/11272.1/AB2/WBPJBV, Abacus Data Network, V1
Abstract Introduction MASRI (Maltese Automatic Speech Recognition I) Synthetic was developed by the MASRI team at the University of Malta and consists of approximately 99 hours of synthesized Maltese speech. Data Source sentences were extracted from the Maltese Language Resource...
Aug 18, 2023 - Linguistic Data Consortium
Pradhan, Sameer; Cole, Ronald Allan; Ward, Wayne, 2023, "MyST Children's Conversational Speech", https://hdl.handle.net/11272.1/AB2/QUHJRW, Abacus Data Network, V1
Abstract Introduction MyST (My Science Tutor) Children's Conversational Speech was developed by Boulder Learning Inc. It is comprised of approximately 470 hours of English speech from 1371 students in grades 3-5 conversing with a virtual science tutor in eight areas of science in...
Aug 17, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Indonesian Representative Language Pack", https://hdl.handle.net/11272.1/AB2/JLEISQ, Abacus Data Network, V1
Abstract Introduction LORELEI Indonesian Representative Language Pack consists of Indonesian monolingual text, Indonesian-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI...
Aug 17, 2023 - Linguistic Data Consortium
Helgadóttir, Inga Rún; Kjaran, Róbert; Nikulásdóttir, Anna Björk; Gudnason, Jon, 2023, "Althingi Parliamentary Speech", https://hdl.handle.net/11272.1/AB2/NIG304, Abacus Data Network, V1
Abstract Introduction Althingi Parliamentary Speech consists of approximately 542 hours of recorded speech from Althingi, the Icelandic Parliament, along with corresponding transcripts, a pronunciation dictionary and two language models. Speeches date from 2005-2016. This dataset...
Aug 17, 2023 - Linguistic Data Consortium
Tracey, Jennifer; Strassel, Stephanie; Graff, David; Wright, Jonathan; Chen, Song; Ryant, Neville; Kulick, Seth; Griffitt, Kira; Delgado, Dana; Arrigo, Michael, 2023, "LORELEI Thai Representative Language Pack", https://hdl.handle.net/11272.1/AB2/GCBMNV, Abacus Data Network, V1
Abstract Introduction LORELEI Thai Representative Language Pack (LDC2023T08) consists of Thai monolingual text, Thai-English parallel text, annotations, supplemental resources and related software tools developed by the Linguistic Data Consortium (LDC) for the DARPA LORELEI progr...
Aug 17, 2023 - Linguistic Data Consortium
Brandschain, Linda; Walker, Kevin; Graff, David, 2023, "Mixer 7 Spanish Speech", https://hdl.handle.net/11272.1/AB2/CYMBUE, Abacus Data Network, V1
Abstract Introduction Mixer 7 Spanish Speech (LDC2023S04) was developed by the Linguistic Data Consortium (LDC) and contains 9,600 hours of audio recordings of interviews, transcript readings and conversational telephone speech involving 191 distinct native Spanish speakers. This...
Aug 17, 2023 - Linguistic Data Consortium
Maamouri, Mohamed; Graff, David, 2023, "Moroccan Arabic - English Lexical Database", https://hdl.handle.net/11272.1/AB2/E8N63E, Abacus Data Network, V1
Abstract Introduction Moroccan Arabic - English Lexical Database was developed by the Linguistic Data Consortium (LDC). It is comprised of a set of five interrelated tables presenting each Moroccan Arabic word as an orthographic form in Arabic script and a pronunciation form in I...
Aug 17, 2023 - Linguistic Data Consortium
Hernández Mena, Carlos Daniel; Borsky, Michal; Mollberg, David; Guðmundsson, Smári Freyr; Hedström, Staffan; Pálsson, Ragnar; Jónsson, Ólafur Helgi; Þorsteinsdóttir, Sunneva; Guðmundsdóttir, Jóhanna Vigdís; Magnusdottir, Eydis Huld; Þórhallsdóttir, Ragnheiður; Gudnason, Jon, 2023, "Samrómur Children Icelandic Speech 1.0", https://hdl.handle.net/11272.1/AB2/LKGTIU, Abacus Data Network, V1
Abstract Introduction Samrómur Children Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 131 hours of Icelandic prompted speech from 3,175 speakers (childre...
Aug 17, 2023 - Linguistic Data Consortium
Mollberg, David; Jónsson, Ólafur Helgi; Þorsteinsdóttir, Sunneva; Guðmundsdóttir, Jóhanna Vigdís; Steingrimsson, Steinthor; Magnusdottir, Eydis Huld; Fong, Judy; Borsky, Michal; Gudnason, Jon, 2023, "Samrómur Icelandic Speech 1.0", https://hdl.handle.net/11272.1/AB2/JXQH5C, Abacus Data Network, V1
Abstract Introduction Samrómur Icelandic Speech 1.0 was developed by the Language and Voice Lab, Reykjavik University in cooperation with Almannarómur, Center for Language Technology. The corpus contains 145 hours of Icelandic prompted speech from 8,392 speakers representing 100,...
Aug 17, 2023 - Linguistic Data Consortium
Sen Bhattacharya, Basabdatta; Subramanian, Aiswarya; Chatterjee, Purbayan; Dey, Sounak, 2023, "Spoken Digits in Hindi and Indian English", https://hdl.handle.net/11272.1/AB2/VQQK0O, Abacus Data Network, V1
Abstract Introduction Spoken Digits in Hindi and Indian English was developed by the Birla Institute of Technology and Science Pilani. It contains approximately two hours of speech comprised of spoken digits from one to ten in Hindi and English with regional accents from across I...
Aug 17, 2023 - Linguistic Data Consortium
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Second DIHARD Challenge Development - SEEDLingS", https://hdl.handle.net/11272.1/AB2/PKMDCL, Abacus Data Network, V1
Abstract Introduction Second DIHARD Challenge Development - SEEDLinGS was developed by Duke University and LDC and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the Second DIHARD Challenge. This relea...
Aug 17, 2023 - Linguistic Data Consortium
Hirschberg, Julia; Gravano, Agustin; Benus, Stefan; Ward, Gregory; German, Elisa Sneed, 2023, "Columbia Games Corpus", https://hdl.handle.net/11272.1/AB2/TGPSBO, Abacus Data Network, V1
Abstract Introduction Columbia Games Corpus was developed by the Spoken Language Group, Columbia University and the Department of Linguistics, Northwestern University. It consists of approximately 10 hours of spontaneous English conversation along with corresponding orthographic...
Jul 24, 2023 - Linguistic Data Consortium
Ryant, Neville; Liberman, Mark; Fiumara, James; Cieri, Christopher, 2023, "Second DIHARD Challenge Evaluation - SEEDLingS", https://hdl.handle.net/11272.1/AB2/CXOTQ3, Abacus Data Network, V1
Abstract Introduction Second DIHARD Challenge Evaluation - SEEDLingS was developed by Duke University and the Linguistic Data Consortium (LDC) and contains approximately two hours of English child language recordings along with corresponding annotations used in support of the Sec...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.

Contact Abacus Data Network Support

Abacus Data Network Support

Please fill this out to prove you are not a robot.

+ =