251 to 300 of 1,855 Results
Aug 29, 2023 -
Noisy TIMIT Speech
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023
Chen, Gang; Neubauer, Juergen; Garellek, Marc; Samlan, Robin; Gerratt, Bruce R.; Kreiman, Jody; Alwan, Abeer, 2017, "UCLA High-Speed Laryngeal Video and Audio", https://hdl.handle.net/11272.1/AB2/OWLHMG, Abacus Data Network, V2
UCLA High-Speed Laryngeal Video and Audio was developed by UCLA Speech Processing and Auditory Perception Laboratory and is comprised of high-speed laryngeal video recordings of the vocal folds and synchronized audio recordings from nine subjects collected between April 2012 and... |
Aug 29, 2023 -
UCLA High-Speed Laryngeal Video and Audio
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
UCLA High-Speed Laryngeal Video and Audio
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Vincent, Emmanuel; Barker, Jon; Watanabe, Shinji; Le Roux, Jonathan; Nesta, Francesco; Matassoni, Marco, 2017, "CHiME2 WSJ0", https://hdl.handle.net/11272.1/AB2/IUB8PD, Abacus Data Network, V2
CHiME2 WSJ0 was developed as part of The 2nd CHiME Speech Separation and Recognition Challenge and contains approximately 166 hours of English speech from a noisy living room environment. The CHiME Challenges focus on distant-microphone automatic speech recognition (ASR) in real-... |
Aug 29, 2023 -
CHiME2 WSJ0
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
CHiME2 WSJ0
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Tracey, Jennifer; Lee, Haejoong; Strassel, Stephanie, 2017, "BOLT English Discussion Forums", https://hdl.handle.net/11272.1/AB2/VDFID2, Abacus Data Network, V2
BOLT English Discussion Forums was developed by the Linguistic Data Consortium (LDC) and consists of 830,440 discussion forum threads in English harvested from the Internet using a combination of manual and automatic processes. The DARPA BOLT (Broad Operational Language Translati... |
Aug 29, 2023 -
BOLT English Discussion Forums
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
BOLT English Discussion Forums
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Tracey, Jennifer; Lee, Haejoong; Strassel, Stephanie; Ismael, Safa, 2018, "BOLT Arabic Discussion Forums", https://hdl.handle.net/11272.1/AB2/DP4INP, Abacus Data Network, V2
BOLT Arabic Discussion Forums was developed by the Linguistic Data Consortium (LDC) and consists of 813,080 discussion forum threads in Egyptian Arabic harvested from the Internet using a combination of manual and automatic processes. The DARPA BOLT (Broad Operational Language Tr... |
Aug 29, 2023 -
BOLT Arabic Discussion Forums
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
BOLT Arabic Discussion Forums
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Ferraro, Francis; Thomas, Max; Wolfe, Travis; R. Gormley, Matthew; Harman, Craig; Van Durme, Benjamin, 2018, "Concretely Annotated New York Times", https://hdl.handle.net/11272.1/AB2/VA98GM, Abacus Data Network, V2
Introduction Concretely Annotated New York Times was developed by Johns Hopkins University’s Human Language Technology Center of Excellence. It adds multiple kinds and instances of automatically-generated syntactic, semantic and coreference annotations to The New York Times Annot... |
Aug 29, 2023 -
Concretely Annotated New York Times
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023
Ferraro, Francis; Thomas, Max; Gormley, Matthew R.; Wolfe, Travis; Harman, Craig; Van Durme, Benjamin, 2018, "Concretely Annotated English Gigaword", https://hdl.handle.net/11272.1/AB2/NQCDFR, Abacus Data Network, V2
Concretely Annotated English Gigaword was developed by Johns Hopkins University’s Human Language Technology Center of Excellence (JHU). It adds multiple kinds and instances of automatically-generated syntactic, semantic and coreference annotations to English Gigaword Fifth Editio... |
Aug 29, 2023 -
Concretely Annotated English Gigaword
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
Concretely Annotated English Gigaword
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Morris, Amanda; Strassel, Stephanie; Li, Xuansong; Antonishek, Brian; Fiscus, Jonathan G., 2019, "HAVIC MED Progress Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/QYTBMD, Abacus Data Network, V2
HAVIC MED Progress Test – Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,650 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and related technologies, LDC... |
Aug 29, 2023 -
HAVIC MED Progress Test -- Videos, Metadata and Annotation
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
HAVIC MED Progress Test -- Videos, Metadata and Annotation
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Greenberg, Craig; Martin, Alvin; Graff, David; Brandschain, Linda; Walker, Kevin, 2017, "2010 NIST Speaker Recognition Evaluation Test Set", https://hdl.handle.net/11272.1/AB2/2CPM3O, Abacus Data Network, V2
Introduction 2010 NIST Speaker Recognition Evaluation Test Set was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains 2,255 hours of American English telephone speech and speech recorded over a microphone chann... |
Aug 29, 2023 -
2010 NIST Speaker Recognition Evaluation Test Set
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
2010 NIST Speaker Recognition Evaluation Test Set
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Barker, Jon; Marxer, Ricard; Vincent, Emmanuel; Watanabe, Shinji, 2017, "CHiME3", https://hdl.handle.net/11272.1/AB2/HGHM4U, Abacus Data Network, V2
Introduction CHiME3 was developed as part of The 3rd CHiME Speech Separation and Recognition Challenge and contains approximately 342 hours of English speech and transcripts from noisy environments and 50 hours of noisy environment audio. The CHiME Challenges focus on distant-mic... |
Aug 29, 2023 -
CHiME3
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
CHiME3
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Bu, Hui, 2018, "AISHELL-1", https://hdl.handle.net/11272.1/AB2/2WMDTT, Abacus Data Network, V2
AISHELL-1 was developed by Beijing Shell Shell Technology Co., Ltd. It contains approximately 520 hours of Chinese Mandarin speech from 400 speakers recorded simultaneously on three different devices with associated transcripts. The goal of the collection was to support speech re... |
Aug 29, 2023 -
AISHELL-1
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
AISHELL-1
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Brandschain, Linda; Walker, Kevin; Graff, David; Cieri, Christopher; Neely, Abby; Mirghafori, Nikki; Peskin, Barbara; Godfrey, Jack; Strassel, Stephanie; Goodman, Fred; Doddington, George R.; King, Mike, 2021, "Mixer 4 and 5 Speech", https://hdl.handle.net/11272.1/AB2/LU0TQ8, Abacus Data Network, V2
Abstract Introduction Mixer 4 and 5 Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 14,185 hours of audio recordings of conversational telephone speech, interviews, elicitation exercises and transcript readings involving 616 distinct... |
Aug 29, 2023 -
Mixer 4 and 5 Speech
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Graff, David; Ma, Xiaoyi; Strassel, Stephanie; Walker, Kevin; Jones, Karen, 2021, "RATS Speaker Identification", https://hdl.handle.net/11272.1/AB2/BZYHPS, Abacus Data Network, V2
Abstract Introduction RATS Speaker Identification was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 1,900 hours of Levantine Arabic, Farsi, Dari, Pashto and Urdu conversational telephone speech with annotations of speech segments. The audio w... |
Aug 29, 2023 -
RATS Speaker Identification
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
RATS Speaker Identification
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Morris, Amanda; Strassel, Stephanie; Li, Xuansong; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Training Data -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/TQLGAR, Abacus Data Network, V2
Abstract Introduction HAVIC MED Training Data -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 2,100 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and re... |
Aug 29, 2023 -
HAVIC MED Training Data -- Videos, Metadata and Annotation
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
HAVIC MED Training Data -- Videos, Metadata and Annotation
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Li, Xuansong; Strassel, Stephanie; Jones, Karen; Antonishek, Brian; Fiscus, Jonathan G., 2022, "HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation", https://hdl.handle.net/11272.1/AB2/SXVGS7, Abacus Data Network, V2
Abstract Introduction HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,800 hours of user-generated videos with annotation and metadata. To advance multimodal event detection and rel... |
Aug 29, 2023 -
HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 29, 2023 -
HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 29, 2023
Mahmoud, Sabri; Ahmad, Irfan; Al-Khatib, Wasfi; Alshayeb, Mohammad; Parvez, Mohammad; Märgner, Volker; Fink, Gernot, 2015, "KHATT: Handwritten Arabic Text", https://hdl.handle.net/11272.1/AB2/PL0DHA, Abacus Data Network, V2
Introduction KHATT: Handwritten Arabic Text was developed by King Fahd University of Petroleum & Minerals, Technical University of Dortmund and Braunschweig University of Technology. It is comprised of scanned Arabic handwriting from 1,000 distinct male and female writers represe... |
Aug 29, 2023 -
KHATT: Handwritten Arabic Text
Markdown Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 25, 2023
Alwan, Abeer; Lulich, Steven; Sommers, Mitchell, 2015, "The Subglottal Resonances Database", https://hdl.handle.net/11272.1/AB2/R82KKG, Abacus Data Network, V2
Introduction The Subglottal Resonances Database was developed by Washington University and University of California Los Angeles and consists of 45 hours of simultaneous microphone and subglottal accelerometer recordings of 25 adult male and 25 adult female speakers of American En... |
Aug 25, 2023 -
The Subglottal Resonances Database
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 25, 2023
Walker, Kevin; Ma, Xiaoyi; Graff, David; Strassel, Stephanie; Sessa, Stephanie; Jones, Karen, 2015, "RATS Speech Activity Detection", https://hdl.handle.net/11272.1/AB2/1UISJ7, Abacus Data Network, V2
Introduction RATS Speech Activity Detection was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 3,000 hours of Levantine Arabic, English, Farsi, Pashto, and Urdu conversational telephone speech with automatic and manual annotation of speech seg... |
Aug 25, 2023 -
RATS Speech Activity Detection
Plain Text - 3.1 KB -
MD5: 1b8a8741370964dcfff1eeec66e4b151
Instructions on how to access LDC data via UBC's Teamshare service (Markdown/ASCII text) |
Aug 25, 2023 -
RATS Speech Activity Detection
Adobe PDF - 31.2 KB -
MD5: 100c549ff1bb48ed76f05d01f6342eb3
Instructions on how to access LDC data via UBC's Teamshare service (PDF) |
Aug 18, 2023
Hernández Mena, Carlos Daniel; Gatt, Albert; Borg, Claudia; DeMarco, Andrea; van der Plas, Lonneke, 2023, "MASRI Synthetic", https://hdl.handle.net/11272.1/AB2/WBPJBV, Abacus Data Network, V1
Abstract Introduction MASRI (Maltese Automatic Speech Recognition I) Synthetic was developed by the MASRI team at the University of Malta and consists of approximately 99 hours of synthesized Maltese speech. Data Source sentences were extracted from the Maltese Language Resource... |
Aug 18, 2023 -
MASRI Synthetic
Optical Disc Image - 6.3 GB -
MD5: ad9774df27abe949208102786e6ecdd8
ISO disc image containing all documentation and data |