SAP: Speech Accessibility Project

Sponsor
University of Illinois at Urbana-Champaign (Other)
Overall Status
Recruiting
CT.gov ID
NCT05889260
Collaborator
LSVT Global (Other), Amazon.com Services LLC (Other), Apple Inc. (Industry), Google LLC. (Industry), Meta Platforms, Inc. (Other), Microsoft Corporation (Industry)
2,000
2
14.6
1000
68.7

Study Details

Study Description

Brief Summary

The goal of the Speech Accessibility Project at the UIUC Beckman Institute (https://speechaccessibilityproject.beckman.illinois.edu) is to collect, annotate, and curate a shared database of speech samples from people with atypical speech, and share this data set with researchers at other organizations. This two-year project plans to collect 1,200,000 speech samples from 2,000 people, each of whom will provide 600 samples. In Year 1, the initial focus will be people with Parkinson's. In Year 2, four more etiologies of interest will be recruited: Amyotrophic Lateral Sclerosis (ALS), Cerebral Palsy (CP), Down Syndrome (DS), and Stroke. UIUC will build an open-source software infrastructure to collect annotated speech samples and share these data in an appropriately secure fashion with researchers from our partner technology companies (and eventually, other organizations as well) so that they can use these data to improve their automatic speech recognition algorithms. This project promotes diversity, equity, and inclusion by helping technology companies to fully support all types of speech, and it is also more efficient and less burdensome for these specialized patient populations to have one centralized "collector" of speech samples.

Condition or Disease Intervention/Treatment Phase

    Detailed Description

    The goal of our project is to collect 1,200,000 speech samples from 2,000 people with dysarthria, where we expect to collect data from 400 people each from five different patient populations. Each person would provide 600 speech samples.

    (600 samples/person x 400 persons/etiology x 5 etiologies = 1,200,000 samples)

    Our schedule of research procedures is:
    1. February or March-August 2023: data collection of speech samples from 400 people with Parkinson's.

    2. August 2023-August 2024: data collection of speech samples from 1,600 people with ALS, CP, DS, and Stroke.

    Data collection of speech samples in Year 1 will be a collaboration of UIUC and LSVT Global team members. Potential participants will be screened both with a questionnaire and by providing a short set of "quality control" speech samples. If the participant does not pass screening, they will be thanked for their interest. Otherwise, the participant is eligible for the study and can do the informed consent process and then engage in contributing speech samples.

    Participants can do as many recordings as they wish at whatever time of day is convenient for them. Participants will be able to login to the system at any time, 24/7.

    In Year 2, this procedure will be performed with patients from other etiologies with additional advocacy organizations as partners.

    Participants who are unable to read text from the computer screen will be offered the opportunity to record speech using a verbal-repetition protocol. In order to participate in the verbal repetition protocol, a participant must be accompanied by a caregiver who is also willing to be recorded. If a participant agrees to this protocol, then the caregiver will read each prompt to the participant. The participant will then repeat the words spoken by the caregiver, or respond to any question asked by the caregiver.

    Participants also have the option to provide additional data about themselves, such as their age, race and ethnicity, and the year of their diagnosis. These "metadata tags" are completely optional but are helpful for analysis.

    The collected speech samples will be stored securely in a custom database built by the UIUC Beckman Institute. All samples are stored with a unique participant ID code. All samples are annotated by our UIUC research team with technical information about the acoustic waveform and other information.

    The entire database of speech samples will be shared with our coalition partners (Amazon, Apple, Google, Meta, and Microsoft), and, after all data collection is complete, with other universities and companies who are willing to sign our data use agreement. Each partner has signed a data use agreement with UIUC that allows these deidentified data to be used for improvements in speech recognition technology and assures the privacy of participants and confidentiality of data.

    Study Design

    Study Type:
    Observational [Patient Registry]
    Anticipated Enrollment :
    2000 participants
    Observational Model:
    Ecologic or Community
    Time Perspective:
    Cross-Sectional
    Official Title:
    People With Speech Disabilities Contributing Speech Samples for Improved Accessibility of Speech-Enabled Devices
    Actual Study Start Date :
    Mar 15, 2023
    Anticipated Primary Completion Date :
    May 31, 2024
    Anticipated Study Completion Date :
    May 31, 2024

    Outcome Measures

    Primary Outcome Measures

    1. Recorded Speech [3-7 hours, self-paced]

      Each participant records 600 sentences: 480 read sentences, and 120 spontaneous sentences recorded in response to 30 prompts.

    Eligibility Criteria

    Criteria

    Ages Eligible for Study:
    18 Years and Older
    Sexes Eligible for Study:
    All
    Accepts Healthy Volunteers:
    No
    Inclusion Criteria:
    • Adult (age >= 18 years)

    • Self-reported diagnosis of Parkinson's Disease, ALS, CP, DS, or Stroke

    • Reads and speaks English in the form of complete sentences

    • Has a valid email address

    • Ability to access web browser to participate in study

    Exclusion Criteria:
    • Is a resident of the State of Washington, Texas, or Illinois (because these states have privacy laws that would not allow us to collect 'voice prints')

    • If quality control screening of initial speech samples "fails" because of poor data quality (e.g., poor quality recording environment, or person's speech is "too typical" and not sufficiently interesting to continue collecting)

    Contacts and Locations

    Locations

    Site City State Country Postal Code
    1 LSVT Global Denver Colorado United States 80204
    2 University of Illinois at Urbana-Champaign Urbana Illinois United States 61801

    Sponsors and Collaborators

    • University of Illinois at Urbana-Champaign
    • LSVT Global
    • Amazon.com Services LLC
    • Apple Inc.
    • Google LLC.
    • Meta Platforms, Inc.
    • Microsoft Corporation

    Investigators

    • Principal Investigator: Mark A Hasegawa-Johnson, Ph.D., University of Illinois at Urbana-Champaign

    Study Documents (Full-Text)

    More Information

    Additional Information:

    Publications

    None provided.
    Responsible Party:
    University of Illinois at Urbana-Champaign
    ClinicalTrials.gov Identifier:
    NCT05889260
    Other Study ID Numbers:
    • 23183
    First Posted:
    Jun 5, 2023
    Last Update Posted:
    Jun 5, 2023
    Last Verified:
    May 1, 2023
    Individual Participant Data (IPD) Sharing Statement:
    Yes
    Plan to Share IPD:
    Yes
    Studies a U.S. FDA-regulated Drug Product:
    No
    Studies a U.S. FDA-regulated Device Product:
    No
    Additional relevant MeSH terms:

    Study Results

    No Results Posted as of Jun 5, 2023