Based on STM32 of CNN Speech Keyword Command Recognition System
Author:
Affiliation:

1. School of Mechanical Engineering and Automation, Wuhan Textile University, Wuhan 430200, China;
2. Hubei digital textile equipment key laboratory, Wuhan 430200, China

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Speech recognition is a hot topic in the field of artificial intelligence. Generally, speech recognition models can only run on large servers or dedicated chips. This paper presents a keyword speech recognition system based on a neural network and a conventional STM32 chip. To address the limited Flash and ROM resources on the STM32 MCU chip, the deployment of the speech recognition model is optimized to meet the requirements of keyword recognition. Firstly, the audio information obtained through sensors is subjected to MFCC (Mel Fre-quency Cepstral Coefficient) feature extraction, and the extracted MFCC features are input into a CNN (Convolutional Neural Network) for deep feature extraction. Then, the features are input into a fully connected layer, and finally, the speech keyword is classified and predicted. Deploying the model to the STM32F429, the prediction model achieves an accuracy of 90.58%, a decrease of less than 1% compared to the accuracy of 91.49% running on a computer, with good performance.

    Reference
    Related
    Cited by
Get Citation

KUANG Wenbo, LUO Weiping.[J]. Instrumentation,2023,(1):17-22

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: October 20,2023
  • Published:
License
  • Copyright (c) 2023 by the authors. This work is licensed under a Creative
  • Creative Commons Attribution-ShareAlike 4.0 International License.