Voice commands aren't just for virtual assistants like Google Assistant or Alexa. They can significantly enhance mobile and desktop applications, adding both functionality and a fun user experience. Integrating voice commands or voice search is surprisingly straightforward. This article demonstrates building a voice-controlled book search application using the Web Speech API within a React framework.
The complete code is available on GitHub, and a working demo is provided at the end.
Key Concepts:
- Leveraging the Web Speech API to enable voice search in React applications, improving user interaction.
- Building a basic React component (using Create React App), then integrating the Web Speech API for speech recognition.
- Implementing continuous speech recognition within the React component lifecycle, managing states with React hooks.
- Developing a custom React hook (
useVoice
) to encapsulate and reuse voice recognition logic. - Extending functionality by integrating a book search feature using another custom hook (
useBookFetch
), interacting with an external API (Open Library) for data retrieval based on voice input.
Web Speech API Introduction:
The Web Speech API has limited browser support. Ensure you're using a compatible browser (check MDN for up-to-date compatibility information).
A simple example of using the Web Speech API:
const SpeechRecognition = webkitSpeechRecognition; const speech = new SpeechRecognition(); speech.onresult = (event) => { console.log(event); }; speech.start();
This code instantiates SpeechRecognition
, adds an onresult
event listener to handle speech transcription, and starts listening. The browser will request microphone access. After speech, onresult
provides the transcribed text.
The onresult
event delivers a SpeechRecognitionEvent
object containing a results
array. The first element of this array holds the transcribed text.
This basic code can run in Chrome DevTools or a JavaScript file. Let's integrate this into a React application.
Using Web Speech in React:
Create a new React project:
npx create-react-app book-voice-search cd book-voice-search npm start
Replace the default App.js
with the following, which incorporates the Web Speech API:
// App.js import React, { useState, useEffect } from "react"; import "./index.css"; import Mic from "./microphone-black-shape.svg"; // Import your microphone image let speech; if (window.webkitSpeechRecognition) { const SpeechRecognition = webkitSpeechRecognition; speech = new SpeechRecognition(); speech.continuous = true; // Enable continuous listening } else { speech = null; } const App = () => { const [isListening, setIsListening] = useState(false); const [text, setText] = useState(""); const listen = () => { setIsListening(!isListening); if (isListening) { speech.stop(); } else { speech.start(); } }; useEffect(() => { if (!speech) return; speech.onresult = (event) => { setText(event.results[event.results.length - 1][0].transcript); }; }, []); // ... (rest of the component remains the same) }; export default App;
This enhanced component manages listening state (isListening
), stores the transcribed text (text
), and handles the microphone click event (listen
). The useEffect
hook sets up the onresult
listener.
Reusable Custom React Voice Hook:
To improve code reusability, create a custom hook useVoice.js
:
const SpeechRecognition = webkitSpeechRecognition; const speech = new SpeechRecognition(); speech.onresult = (event) => { console.log(event); }; speech.start();
This hook encapsulates the voice recognition logic. Now, update App.js
to use this hook:
npx create-react-app book-voice-search cd book-voice-search npm start
This simplifies App.js
and promotes code reuse.
Book Voice Search Functionality:
Create another custom hook useBookFetch.js
to handle the book search:
// App.js import React, { useState, useEffect } from "react"; import "./index.css"; import Mic from "./microphone-black-shape.svg"; // Import your microphone image let speech; if (window.webkitSpeechRecognition) { const SpeechRecognition = webkitSpeechRecognition; speech = new SpeechRecognition(); speech.continuous = true; // Enable continuous listening } else { speech = null; } const App = () => { const [isListening, setIsListening] = useState(false); const [text, setText] = useState(""); const listen = () => { setIsListening(!isListening); if (isListening) { speech.stop(); } else { speech.start(); } }; useEffect(() => { if (!speech) return; speech.onresult = (event) => { setText(event.results[event.results.length - 1][0].transcript); }; }, []); // ... (rest of the component remains the same) }; export default App;
This hook fetches book data from Open Library based on the author's name. Finally, integrate this into App.js
to display the search results:
// useVoice.js import { useState, useEffect } from 'react'; // ... (SpeechRecognition setup remains the same) const useVoice = () => { // ... (state and listen function remain the same) useEffect(() => { // ... (onresult event listener remains the same) }, []); return { text, isListening, listen, voiceSupported: speech !== null }; }; export { useVoice };
This completes the voice-controlled book search application.
Demo:
[Insert CodeSandbox or similar demo link here]
Conclusion:
This example showcases the power and simplicity of the Web Speech API for adding voice interaction to React applications. Remember browser compatibility and potential accuracy limitations. The full code is available on GitHub.
FAQs (moved to the end for better flow): (These would follow the conclusion in the original format) The FAQs section from the original input can be included here, slightly rephrased for better clarity and flow within this revised article.
The above is the detailed content of Adding Voice Search to a React Application. For more information, please follow other related articles on the PHP Chinese website!

Choosing Python or JavaScript should be based on career development, learning curve and ecosystem: 1) Career development: Python is suitable for data science and back-end development, while JavaScript is suitable for front-end and full-stack development. 2) Learning curve: Python syntax is concise and suitable for beginners; JavaScript syntax is flexible. 3) Ecosystem: Python has rich scientific computing libraries, and JavaScript has a powerful front-end framework.

The power of the JavaScript framework lies in simplifying development, improving user experience and application performance. When choosing a framework, consider: 1. Project size and complexity, 2. Team experience, 3. Ecosystem and community support.

Introduction I know you may find it strange, what exactly does JavaScript, C and browser have to do? They seem to be unrelated, but in fact, they play a very important role in modern web development. Today we will discuss the close connection between these three. Through this article, you will learn how JavaScript runs in the browser, the role of C in the browser engine, and how they work together to drive rendering and interaction of web pages. We all know the relationship between JavaScript and browser. JavaScript is the core language of front-end development. It runs directly in the browser, making web pages vivid and interesting. Have you ever wondered why JavaScr

Node.js excels at efficient I/O, largely thanks to streams. Streams process data incrementally, avoiding memory overload—ideal for large files, network tasks, and real-time applications. Combining streams with TypeScript's type safety creates a powe

The differences in performance and efficiency between Python and JavaScript are mainly reflected in: 1) As an interpreted language, Python runs slowly but has high development efficiency and is suitable for rapid prototype development; 2) JavaScript is limited to single thread in the browser, but multi-threading and asynchronous I/O can be used to improve performance in Node.js, and both have advantages in actual projects.

JavaScript originated in 1995 and was created by Brandon Ike, and realized the language into C. 1.C language provides high performance and system-level programming capabilities for JavaScript. 2. JavaScript's memory management and performance optimization rely on C language. 3. The cross-platform feature of C language helps JavaScript run efficiently on different operating systems.

JavaScript runs in browsers and Node.js environments and relies on the JavaScript engine to parse and execute code. 1) Generate abstract syntax tree (AST) in the parsing stage; 2) convert AST into bytecode or machine code in the compilation stage; 3) execute the compiled code in the execution stage.

The future trends of Python and JavaScript include: 1. Python will consolidate its position in the fields of scientific computing and AI, 2. JavaScript will promote the development of web technology, 3. Cross-platform development will become a hot topic, and 4. Performance optimization will be the focus. Both will continue to expand application scenarios in their respective fields and make more breakthroughs in performance.


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

SublimeText3 Chinese version
Chinese version, very easy to use

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.
