Nothing launches ‘Essential Voice’ for smarter speech-to-text transcription

London-based consumer technology brand Nothing has announced the release of Essential Voice, a new software feature designed to enhance the speech-to-text experience on its smartphones. The feature is an extension of the company’s existing Essential AI toolset, aimed at shifting device communication toward a voice-first interface.

Addressing the Limitations of Typing and Dictation

The primary goal of Essential Voice is to bridge the efficiency gap between typing and speaking. While average smartphone users type at approximately 36 words per minute, speaking allows for a much faster pace of around 150 words per minute.

Although voice notes and traditional dictation tools exist, both present distinct challenges. Voice notes can be inconvenient to listen to in public spaces and difficult to scan for key information.

Standard dictation, on the other hand, often produces raw, fragmented text that includes stutters and filler words. Essential Voice processes spoken input to generate clean, formatted text in real-time, aiming to combine the speed of speaking with the utility of written text.

How It Works and Core Features

The feature is integrated directly into the device’s keyboard and can also be activated via a long-press on the Essential Key, allowing users to dictate without leaving their current application.

Key functionalities include:

  • Auto-Correction and Formatting: Moving beyond verbatim transcription, the software automatically removes filler words (like “ums” and “uhs”) and tidies up sentence structure. Users can also dictate formatting instructions to organize their speech into steps, lists, or bullet points.
  • Personal Mappings: Users can create a library of custom voice shortcuts over time. Dictating a specific trigger phrase can prompt the software to automatically output specific spellings, inserted templates, or linked addresses (e.g., ensuring “nothing OS” is automatically capitalized as “Nothing OS”).
  • Multilingual Support and Translation: Essential Voice supports over 100 languages with automatic language detection, including specific regional variants like Simplified Chinese or Latin American Spanish. Additionally, it functions as a translation agent, allowing a user to speak in one language and have the text transcribed and translated into another in real-time.
Privacy and Data Handling

Regarding user privacy, Nothing states that Essential Voice does not listen in the background and only activates upon user initiation. Once triggered, the audio recording is encrypted and sent to Nothing’s servers for processing. The generated text is then returned to the device, and the company confirms that audio data is not stored on its servers after the process is complete.

Availability and Roadmap

Essential Voice is currently available on the Nothing Phone (3). The update is scheduled to roll out to the Phone (4a) Pro later this month, followed by the Phone (4a) in early May.

Looking forward, Nothing plans to introduce context awareness to the feature, enabling the software to adapt its formatting and tone based on the application in use—such as distinguishing between a casual text message and a formal work email. The company also noted intentions to eventually expand Essential Voice across its broader ecosystem of smart products.


Srivatsan Sridhar: Srivatsan Sridhar is a Mobile Technology Enthusiast who is passionate about Mobile phones and Mobile apps. He uses the phones he reviews as his main phone. You can follow him on Twitter and Instagram
Related Post