Digital Nirvana’s transcription experience spans more than two decades across industries such as medical, finance, as well as media, and entertainment. Our clientele includes early-stage publicly listed companies, Fortune 500 enterprises, and well-known Film and Broadcast Studios around the globe. Digital Nirvana pioneered verbatim transcripts and summarization of earning events in India.
Digital Nirvana ensures transcripts are of the highest quality, accurately correlating video and audio content, and delivering on time, even with short turnaround times. With skilled and versatile editors we produce the highest-quality transcriptions in record time through multitasking, time management, active listening, attention to detail, a vast research database, and fast-but-accurate typing skills. Our diverse global workforce also enables us to comprehend even minute cultural sensibility in speech and produce logical transcripts.
Since 2005, the adoption of Automatic Speech Recognition (ASR) engines has made Digital Nirvana capable of delivering a high-quality transcript, direct from a live performance. A pioneer in this space, we service transcription across many verticals, including financial, medical, legal, technology, broadcast, and traditional M&E. .
Our high-profile clientele has been using our transcription services to deliver business and market news for about two decades.
These services can be utilized via REST APIs or Digital Nirvana’s Web Portal with various options data transfer, including APIs, SFTPs, S3, GC storage, etc.
In addition to the 20+ years in transcription experience, Digital Nirvana has been innovative in terms of technology adoption, helping operations scale. The asynchronous (non-live) module of ASR is used to create a preliminary transcript transferred to users via our Trance application for further edits. ASR output with word-by-word confidence ratings makes the process incredibly efficient and enables users to increase production velocity.
Since 2016, Digital Nirvana has been providing live streaming transcription services to leading financial institutions. Digital Nirvana implements a synchronous (live streaming) ASR model, which can create language-specific models for a single customer, leading to large-scale improvements in output quality. Using this methodology, we have seen quality improvements of at least four percentage points with every update! The Speech to Text (STT) engines can do auto ML-based training and supervised training based on the corrected data feedback.
Our automatic speech recognition engines can do specific and concentrated training on data from a particular customer, region, or industry segment. Once trained, these models prove to generate high-quality automated transcripts in comparison to generic models. We also offer to correct automated transcripts as a service where high-quality transcripts are returned to the customer and feedback to the ASR engines. Moreover, if daily news feeds are provided to the ASR engine, it will automatically update the language model using the text documents. Periodic updates to the acoustic models are done. Customization and client-specific models can also be developed and maintained.
The current latency ranges around 25 seconds, and Digital Nirvana is striving hard to get the latency down to milliseconds so that this solution can be utilized for live captioning. Leading companies in the financial space use this service to stream transcripts of earnings and corporate conference calls conducted by Fortune 500 companies to their Terminal in real-time. The service can scale up to 400 hours of live transcription in a day during the earnings release season in the financial industry and scale down to fewer hours during other days in a calendar quarter.