API AI Transcription & Speech Summary Service
What is assemblyai.com?
AssemblyAI.com is a website that provides AI models for transcribing and comprehending speech. Their platform offers a simple API that grants access to robust AI models capable of tasks such as speech recognition, speaker detection, and speech summarization. AssemblyAI.com incorporates cutting-edge AI research to deliver production-ready, scalable, and secure AI models through their user-friendly API. Additionally, they offer a no-code Playground feature, which allows users to experiment with their API using various sources like YouTube links, audio files, or video files. Notably, AssemblyAI.com has secured significant funding amounting to over $63 million from renowned investors, including Insight Partners, Accel, and Y Combinator.
How much does it cost to use assemblyai.com?
AssemblyAI offers three pricing plans: Core Transcription, Audio Intelligence, and Enterprise. The Core Transcription plan, priced at $0.00025 per second, encompasses a range of features such as speech recognition, speaker diarization, auto punctuation and casing, auto language detection, custom spelling, custom vocabulary, dual channel transcription, export of SRT or VTT caption files, filler word filtering, profanity filtering, word search, support for 49 different audio and video file types, up to 32 audio files processing in parallel, and support for 12 languages.
For additional functionality, users can opt for the Audio Intelligence plan, which costs $0.000583 per second on top of the Core Transcription pricing. This plan includes features like summarization, content moderation, sentiment analysis, auto highlights, PII redaction, topic detection (IAB classification), entity detection, and auto chapters.
The Enterprise plan is designed for businesses with high-volume requirements, specific support needs, or custom use cases. To obtain pricing information for the Enterprise plan, users are advised to contact the sales team directly. This plan offers benefits such as enabling AI at scale with increased concurrency, collaborating with AssemblyAI engineers to develop custom integrations, receiving dedicated support for troubleshooting, planning, and building, as well as custom pricing tailored to the user's specific use case and needs.
Users also have the option to sign up for a free account and utilize the API on a limited trial basis. It's worth noting that a separate source mentions AssemblyAI pricing starting at $0.90 based on usage, although specific details about the usage parameters are not provided.
How do I sign up for assemblyai.com?
To create an account on AssemblyAI, visit their website and locate the ""Try the API"" button situated at the top right corner. Clicking on this button will redirect you to a page where you can provide your email address to receive your unique API key. It is important to note that upon obtaining your API key, you are required to agree to AssemblyAI's Terms of Service.
Additionally, AssemblyAI offers the option to sign up for early access to their LeMUR API. LeMUR API is a novel framework that applies advanced Language Model Models (LLMs) to transcribed speech. If you are interested in exploring this feature, you can complete a form available on their website to express your interest in accessing the LeMUR API.
What are the benefits of assemblyai.com?
AssemblyAI.com offers a range of benefits to its users. By utilizing their platform, users gain access to powerful AI models that can transcribe and comprehend speech through a user-friendly API. Additionally, customization options are available, allowing users to enable various features such as content moderation, sentiment analysis, PII redaction, key phrase identification, and speaker diarization. The platform also leverages the latest advancements in AI research, ensuring that users can build production-ready, scalable, and secure AI models. Furthermore, users can become part of a growing community of AI developers and receive support from the AssemblyAI team. Lastly, users have the opportunity to sign up for early access to the LeMUR API, a novel framework that applies advanced language models to transcribed speech.
What is the accuracy of assemblyai.com models?
The accuracy of AssemblyAI models is influenced by the characteristics and quality of the audio being processed. The company asserts that they offer the most precise API available in the market. Their benchmark report indicates that their latest v8 model architecture achieved an average Word Error Rate (WER) of 6.5% across various audio use cases. In comparison, Google Cloud Speech-to-Text achieved 8.1% WER, and AWS Transcribe achieved 9.3% WER. The report also highlights that their v8 model demonstrated an 18.72% improvement in proper noun accuracy when compared to their previous v7 model. AssemblyAI has recently introduced their v9 model, claiming it to be 11% more accurate than the v8 model.