MindMap Gallery Voice recognition commercial solution
This is a mind map about a commercial solution for task speech recognition. The main content includes: text file content format:, providing text files according to the same file name as the voice file.
Edited at 2025-02-12 14:24:02In order to help everyone use DeepSeek more efficiently, a collection of DeepSeek guide mind map was specially compiled! This mind map summarizes the main contents: Yitu related links, DS profile analysis, comparison of DeepSeek and ChatGPT technology routes, DeepSeek and Qwen model deployment guide, how to make more money with DeepSeek, how to play DeepSeek, DeepSeek scientific research Application, how to import text from DeepSeek into MindMaster, the official recommendation of DeepSeek Wait, allowing you to quickly grasp the essence of AI interaction. Whether it is content creation, plan planning, code generation, or learning improvement, DeepSeek can help you achieve twice the result with half the effort!
This is a mind map about DeepSeek's 30 feeding-level instructions. The main contents include: professional field enhancement instructions, interaction enhancement instructions, content production instructions, decision support instructions, information processing instructions, and basic instructions.
This is a mind map about a commercial solution for task speech recognition. The main content includes: text file content format:, providing text files according to the same file name as the voice file.
In order to help everyone use DeepSeek more efficiently, a collection of DeepSeek guide mind map was specially compiled! This mind map summarizes the main contents: Yitu related links, DS profile analysis, comparison of DeepSeek and ChatGPT technology routes, DeepSeek and Qwen model deployment guide, how to make more money with DeepSeek, how to play DeepSeek, DeepSeek scientific research Application, how to import text from DeepSeek into MindMaster, the official recommendation of DeepSeek Wait, allowing you to quickly grasp the essence of AI interaction. Whether it is content creation, plan planning, code generation, or learning improvement, DeepSeek can help you achieve twice the result with half the effort!
This is a mind map about DeepSeek's 30 feeding-level instructions. The main contents include: professional field enhancement instructions, interaction enhancement instructions, content production instructions, decision support instructions, information processing instructions, and basic instructions.
This is a mind map about a commercial solution for task speech recognition. The main content includes: text file content format:, providing text files according to the same file name as the voice file.
Task
Provide text files according to the same file name as the voice file
【Solved】Providing voice files, you can realize recognition through web pages or API
Text file content format:
1) XXX Account Manager (voiceprint recognition corresponds to account manager ID), the visit lasts XX minutes
First call the voiceprint recognition interface to identify who the current speaker is. Output fixed format statements.
2) Among them
Number of keywords for Class A work XX times, accounting for XX% of this work
Class A working keywords are input as a dictionary, and then the number and proportion of all keywords appear in the recognition content are analyzed. B, C work keywords and so on. Output fixed format statements.
The number of keywords in Class B works XX times, accounting for XX% of this work
C-type work keywords XX times, accounting for XX% of this work
3) Keyword list analysis
Category A: AX1 (keyword): X times, accounting for X%;......; AX2 (keyword): XX times, accounting for X%
In Class A working keywords, analyze the number and proportion of each keyword appearing. B, C work keywords and so on. Output fixed format statements.
Category B: BX1 (keyword): XX times, accounting for X%;......; BX2 (keyword): XX times, accounting for X%
Category c: CX1 (keyword): XX times, accounting for X%;......; CX2 (keyword): XX times, accounting for X%
4) Text recognition record "Dialogue process)
XX (Account Manager): XXXXXXXXXX (Keywords are marked with sharp corners)
Use speech segmentation, speaker separation, speech recognition, voiceprint recognition and other technologies to output fixed format statements.
AX (not recognized):XXXXXXXX
xX (Account Manager): XXXXXXXXXX (Keyword Sharp-angle Bracket Identification)
BX(XX retailer-identified after later identification): XXXXXXXX
5) Pinyin recognition record "Dialogue process)
XX (Account Manager): XXXXXXXXXX (Keyword Sharp-angle Bracket Identification)
Use speech segmentation, speaker separation, acoustic model, speech model recognition, voiceprint recognition and other technologies to output fixed format statements.
AX (not recognized): XXXXXXX
XX (Account Manager): XXXXXXXXXX (Keyword Sharp-angle Bracket Identification)
Bx (XX retailer-identified after later identification): XXXXXXXX