The role of a meeting secretary often faces various challenges: taking notes is time-consuming since people speak faster than they type; meeting minutes are delayed (sometimes taking up to two days for a one-hour meeting); reviewing audio recordings to finalize the document is exhausting; and collaboration becomes difficult when multiple secretaries are involved in the same session.
Key Features
1. High Accuracy
The system achieves an impressive 95.2% accuracy rate and has won First Prize at the VLSP Competitions in both 2018 and 2019 for Vietnamese Speech Recognition. It supports a wide range of regional accents — Northern, Central, and Southern Vietnam.
2. Fast Processing Speed
For pre-recorded audio files, the transcription time is only one-tenth of the audio duration. In live online meetings, text is displayed almost instantly as participants speak.
3. Speaker Voice Identification
The system can learn and recognize a speaker’s voice with just 5 minutes of training data, achieving over 90% accuracy in speaker identification.
4. Regional Accent Support
V-IDT effectively recognizes voices from all three main Vietnamese regions:
- North: full regional coverage
- Central: Da Nang, Hue, and nearby areas with moderate accents
- South: Ho Chi Minh City and surrounding provinces
5. Multi-Secretary Collaboration
Multiple secretaries can work simultaneously without data conflict. All results can be merged into a single consolidated report, and an administrator can review and approve the final version before release.
6. Video and Subtitle Processing
The system automatically generates time-synced subtitles for videos and supports precise content extraction and editing.
7. Convenient Sharing
Users can share entire meeting minutes or specific excerpts via access links, ensuring smooth information exchange.
8. Mobile Application
The mobile app allows users to record audio, which is automatically synced to the server. Users can view, share, edit, and highlight key segments directly from their mobile devices.
9. Additional Features
- Real-time transcription display.
- Effective recognition in noisy environments.
- Remote voice recognition.
- Supports multiple audio formats (MP3, WAV, etc.)
- Automatic text normalization (names, dates, numbers).
- Rich Vietnamese vocabulary of up to 7,000 words.
Deployment Models
1. On-Premise
Deployed on the client’s internal server infrastructure.
- Ensures absolute data security
- Clients have full control over system management and operations
2. All-in-One Solution
A compact, turnkey solution ideal for organizations seeking quick deployment.
- Comes with pre-installed software on a dedicated server device
- Portable and easy to use — perfect for business trips
- Guarantees complete data protection
- Simply plug in, connect to the internal network, and start using (Plug & Play)
Many government agencies, ministries, and departments are already using the V-IDT solution.
For more information, please contact:
Email: info@idt.org.vn