Arabic Voice to Text — تحويل الصوت إلى نص بالعربية
Arabic poses challenges that no other language does for speech recognition: a formal written standard (الفصحى) that nobody speaks as a native tongue, six mutually distinct dialect families, and a script where letters physically connect and change shape depending on their position in a word. This page explains exactly how our tool handles all of it — and how to get the best results from your Arabic dictation.
العربية لغة فريدة من نوعها في التعرف الصوتي: لها معيار مكتوب رسمي لا أحد يتكلمه كلغة أم، وست مجموعات لهجية متمايزة، وخط يتصل فيه الحروف ويتغير شكلها حسب موقعها في الكلمة. هذه الصفحة تشرح بالتفصيل كيف تتعامل أداتنا مع كل هذا.
Arabic's Unique Challenge: Diglossia — ازدواجية اللغة
Arabic is the world's most prominent example of diglossia — the coexistence of a high prestige written form (Modern Standard Arabic / الفصحى) and regional spoken vernaculars (العامية) that differ from it grammatically, phonologically, and in vocabulary. No native speaker grows up speaking MSA at home; it is learned formally in school, like Latin was in medieval Europe.
This matters for voice recognition because speech recognition models are primarily trained on audio paired with written text — and written Arabic text is mostly MSA. When you speak Egyptian dialect (مصري) into a microphone, the model is matching your speech patterns against a corpus that is weighted toward formal broadcast Arabic. The mismatch explains why accuracy drops for colloquial speech.
Practical implication
If you need a transcription that you'll publish, present, or submit formally — dictate in MSA (الفصحى). If you're drafting a WhatsApp message or informal note — your dialect will work, but expect to correct more words, especially function words like "هيجي" (Egyptian) vs "راح يجي" (Gulf) for "he will come."
Dialect Accuracy by Region — دقة التحويل حسب اللهجة
Not all Arabic dialects perform equally. Here is an honest breakdown based on how well speech models handle each family, and why:
Egyptian (المصري) — Best Results
Egyptian Arabic is the dialect most speech models perform best on, for one reason: Egyptian media dominates the Arab world. Egyptian films, TV series, and YouTube channels have been producing subtitled/captioned content for decades, creating the largest dialect-specific training corpus. If you speak Egyptian, you'll see the fewest corrections needed.
Gulf (الخليجي) — Good Results
Gulf Arabic — spoken across Saudi Arabia, UAE, Kuwait, Qatar, Bahrain, and Oman — benefits from heavy investment in Arabic voice technology from regional tech companies. Distinctive sounds like the گ (gaf) in Kuwaiti/Saudi dialects can occasionally be misread, but overall accuracy is strong for everyday speech.
Levantine (الشامي) — Moderate Results
Levantine Arabic (Syria, Lebanon, Jordan, Palestine) is closer to MSA in phonology than Gulf or Maghrebi, which helps. The قاف (qaf) pronounced as a glottal stop (ء) in urban Levantine — e.g. Beirut — can confuse models. The bʼ- prefix for present tense (بيروح instead of يروح) may occasionally render incorrectly.
Maghrebi (المغربي) — Most Challenging
Moroccan Darija, Algerian Arabic, and Tunisian Arabic are phonologically furthest from MSA and heavily influenced by Tamazight (Berber) and French. Consonant clusters uncommon in MSA, French loanwords like "fhamti?" (from French "comprendre"), and distinct vowel reduction patterns mean accuracy suffers most here. For Maghrebi users, dictating in MSA gives significantly better results.
كيفية الاستخدام — How to Start
اختر "Arabic (العربية)" من قائمة اللغات — متصفح Chrome أو Edge يعطي أفضل دقة للعربية
Select Arabic from the language menu. Chrome or Edge on desktop gives best Arabic results.
انقر "Start 🎤" واسمح بالوصول للميكروفون
Click Start and allow microphone access when prompted.
للدقة القصوى: تكلم بالفصحى بسرعة طبيعية. للمراسلات اليومية: لهجتك تعمل جيداً
For maximum accuracy: speak MSA at a natural pace. For casual use: your dialect works well.
النص يظهر من اليمين إلى اليسار تلقائياً. انسخه مباشرة إلى Word أو واتساب أو أي تطبيق
Text renders RTL automatically. Copy directly to Word, WhatsApp, or any application.
Arabic Script Rendering — عرض الخط العربي
Arabic script has characteristics that make it fundamentally different from Latin transcription:
🔗 Connected Script & Letter Forms
Arabic letters have up to four forms depending on position: isolated (ب), initial (بـ), medial (ـبـ), final (ـب). The voice recognition engine outputs Unicode characters; correct rendering is handled by the browser. This works seamlessly — but if you paste transcribed Arabic into an old system that doesn't support Unicode properly, you may see disconnected letters. The fix is always at the destination, not the source.
⬅️➡️ Bidirectional Text (Bidi)
When your Arabic speech includes English brand names, numbers, or technical terms — which is extremely common in Gulf business Arabic and tech contexts — the output becomes bidirectional. "أستخدم ChatGPT لتلخيص ملفات PDF" mixes RTL Arabic with LTR English acronyms. Our tool uses the dir="rtl" attribute with Unicode bidi controls to render this correctly. In Word or Google Docs, you may need to set the paragraph direction to RTL manually.
ء Hamza Variants
Arabic has six hamza forms: ء أ إ ؤ ئ آ — each grammatically distinct but pronounced similarly. Voice recognition frequently normalises these to ا or ء. If your text will be formally published, check hamza placement manually. Common error: "مسألة" (mas'ala, an issue) may render as "مسالة" — visually similar but incorrect in formal Arabic. This is a limitation of all current Arabic ASR systems, not just ours.
🔢 Numerals: Eastern vs Western
Arabic uses two numeral systems: Western Arabic (0 1 2 3 — the ones used globally) and Eastern Arabic-Indic (٠ ١ ٢ ٣ — used in many Arab countries in printed text). When you dictate a number like "خمسة وعشرون" (25), output defaults to Western Arabic numerals (25) rather than (٢٥). This is correct for digital use but may not match your style guide if you're producing formal Arabic documents.
Code-Switching: When You Mix Arabic and English
Modern Arabic speakers — especially in the Gulf, Egypt, and Lebanon — routinely mix Arabic and English in the same sentence. This phenomenon, called code-switching, is particularly common in tech, business, and youth communication:
"أنا شغّال على deadline بكرة وعندي presentation للـmanagement"
"الـupdate نزل بس في bug في الـdashboard"
"حوّل الـfile لـPDF وابعته على الـWhatsApp"
The Web Speech API handles code-switching reasonably well for common English technical terms — it recognises that "deadline" and "presentation" are likely English words in an Arabic stream. However, less common English words may be transcribed phonetically in Arabic script (e.g. "ديدلاين"). If this happens, you can type the English word directly after the fact — the RTL/LTR switch in the text box is handled automatically.
Speaking Tips for Best Arabic Accuracy — نصائح للنطق الأمثل
✅ يحسّن الدقة
- • تكلم بالفصحى للمستندات الرسمية — الدقة تتحسن بنسبة 15-25%
- • أكمل الجملة كاملة قبل التوقف — لا تقطع في منتصف الجملة
- • نطق الحركات بوضوح (الفتحة والضمة والكسرة) يساعد في تمييز الكلمات المتشابهة
- • قُل "نقطة" أو "فاصلة" لإضافة علامات الترقيم
- • ابعد عن الأصوات الخلفية (صوت التلفزيون، ضجيج الشارع)
- • للأرقام: قُلها بالعربي الفصيح ("خمسة وعشرون") أو بالأرقام مباشرة
⚠️ مشاكل شائعة وحلولها
- • الهمزة مكتوبة غلط — راجعها يدوياً في المستندات الرسمية
- • كلمة إنجليزية مكتوبة بالعربي — اكتبها يدوياً بعد الإملاء
- • الأسماء الشخصية والأماكن — قد تحتاج تعديل خاصة غير المشهورة
- • اللهجة المغربية — تحدث بالفصحى للحصول على نتائج أفضل بكثير
- • الكلمات المشتركة بين لهجات مثل "شو/إيش/وش" — قد تُكتب بأشكال مختلفة
- • التشكيل — لا يظهر في الإخراج؛ أضفه يدوياً إن احتجت
Who Uses Arabic Voice to Text — من يستخدم الأداة
WhatsApp Power Users
The Arab world has among the highest WhatsApp usage globally. Egyptian and Gulf users dictate long messages rather than type — Arabic keyboard input on mobile is slow due to the script's complexity. Voice dictation to text is 3–4× faster than tapping on an Arabic soft keyboard.
University Students
Students at Egyptian, Saudi, and UAE universities use Arabic dictation for thesis drafts, research summaries, and lecture notes. Dictating in الفصحى produces ready-to-use academic Arabic — far faster than typing in a language many students are more comfortable speaking than writing.
Business Professionals (GCC)
Bilingual Gulf professionals drafting emails, meeting notes, and reports switch between Arabic and English constantly. Dictation handles code-switching naturally — speak the Arabic parts, type the technical English terms. Cuts document drafting time significantly.
Arabic Content Creators
YouTube Arabic is one of the fastest-growing content verticals. Creators use voice-to-text for video scripts, blog posts in Arabic, and social media captions. Speaking ideas aloud is faster than writing — then clean up the transcription rather than draft from scratch.
Legal & Medical Professionals
Arabic-speaking doctors and lawyers use dictation for case notes and documentation. For legal Arabic, MSA accuracy matters — dictate formally and review hamza placement carefully. Medical Arabic terminology (mostly from Greek/Latin roots) transcribes reliably in MSA.
Arabic Diaspora
Arabic speakers in Europe, North America, and Australia who grew up speaking dialect but need to write formal Arabic for family correspondence, official letters, or community work. Speaking in Arabic and getting MSA-formatted output bridges the spoken/written gap.
شاهد الأداة في العمل
تحويل عربي فعلي — فصحى ولهجات
سجّل مجانًا — 15 دقيقة هدية عند التسجيل
ابدأ الكتابة الصوتية →Arabic Voice Commands — أوامر الإملاء الصوتي
Say these words while dictating to add punctuation and formatting:
Punctuation / علامات الترقيم
| قُل / Say | يُضيف / Inserts |
| "نقطة" | . (full stop) |
| "فاصلة" | ، (Arabic comma) |
| "علامة استفهام" | ؟ (Arabic question mark) |
| "علامة تعجب" | ! |
| "نقطتان" | : |
| "قوس فتح / قوس إغلاق" | ( ) |
Formatting / التنسيق
| قُل / Say | يفعل / Action |
| "سطر جديد" | New line |
| "فقرة جديدة" | New paragraph |
| "امسح الكلمة الأخيرة" | Delete last word |
| "توقف" | Pauses recognition |
* Formatting commands depend on browser implementation. Chrome has best support.
Transcribe Arabic audio files — MP3, WAV, MP4
Upload recordings of Arabic lectures, meetings, or podcasts. Pro plan processes files up to 5 hours with timestamps. / ارفع ملفات MP3 أو WAV وحوّلها إلى نص عربي مع التوقيت.
الأسئلة الشائعة — Frequently Asked Questions
What is the difference between MSA and dialect accuracy, in numbers?
For Modern Standard Arabic (فصحى), you can expect word error rates of around 8–12% for clear speech in a quiet environment — meaning roughly 9 out of 10 words correct. For Egyptian dialect, expect 12–18% error rate. For Levantine and Gulf, 15–22%. For Maghrebi dialects (Moroccan Darija especially), error rates can reach 30–40% — at that point, MSA dictation followed by manual editing is faster. These figures are approximate and depend heavily on speaking clarity and background noise.
هل يعمل للإملاء الطبي والقانوني بالعربية؟
يعمل جيداً للإملاء الطبي والقانوني بشرط التحدث بالفصحى. المصطلحات الطبية ذات الأصول اللاتينية واليونانية (كالتهاب، الضغط، التشخيص) تُنسخ بدقة عالية. أما المصطلحات القانونية الفقهية المتخصصة كـ"الوصية" و"الحيازة" و"الاستئناف" فعادةً ما تُنسخ بشكل صحيح في الفصحى. راجع دائماً المستندات القانونية قبل توقيعها أو تقديمها.
Does the tool add tashkeel (harakat / vowel marks)?
No. Standard speech recognition output — across all providers, not just ours — produces unvocalised Arabic (بدون تشكيل), which is the norm for modern written Arabic. Adding tashkeel automatically would require a separate NLP layer and would introduce errors. If you need fully vocalised Arabic (for children's books, religious texts, or poetry), add the tashkeel manually after transcription using Word's Arabic tools or a dedicated Arabic text editor.
هل يعمل على الجوال في السعودية والإمارات ومصر؟
نعم. يعمل على Android (Chrome) وiPhone (Safari) في جميع الدول العربية. متصفح Chrome على Android يُعطي أفضل النتائج للعربية. على iPhone، Safari مدعوم لكن Chrome يبقى الأفضل. ميزة RTL تعمل تلقائياً على الجوال أيضاً — النص العربي يظهر بشكل صحيح على جميع الأجهزة.
I speak a mix of Arabic and English — will that work?
Yes, with limitations. The tool is set to Arabic recognition mode, so common English technical words (email, PDF, deadline, password, update) are usually rendered correctly in the Latin script within the Arabic text. Less common English words may be phonetically transcribed in Arabic characters — e.g. "ميتينج" for "meeting." Simply overtype these after you finish dictating. For heavy code-switching, some users prefer to dictate the Arabic portions and type the English terms.
هل يُحفظ صوتي أو يُرسل لجهة خارجية؟
لا. الأداة المجانية تستخدم Web Speech API المدمجة في المتصفح — الصوت يُعالَج على مستوى المتصفح مباشرة ولا يمر بخوادمنا. خطة Pro التي تستخدم رفع الملفات ترسل الملف لمعالجته على الخادم، لكن الملف يُحذف فور اكتمال التحويل. لا نحتفظ بأي تسجيل صوتي.
Can I use this to transcribe Quran recitation?
The tool can transcribe Quranic Arabic text, but with important caveats: Tajweed rules (elongation, assimilation, nasalisation) are performance features of recitation that can confuse speech recognition. The output will be the text without tashkeel or Tajweed markings. For Quran study apps or educational tools, a purpose-built Quran recognition system will outperform a general-purpose Arabic ASR tool. That said, for simple verification of what is being recited, the general accuracy is usable.
أدوات ذات صلة
ابدأ تحويل الصوت إلى نص الآن
فصحى أو عامية — بدون تسجيل، مجاني تماماً.
ابدأ الآن →يعمل في المتصفح — Chrome أو Edge أو Safari