Transliteration
Installation
# you can use a Conda environment
pip install --extra-index-url "https://oauth2accesstoken:$(gcloud auth print-access-token)@glsdk.gdplabs.id/gen-ai-internal/simple/" python-dotenv gllm-intl# you can use a Conda environment
pip install --extra-index-url "https://oauth2accesstoken:$(gcloud auth print-access-token)@glsdk.gdplabs.id/gen-ai-internal/simple/" python-dotenv gllm-intlFOR /F "tokens=*" %T IN ('gcloud auth print-access-token') DO pip install --extra-index-url "https://oauth2accesstoken:%T@glsdk.gdplabs.id/gen-ai-internal/simple/" python-dotenv gllm-intlWhat is Transliteration?
Examples:
Original
Script
Transliterated
Target Script
Why Transliterate Text?
1. Search & Indexing
2. URL Slugs & Identifiers
3. Data Integration
4. Display Fallbacks
5. Cross-Script Communication
Quick Start
Supported Scripts
Supported Scripts:
Script
Example
Description
Common Script Pairs:
Basic Transliteration
transliterate() - Main Function
transliterate() - Main FunctionSpecifying Source Script
Japanese Script Conversion
Unicode Characters Preserved
ASCII Conversion
to_ascii() - Convert Any Script to ASCII
to_ascii() - Convert Any Script to ASCIICase Preservation
How Case Preservation Works
Advanced Features
Reusable Transliterators
Thread Safety
Common Use Cases
Use Case 1: URL Slug Generation
Use Case 2: Search Index Creation
Use Case 3: Name Normalization
Use Case 4: Multi-Script Form Validation
Use Case 5: Display with Romanization
Use Case 6: File Name Sanitization
Best Practices
1. Specify Source Script for Accuracy
2. Use to_ascii() for System Compatibility
to_ascii() for System Compatibility3. Cache Transliterators for Performance
4. Combine with Normalization
5. Handle Mixed Scripts Gracefully
6. Test with Real Data
7. Document Transliteration Scheme
Last updated
Was this helpful?