DocuLingo

Automated Word-to-Word translation with full style preservation

DocuLingo Visit Project

DocuLingo is an intelligent translation engine for documents that leverages OpenAI models while faithfully preserving every style detail—from bold runs to paragraph spacing. Written in modern Python 3.12+, it combines smart batching, retry logic, and a plug‑and‑play translator framework to deliver production‑ready localization workflows.


My Role in the Project

As the project’s creator and sole developer, I:


Technical Details and Interesting Aspects

Core Architecture

AreaTechnologyHighlights
SettingsPydantic SettingsCentralised env configuration (OPENAI_API_KEY)
TranslatorsEnum + factory patternCurrently OpenAI; easily extendable to DeepL, Google, etc.
Word Processingpython‑docxHandles single‑run and multi‑run paragraphs; preserves empty lines
Style CopierCustom utilitiesCopies font, bold, italic, underline, RGB colours, alignment, spacing, indentation, page breaks
API ClientOpenAI Chat (default gpt‑4o)Smart segmentation (≤100 chunks), 5× auto‑retry, JSON validation

Key Features


My Impact on the Project


Conclusion

DocuLingo demonstrates how thoughtful architecture and modern AI APIs can automate complex document translation while respecting author intent and styling. It reflects my commitment to building developer‑friendly tools that solve real productivity bottlenecks.

Visit Project

Leynier Gutiérrez González

All rights reserved © 2025

Linkedin GitHub 𝕏 Instagram Youtube Telegram telegram