regular expressions library Thread poster: Luna Media
|
My colleagues and I are trying to build a library of regular expressions to help us fix machine translation errors automatically. Before we get too deep into learning and building it, we were wondering if such a thing already exists out there. Google searches did not help locate a "library regular expressions for localization" and it would be a bit time-consuming to find examples throughout the forum, which I'm sure there are plenty. Thank you very much! | | |
Machine versus machine? | Jun 16, 2024 |
Maybe I've misunderstood, but it sounds like you want to improve MT by using another machine?
Automatically replace certain terms with others? Would that improve anything?
More details please.
A list of 'regular expressions" would be endless. | | |
Dan Lucas United Kingdom Local time: 04:01 Member (2014) Japanese to English Handcraft them | Jun 16, 2024 |
Luna Media wrote:
My colleagues and I are trying to build a library of regular expressions to help us fix machine translation errors automatically.
Personally, I don't think a library or any other kind of prebuilt collection will be of much use to you, because you will have to inspect each case individually. It will depend very much on the language pair, the content of the text, and so on.
Regexes are powerful tools, but there are lots of edge cases and they can be destructive if deployed carelessly or indiscriminately. I use them fairly frequently and can testify that is very easy to make mistakes that ruin large sections of your document. This potential for disaster may be the origin of the following well-known observation by Jamie Zawinski:
Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems. If you really want to do this, I would abandon the idea of a pre-built library and just start crafting one example at a time, and testing very carefully (in a sandbox) as you go. That way you will at least understand what you are doing. If you're new to regexes and I suggest looking at a tool like RegexBuddy.
Regards,
Dan | | |
Luna Media United States English to Spanish + ... TOPIC STARTER
Thank you for all the suggestions and input. | | |
There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact
site staff »
regular expressions library
LinguaCore | AI Translation at Your Fingertips
The underlying LLM technology of LinguaCore offers AI translations of unprecedented quality. Quick and simple. Add a human linguistic review at the end for expert-level quality at a fraction of the cost and time.
More info » |
|
TM-Town | Manage your TMs and Terms ... and boost your translation business
Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.
More info » |
|