Turning Bilingual Lexicography Upside Down: Improving Quality and Productivity with New Methods and Technology  <span class="so-article-trans-title" dir="auto"> Translated title: &apos;n Omwenteling in tweetalige leksikografie: Die verbetering van kwaliteit en produktiwiteit met nuwe metodes en tegnologie. Hierdie is &apos;n </span>

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

This is a report from the real world. It informs about the outcome of a project, which the author conducted during a months-long research stay at the Danish company Ordbogen where he integrated its research and development (R) team. The first part of the project was to test machine translation and find out to what extent it is usable in the compilation of bilingual lexicographical databases. The hypothesis was that the technology was not yet mature. But surprisingly, it turned out that the accuracy rate is already so high that it is worth considering how to implement it. The second part of the project aimed at further developing an idea formulated by Fuertes-Olivera et al (2018) on how to invert a dictionary without losing semantic content. The new vision is to compile a monolingual L2 database, bilingualize it to an L2-L1 database using machine translation, and then invert the relationship between L2 lemmata and L1 equivalents using the L1 definitions of the L2 lemmata as the axis. The third part of the project was to test this idea using a specially designed ad hoc program. The program automatically uploads relevant data from existing lexicographical databases, translates L2 definitions and example sentences into L1, suggests adequate L1 equivalents, and eventually inverts the relationship between the two languages. It worked, but the methodology still needs further refinement to be implementable on a large scale. The report concludes by listing some of the remaining challenges and defining the new role of the lexicographer in this type of project.

Translated abstract

verslag uit die praktyk Daar word verslag gelewer oor die uitkoms van 'n projek, wat die outeur gedurende 'n maandelange navorsingstydperk by die Deense maatskappy Ordbogen aangevoer het waartydens hy die navorsings- en ontwikkeling- (N)-span geïntegreer het. Die eerste deel van die projek het die toets van masjienvertaling behels en om die bruikbaarheid daarvan in die saam-stel van tweetalige leksikografiese databasisse te bepaal. Die hipotese was dat die tegnologie nog nie gevorderd genoeg was nie. Dit het egter, verrassend genoeg, geblyk dat die akkuraatheidsyfer reeds so hoog was dat dit die moeite werd was om die implementering daarvan te oorweeg. Die tweede deel van die projek het die verdere ontwikkeling van 'n idee, geformuleer deur Fuertes-Olivera et al. (2018) oor die omskakeling van 'n woordeboek sonder verlies van semantiese inhoud, ten doel gehad. Die nuwe visie is om 'n eentalige L2-databasis saam te stel, dan met behulp van masjienvertaling te omskep in 'n L2-L1-databasis, en daarna die verhouding tussen L2-lemmata en L1-ekwivalente om te skakel deur die L1-definisies van die L2-lemmata as die spil te gebruik. Die derde deel van die projek was die toets van hierdie idee met 'n spesiaal ontwerpte ad hoc-program. Hierdie program laai outomaties relevante data vanuit leksikografiese databasisse, vertaal L2-definisies en -voorbeeldsinne in L1, stel gepaste L1-ekwivalente voor, en skakel uiteindelik die verhouding tussen die twee tale om. Dit was geslaagd, maar die metodologie moet nog verder verfyn word voordat dit op groot skaal geïmplementeer kan word. Die verslag word afgesluit met die lys van sommige van die oorblywende uitdagings en met die definiëring van die nuwe rol van die leksikograaf in hierdie tipe projek.

Related collections

Most cited references 23

Record: found
Abstract: not found
Book: not found

Dictionary of Lexicography

R R K Hartmann, Gregory James (1998)

0 comments Cited 14 times – based on 0 reviews

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

L2 Writing Assistants and Context-Aware Dictionaries: New Challenges to Lexicography

Peter Sepstrup, Kasper Fisker, Sven Tarp (2017)

Dictionaries are increasingly integrated into other tools designed to assist the reading, writing and translation of texts. Write Assistant is a newly developed tool aimed at assisting people writing in a second language. It feeds on big data taken in from corpora and digital dictionaries. The paper discusses the philosophy behind the tool, the techniques applied, its empirical basis and functionality, as well as the extent to which it helps its users. It shows how the tool makes it possible to shorten and even skip some phases in the traditional information-search process and allows its user to maintain the focus on the message to be written without the need to consult external information resources. The paper shows how the underpinning technology gives birth to a new type of dictionary that is context-aware and provides a more personalised user service. But it also indicates that future dictionaries need to be conceptionally adapted to the specific tool in order to optimize the service. All this poses new challenges to lexicography.