19-04-2023 · Insight

Quant Chart: Lost in translation

Do you like watching old Asian movies from the 60s and 70s? Perhaps you’re a connoisseur of kung fu or monster movies; popular genres from that era. If you are, and you live in a Western country, those movies would have been translated from their original Chinese or Japanese into a Western language, most likely English.

    Authors

  • Mike Chen - Head of Next Gen Research

    Mike Chen

    Head of Next Gen Research

  • Matthias Hanauer - Researcher

    Matthias Hanauer

    Researcher

  • Nick Mutsaers - Researcher

    Nick Mutsaers

    Researcher

If so, you may have noticed that when the actors speak, their mouths move for far longer than it took to say the English translation, and you might even have wondered what it was you were missing. Of course, everyone knows that a lot of context and information – actors’ performance, accents, nuances and local culture references – gets lost in translation when a movie is dubbed. But have you ever wondered whether investment information also gets lost in translation?

Natural Language Processing (NLP), an application of artificial intelligence, is a popular tool that is revolutionizing quantitative finance and being applied to many types of texts. However, most NLP tools are developed for texts in English. Since English is not the only language spoken around the world1, a popular approach to process non-English texts is to translate them into English, and then apply English NLP models to the translated texts.

In recent research, Robeco discovered that just like in those old Asian movies, the above-described approach based on translated text also results in some information (alpha) being lost in translation. When a local-language-based NLP model is applied to the local-language text, additional information (alpha) can be revealed and therefore harvested.

Take, for example, Chinese investment texts. The left-hand chart in Figure 1 shows the performance of factors built from Chinese and English-based NLP engines. The good news is that both are positive, so not all information is lost in translation. However, the right-hand chart in Figure 1 shows that of the top quintile-ranked stocks from the Chinese NLP model, only 50% of which would be classified in the top two quintiles under the English NLP model.

Figure 1: English translation versus Chinese original NLP output

Figure 1: English translation versus Chinese original NLP output

Source: I/B/E/S, Refinitiv, Orbit Financial Technology, Robeco. The left panel of the figure displays the return spread between the top and bottom quintile portfolios based on the NLP sentiment score using the Chinese and the English language. The right panel of the graph displays the similarity in stock classification between the two signals. More specifically, it shows the percentage of top English NLP stocks classified in the corresponding quintiles based on the Chinese language. The investment universe consist of MSCI China A index constituents. The portfolios are equally weighted, rebalanced monthly. The left and right charts illustrate the results for the sample period of January 2013 till December 2022.

This shows that the stocks selected are different because there is no perfect overlap. Like those old Asian movies from the 60s and 70s, information may also be lost in translation. To fully grasp the nuances of a movie’s dialogue, it is worth watching the film in the original language, if possible. And to fully understand what is being communicated in an investment text, it may be worth reading the texts in their original local language.

As technology advances, so do the opportunities for quantitative investors. By incorporating more data and leveraging advanced modelling techniques, we can develop deeper insights and enhance decision-making.

Footnote

1 English is only spoken natively by 400 million people around the world, or ~5% of the global population.

Quant Charts

Let's keep the conversation going

Keep track of fast-moving events in sustainable and quantitative investing, trends and credits with our newsletters.

Stay updated
Robeco

Robeco aims to enable its clients to achieve their financial and sustainability goals by providing superior investment returns and solutions.

Important information This disclaimer applies to any documents and the verbal or written comments of any person in presentations or webinars on this website and taken together is referred to herein as the “Information”. The services to which the Information relate are NOT FOR RETAIL CLIENTS - The information contained in the Website is solely intended for professional investors, defined as investors which (1) qualify as professional clients within the meaning of the Markets in Financial Instruments Directive (MiFID), (2) have requested to be treated as professional clients within the meaning of the MiFID or (3) are authorized to receive such information under any other applicable laws and must not be relied or acted upon by any other persons. This Information does not constitute an offer to sell, or a solicitation of an offer to buy, any financial product, and may not be relied upon in connection with the purchase or sale of any financial product. You are cautioned against using this Information as the basis for making a decision to purchase any financial product. To the extent that you rely on the Information in connection with any investment decision, you do so at your own risk. The Information does not purport to be complete on any topic addressed. The Information may contain data or analysis prepared by third parties and no representation or warranty about the accuracy of such data or analysis is provided.
In all cases where historical performance is presented, please note that past performance is not a reliable indicator of future results and should not be relied upon as the basis for making an investment decision. Investors may not get back the amount originally invested. Neither Robeco Institutional Asset Management B.V. nor any of its affiliates guarantees the performance or the future returns of any investments. If the currency in which the past performance is displayed differs from the currency of the country in which you reside, then you should be aware that due to exchange rate fluctuations the performance shown may increase or decrease if converted into your local currency. Robeco Institutional Asset Management B.V. (“Robeco”) expressly prohibits any redistribution of the Information without the prior written consent of Robeco. The Information is not intended for distribution to, or use by, any person or entity in any jurisdiction or country where such distribution or use is contrary to law, rule or regulation. Certain information contained in the Information includes calculations or figures that have been prepared internally and have not been audited or verified by a third party. Use of different methods for preparing, calculating or presenting information may lead to different results. Robeco Institutional Asset Management UK Limited (“RIAM UK”) is authorised and regulated by the Financial Conduct Authority. RIAM UK, 30 Fenchurch Street, Part Level 8, London EC3M 3BD (FCA Reference No:1007814). The company is registered in England and Wales under Ref No. 15362605.

In all cases where historical performance is presented, please note that past performance is not a reliable indicator of future results and should not be relied upon as the basis for making an investment decision. Investors may not get back the amount originally invested. Neither Robeco Institutional Asset Management B.V. nor any of its affiliates guarantees the performance or the future returns of any investments. If the currency in which the past performance is displayed differs from the currency of the country in which you reside, then you should be aware that due to exchange rate fluctuations the performance shown may increase or decrease if converted into your local currency. Robeco Institutional Asset Management B.V. (“Robeco”) expressly prohibits any redistribution of the Information without the prior written consent of Robeco. The Information is not intended for distribution to, or use by, any person or entity in any jurisdiction or country where such distribution or use is contrary to law, rule or regulation. Certain information contained in the Information includes calculations or figures that have been prepared internally and have not been audited or verified by a third party. Use of different methods for preparing, calculating or presenting information may lead to different results. Robeco Institutional Asset Management B.V. is authorised as a manager of UCITS and AIFs by the Netherlands Authority for the Financial Markets and subject to limited regulation in the UK by the Financial Conduct Authority. Details about the extent of our regulation by the Financial Conduct Authority are available from us on request.