18 August 2020
|
Manuel Augustin
In a previous article, I’ve told you the story of how we fixed the problem of sentence tokenization in RTL languages. If it’s only this specific technical problem you were interested in, read no further! I’m not going into more detail about the technical solution. But there’s also a bigger lesson to be learned here, »
Read: "Splitting RTL texts into sentences, part 2"
18 August 2020
|
Manuel Augustin
In this article, we'll outline a solution to a problem we faced when splitting up a text into sentences in RTL languages. These are languages with a right-to-left written script such as Arabic, Farsi, Urdu, and Hebrew. You’ll learn about the technical implementation of sentence tokenization in the Yoast text analysis, and how we expanded »
Read: "Splitting RTL texts into sentences, part 1"