Skip to main content
Resources

Second-Level Reference Label Generation Rules

ICANN has developed second-level Internationalized Domain Name (IDN) tables in machine-readable format or Label Generation Rules (LGRs) that registry operators can reference while designing their IDN tables. These reference LGRs will be used by ICANN org when reviewing IDN tables submitted for use with the generic top-level domains (gTLDs).

The reference LGRs have been developed using guidelines, which have been reviewed by the community. These LGRs are provided below in the XML format along with a more readable HTML format. See the Overview and Summary document for further details about these LGRs.

If you have questions or feedback regarding these reference LGRs, please send an email to [email protected].

Name Language/Script Date Finalized LGR Documents Additional Information
Arabic Language 18 May 2021 HTML
XML
Archive:
Version 1, 13 January 2021
(HTML, XML)
View public comment materials.
Arabic Script 22 April 2021 HTML
XML
View public comment materials.
Bangla (Bengali) Script 15 December 2020 HTML
XML
View public comment materials.
Belarusian Language 18 May 2021 HTML
XML
Archive:
Version 1, 19 December 2016 (HTML, XML)
View public comment materials.
Bosnian (Cyrillic) Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Bosnian (Latin) Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Bulgarian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Chinese Language 15 December 2020 HTML
XML

View public comment materials.

The 10 October 2016 version HTML and XML.
View the public comment materials.

Danish Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Devanagari Script 15 December 2020 HTML
XML
View public comment materials.
English Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Ethiopic Script 15 December 2020 HTML
XML
View public comment materials.
Finnish Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
French Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Georgian Script 15 December 2020 HTML
XML
View public comment materials.
German Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Gujarati Script 15 December 2020 HTML
XML
View public comment materials.
Gurmukhi Script 15 December 2020 HTML
XML
View public comment materials.
Hebrew Language 18 May 2021 HTML
XML
Archive:
Version 1, 22 April 2021
(HTML, XML)
View public comment materials.
Hebrew Script 22 April 2021 HTML
XML
View public comment materials.
Hindi Language 18 May 2021 HTML
XML
Archive:
Version 1, 15 December 2020
(HTML, XML)
View public comment materials.
Hungarian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Icelandic Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Italian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Kannada Script 15 December 2020 HTML
XML
View public comment materials.
Khmer Script 15 December 2020 HTML
XML
View public comment materials.
Korean Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Lao Script 22 April 2021 HTML
XML

Archive:
Version 1, 15 December 2020 (HTML, XML) View public comment materials.

Latvian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Lithuanian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Macedonian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Malayalam Script 15 December 2020 HTML
XML
View public comment materials.
Montenegrin Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Norwegian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Oriya Script 15 December 2020 HTML
XML
View public comment materials.
Polish Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Portuguese Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Russian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Serbian Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Sinhala Script 22 April 2021 HTML
XML
View public comment materials.
Spanish Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Swedish Language 18 May 2021 HTML
XML
Archive:
Version 1, 10 October 2016
(HTML, XML)
View public comment materials.
Tamil Script 15 December 2020 HTML
XML
View public comment materials.
Telugu Script 15 December 2020 HTML
XML
View public comment materials.
Thai Language 18 May 2021 HTML
XML
Archive:
Version 1, 15 December 2020
(HTML, XML)
View public comment materials.
Ukrainian Language 18 May 2021 HTML
XML
Archive:
Version 1, 19 December 2016 (HTML, XML)
View public comment materials.
Domain Name System
Internationalized Domain Name ,IDN,"IDNs are domain names that include characters used in the local representation of languages that are not written with the twenty-six letters of the basic Latin alphabet ""a-z"". An IDN can contain Latin letters with diacritical marks, as required by many European languages, or may consist of characters from non-Latin scripts such as Arabic or Chinese. Many languages also use other types of digits than the European ""0-9"". The basic Latin alphabet together with the European-Arabic digits are, for the purpose of domain names, termed ""ASCII characters"" (ASCII = American Standard Code for Information Interchange). These are also included in the broader range of ""Unicode characters"" that provides the basis for IDNs. The ""hostname rule"" requires that all domain names of the type under consideration here are stored in the DNS using only the ASCII characters listed above, with the one further addition of the hyphen ""-"". The Unicode form of an IDN therefore requires special encoding before it is entered into the DNS. The following terminology is used when distinguishing between these forms: A domain name consists of a series of ""labels"" (separated by ""dots""). The ASCII form of an IDN label is termed an ""A-label"". All operations defined in the DNS protocol use A-labels exclusively. The Unicode form, which a user expects to be displayed, is termed a ""U-label"". The difference may be illustrated with the Hindi word for ""test"" — परीका — appearing here as a U-label would (in the Devanagari script). A special form of ""ASCII compatible encoding"" (abbreviated ACE) is applied to this to produce the corresponding A-label: xn--11b5bs1di. A domain name that only includes ASCII letters, digits, and hyphens is termed an ""LDH label"". Although the definitions of A-labels and LDH-labels overlap, a name consisting exclusively of LDH labels, such as""icann.org"" is not an IDN."