Information technology - Universal Coded Character Set (UCS) (Adopted ISO/IEC 10646:2017, fifth edition, 2017-12, including adopted amendment 1:2019 and amendment 2:2019)
Standards development within the Information Technology sector is harmonized with international standards development. Through the CSA Technical Committee on Information Technology (TCIT), Canadians serve as the SCC Mirror Committee (SMC) on ISO/IEC Joint Technical Committee 1 on Information Technology (ISO/IEC JTC1) for the Standards Council of Canada (SCC), the ISO member body for Canada and sponsor of the Canadian National Committee of the IEC. Also, as a member of the International Telecommunication Union (ITU), Canada participates in the International Telegraph and Telephone Consultative Committee (ITU-T).
This Standard supersedes CAN/CSA-ISO/IEC 10646:15 (adopted ISO/IEC 10646:2014). At the time of publication, ISO/IEC 10646:2017, ISO/IEC Amendment 1:2019 and ISO/IEC Amendment 2:2019 is available from ISO and IEC in English only. CSA Group will publish the French versions when they become available from ISO and IEC.
This Standard has been formally approved, without modification, by the Technical Committee and has been developed in compliance with Standards Council of Canada requirements for National Standards of Canada. It has been published as a National Standard of Canada by CSA Group.
This International Standard specifies the Universal Coded Character Set (UCS). It is applicable to the representation, transmission, interchange, processing, storage, input, and presentation of the written form of the languages of the world as well as of additional symbols.
This International Standard
• specifies the architecture of this International Standard
• defines terms used in this International Standard
• describes the general structure of the UCS codespace
• specifies the Basic Multilingual Plane (BMP) of the UCS
• specifies supplementary planes of the UCS: the Supplementary Multilingual Plane (SMP), the Supplementary Ideographic Plane (SIP), the Tertiary Ideographic Plane (TIP), and the Supplementary Special-purpose Plane (SSP)
• defines a set of graphic characters used in scripts and the written form of languages on a world-wide scale
• specifies the names for the graphic characters and format characters of the BMP, SMP, SIP, TIP, SSP and their coded representations within the UCS codespace
• specifies the coded representations for control characters and private use characters
• specifies three encoding forms of the UCS: UTF-8, UTF-16, and UTF-32
• specifies seven encoding schemes of the UCS: UTF-8, UTF-16, UTF-16BE, UTF-16LE, UTF-32, UTF-32BE, and UTF-32LE
• specifies the management of future additions to this coded character set.
The UCS is an encoding system different from that specified in ISO/IEC 2022. The method to designate UCS from ISO/IEC 2022 is specified in 12.2.
A graphic character will be assigned only one code point in the standard, located either in the BMP or in one of the supplementary planes.
Information technology - Universal coded character set (UCS) (Adopted ISO/IEC 10646:2014, fourth edition, 2014-09-01)