
This is an audio version of the Wikipedia Article: https://en.wikipedia.org/wiki/Unicode 00:04:38 1 Origin and development 00:07:41 1.1 History 00:12:23 1.2 Architecture and terminology 00:13:33 1.2.1 Code point planes and blocks 00:14:45 1.2.2 General Category property 00:20:45 1.2.3 Abstract characters 00:23:02 1.3 Unicode Consortium 00:24:14 1.4 Versions 00:26:44 1.5 Scripts covered 00:29:30 2 Mapping and encodings 00:29:56 2.1 Unicode Transformation Format and Universal Coded Character Set 00:37:23 2.2 Ready-made versus composite characters 00:41:16 2.3 Ligatures 00:43:18 2.4 Standardized subsets 00:44:59 2.5 Code point lookup 00:45:46 3 Adoption 00:45:55 3.1 Operating systems 00:48:09 3.2 Input methods 00:49:09 3.3 Email 00:50:41 3.4 Web 00:52:54 3.5 Fonts 00:54:36 3.6 Newlines 00:56:30 4 Issues 00:56:40 4.1 Philosophical and completeness criticisms 01:00:04 4.2 Mapping to legacy character sets 01:02:57 4.3 Indic scripts 01:05:47 4.4 Combining characters 01:07:11 4.5 Anomalies 01:09:17 5 See also Listening is a more natural way of learning, when compared to reading. Written language only began at around 3200 BC, but spoken language has existed long ago. Learning by listening is a great way to: - increases imagination and understanding - improves your listening skills - improves your own spoken accent - learn while on the move - reduce eye strain Now learn the vast amount of general knowledge available on Wikipedia through audio (audio article). You could even learn subconsciously by playing the audio while you are sleeping! If you are planning to listen a lot, you could try using a bone conduction headphone, or a standard speaker instead of an earphone. Listen on Google Assistant through Extra Audio: https://assistant.google.com/services... Other Wikipedia audio articles at: https://www.youtube.com/results?searc... Upload your own Wikipedia articles through: https://github.com/nodef/wikipedia-tts Speaking Rate: 0.7672205641216903 Voice name: en-US-Wavenet-F "I cannot teach anybody anything, I can only make them think." - Socrates SUMMARY ======= Unicode is a computing industry standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard is maintained by the Unicode Consortium, and as of March 2019 the most recent version, Unicode 12.0, contains a repertoire of 137,993 characters covering 150 modern and historic scripts, as well as multiple symbol sets and emoji. The character repertoire of the Unicode Standard is synchronized with ISO/IEC 10646, and both are code-for-code identical. The Unicode Standard consists of a set of code charts for visual reference, an encoding method and set of standard character encodings, a set of reference data files, and a number of related items, such as character properties, rules for normalization, decomposition, collation, rendering, and bidirectional display order (for the correct display of text containing both right-to-left scripts, such as Arabic and Hebrew, and left-to-right scripts).Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including modern operating systems, XML, Java (and other programming languages), and the .NET Framework. Unicode can be implemented by different character encodings. The Unicode standard defines UTF-8, UTF-16, and UTF-32, and several other encodings are in use. The most commonly used encodings are UTF-8, UTF-16, and UCS-2, a precursor of UTF-16. UTF-8, the dominant encoding on the World Wide Web (used in over 92% of websites), uses one byte for the first 128 code points, and up to 4 bytes for other characters. The first 128 Unicode code points are the ASCII characters, which means that any ASCII text is also a UTF-8 text. UCS-2 uses two bytes (16 bits) for each character but can only encode the first 65,536 code points, the so-called Basic Multilingual Plane (BMP). With 1,114,112 code points on 17 planes being possible, and with over 137,000 code points defined so far, UCS-2 is only able to represent less than half of all encoded Unicode characters. Therefore, UCS-2 is outdated, though still widely used in software. UTF-16 extends UCS-2, by using the same 16-bit encoding as UCS-2 for the Basic Multilingual Plane, and a 4-byte encoding for the other planes. As long as it contains no code points in the reserved range U+D800–U+DFFF, a UCS-2 text is a valid UTF-16 text. UTF-32 (also referred to as UCS-4) uses four bytes for each character. Like UCS-2, the number of bytes per character is fixed, facilitating character indexing; but unlike UCS-2, UTF-32 is able to encode all Unicode code points. However, because each character uses four bytes, UTF-32 takes significantly more space than other encodings, a ...
Không có nhận xét nào:
Đăng nhận xét