由 小可愛智智 » 2012年5月9日 23:39:45 (p#2311207)
由 中東呼吸中級呼綜合症 » 2012年5月9日 23:41:49 (p#2311209)
由 小可愛智智 » 2012年5月9日 23:45:26 (p#2311211)
田園都市線中央林間駅 :Please make sure characters you type are within the UTF-8 zone.
由 中東呼吸中級呼綜合症 » 2012年5月9日 23:50:47 (p#2311215)
miklcct :田園都市線中央林間駅 :Please make sure characters you type are within the UTF-8 zone.
ref: http://tools.ietf.org/html/rfc3629#page-4
由 小可愛智智 » 2012年5月9日 23:52:54 (p#2311217)
田園都市線中央林間駅 :miklcct :田園都市線中央林間駅 :Please make sure characters you type are within the UTF-8 zone.
ref: http://tools.ietf.org/html/rfc3629#page-4
It is inside the CJK Ideographs Extensions, thus not an acceptable character set and is out of UTF-8 zones.
由 中東呼吸中級呼綜合症 » 2012年5月9日 23:56:19 (p#2311221)
miklcct :田園都市線中央林間駅 :miklcct :田園都市線中央林間駅 :Please make sure characters you type are within the UTF-8 zone.
ref: http://tools.ietf.org/html/rfc3629#page-4
It is inside the CJK Ideographs Extensions, thus not an acceptable character set and is out of UTF-8 zones.
Then which zone of the Unicode character set is acceptable in this forum?
由 小可愛智智 » 2012年5月9日 23:58:40 (p#2311226)
田園都市線中央林間駅 :miklcct :田園都市線中央林間駅 :miklcct :U+244D3是能夠在UTF-8中表達的(UTF-8的範圍由U+0至U+10FFFF)
ref: http://tools.ietf.org/html/rfc3629#page-4
It is inside the CJK Ideographs Extensions, thus not an acceptable character set and is out of UTF-8 zones.
Then which zone of the Unicode character set is acceptable in this forum?
As mentioned, only UTF-8 characters are accepted.
3. UTF-8 definition
UTF-8 is defined by the Unicode Standard [UNICODE]. Descriptions and
formulae can also be found in Annex D of ISO/IEC 10646-1 [ISO.10646]
In UTF-8, characters from the U+0000..U+10FFFF range (the UTF-16
accessible range) are encoded using sequences of 1 to 4 octets. The
only octet of a "sequence" of one has the higher-order bit set to 0,
the remaining 7 bits being used to encode the character number. In a
sequence of n octets, n>1, the initial octet has the n higher-order
bits set to 1, followed by a bit set to 0. The remaining bit(s) of
that octet contain bits from the number of the character to be
encoded. The following octet(s) all have the higher-order bit set to
1 and the following bit set to 0, leaving 6 bits in each to contain
bits from the character to be encoded.
The table below summarizes the format of these different octet types.
The letter x indicates bits available for encoding bits of the
character number.
Char. number range | UTF-8 octet sequence
(hexadecimal) | (binary)
0000 0000-0000 007F | 0xxxxxxx
0000 0080-0000 07FF | 110xxxxx 10xxxxxx
0000 0800-0000 FFFF | 1110xxxx 10xxxxxx 10xxxxxx
0001 0000-0010 FFFF | 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
Encoding a character to UTF-8 proceeds as follows:
1. Determine the number of octets required from the character number
and the first column of the table above. It is important to note
that the rows of the table are mutually exclusive, i.e., there is
only one valid way to encode a given character.
2. Prepare the high-order bits of the octets as per the second
column of the table.
3. Fill in the bits marked x from the bits of the character number,
expressed in binary. Start by putting the lowest-order bit of
the character number in the lowest-order position of the last
octet of the sequence, then put the next higher-order bit of the
character number in the next higher-order position of that octet,
etc. When the x bits of the last octet are filled in, move on to
the next to last octet, then to the preceding one, etc. until all
x bits are filled in.
由 中東呼吸中級呼綜合症 » 2012年5月10日 00:26:11 (p#2311247)
現在的時間是 2025年3月7日 11:31:42
Powered by phpBB® Forum Software © phpBB Group
正體中文語系由 竹貓星球 維護製作
phpBB Metro Theme by PixelGoose Studio