To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????n}???????????n{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 鼎訥??璋?雄?醫?┌n}鼎訥??璋?雄?醫?┌n{^ 100100110100001111100110011000110011111100111111111000001111011000111111100101110101100100111111111001111100111000111111100001001010000101101110011111011001001101000011111001100110001100111111001111111110000011110110001111111001011101011001001111111110011111001110001111111000010010100001011011100111101101011110 9343e6633f3fe0f63f97593fe7ce3f84a16e7d9343e6633f3fe0f63f97593fe7ce3f84a16e7b5e
EUC-JP 鼎訥??璋?雄?醫?┌n}鼎訥??璋?雄?醫?┌n{^ 110001011010010011101011110001000011111100111111111000001111100000111111110011011011101000111111111011101101000000111111101010001010001101101110011111011100010110100100111010111100010000111111001111111110000011111000001111111100110110111010001111111110111011010000001111111010100010100011011011100111101101011110 c5a4ebc43f3fe0f83fcdba3feed03fa8a36e7dc5a4ebc43f3fe0f83fcdba3feed03fa8a36e7b5e
UTF-8 鼎訥렎렱璋렢雄렪醫쾌┌n}鼎訥렎렱璋렢雄렪醫쾌┌n{^ 1110100110111100100011101110100010101000101001011110101110100000100011101110101110100000101100011110011110010010100010111110101110100000101000101110100110011011100001001110101110100000101010101110100110000110101010111110110010111110100011001110001010010100100011000110111001111101111010011011110010001110111010001010100010100101111010111010000010001110111010111010000010110001111001111001001010001011111010111010000010100010111010011001101110000100111010111010000010101010111010011000011010101011111011001011111010001100111000101001010010001100011011100111101101011110 e9bc8ee8a8a5eba08eeba0b1e7928beba0a2e99b84eba0aae986abecbe8ce2948c6e7de9bc8ee8a8a5eba08eeba0b1e7928beba0a2e99b84eba0aae986abecbe8ce2948c6e7b5e
UHC 鼎訥렎렱璋렢雄렪醫쾌┌n}鼎訥렎렱璋렢雄렪醫쾌┌n{^ 11110000101000111101001011101101100011101010010010001110101111101110110111110000100011101011001111101010101010011000111010111000111011001010001011000100111010001010011010100011011011100111110111110000101000111101001011101101100011101010010010001110101111101110110111110000100011101011001111101010101010011000111010111000111011001010001011000100111010001010011010100011011011100111101101011110 f0a3d2ed8ea48ebeedf08eb3eaa98eb8eca2c4e8a6a36e7df0a3d2ed8ea48ebeedf08eb3eaa98eb8eca2c4e8a6a36e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)