To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??ぉ??ぃ??ぉ?⇒赴??ぉ??ぃ??ぉ?⇒赴^ 0011111100111111100000101010011100111111001111111000001010100001001111110011111110000010101001110011111110000001110010111001010110001011001111110011111110000010101001110011111100111111100000101010000100111111001111111000001010100111001111111000000111001011100101011000101101011110 3f3f82a73f3f82a13f3f82a73f81cb958b3f3f82a73f3f82a13f3f82a73f81cb958b5e
EUC-JP ??ぉ??ぃ??ぉ?⇒赴??ぉ??ぃ??ぉ?⇒赴^ 0011111100111111101001001010100100111111001111111010010010100011001111110011111110100100101010010011111110100010110011011100100111101011001111110011111110100100101010010011111100111111101001001010001100111111001111111010010010101001001111111010001011001101110010011110101101011110 3f3fa4a93f3fa4a33f3fa4a93fa2cdc9eb3f3fa4a93f3fa4a33f3fa4a93fa2cdc9eb5e
UTF-8 룶쥚ぉ룶쥚ぃ룶쥚ぉ룶⇒赴룶쥚ぉ룶쥚ぃ룶쥚ぉ룶⇒赴^ 11101011101000111011011011101100101001011001101011100011100000011000100111101011101000111011011011101100101001011001101011100011100000011000001111101011101000111011011011101100101001011001101011100011100000011000100111101011101000111011011011100010100001111001001011101000101101011011010011101011101000111011011011101100101001011001101011100011100000011000100111101011101000111011011011101100101001011001101011100011100000011000001111101011101000111011011011101100101001011001101011100011100000011000100111101011101000111011011011100010100001111001001011101000101101011011010001011110 eba3b6eca59ae38189eba3b6eca59ae38183eba3b6eca59ae38189eba3b6e28792e8b5b4eba3b6eca59ae38189eba3b6eca59ae38183eba3b6eca59ae38189eba3b6e28792e8b5b45e
UHC 룶쥚ぉ룶쥚ぃ룶쥚ぉ룶⇒赴룶쥚ぉ룶쥚ぃ룶쥚ぉ룶⇒赴^ 10001111101010111010001010001111101010101010100110001111101010111010001010001111101010101010001110001111101010111010001010001111101010101010100110001111101010111010001010100001110111011011100110001111101010111010001010001111101010101010100110001111101010111010001010001111101010101010001110001111101010111010001010001111101010101010100110001111101010111010001010100001110111011011100101011110 8faba28faaa98faba28faaa38faba28faaa98faba2a1ddb98faba28faaa98faba28faaa38faba28faaa98faba2a1ddb95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)