To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN ?厓∮W^?厓∮\}v?厓∮W^?厓∮\}vB 00111111111110101000110110000111100100110101011101011110001111111111101010001101100001111001001101011100011111010111011000111111111110101000110110000111100100110101011101011110001111111111101010001101100001111001001101011100011111010111011001000010 3ffa8d8793575e3ffa8d87935c7d763ffa8d8793575e3ffa8d87935c7d7642
EUC-JP ?厓?W^?厓?\}v?厓?W^?厓?\}vB 00111111100011111011010011000111001111110101011101011110001111111000111110110100110001110011111101011100011111010111011000111111100011111011010011000111001111110101011101011110001111111000111110110100110001110011111101011100011111010111011001000010 3f8fb4c73f575e3f8fb4c73f5c7d763f8fb4c73f575e3f8fb4c73f5c7d7642
UTF-8 룶厓∮W^룶厓∮\}v룶厓∮W^룶厓∮\}vB 1110101110100011101101101110010110001110100100111110001010001000101011100101011101011110111010111010001110110110111001011000111010010011111000101000100010101110010111000111110101110110111010111010001110110110111001011000111010010011111000101000100010101110010101110101111011101011101000111011011011100101100011101001001111100010100010001010111001011100011111010111011001000010 eba3b6e58e93e288ae575eeba3b6e58e93e288ae5c7d76eba3b6e58e93e288ae575eeba3b6e58e93e288ae5c7d7642
UHC 룶厓∮W^룶厓∮\}v룶厓∮W^룶厓∮\}vB 1000111110101011111001001110110110100010101100010101011101011110100011111010101111100100111011011010001010110001010111000111110101110110100011111010101111100100111011011010001010110001010101110101111010001111101010111110010011101101101000101011000101011100011111010111011001000010 8fabe4eda2b1575e8fabe4eda2b15c7d768fabe4eda2b1575e8fabe4eda2b15c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)