To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 愼?弔峰??缺愼? 1001110011000100001111111001001010100010100101011111010000111111001111111110001110011110100111001100010000111111 9cc43f92a295f43f3fe39e9cc43f
EUC-JP 愼?弔峰??缺愼? 1101100011000110001111111100010010100100110010101111011000111111001111111110010111111110110110001100011000111111 d8c63fc4a4caf63f3fe5fed8c63f
UTF-8 愼렡弔峰렱곈缺愼렣 111001101000010010111100111010111010000010100001111001011011110010010100111001011011001110110000111010111010000010110001111010101011001110001000111001111011110010111010111001101000010010111100111010111010000010100011 e684bceba0a1e5bc94e5b3b0eba0b1eab388e7bcbae684bceba0a3
UHC 愼렡弔峰렱곈缺愼렣 111000111110010110001110101100101111000011000000110111001110100010001110101111101011000011101001110011001100000011100011111001011000111010110100 e3e58eb2f0c0dce88ebeb0e9ccc0e3e58eb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)